HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Why Is It Hate Speech? Masked Rationale Prediction for Explainable Hate Speech Detection

Jiyun Kim Byounghan Lee Kyung-Ah Sohn

Why Is It Hate Speech? Masked Rationale Prediction for Explainable Hate Speech Detection

Abstract

In a hate speech detection model, we should consider two critical aspects in addition to detection performance-bias and explainability. Hate speech cannot be identified based solely on the presence of specific words: the model should be able to reason like humans and be explainable. To improve the performance concerning the two aspects, we propose Masked Rationale Prediction (MRP) as an intermediate task. MRP is a task to predict the masked human rationales-snippets of a sentence that are grounds for human judgment-by referring to surrounding tokens combined with their unmasked rationales. As the model learns its reasoning ability based on rationales by MRP, it performs hate speech detection robustly in terms of bias and explainability. The proposed method generally achieves state-of-the-art performance in various metrics, demonstrating its effectiveness for hate speech detection.

Code Repositories

alatteaday/mrp_hate-speech-detection
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
hate-speech-detection-on-hatexplainBERT-RP
AUROC: 0.853
Accuracy: 0.707
Macro F1: 0.693
hate-speech-detection-on-hatexplainBERT-MRP
AUROC: 0.862
Accuracy: 0.704
Macro F1: 0.699

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp