Revisiting Hate Speech Benchmarks: From Data Curation to System Deployment

Kulkarni, Atharva; Masud, Sarah; Goyal, Vikram; Chakraborty, Tanmoy

doi:10.1145/3580305.3599896

Cited by 3 publications

(1 citation statement)

References 60 publications

(96 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Along similar lines, we have hate and offense datasets from diverse online forums like Wikipedia [64], Stormfront [16], Facebook [46], Reddit [37] etc. On the other hand, to overcome the use of hate lexicons in curating the datasets, large-scale neutrally seeded datasets have also been proposed [19,28,55]. The initial research in hate speech datasets focused on multi-class text classification assuming English posts.…”

Section: Related Workmentioning

confidence: 99%

Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization

Masud

Bedi

Khan

et al. 2022

Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

View full text Add to dashboard Cite

Although pre-trained large language models (PLMs) have achieved state-of-the-art on many NLP tasks, they lack understanding of subtle expressions of implicit hate speech. Such nuanced and implicit hate is often misclassified as non-hate. Various attempts have been made to enhance the detection of (implicit) hate content by augmenting external context or enforcing label separation via distance-based metrics. We combine these two approaches and introduce FiADD, a novel Focused Inferential Adaptive Density Discrimination framework. FiADD enhances the PLM finetuning pipeline by bringing the surface form of an implicit hate speech closer to its implied form while increasing the inter-cluster distance among various class labels. We test FiADD on three implicit hate datasets and observe significant improvement in the two-way and three-way hate classification tasks. We further experiment on the generalizability of FiADD on three other tasks, namely detecting sarcasm, irony, and stance, in which surface and implied forms differ, and observe similar performance improvement. We analyze the generated latent space to understand its evolution under FiADD, which corroborates the advantage of employing FiADD for implicit hate speech detection. 1

show abstract