Binny Mathew scite author profile

The present online social media platform is afflicted with several issues, with hate speech being on the predominant forefront. The prevalence of online hate speech has fuelled horrific real-world hate-crime such as the mass-genocide of Rohingya Muslims, communal violence in Colombo and the recent massacre in the Pittsburgh synagogue. Consequently, It is imperative to understand the diffusion of such hateful content in an online setting. We conduct the first study that analyses the flow and dynamics of posts generated by hateful and non-hateful users on Gab (gab.com) over a massive dataset of 341K users and 21M posts. Our observations confirms that hateful content diffuse farther, wider and faster and have a greater outreach than those of non-hateful users. A deeper inspection into the profiles and network of hateful and nonhateful users reveals that the former are more influential, popular and cohesive. Thus, our research explores the interesting facets of diffusion dynamics of hateful users and broadens our understanding of hate speech in the online world.

show abstract

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Mathew

Saha

Yimam

et al. 2021

AAAI

207

View full text Add to dashboard Cite

Hate speech is a challenging issue plaguing the online social media. While better models for hate speech detection are continuously being developed, there is little research on the bias and interpretability aspects of hate speech. In this paper, we introduce HateXplain, the first benchmark hate speech dataset covering multiple aspects of the issue. Each post in our dataset is annotated from three different perspectives: the basic, commonly used 3-class classification (i.e., hate, offensive or normal), the target community (i.e., the community that has been the victim of hate speech/offensive speech in the post), and the rationales, i.e., the portions of the post on which their labelling decision (as hate, offensive or normal) is based. We utilize existing state-of-the-art models and observe that even models that perform very well in classification do not score high on explainability metrics like model plausibility and faithfulness. We also observe that models, which utilize the human rationales for training, perform better in reducing unintended bias towards target communities. We have made our code and dataset public for other researchers.

show abstract

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Mathew¹,

Saha²,

Yimam³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

Deep Learning Models for Multilingual Hate Speech Detection

Aluru¹,

Mathew²,

Saha³

et al. 2020

Preprint

View full text Add to dashboard Cite

A Deep Dive into Multilingual Hate Speech Classification

Aluru

Mathew

Saha

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Binny Mathew

Spread of Hate Speech in Online Social Media

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection

Deep Learning Models for Multilingual Hate Speech Detection

A Deep Dive into Multilingual Hate Speech Classification

Contact Info

Product

Resources

About