Examining a hate speech corpus for hate speech detection and popularity prediction

Klubička, Filip; Fernández, Raquel

doi:10.48550/arxiv.1805.04661

2018

DOI: 10.48550/arxiv.1805.04661

|View full text |Cite

Preprint

Examining a hate speech corpus for hate speech detection and popularity prediction

Filip Klubička¹,

Raquel Fernández²

Abstract: As research on hate speech becomes more and more relevant every day, most of it is still focused on hate speech detection. By attempting to replicate a hate speech detection experiment performed on an existing Twitter corpus annotated for hate speech, we highlight some issues that arise from doing research in the field of hate speech, which is essentially still in its infancy. We take a critical look at the training corpus in order to understand its biases, while also using it to venture beyond hate speech det… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2021

Publication Types

Select...

Other1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The authors labeled the tweets as racist, sexist or neither using guidelines inspired by critical race theory and had a domain expert review their labels. However, this dataset has received significant criticism from scholars [23,30], who deride it for most of the racist tweets being anti-muslim and the sexist tweets relating to a debate over an Australian television show. Additionally, this dataset can introduce author bias as it is noted that two users wrote 70% of sexist tweets and 99% of racist tweets were written by another single user.…”

mentioning

confidence: 99%

Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework

Halevy

Harris

Bruckman

et al. 2021

Equity and Access in Algorithms, Mechanisms, and Optimization

View full text Add to dashboard Cite

Recent research has demonstrated how racial biases against users who write African American English exists in popular toxic language datasets. While previous work has focused on a single fairness criteria, we propose to use additional descriptive fairness metrics to better understand the source of these biases. We demonstrate that different benchmark classifiers, as well as two in-process bias-remediation techniques, propagate racial biases even in a larger corpus. We then propose a novel ensemble-framework that uses a specialized classifier that is fine-tuned to the African American English dialect. We show that our proposed framework substantially reduces the racial biases that the model learns from these datasets. We demonstrate how the ensemble framework improves fairness metrics across all sample datasets with minimal impact on the classification performance, and provide empirical evidence in its ability to unlearn the annotation biases towards authors who use African American English. ** Please note that this work may contain examples of offensive words and phrases.CCS Concepts: • Computing methodologies → Discourse, dialogue and pragmatics; • Human-centered computing → Empirical studies in collaborative and social computing.

show abstract

mentioning

confidence: 99%

Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework

Halevy

Harris

Bruckman

et al. 2021

Equity and Access in Algorithms, Mechanisms, and Optimization

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Examining a hate speech corpus for hate speech detection and popularity prediction

Cited by 1 publication

References 14 publications

Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework

Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework

Contact Info

Product

Resources

About