2022
DOI: 10.48550/arxiv.2202.00126
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Handling Bias in Toxic Speech Detection: A Survey

Abstract: The massive growth of social media usage has witnessed a tsunami of online toxicity in teams of hate speech, abusive posts, cyberbullying, etc. Detecting online toxicity is challenging due to its inherent subjectivity. Factors such as the context of the speech, geography, socio-political climate, and background of the producers and consumers of the posts play a crucial role in determining if the content can be flagged as toxic. Adoption of automated toxicity detection models in production can lead to a sidelin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(9 citation statements)
references
References 69 publications
0
2
0
Order By: Relevance
“…Toxicity detection is inherently complex and subjective, with different definitions and interpretations among researchers (Garg et al, 2022;Kowert, 2020). Biases also vary across communities, influenced by culture, origin and socio-political context.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Toxicity detection is inherently complex and subjective, with different definitions and interpretations among researchers (Garg et al, 2022;Kowert, 2020). Biases also vary across communities, influenced by culture, origin and socio-political context.…”
Section: Related Workmentioning
confidence: 99%
“…In this study, we define biases as "prejudice in favour of or against one thing, person, or group compared with another usually in a way that's considered to be unfair" (University of California). Natural language processing encompasses a wide range of types of biases, categorized by their sources or the type of harm they cause (Sap et al, 2019;Garg et al, 2022). Our focus lies specifically on lexical identity biases, which refer to biases conveyed by terms related to one's identity or characteristic (Zhou et al, 2021).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The COMPAS dataset contains outcomes within 2 years of the decision, for over 10,000 criminal defendants in Broward County, Florida. The ''Stop, Question and Frisk'' database 5 contains data from NYPD officers' interactions with potential suspects of committing a crime. Features include locality-based information like time, street name, area code, etc.…”
Section: ) Racial and Religious Bias Datasetsmentioning
confidence: 99%
“…Levitin [4] highlights that as data is collected by humans, they decide what to collect and what not. The objective for which the data is collected and its respective planning leads to wrong analysis/conclusions, e.g., which population/features to select and what to label, also called lexical bias [5]. At the learning stage, it is the bias that exists due to the transfer of bias in the model and how much it affects certain groups while proposing a generalised model that will work for all groups in the data [6].…”
Section: Introductionmentioning
confidence: 99%