Detecting Online Hate Speech Using Context Aware Models

Gao, Lei; Huang, Ruihong

doi:10.26615/978-954-452-049-6_036

Cited by 162 publications

(122 citation statements)

References 20 publications

Supporting

Mentioning

114

Contrasting

Order By: Relevance

“…The task of toxic comment classification lacks a consistently labeled standard dataset for comparative evaluation (Schmidt and Wiegand, 2017). While there are a number of annotated public datasets in adjacent fields, such as hate speech (Ross et al, 2016;Gao and Huang, 2017), racism/sexism (Waseem, 2016;Waseem and Hovy, 2016) or harassment (Golbeck et al, 2017) detection, most of them follow different definitions for labeling and therefore often constitute different problems.…”

Section: Datasets and Tasksmentioning

confidence: 99%

“…Bidirectional GRU with Attention Layer. Gao and Huang (2017) phrase that "attention mechanisms are suitable for identifying specific small regions indicating hatefulness in long comments". In order to detect these small regions in our comments, we add an attention layer to our bidirectional GRU-based network following the work of Yang et al (2016).…”

Section: Recurrent Neural Networkmentioning

confidence: 99%

See 1 more Smart Citation

Challenges for Toxic Comment Classification: An In-Depth Error Analysis

Aken¹,

Risch²,

Krestel³

et al. 2018

Proceedings of the 2nd Workshop on Abusive Language Online (ALW2)

162

View full text Add to dashboard Cite

Toxic comment classification has become an active research field with many recently proposed approaches. However, while these approaches address some of the task's challenges others still remain unsolved and directions for further research are needed. To this end, we compare different deep learning and shallow approaches on a new, large comment dataset and propose an ensemble that outperforms all individual models. Further, we validate our findings on a second dataset. The results of the ensemble enable us to perform an extensive error analysis, which reveals open challenges for state-of-the-art methods and directions towards pending future research. These challenges include missing paradigmatic context and inconsistent dataset labels.

show abstract

Section: Datasets and Tasksmentioning

confidence: 99%

Section: Recurrent Neural Networkmentioning

confidence: 99%

Challenges for Toxic Comment Classification: An In-Depth Error Analysis

Aken¹,

Risch²,

Krestel³

et al. 2018

Proceedings of the 2nd Workshop on Abusive Language Online (ALW2)

162

View full text Add to dashboard Cite

show abstract

“…A bewildering plethora of different types of abusive language can be found online. Some of the types dealt with in related work include but are not limited to sexism, racism (Waseem and Hovy, 2016;Waseem, 2016), toxicity (Kolhatkar et al, 2018), hatefulness (Gao and Huang, 2017), aggression (Kumar et al, 2018), attack (Wulczyn et al, 2017), obscenity, threats, and insults. A typology of abusive language detection subtasks was recently proposed by Waseem et al (2017).…”

Section: Related Workmentioning

confidence: 99%

“…Traditional machine learning approaches to detecting abusive language include the naive Bayes classifier (Kwok and Wang, 2013;Chen et al, 2012;Dinakar et al, 2011), logistic regression (Waseem and Hovy, 2016;Wulczyn et al, 2017;Burnap and Williams, 2015), and support vector machines (SVM) Dadvar et al, 2013;Schofield and Davidson, 2017). The best performance is most often attained by deep learning models, the most popular being convolutional neural networks (Gambäck and Sikdar, 2017;Potapova and Gordeev, 2016;Pavlopoulos et al, 2017) and variants of recurrent neural networks (Pavlopoulos et al, 2017;Gao and Huang, 2017;Pitsilis et al, 2018;Zhang et al, 2018). Some approaches (Badjatiya et al, 2017;Park and Fung, 2017;Mehdad and Tetreault, 2016) also rely on combining different types of models.…”

Section: Related Workmentioning

confidence: 99%

Cross-Domain Detection of Abusive Language Online

Karan¹,

Šnajder²

2018

Proceedings of the 2nd Workshop on Abusive Language Online (ALW2)

View full text Add to dashboard Cite

We investigate to what extent the models trained to detect general abusive language generalize between different datasets labeled with different abusive language types. To this end, we compare the cross-domain performance of simple classification models on nine different datasets, finding that the models fail to generalize to out-domain datasets and that having at least some in-domain data is important. We also show that using the frustratingly simple domain adaptation (Daume III, 2007) in most cases improves the results over indomain training, especially when used to augment a smaller dataset with a larger one.

show abstract

“…Many varieties of toxic language have been considered in NLP research, including sexism, racism (Waseem and Hovy, 2016a;Waseem, 2016), toxicity (Kolhatkar et al, 2018), hatefulness (Gao and Huang, 2017a), aggression (Kumar et al, 2018), attack (Wulczyn et al, 2017a), obscenity, threats, and insults. Waseem et al (2017) proposed a systematic typology of toxic language.…”

Section: Related Workmentioning

confidence: 99%

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Karan¹,

Šnajder²

2019

Proceedings of the Third Workshop on Abusive Language Online

View full text Add to dashboard Cite

We address the task of automatically detecting toxic content in user generated texts. We focus on exploring the potential for preemptive moderation, i.e., predicting whether a particular conversation thread will, in the future, incite a toxic comment. Moreover, we perform preliminary investigation of whether a model that jointly considers all comments in a conversation thread outperforms a model that considers only individual comments. Using an existing dataset of conversations among Wikipedia contributors as a starting point, we compile a new large-scale dataset for this task consisting of labeled comments and comments from their conversation threads.

show abstract

Detecting Online Hate Speech Using Context Aware Models

Cited by 162 publications

References 20 publications

Challenges for Toxic Comment Classification: An In-Depth Error Analysis

Challenges for Toxic Comment Classification: An In-Depth Error Analysis

Cross-Domain Detection of Abusive Language Online

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

Contact Info

Product

Resources

About