Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1271
|View full text |Cite
|
Sign up to set email alerts
|

CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

Abstract: Although there is an unprecedented effort to provide adequate responses in terms of laws and policies to hate content on social media platforms, dealing with hatred online is still a tough problem. Tackling hate speech in the standard way of content deletion or user suspension may be charged with censorship and overblocking. One alternate strategy, that has received little attention so far by the research community, is to actually oppose hate content with counter-narratives (i.e. informed textual responses). I… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
115
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 126 publications
(132 citation statements)
references
References 53 publications
1
115
0
Order By: Relevance
“…Regarding the languages, as expected, most of the resources use English data, although in some cases they are collected along with texts in Hindi (Bohra et al 2018;Kumar et al 2018a;Mathur et al 2018) or they are part of even larger multilingual collections (Chung et al 2019;Ousidhoum et al 2019;Steinberger et al 2017). It is also worth pointing out that less-resourced languages such as Amharic, Bengali, Slovene and Swedish, are also represented in the corpora we found, thus enabling a greater linguistic diversity in this field.…”
Section: Tablesupporting
confidence: 58%
“…Regarding the languages, as expected, most of the resources use English data, although in some cases they are collected along with texts in Hindi (Bohra et al 2018;Kumar et al 2018a;Mathur et al 2018) or they are part of even larger multilingual collections (Chung et al 2019;Ousidhoum et al 2019;Steinberger et al 2017). It is also worth pointing out that less-resourced languages such as Amharic, Bengali, Slovene and Swedish, are also represented in the corpora we found, thus enabling a greater linguistic diversity in this field.…”
Section: Tablesupporting
confidence: 58%
“…They were then extended with minor linguistic variants. This report 15 from Moonshot CVE was used as a guide to the overall conspiracy landscape within COVID-19. They provide some hashtags, and variants were then acquired, again, from looking down the list of hashtags appearing in the dataset for other variants, and including linguistic variations.…”
Section: Findings: General Trends and Comparisons (Rq1)mentioning
confidence: 99%
“…Agonism argues that the contestations of the time can be used to renew democracy and strengthen public discourse [47]. Promising work on recognising an highlighting counter-speech in online communication is already on the horizon [15,50].…”
Section: Power Affect and Vitriolmentioning
confidence: 99%
“…We made use of the latter three datasets in our research. Mathew et al [31] and Chung et al [11] provided counter speech datasets for better analysis of hate speech.…”
Section: Related Work 21 Hate Speech Detectionmentioning
confidence: 99%