EVALITA Evaluation of NLP and Speech Tools for Italian 2018
DOI: 10.4000/books.aaccademia.4503
|View full text |Cite
|
Sign up to set email alerts
|

Overview of the EVALITA 2018 Hate Speech Detection Task

Abstract: The Hate Speech Detection (HaSpeeDe) task is a shared task on Italian social media (Facebook and Twitter) for the detection of hateful content, and it has been proposed for the first time at EVALITA 2018. Providing two datasets from two different online social platforms differently featured from the linguistic and communicative point of view, we organized the task in three tasks where systems must be trained and tested on the same resource or using one in training and the other in testing: HaSpeeDe-FB, HaSpeeD… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
87
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 95 publications
(87 citation statements)
references
References 1 publication
0
87
0
Order By: Relevance
“…sono zavorre e tutti uomini (refugees? They are deadweights and all men) Source: (Bosco et al 2018 Warner and Hirschberg (2012) Use of a sexist or racial slur, attack a minority, promotes hate speech or violent crime, blatantly misrepresents truth, shows support of problematic hashtags, defends xenophobia or sexism, or contains a screen name that is offensive Waseem and Hovy (2016) Act of offending, insulting or threatening a person or a group of similar people on the basis of religion, race, caste, sexual orientation, gender or belongingness to a specific stereotyped community Schmidt and Wiegand (2017) Language that is used to express hatred towards a targeted group or is intended to be derogatory, to humiliate, or to insult the members of the group Davidson et al (2017) Any communication that disparages a target group of people based on some characteristic such as race, colour, ethnicity, gender, sexual orientation, nationality, religion, or other characteristic Nockleby (2000) Aggressiveness Intention to be aggressive, harmful, or even to incite, in various forms, to violent acts against a given target Sanguinetti et al (2018) Offensiveness Any form of non-acceptable language (profanity) or a targeted offense, which can be veiled or direct Zampieri et al (2019a) Profanity, strongly impolite, rude or vulgar language expressed with fighting or hurtful words in order to insult a targeted individual or group Fortuna and Nunes (2018) Abusiveness/ toxicity Hurtful language, including hate speech, derogatory language and also profanity Founta et al (2018) Any strongly impolite, rude or hurtful language using profanity, that can show a debasement of someone or something, or show intense emotion Fortuna and Nunes (2018) Extremely offensive and insulting; engaging in or characterized by habitual violence and cruelty Oxford English Dictionary (2019)…”
Section: Inclusion and Exclusion Criteriamentioning
confidence: 99%
See 2 more Smart Citations
“…sono zavorre e tutti uomini (refugees? They are deadweights and all men) Source: (Bosco et al 2018 Warner and Hirschberg (2012) Use of a sexist or racial slur, attack a minority, promotes hate speech or violent crime, blatantly misrepresents truth, shows support of problematic hashtags, defends xenophobia or sexism, or contains a screen name that is offensive Waseem and Hovy (2016) Act of offending, insulting or threatening a person or a group of similar people on the basis of religion, race, caste, sexual orientation, gender or belongingness to a specific stereotyped community Schmidt and Wiegand (2017) Language that is used to express hatred towards a targeted group or is intended to be derogatory, to humiliate, or to insult the members of the group Davidson et al (2017) Any communication that disparages a target group of people based on some characteristic such as race, colour, ethnicity, gender, sexual orientation, nationality, religion, or other characteristic Nockleby (2000) Aggressiveness Intention to be aggressive, harmful, or even to incite, in various forms, to violent acts against a given target Sanguinetti et al (2018) Offensiveness Any form of non-acceptable language (profanity) or a targeted offense, which can be veiled or direct Zampieri et al (2019a) Profanity, strongly impolite, rude or vulgar language expressed with fighting or hurtful words in order to insult a targeted individual or group Fortuna and Nunes (2018) Abusiveness/ toxicity Hurtful language, including hate speech, derogatory language and also profanity Founta et al (2018) Any strongly impolite, rude or hurtful language using profanity, that can show a debasement of someone or something, or show intense emotion Fortuna and Nunes (2018) Extremely offensive and insulting; engaging in or characterized by habitual violence and cruelty Oxford English Dictionary (2019)…”
Section: Inclusion and Exclusion Criteriamentioning
confidence: 99%
“…A second key distinction concerns the source from which data are retrieved. The microblogging platform Twitter 11 is by far the most exploited source, due to the relatively reduced length of texts and to a friendly policy on making data publicly available: 32 resources contain tweets, one of which (Olteanu et al 2018) also features posts from the social aggregator Reddit 12 , one (Nascimento et al 2019) also retrieves comments from the 55chan 13 imageboard, while in two works (Bosco et al 2018;Mandl et al 2019 2018use sentences from the well-known white-suprematist forum Stormfront; the dataset released for the Hate Speech Hackathon 15 contains posts from the Wikipedia Topical focus: Abusiveness (5); Aggressiveness (2); Anti-Roma (1); Child sexual abuse (1); Cyberbullying (2); Flames (1); Harassment (1); Homophobia (4); HS (36); Islamophobia (2); Obscenity, Profanity (3); Offensiveness (13); Personal Attacks (1); Racism (6); Sexism, Misogyny (9); Threats, Violence (1); Toxicity (1); White supremacy (1). Nearly all the resources feature user-generated public contents, mostly microblog posts, often retrieved with a keyword-based approach and mostly using words with a negative polarity.…”
Section: Data Sourcementioning
confidence: 99%
See 1 more Smart Citation
“…The different models and features presented in the literature are difficult to compare effectively because the results are evaluated on individual datasets that are often not public, hence the survey advocates for broader availability of publicly available data. This evaluation gap is being bridged recently by evaluation campaigns for English, Spanish (SemEval [10]), German [11], and Italian (EVALITA [12]), whose shared tasks released annotated datasets for hate speech detection. The availability of benchmarks for system evaluation and datasets for hate speech detection in different languages made the challenge of investigating architectures, which are also stable and well-performing across different languages, an exciting issue to research [13,14].…”
Section: Related Workmentioning
confidence: 99%
“…This result is confirmed in [23], where AlBERTo is applied to hate speech detection on Italian social media. We trained AlBERTo on data that also encompassed the train and reference set from Haspeede [12], the first shared task on hate speech on Italian organized within EVALITA2018 evaluation campaign (http://www.di.unito.it/~tutreeb/ haspeede-evalita18/index.html).…”
Section: Related Workmentioning
confidence: 99%