Forum for Information Retrieval Evaluation 2021
DOI: 10.1145/3503162.3503176
|View full text |Cite
|
Sign up to set email alerts
|

Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech

Abstract: The widespread of offensive content online such as hate speech poses a growing societal problem. AI tools are necessary for supporting the moderation process at online platforms. For the evaluation of these identification tools, continuous experimentation with data sets in different languages are necessary. The HASOC track (Hate Speech and Offensive Content Identification) is dedicated to develop benchmark data for this purpose. This paper presents the HASOC subtrack for English, Hindi, and Marathi. The data s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
34
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 57 publications
(42 citation statements)
references
References 15 publications
0
34
0
Order By: Relevance
“…Its performance, averaged on the two Hindi problems and the Marathi problem, ranks it in first place among the teams that proposed systems for at least two of these problems. These performances suggest that it is an interesting reference level to evaluate the benefits of using more complex approaches that are frequently used to address this type of task such as deep learning or taking into account complementary resources (Mandl et al, 2019;Mandl et al, 2020;. However, it is essential to note that the proposed system never ranked first in any specific task.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Its performance, averaged on the two Hindi problems and the Marathi problem, ranks it in first place among the teams that proposed systems for at least two of these problems. These performances suggest that it is an interesting reference level to evaluate the benefits of using more complex approaches that are frequently used to address this type of task such as deep learning or taking into account complementary resources (Mandl et al, 2019;Mandl et al, 2020;. However, it is essential to note that the proposed system never ranked first in any specific task.…”
Section: Discussionmentioning
confidence: 99%
“…Not surprisingly, a lot of research is being done to develop automatic detection systems. As in many NLP domains, deep learning approaches and the use of pre-computed embeddings have proven to be the most efficient, even in languages with few resources (Mandl et al, 2019;Mandl et al, 2020). However, traditional machine learning systems have sometimes proven to be very competitive (Mujadia et al, 2019;Saroj et al, 2019).…”
mentioning
confidence: 99%
“…The Hate speech and Offensive Content Identification in English and Indo-Aryan Languages HASOC 2021 [5,6] purposes two different tasks, in 3 different languages English, Hindi, Marathi. The authors participated in both tasks for English and Hindi languages.…”
Section: Languagesmentioning
confidence: 99%
“…The authors would like to thank the organizers of Hate Speech and Offensive Content Identification in Indo-Aryan Languages 2021 [5] for conducting this data challenge. The authors gratefully acknowledge google colab for providing GPU's to do the computation.…”
Section: Acknowledgmentsmentioning
confidence: 99%
See 1 more Smart Citation