Forum for Information Retrieval Evaluation 2021
DOI: 10.1145/3503162.3505241
|View full text |Cite
|
Sign up to set email alerts
|

UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 2 publications
0
2
0
Order By: Relevance
“…For the purpose of this study, two cross-platform datasets have been gathered for detecting offensive language, one from Twitter platform ( Amjad et al, 2022 ) (referred to as D 1 ) and the other from YouTube (referred to as D 2 ). The type of text on both platforms is different so these datasets are ideal for answering the research question discussed in the introduction.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…For the purpose of this study, two cross-platform datasets have been gathered for detecting offensive language, one from Twitter platform ( Amjad et al, 2022 ) (referred to as D 1 ) and the other from YouTube (referred to as D 2 ). The type of text on both platforms is different so these datasets are ideal for answering the research question discussed in the introduction.…”
Section: Methodsmentioning
confidence: 99%
“…As in Akhter et al (2021) researchers have used machine learning and deep learning techniques to understand which technique performed well on Roman Urdu and Nastaliq Urdu scripts. In order to improve collaboration and contribution in the field of Urdu abusive language detection, a competition was arranged to come up with novel ways to detect the abusive language in Urdu Nastaliq script ( Amjad et al, 2022 ). Urdu, a language spoken mainly in Pakistan and India, is considered a low-resource language in terms of natural language processing (NLP) research.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…The shared tasks [19] present in this competition are divided into two parts. Where in one part participants have to focus on detecting Abusive language using twitter tweets in Urdu language (Subtask A) 13 and in other part mainly focusing on detecting Threatening language using Twitter tweets in Urdu language (Subtask B) 14 . The presented data has been collected and annotated from Natural Language and Text Processing Laboratory 15 at Center for Computing Research 16 of Instituto Politécnico Nacional, Mexico.…”
Section: Dataset Descriptionmentioning
confidence: 99%
“…There is also one sub-task in HASOC 2020 9 which aimed to identify offensive post in code-mixed dataset. Extending that task further, the organisers of this shared task [13] have build two datasets of 3400, 9950 posts to detect abusive and threatening language in Urdu. Twitter's definition has been followed to describe , whether a post is abusive/non-abusive 10 , and threading/non-threatening 11 .…”
Section: Introductionmentioning
confidence: 99%