2021
DOI: 10.3390/informatics8040069
|View full text |Cite
|
Sign up to set email alerts
|

Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model

Abstract: As social media platforms offer a medium for opinion expression, social phenomena such as hatred, offensive language, racism, and all forms of verbal violence have increased spectacularly. These behaviors do not affect specific countries, groups, or communities only, extending beyond these areas into people’s everyday lives. This study investigates offensive and hate speech on Arab social media to build an accurate offensive and hate speech detection system. More precisely, we develop a classification system f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 50 publications
(27 citation statements)
references
References 60 publications
0
12
0
Order By: Relevance
“…Hugely influenced by (Aldjanabi et al, 2021) work, we were able to explore many previous approaches to Arabic (HS) and (OFF) detection using (MTL). The first Arabic Religious (HS) Twitter dataset was collected by (Albadi et al, 2018).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Hugely influenced by (Aldjanabi et al, 2021) work, we were able to explore many previous approaches to Arabic (HS) and (OFF) detection using (MTL). The first Arabic Religious (HS) Twitter dataset was collected by (Albadi et al, 2018).…”
Section: Related Workmentioning
confidence: 99%
“…Their proposed model achieved 0.904, 0.737 F1-score in the (OFF) and (HS) sub-tasks respectively. Moving from OSACT2020 submissions, (Aldjanabi et al, 2021) explores (MTL) more widely. They use dataset from OSACT2020 (HS) and (OFF), T-HSAB (Haddad et al, 2019), and (L-HSAB) (Mulki et al, 2019).…”
Section: Related Workmentioning
confidence: 99%
“…TC is a machine learning challenge that tries to classify new written content into a conceptual group from a predetermined classification collection [1]. It is crucial in a variety of applications, including sentiment analysis [2,3], spam email filtering [4,5], hate speech detection [6], text summarization [7], website classification [8], authorship attribution [9], information retrieval [10], medical diagnostics [11], emotion detection on smart phones [12], online recommendations [13], fake news detection [14,15], crypto-ransomware early detection [16], semantic similarity detection [17], part-of-speech tagging [18], news classification [19], and tweet classification [20].…”
Section: Introductionmentioning
confidence: 99%
“…Detecting hate speech in Arabic, used by Islamicists, is even more challenging due to the lack of high-quality labeled datasets that can be used to train models to detect hatred content automatically [8]. This is because of the Arabic language's complex grammatical structure and extensive morphology.…”
Section: Introductionmentioning
confidence: 99%