Proceedings of the Fourteenth Workshop on Semantic Evaluation 2020
DOI: 10.18653/v1/2020.semeval-1.283
|View full text |Cite
|
Sign up to set email alerts
|

SalamNET at SemEval-2020 Task 12: Deep Learning Approach for Arabic Offensive Language Detection

Abstract: This paper describes SalamNET, an Arabic offensive language detection system that has been submitted to SemEval 2020 shared task 12: Multilingual Offensive Language Identification in Social Media. Our approach focuses on applying multiple deep learning models and conducting in depth error analysis of results to provide system implications for future development considerations. To pursue our goal, a Recurrent Neural Network (RNN), a Gated Recurrent Unit (GRU), and Long-Short Term Memory (LSTM) models with diffe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 18 publications
0
7
0
Order By: Relevance
“…On the other hand, deep learning techniques eliminate the need for handcrafted features. Deep learning has gained significant popularity for HS identification in Arabic Twitter data since 2017 ( Badjatiya et al, 2017 ), primarily due to its capacity to research classification appropriate to data representations ( Husain, 2020 ; Mansur, Omar & Tiun, 2023 ). Well-known deep learning techniques include CNNs and LSTM networks ( Duwairi, Hayajneh & Quwaider, 2021 ).…”
Section: Review Findings and Discussionmentioning
confidence: 99%
“…On the other hand, deep learning techniques eliminate the need for handcrafted features. Deep learning has gained significant popularity for HS identification in Arabic Twitter data since 2017 ( Badjatiya et al, 2017 ), primarily due to its capacity to research classification appropriate to data representations ( Husain, 2020 ; Mansur, Omar & Tiun, 2023 ). Well-known deep learning techniques include CNNs and LSTM networks ( Duwairi, Hayajneh & Quwaider, 2021 ).…”
Section: Review Findings and Discussionmentioning
confidence: 99%
“…A dataset consisting of 5% hate speech was presented at OSACT 2020 shared task. The best system performed extensive preprocessing including normalizing emojis (translate their English description to Arabic) and dialectal Arabic (DA) to modern standard Arabic (MSA) conversion among others (Husain 2020). ASAD (Hassan et al 2021a) is an online tool that utilizes the shared task datasets for offensiveness and hate speech detection in tweets along with other social media analysis components such as emotion (Hassan et al 2021b) and spam detection (Mubarak et al 2020a).…”
Section: Related Workmentioning
confidence: 99%
“…Using (2), IDF determines if a term (t) is frequent or rare in all documents (n) in order to know its importance. The document frequency (d) is the number of documents (d) that include the term (t) [26], [27].…”
Section: Feature Extractionmentioning
confidence: 99%
“…There are 6,839 tweets in the training dataset, including 1,371 offensive tweets and 350 hate speech tweets. There are 1,000 tweets in the development dataset, including 179 offensive tweets and 44 hate speech tweets [16], [27].…”
Section: Arabic Hate Speech Datasetmentioning
confidence: 99%