Proceedings of the Fourth Arabic Natural Language Processing Workshop 2019
DOI: 10.18653/v1/w19-4621
|View full text |Cite
|
Sign up to set email alerts
|

Mazajak: An Online Arabic Sentiment Analyser

Abstract: Sentiment analysis (SA) is one of the most useful natural language processing applications. Literature is flooding with many papers and systems addressing this task, but most of the work is focused on English. In this paper, we present "Mazajak", an online system for Arabic SA. The system is based on a deep learning model, which achieves state-of-theart results on many Arabic dialect datasets including SemEval 2017 and ASTD. The availability of such system should assist various applications and research that r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
62
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 87 publications
(94 citation statements)
references
References 30 publications
0
62
1
Order By: Relevance
“…In [38], the authors used an ensemble approach to combine the performance of a CNN with an LSTM model using the ASTD dataset with three sentiment classes: positive, negative and neutral. The accuracy produced by the CNN model was 64.3%, the accuracy produced by LSTM was 64.75%, and the combined ensemble accuracy is 65.05%, which was much lower than the results produced by other papers using the same data (for example, the accuracies produced in [36]). The authors used pretrained word embeddings to train the models.…”
Section: Literature Review and Related Work: Models And Datasetscontrasting
confidence: 56%
See 1 more Smart Citation
“…In [38], the authors used an ensemble approach to combine the performance of a CNN with an LSTM model using the ASTD dataset with three sentiment classes: positive, negative and neutral. The accuracy produced by the CNN model was 64.3%, the accuracy produced by LSTM was 64.75%, and the combined ensemble accuracy is 65.05%, which was much lower than the results produced by other papers using the same data (for example, the accuracies produced in [36]). The authors used pretrained word embeddings to train the models.…”
Section: Literature Review and Related Work: Models And Datasetscontrasting
confidence: 56%
“…In [36], the authors applied a CNN combined with long short-term memory (LSTM) to analyze sentiments in three public datasets: SemEval 2017, ASTD, and ARSAS. They used the word2vec model to generate the required word embeddings.…”
Section: Literature Review and Related Work: Models And Datasetsmentioning
confidence: 99%
“…The reason for the difference in accuracy should be attributed to the insufficient number of samples in the training set, and the quality is also a factor limiting the accuracy. In terms of recall, the Bi-LSTM model is superior to the CNN-LSTM model [58] on an average level. In terms of F1-Score, the Bi-LSTM model far exceeds the two types of models proposed by the papers [59] [60].In summary, compared with the SOTA, the Bi-LSTM model proposed in this paper has a certain gap in some indicators, such as accuracy and recall.…”
Section: Comparative Experiments With Sotamentioning
confidence: 92%
“…It is worth to mention that for PBLM and HATN, we have used an extra 4000 unlabeled sentences from each domain/dialect. For HTAN, we have used Mazjak word embedding model (Abu Farha and Magdy, 2019)…”
Section: Compared Methodsmentioning
confidence: 99%