Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing 2015
DOI: 10.18653/v1/d15-1299
|View full text |Cite
|
Sign up to set email alerts
|

ASTD: Arabic Sentiment Tweets Dataset

Abstract: This paper introduces ASTD, an Arabic social sentiment analysis dataset gathered from Twitter. It consists of about 10,000 tweets which are classified as objective, subjective positive, subjective negative, and subjective mixed. We present the properties and the statistics of the dataset, and run experiments using standard partitioning of the dataset. Our experiments provide benchmark results for 4 way sentiment classification on the dataset.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
197
0
2

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 283 publications
(225 citation statements)
references
References 4 publications
1
197
0
2
Order By: Relevance
“…Our data collection method is most similar to Abdul-Mageed et al (2016), who also use phrase seeds to acquire tweets for Ekman's 6 basic emotions, but we extend the work to 8 emotions, expand the list of seed expressions used, improve on the manual annotation study, and empirically validate the method on the practical emotion modeling task both on our data and on an external dataset. Our work also has affinity to works on Arabic text classification (Abdul-Mageed et al, 2011;Refaee and Rieser, 2014;Abdul-Mageed et al, 2014;Nabil et al, 2015;Salameh et al, 2015;Abdul-Mageed, 2017, 2018Alshehri et al, 2018;), but we focus on emotion.…”
Section: Related Workmentioning
confidence: 99%
“…Our data collection method is most similar to Abdul-Mageed et al (2016), who also use phrase seeds to acquire tweets for Ekman's 6 basic emotions, but we extend the work to 8 emotions, expand the list of seed expressions used, improve on the manual annotation study, and empirically validate the method on the practical emotion modeling task both on our data and on an external dataset. Our work also has affinity to works on Arabic text classification (Abdul-Mageed et al, 2011;Refaee and Rieser, 2014;Abdul-Mageed et al, 2014;Nabil et al, 2015;Salameh et al, 2015;Abdul-Mageed, 2017, 2018Alshehri et al, 2018;), but we focus on emotion.…”
Section: Related Workmentioning
confidence: 99%
“…The researchers on [33] and [16] used the manual annotation method with the help of tools that were used to facilitate the annotation process and to reduce time and workload for the annotators. The first research used the Amazon Mechanical Turk service using an API called Boto, to annotate the dataset manually [34].…”
Section: Annotation Processmentioning
confidence: 99%
“…Manually on a sentence level 21 [30], [21], [23], [10], [11], [31], [28], [29], [2], [15], [14], [20], [8], [13], [35], [26], [33], [16], [27], [18], [12].…”
Section: Annotation Process Type Paper Count Paperunclassified
See 2 more Smart Citations