2018
DOI: 10.1177/0165551518761014
|View full text |Cite
|
Sign up to set email alerts
|

TREMO: A dataset for emotion analysis in Turkish

Abstract: This study presents a new dataset to be used in emotion extraction studies in Turkish text. We consider emotion extraction as a supervised text classification problem, which thereby requires a dataset for the training process. To satisfy this requirement, we aim to create a new dataset containing data for the six emotion categories: happiness, fear, anger, sadness, disgust and surprise. To gather this dataset, we conducted a survey and collected 27,350 entries from 4709 individuals. In the next step, we perfor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 24 publications
(20 citation statements)
references
References 15 publications
(16 reference statements)
0
19
0
1
Order By: Relevance
“…On the other hand, the anger emotion category obtained the lowest accuracy value for SVM, RF, and lexiconbased approach. This is because anger emotion category is likely to be confused with fear and sadness emotion categories [35]. Table 12 shows the overall evaluation results in terms of accuracy, precision, recall, and f-measure for each classification method.…”
Section: Experiments and Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, the anger emotion category obtained the lowest accuracy value for SVM, RF, and lexiconbased approach. This is because anger emotion category is likely to be confused with fear and sadness emotion categories [35]. Table 12 shows the overall evaluation results in terms of accuracy, precision, recall, and f-measure for each classification method.…”
Section: Experiments and Discussionmentioning
confidence: 99%
“…In this article, we proposed a Turkish emotion lexicon (TEL) 1 that can be used in emotion analysis in Turkish text for six emotion categories. To the best of our knowledge, it is the first Turkish lexicon which is generated from an original Turkish dataset, TREMO, in the literature [35]. To create the lexicon, we examined the effects of a lemmatizer and a stemmer, two term-weighting schemes, four different lexicon enrichment methods, and a term selection process for lexicon-based emotion analysis, respectively.…”
Section: Introductionmentioning
confidence: 99%
“…Several research to analyze sentiment from Turkish texts have been carried out specifically on stemmed data [54], [62], [63], [66].…”
Section: B Rating the Stemmed Turkish Datamentioning
confidence: 99%
“…As a second method is supervised LSTM [33]. As the training data, the results accepted by unanimous or majority vote from 34.397 classified comments in the TREMO [34] data set were used. The labels in this training data have been converted as positive, negative and neutral.…”
Section: Figure 8 Cumulative Data Graph Of Death and Interpretation By Daysmentioning
confidence: 99%