2016
DOI: 10.1109/taffc.2015.2476456
|View full text |Cite
|
Sign up to set email alerts
|

SentiWords: Deriving a High Precision and High Coverage Lexicon for Sentiment Analysis

Abstract: Deriving prior polarity lexica for sentiment analysis -where positive or negative scores are associated with words out of context -is a challenging task. Usually, a trade-off between precision and coverage is hard to find, and it depends on the methodology used to build the lexicon. Manually annotated lexica provide a high precision but lack in coverage, whereas automatic derivation from pre-existing knowledge guarantees high coverage at the cost of a lower precision. Since the automatic derivation of prior po… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
50
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 98 publications
(55 citation statements)
references
References 40 publications
2
50
0
Order By: Relevance
“…In order to perform sentiment analysis, specialized lexica is used, which is mostly a list of positive and negative words for assigning scores to the input text (Gatti, Guerini, & Turchi, ). For performing Urdu sentiment analysis, a wide coverage Urdu lexicon is developed first.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to perform sentiment analysis, specialized lexica is used, which is mostly a list of positive and negative words for assigning scores to the input text (Gatti, Guerini, & Turchi, ). For performing Urdu sentiment analysis, a wide coverage Urdu lexicon is developed first.…”
Section: Methodsmentioning
confidence: 99%
“…In order to perform sentiment analysis, specialized lexica is used, which is mostly a list of positive and negative words for assigning scores to the input text (Gatti, Guerini, & Turchi, 2016 . For having maximum reliability, Urdu lughat is reused for validating the different POS assigned by the tagger.…”
Section: Methodsmentioning
confidence: 99%
“…The above 14 calculated feature values can be used as the sentiment strength score. At the same time, in [4], these features are input to support vector machine model to obtain the best prior polarity dictionary. The next section of our paper will apply Random Forest model to obtain a priori polarity dictionary which has superior performance than the dictionary obtained in [4].…”
Section: Featuresmentioning
confidence: 99%
“…Inspired by the paper [4], we put forward a novel method to obtain prior polarity dictionary with Random Forest algorithm. Combined with various sentiment calculation formulas the algorithm can get a sentiment dictionary with higher performance.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation