2020
DOI: 10.1007/978-3-030-52856-0_31
|View full text |Cite
|
Sign up to set email alerts
|

An Investigation and Evaluation of N-Gram, TF-IDF and Ensemble Methods in Sentiment Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 16 publications
0
6
0
Order By: Relevance
“…N represents the number of adjacent words considered as a sequence. In the unigram, each word is considered as a single sequence, whereas in bigram every two words are a sequence [ 27 , 28 ].…”
Section: Methodsmentioning
confidence: 99%
“…N represents the number of adjacent words considered as a sequence. In the unigram, each word is considered as a single sequence, whereas in bigram every two words are a sequence [ 27 , 28 ].…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, a features extraction approach is implemented to convert text data into numerical vectors that the algorithms can process and work with. N-gram and the Term Frequency/Inverse Document Frequency (TF-IDF) are the most used feature extraction approaches [ 18 ].…”
Section: Methodsmentioning
confidence: 99%
“…The N-gram technique represents the text as an N-words sequence; it can be simple or complex, based on the value of N. In unigrams, it considers each word a sequence, while in bigrams it considers each pair of words a sequence. Then, the vectorizer calculates the occurrences of each sequence to generate the sentences' vectors [23].…”
Section: Feature Extractionmentioning
confidence: 99%