2019 11th International Conference on Knowledge and Systems Engineering (KSE) 2019
DOI: 10.1109/kse.2019.8919368
|View full text |Cite
|
Sign up to set email alerts
|

Effective Text Data Preprocessing Technique for Sentiment Analysis in Social Media Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 77 publications
(37 citation statements)
references
References 8 publications
0
36
1
Order By: Relevance
“…In this study, we reviewed and selected a text data preprocessing technique. Ten preprocessing techniques are frequently used [ 33 ]. The data preprocessing has four steps: data cleansing, similar word matching, stop word removal, and tokenization.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this study, we reviewed and selected a text data preprocessing technique. Ten preprocessing techniques are frequently used [ 33 ]. The data preprocessing has four steps: data cleansing, similar word matching, stop word removal, and tokenization.…”
Section: Methodsmentioning
confidence: 99%
“…A stop word removal step was then performed. Stop words are common words with no semantics and do not aggregate relevant information to the task, such as “the” and “a” [ 33 ]. Lastly, the tokenization step divides each accident situation description sentence into token units, which are small chunks such as words and attached parts of speech.…”
Section: Methodsmentioning
confidence: 99%
“…The authors (Jianqiang & Xiaolin, 2017) have conducted experiments to prove that the use of text preprocessing techniques results in better accuracy for Twitter sentiment analysis. The concept of lemmatization and stemming was jointly used by the authors (Pradha, Halgamuge, & Tran Quoc Vinh, 2019) on the Twitter dataset to perform text-based sentiment analysis.…”
Section: Preprocessing Of Textmentioning
confidence: 99%
“…Furthermore, Pradha et al [ 71 ] proposed an effective technique for pre-processing text data and developed an algorithm to train Support Vector Machine (SVM), Deep Learning (DL) and Naïve Bayes (NB) classifiers for processing Twitter data, developing an algorithm to weight the feeling evaluation in relation to the weight of the hashtag and clean text. Sohrabi and Hemmatian [ 72 ] presented an efficient pre-processing method for opinion mining, testing it on Twitter user comments, and demonstrated how its use in combination with SVM and ANNs achieves the highest accuracy scores compared to other methods.…”
Section: Background and Related Workmentioning
confidence: 99%