2022
DOI: 10.3390/info13100484
|View full text |Cite
|
Sign up to set email alerts
|

A Semi-Supervised Approach to Sentiment Analysis of Tweets during the 2022 Philippine Presidential Election

Abstract: With the increasing popularity of Twitter as both a social media platform and a data source for companies, decision makers, advertisers, and even researchers alike, data have been so massive that manual labeling is no longer feasible. This research uses a semi-supervised approach to sentiment analysis of both English and Tagalog tweets using a base classifier. In this study involving the Philippines, where social media played a central role in the campaign of both candidates, the tweets during the widely conte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(13 citation statements)
references
References 11 publications
0
13
0
Order By: Relevance
“…The data cleaning and preprocessing technique employed natural language processing (NLP) to attain classification outcomes with high accuracy, as this processing method is crucial for the computer's understanding of data [45]. In this phase, various libraries, including Google Colab, nltk, pandas, spacy, and the Indonesian Sastrawi library were utilized for preprocessing.…”
Section: Preprocessingmentioning
confidence: 99%
See 1 more Smart Citation
“…The data cleaning and preprocessing technique employed natural language processing (NLP) to attain classification outcomes with high accuracy, as this processing method is crucial for the computer's understanding of data [45]. In this phase, various libraries, including Google Colab, nltk, pandas, spacy, and the Indonesian Sastrawi library were utilized for preprocessing.…”
Section: Preprocessingmentioning
confidence: 99%
“…For instance, "Saya" and "saya" were considered the same. This stage aimed to reduce the differences between lowercase, uppercase, and capital letters when vectoring [45]. (3) Stop-word removal eliminated meaningless words and was carried out using the stopwords() library provided by NLTK through the Sastrawi tool.…”
Section: Preprocessingmentioning
confidence: 99%
“…Additionally, up to 250 replies from each tweet were queried which resulted in a total number of 8,362,555 replies. Moreover, Macrohon et al [14] proposed a semi-supervised sentiment analysis using multinomial Naive Bayes of tweets during the 2022 Philippine Presidential Election. A total of 150,792 raw tweets were collected from Twitter API.…”
Section: A Political Discourse In the Internetmentioning
confidence: 99%
“…The complex and monographically rich nature of the Nepali language makes Natural Language Processing (NLP) tasks particularly challenging for this language [12]. Unlike languages such as English, which has ample resources and a plethora of NLP studies [13], [14], research in the field of low-resource languages such as Nepali is scarce [15], [16]. There have been some works involving sentimental analysis in Nepali [12], [17], [18].…”
Section: Introductionmentioning
confidence: 99%
“…This study used a semi-supervised approach in machine learning, which involved the use of the supervised machine learning models and augmenting the said model, especially in some circumstances when data are weak or where vast quantities of data are unlabeled [ 6 , 7 ].…”
Section: Introductionmentioning
confidence: 99%