Analisa Sentimen Tweet Berbahasa Indonesia dengan Menggunakan Metode Pembobotan Hybrid TF-IDF pada Topik Transportasi Online

Wahyunita, Sari; Azhar, Yufiz; Hayatin, Nur

doi:10.22219/repositor.v2i2.238

Cited by 3 publications

(5 citation statements)

References 2 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In this weighting, each Twitter dataset that has been preprocessed will be given a weighting value using TF-IDF which will produce a value for each text dataset from the tweet (Wahyunita et al, 2020).…”

Section: Tf-idf Weightingmentioning

confidence: 99%

Classification of Traffic Congestion in Indonesia Using the Naive Bayes Classification Method

Robi Padri,

Asro,

Indra

2023

JWS

View full text Add to dashboard Cite

The purpose of this research is to analyze the accuracy of congestion data using Google Colab in detecting congestion by the province in Indonesia the author tries to test strategies for dealing with congestion in the Indonesian region by utilizing the Naïve Bayes method. In this journal, apply with Google Collab . This research uses data that comes from crawling data on Twitter. Using the Naive Bayes method to find the shortest route is efficient and not congested. Implementation of online school transportation using the naive Bayes method in minimizing travel costs to pick up students can reduce traffic jams, reduce accidents, reduce student tardiness, and minimize travel costs. The Naive Bayes method can be used to identify relevant information about traffic jams in Indonesia through Twitter data with a good degree of accuracy. These results can assist decision-making and strategic planning in overcoming the problem of traffic congestion in Indonesia. Therefore, this research implies that it can help improve the accuracy of traffic congestion data in Indonesia. By using Google Colab, more advanced analysis methods and machine learning algorithms can be applied to process the existing traffic data. Additionally, utilizing Google Colab allows for fast and efficient data processing.

show abstract

Section: Tf-idf Weightingmentioning

confidence: 99%

Classification of Traffic Congestion in Indonesia Using the Naive Bayes Classification Method

Robi Padri,

Asro,

Indra

2023

JWS

View full text Add to dashboard Cite

show abstract

“…In the dataset obtained, not all letters are consistent with lower case or lower case. This Case Folding stage is needed in converting the entire text in the dataset into a standard form or into lowercase letters by using the lower() function to generalize text that is not yet in lowercase form [14]. The results of the case folding process are presented in Figure 3.…”

Section: Case Foldingmentioning

confidence: 99%

“…The tokenization stage is used to separate a sentence into word for word which is usually called a token so that it can be analyzed and facilitate data processing in the next stage [14]. The results of the tokenizing process are presented in Figure 5.…”

Section: Tokenizingmentioning

confidence: 99%

See 1 more Smart Citation

Comparison of Classification Algorithms for Predicting Indonesian Fake News using Balanced and Imbalanced Datasets

Karima,

Mutiara

2023

FaktorExacta

View full text Add to dashboard Cite

Kemajuan teknologi informasi memberikan dampak yang besar, seperti penyebaran berita online. Namun, kabar yang tersebar belum tentu benar adanya. Dalam beberapa penelitian, pendeteksian berita hoax telah dilakukan. Namun, terdapat perbedaan hasil dari beberapa algoritma yang digunakan. Oleh karena itu, dalam penelitian ini dilakukan perbandingan antara algoritma Logistic Regression, Naïve Bayes, Random Forest dan Support Vector Machine untuk memprediksi berita hoax khusus Indonesia dengan dataset seimbang dan tidak seimbang. Tahapan perancangan sistem dimulai dari pengumpulan dataset, pelabelan data, pre-processing, pembobotan TF-IDF, klasifikasi model hingga pengujian. Hasil akurasi tertinggi baik dari jumlah dataset yang tidak seimbang maupun dataset yang seimbang didapatkan dari SVM dengan perbandingan 80:20. Dataset tidak seimbang memiliki akurasi 85,47% dan F1-score 90% dan dataset seimbang memiliki akurasi 84,36% dan F1-score 84,80%. Pada penelitian ini dataset tidak seimbang mendapatkan hasil akurasi yang lebih baik dengan menggunakan algoritma SVM dan jika jumlah dataset yang menjadi target kelas utama lebih banyak maka akan memberikan hasil yang lebih baik.

show abstract

“…Hasil penelitian menunjukkan polarisasi sentimen positif 44,51% dan negatif 45,80%. Dalam penelitian yang dilakukan Wahyunita [7] tentang transportasi online, analisis sentimen dilakukan pada tweet berbahasa Indonesia dengan metode pembobotan Hybrid TF-IDF dan metode algoritma K-Nearest Neighbor (KNN) sebagai metode klasifikasi sentimen. Dengan menggunakan pengujian cross validation mendapatkan hasil terbaik dengan nilai akurasi 70% untuk k=5, presisi kelas sentimen positif 68%, negatif 75%, recall kelas sentimen positif 82%, negatif 59%, F-measure kelas sentimen positif 74% dan negatif 65%.…”

Section: Pendahuluanunclassified

Implementasi Bidirectional LSTM untuk Analisis Sentimen Terhadap Layanan Grab Indonesia

Alghifari

Edi²,

Firmansyah³

2022

JAMIKA

View full text Add to dashboard Cite

Grab Indonesia is one of the leading online motorcycle taxi companies in Indonesia and has a large number of customers in Indonesia. The level of customer satisfaction varies with the services provided, so there must be suggestions and complaints from customers. Sentiment analysis can be used as a solution to determine the level of service satisfaction in order to improve the system and service. This study aims to determine the level of satisfaction of Grab Indonesia users through the Grab application in the Playstore. One of the approaches that can be used is LSTM. LSTM is an RNN algorithm development to solve the vanishing gradient problem. LSTM has the disadvantage of only running can only capture information from one direction. Bidirectional LSTM (BiLSTM) is an LSTM method that has been developed, where BiLSTM can capture information from two directions. In this BiLSTM method, the more data, the better the algorithm's performance. The test results show that BiLSTM is more reliable than LSTM in the case of sentiment analysis on the Indonesian Grab service. BiLSTM produces the best accuracy of 91% and training loss of 28%. Suggestions for future research can produce more and varied word representations by considering the word embedding combinations.

show abstract

Analisa Sentimen Tweet Berbahasa Indonesia dengan Menggunakan Metode Pembobotan Hybrid TF-IDF pada Topik Transportasi Online

Cited by 3 publications

References 2 publications

Classification of Traffic Congestion in Indonesia Using the Naive Bayes Classification Method

Classification of Traffic Congestion in Indonesia Using the Naive Bayes Classification Method

Comparison of Classification Algorithms for Predicting Indonesian Fake News using Balanced and Imbalanced Datasets

Implementasi Bidirectional LSTM untuk Analisis Sentimen Terhadap Layanan Grab Indonesia

Contact Info

Product

Resources

About