Abstract:Abstrak Beberapa tahun belakangan ini, muncul perusahaan-perusahaan penyedia jasa transportasi yang menggunakan aplikasi berbasis android dalam proses pelayanannya atau biasa disebut dengan transportasi online. Hal ini dilakukan untuk meningkatkan pelayanan terhadap pengguna jasa transportasi. Hadirnya transportasi online seperti Gojek, Grab dan Uber menimbulkan masalah sosial antara supir transportasi online dan supir transportasi non aplikasi. Penyebabnya dikarenakan sebagian besar masyarakat beralih menggu… Show more
“…In this weighting, each Twitter dataset that has been preprocessed will be given a weighting value using TF-IDF which will produce a value for each text dataset from the tweet (Wahyunita et al, 2020).…”
The purpose of this research is to analyze the accuracy of congestion data using Google Colab in detecting congestion by the province in Indonesia the author tries to test strategies for dealing with congestion in the Indonesian region by utilizing the Naïve Bayes method. In this journal, apply with Google Collab . This research uses data that comes from crawling data on Twitter. Using the Naive Bayes method to find the shortest route is efficient and not congested. Implementation of online school transportation using the naive Bayes method in minimizing travel costs to pick up students can reduce traffic jams, reduce accidents, reduce student tardiness, and minimize travel costs. The Naive Bayes method can be used to identify relevant information about traffic jams in Indonesia through Twitter data with a good degree of accuracy. These results can assist decision-making and strategic planning in overcoming the problem of traffic congestion in Indonesia. Therefore, this research implies that it can help improve the accuracy of traffic congestion data in Indonesia. By using Google Colab, more advanced analysis methods and machine learning algorithms can be applied to process the existing traffic data. Additionally, utilizing Google Colab allows for fast and efficient data processing.
“…In this weighting, each Twitter dataset that has been preprocessed will be given a weighting value using TF-IDF which will produce a value for each text dataset from the tweet (Wahyunita et al, 2020).…”
The purpose of this research is to analyze the accuracy of congestion data using Google Colab in detecting congestion by the province in Indonesia the author tries to test strategies for dealing with congestion in the Indonesian region by utilizing the Naïve Bayes method. In this journal, apply with Google Collab . This research uses data that comes from crawling data on Twitter. Using the Naive Bayes method to find the shortest route is efficient and not congested. Implementation of online school transportation using the naive Bayes method in minimizing travel costs to pick up students can reduce traffic jams, reduce accidents, reduce student tardiness, and minimize travel costs. The Naive Bayes method can be used to identify relevant information about traffic jams in Indonesia through Twitter data with a good degree of accuracy. These results can assist decision-making and strategic planning in overcoming the problem of traffic congestion in Indonesia. Therefore, this research implies that it can help improve the accuracy of traffic congestion data in Indonesia. By using Google Colab, more advanced analysis methods and machine learning algorithms can be applied to process the existing traffic data. Additionally, utilizing Google Colab allows for fast and efficient data processing.
“…In the dataset obtained, not all letters are consistent with lower case or lower case. This Case Folding stage is needed in converting the entire text in the dataset into a standard form or into lowercase letters by using the lower() function to generalize text that is not yet in lowercase form [14]. The results of the case folding process are presented in Figure 3.…”
Section: Case Foldingmentioning
confidence: 99%
“…The tokenization stage is used to separate a sentence into word for word which is usually called a token so that it can be analyzed and facilitate data processing in the next stage [14]. The results of the tokenizing process are presented in Figure 5.…”
Section: Tokenizingmentioning
confidence: 99%
“…The stemming stage is done to change the word into its basic word form by removing the initial and final suffixes. The stemming process uses a special library for Indonesian language processing, namely the Python Sastrawi library [14]. The results of the stemming process are presented in Figure 7.…”
Kemajuan teknologi informasi memberikan dampak yang besar, seperti penyebaran berita online. Namun, kabar yang tersebar belum tentu benar adanya. Dalam beberapa penelitian, pendeteksian berita hoax telah dilakukan. Namun, terdapat perbedaan hasil dari beberapa algoritma yang digunakan. Oleh karena itu, dalam penelitian ini dilakukan perbandingan antara algoritma Logistic Regression, Naïve Bayes, Random Forest dan Support Vector Machine untuk memprediksi berita hoax khusus Indonesia dengan dataset seimbang dan tidak seimbang. Tahapan perancangan sistem dimulai dari pengumpulan dataset, pelabelan data, pre-processing, pembobotan TF-IDF, klasifikasi model hingga pengujian. Hasil akurasi tertinggi baik dari jumlah dataset yang tidak seimbang maupun dataset yang seimbang didapatkan dari SVM dengan perbandingan 80:20. Dataset tidak seimbang memiliki akurasi 85,47% dan F1-score 90% dan dataset seimbang memiliki akurasi 84,36% dan F1-score 84,80%. Pada penelitian ini dataset tidak seimbang mendapatkan hasil akurasi yang lebih baik dengan menggunakan algoritma SVM dan jika jumlah dataset yang menjadi target kelas utama lebih banyak maka akan memberikan hasil yang lebih baik.
“…Hasil penelitian menunjukkan polarisasi sentimen positif 44,51% dan negatif 45,80%. Dalam penelitian yang dilakukan Wahyunita [7] tentang transportasi online, analisis sentimen dilakukan pada tweet berbahasa Indonesia dengan metode pembobotan Hybrid TF-IDF dan metode algoritma K-Nearest Neighbor (KNN) sebagai metode klasifikasi sentimen. Dengan menggunakan pengujian cross validation mendapatkan hasil terbaik dengan nilai akurasi 70% untuk k=5, presisi kelas sentimen positif 68%, negatif 75%, recall kelas sentimen positif 82%, negatif 59%, F-measure kelas sentimen positif 74% dan negatif 65%.…”
Grab Indonesia is one of the leading online motorcycle taxi companies in Indonesia and has a large number of customers in Indonesia. The level of customer satisfaction varies with the services provided, so there must be suggestions and complaints from customers. Sentiment analysis can be used as a solution to determine the level of service satisfaction in order to improve the system and service. This study aims to determine the level of satisfaction of Grab Indonesia users through the Grab application in the Playstore. One of the approaches that can be used is LSTM. LSTM is an RNN algorithm development to solve the vanishing gradient problem. LSTM has the disadvantage of only running can only capture information from one direction. Bidirectional LSTM (BiLSTM) is an LSTM method that has been developed, where BiLSTM can capture information from two directions. In this BiLSTM method, the more data, the better the algorithm's performance. The test results show that BiLSTM is more reliable than LSTM in the case of sentiment analysis on the Indonesian Grab service. BiLSTM produces the best accuracy of 91% and training loss of 28%. Suggestions for future research can produce more and varied word representations by considering the word embedding combinations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.