The Formula E racing series has become one of the world's most prestigious competitions. In 2022, Indonesia hosted the famous Formula E race. The event possesses the potential for economic benefits for Indonesia worth 78 million euros through the arrival of 35,000 spectators. Indonesians are enthusiastic about Formula E since it allows their nation to encourage tourists and gain international prominence. However, some people do not support this event. Since they regard that amid the COVID-19 pandemic, it is preferable for the government to focus on people affected by the pandemic rather than support a Formula E event. This study compares the Support Vector Machine and Naive Bayes algorithms in classifying public opinion in the Formula E race. This study gets its information from user comments on social media platforms, especially Twitter. The stages start with text preprocessing and include cleaning, case folding, tokenization, filtering, and stemming. Proceed with weighting using the TF-IDF approach. Data testing uses a confusion matrix to evaluate the classification results by testing accuracy, precision, and recall. Categorizing public opinion using the SVM algorithm has an accuracy of 82 percent, a precision of 97.86 percent, and a recall of 77.90 percent. On the other hand, the accuracy of the Naive Bayes technique is more limited, at 87.54 percent. Society's opinion on Twitter shows positive sentiment towards implementing Formula E.
Politeknik Tri Mitra Karya Mandiri adalah salah satu perguruan tinggi vokasi yang berada di wilayah Cikampek Kabupaten Karawang yang pada tahun akademik 2017/2018 mempunyai jumlah mahasiswa mencapai 987 orang mahasiswa.Namun sayangnya dari total jumlah mahasiswa tidak seluruhnya mempunyai orientasi minat yang sesungguhnya untuk kuliah, banyak factor yang mempengaruhinya. Tinginya tingkat orientasi minat mahasiswa yang tidak memilih kuliah, inilah yang membuat diadakan penelitian tentang sebab-sebab mengapa mahasiswa berkuliah dikampus ini serta mecari solusi guna mengurangi jumlah mahasiswa yang menjadi non aktif ketika diketahui mempunyai orientasi minat yang bukan untuk kuliah. Dengan melakukan komparasi menggunakan 3 algoritma yang termasuk dalam metode klasifikasi data mining yaitu; Decision Tree C4.5, Naïve Bayes dan K-Nearest Neighbor penelitian ini mencari nilai akurasi dan Area Under Curve (AUC) yang terbaik dari ketiga algoritma yang dikomparasi guna ditentukan model yang digunakan pada penentuan orientasi minat mahasiswa. Hasil dari komparasi yang dilakukan dalam penelitian ini adalah; algoritma Decision Tree C4.5 mempunyai nilai akurasi sebesar 91,75% dan AUC sebesar 0,969, Naïve Bayes mempunyai nilai akurasi sebesar 86,77% dan AUC sebesar 0,930 sedangngkan K-Nearest Neighbor mempunyai nilai akurasi sebesar 88,61% dan AUC sebesar 0,500. Melalui uji beda yang dilakukan menggunakan operator T-test pada Rapid Miner ditemukan hasil bahwa algoritma Decision Tree C4.5 merupakan algoritma terbaik dari 3 buah algoritma yang digunakan, maka dalam penelitian ini digunakan rule Decision Tree C4.5 untuk diterapkan pada deployment yang dilakukan.
The company produces sales data every day. Over time, the data increases, and the amount becomes very large. The data is only stored without understanding the benefits that exist from these data due to limitations in proper knowledge in analyzing the data, especially transaction data. Sale. To overcome these problems, a study focused on reprocessing sales transaction data in 2018 with a data mining technique approach using the Knowledge Discovery in Database (KDD) concept using the association method and apriori algorithm and a supporting application, namely RapidMiner. This study aims to help companies find customer buying habits or patterns based on 2018 sales transaction data. The results of this study produce 316 association rules where the best rules are generated on record 309 with PRO 889 & PRO 868 PRO 869 rules.
Customer complaints about the company can be used as a form of self-evaluation and performance that has been carried out by the company, based on customer complaints the company can find out the weaknesses that exist in the company and fix them. The forms of submitting customer complaints are very diverse, currently not only by telephone, but customers also submit suggestions or complaints, customers can submit suggestions or complaints via electronic mail or e-mail or forums in cyberspace that are indeed created by product-producing companies to accommodate various complaints, suggestions, and direct criticism from consumers, especially social media that are free to express opinions on the delivery services used. Instagram is a social media that is more inclined towards images and on the other hand, has captions and comments text, a study is needed for the problem of customer complaints from shipping service users on an Instagram account of a delivery service company. Based on this background, a solution is needed in solving problems for text mining classification using Naïve Bayes with SMOTE techniques and N-Gram feature extraction with the usual process for text mining so that it can produce Naïve Bayes and SMOTE accuracy with an accuracy of 88.54%, before implementation. N-Gram and the accuracy rate increased by 1.44% after the N-Gram Term was applied to 89.98% by using a dataset of 776 Instagram comment text records that had to preprocess text.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.