Abstract. Spam is an abuse of messaging undesired by recipients. Those who send spam are called spammers. Popularity of Twitter has attracted spammers to use it as a means to disseminate spam messages. The spams are characterized by a neutral emotional sentiment or no particular users’ preference perspective. In addition, the regularity of tweeting behavior periodically shows automation performed by bot. This study proposes a new method to differentiate between bot spammer and legitimate user accounts by integrating the sentiment analysis (SA) based on emotions and time interval entropy (TIE). The combination of knowledge-based and machine learning-based were used to classify tweets with positive, negative and neutral sentiments. Furthermore, the collection of timestamp is used to calculate the time interval entropy of each account. The results show that the precision and recall of the proposed method reach up to 83% and 91%. This proves that the merging SA and TIE can optimize overall system performance in detecting Bot Spammer.Keywords: bot spammer, twitter, sentiment analysis, polarity, entropy Abstrak. Spam merupakan penyalahgunaan pengiriman pesan tanpa dikehendaki oleh penerimanya, orang yang mengirimkan spam disebut spammer. Ketenaran Twitter mengundang spammer untuk menggunakannya sebagai sarana menyebarluaskan pesan spam. Karakteristik dari tweet yang dikategorikan spam memiliki sentimen emosi netral atau tidak ada preferensi tertentu terhadap suatu perspektif dari user yang memposting tweet. Selain itu keteraturan waktu perilaku saat memposting tweet secara periodik menunjukkan otomatisasi yang dilakukan bot. Pada penelitian ini diusulkan metode baru untuk mendeteksi antara bot spammer dan legitimate user dengan mengintegrasikan sentimen analysis berdasarkan emosi dan time interval entropy. Pendekatan gabungan knowledge-based dan machine learning-based digunakan untuk mengklasifikasi tweet yang memiliki sentimen positif, negatif dan tweet netral. Selanjutnya kumpulan timestamp digunakan untuk menghitung time interval entropy dari tiap akun. Hasil percobaan menunjukan bahwa precision dan recall dari metode yang diusulkan mencapai 83% dan 91%. Hal ini membuktikan penggabungan Sentiment Analysis (SA) dan Time Interval Entropy (TIE) dapat mengoptimalkan performa sistem secara keseluruhan dalam mendeteksi Bot Spammer.Kata Kunci: bot spammer, twitter, sentiment analysis, polarity, entropy
Madrasah Muhammadiyah Al-Munawarroh adalah madrasah yang terletak di Malang dan salah satu madrasah yang berkembang. Untuk menunjang perkembangan tersebut, dibuat sebuah website profile yang dapat memberikan informasi kepada masyarakat secara cepat. Metode yang digunakan dalam pembuatan website adalah metode RAD (Rapid Application Development). Metode ini digunakan karena kami ingin melibatkan pihak madrasah dalam pembuatan website, agar website yang dibuat dapat bermanfaat secara penuh dan sesuai dengan kebutuhan penyebaran informasi bagi madrasah. Hasil dari pembuatan website profile menunjukkan bahwa pihak madrasah dapat dengan mudah menyebarkan informasi-informasi penting seperti jadwal pendaftaran siswa, informasi mengenai madrasah hingga informasi lowongan pekerjaan yang ada di madrasah.
In mid-2019 President of the Republic of Indonesia officially decided that the capital city be moved outside of Java. This has caused many responses from the public who responded to this decision. We have seen many of these community responses on social media, especially in Twitter. To see the reality of the response of the Indonesian people requires a study that can draw conclusions from the number of community responses. So from this problem this study was conducted to find the truth of the community response related to the decision to move the Indonesian capital by using the lexicon method. This study also wants to see a comparison of the effect of the stemming process on sentiment analysis. To measure the performance of the Lexicon method, this research will be tested by an expert. Then the results of the experts will be entered into the confusion matrix. From the calculations with the confusion matrix, the results showed that the response of many Indonesian people who agree with the decision to move the Indonesian capital.
Support Vector Machine (SVM) is one of the most widely used classification algorithms for sentiment analysis and has been shown to provide satisfactory performance. However, despite its advantages, the SVM algorithm still has weaknesses in selecting the right SVM parameters to optimize the performance. In this study, sentiment analysis was done with the use of data called tweets about Undang-Undang Cipta Kerja which reap many pros and cons by the people in Indonesia, especially the laborers. The classification method used in this study is the Support Vector Machine algorithm which is optimized using the Particle Swarm Optimization method for the SVM parameters selection in the hope of optimizing the performance generated by the SVM algorithm in sentiment analysis. The results of the study using 10 k-fold cross-validations using the SVM algorithm resulted in an accuracy of 92,99%, a precision of 93,24%, and a recall of 93%. Meanwhile, the SVM and PSO algorithms produce an accuracy of 95%, precision of 95,08%, and recall of 94,97%. The results show that the Particle Swarm Optimization method can overcome the weaknesses of the Support Vector Machine algorithm in the problem of parameter selection and has succeeded in improving the resulting performance where the SVM-PSO is more superior to SVM without optimization in sentiment analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.