Beasiswa merupakan salah satu bantuan belajar yang diberikan kepada mahasiswa. Salah satu beasiswa yang ada adalah beasiswa yang diberikan oleh negara dengan nama Bantuan Belajar Mahasiswa (BBM). Pengelompokan data mahasiswa penerima beasiswa berguna untuk menentukan mahasiswa yang berhak, dipertimbangkan atau tidak berhak. Dengan pengelompokan mahasiswa penerima beasiswa ini dapat memudahkan pihak tata usaha dalam menentukan penerima beasiswa khususnya beasiswa BBM. Pengelompokan tersebut dalam dilakukan dengan menggunakan teknik klustering berbasis partisi yaitu dengan algoritma K-Medoids. Data-data yang didapat untuk dilakukan pengelompokan terdiri dari atribut SKS, IPK, Tanggungan orang tua dan jumlah penghasilan orang tua. Dari data-data yang didapat memiliki nilai yang beragam dan memiliki rentang satu dengan yang lainnya berjauhan. Maka dilakukan tiga buah skenario, yaitu 1: semua data yang didapat dilakukan pengelompokan dengan K-Medoids, 2 : sebagian data yang didapat dilakukan kodefikasi, 3 : semua data yang ada dilakukan kodefikasi. Dari ketiga skenario yang dilakukan didapat nilai Cubic Clustering Criterion (CCC). Dataset kodifikasi keseluruhan menunjukkan nilai CCC berada diantara 2 sampai 3 ini menunjukkan bahwa dataset kodifikasi keseluruhan mempunyai keseragaman yang baik. Hal ini dikarenakan semua nilai pada setiap atribut memiliki nilai yang hampir sama.
Classification of data with unbalanced classes is a major problem in the field of machine learning and data mining. If working on unbalanced data, almost all classification algorithms will produce much higher accuracy for majority classes than minority classes. This research will implement the Synthetic Minority Over-sampling Technique (SMOTE) method to overcome unbalanced data on credit customer data in Rawamerta teacher cooperatives. The research methodology uses SEMMA with the stages of research Sample, Explore, Modify, Model, and Asses. The Sample Phase was conducted to choose the data of the Rawamerta Teachers Cooperative credit customers for 2015-2017 with a total of 878 data with the attributes used namely income, total deposits, loan amount, duration of installments, services, installments, and credit status. The Explore phase analyzes current classes which are categorized as majority classes because there are 813 data, while traffic classes can be categorized as minority classes because there are 65 data. The data shows an imbalance of data between the two classes. The Modify stages perform the 500% SMOTE process. The Model Stage classifies using Na�ve Bayes. Na�ve Bayes modeling with SMOTE produced 1131 successfully classified data correctly and 72 data were not classified correctly while without SMOTE resulted in 818 data was classified correctly and 60 data were not classified correctly.Keywords: Na�ve Bayes, SMOTE, unbalanced data
Violence is action or threats against themselves alone, a group of people or community a group of people or community, loss psychologist, trauma, or deprivation of rights. District Karawang is on of the district that exist in the province of Jawa Barat. Violence that befell children and women in the area of Karawang bloom occurs, such as the lacj awareness of the victim to follow up cases that happened. The purpose of knowing the results of the cluster of cases of violence against children and women into three clusters are statterd in every sub-district in the District Karawang with category level of hardness low, medium or high in order that the government Karawang can provide treatment that is defferent and more targeted and focused on the results ot the analysis for each-each district. Data mining is the process of extracting data to obtain new information. In this study using CRIPS-DM methodology.Research is doing computation algorithm k-means clustering on the data of case of violence against children and women in 2016-2020. Results of testing using tools WEKA 3.8 earnded three cluster or the three categories of the level of violence that is cluster 0 there are 4 members who categorized the level of violence high, cluster 1 there are 2 members categorized the level of violence medium, and cluster 2 there are 24 members who categorized the level of violence low, the results of clustering is evaluated using equation testing purity measure, generate value purity 0,617, case that shows the cluster is quite good.
WeTV is an online streaming application widely used by Indonesia’s people as an entertainment medium while at home. This application has been downloaded more than 50 million times on the official Google Play Store website. The number of users who use it makes the reviews of this application abundant as well. Large numbers of reviews are very difficult to read manually, sentiment analysis is needed to classify reviews into positive and negative classes. This study uses a support vector machine algorithm with a linear kernel to classify review data from the WeTV application. KDD was used as a method to complete this research. In the analysis process to obtain information, 4 scenarios were carried out, with the division in the first scenario consisting of 60% training data and 40% test data, the second scenario consisting of 70% training data and 30% test data, the third scenario 80% training data and 20% test data, and the last scenario 90% training data and 10% test data. The highest test results of 85% were obtained from the second scenario with the distribution of training data of 70% and 30% of test data, the third with the distribution of training data of 80% and 20% of test data, and the fourth with the distribution of training data of 90% and test 10% data. The confusion matrix is used as an evaluation of the model that has been made, the results show an accuracy in the first scenario of 83%, with a precision value of 83%, recall 89%, and an f1-score of 86%. The accuracy in the second scenario is 85%, precision is 86%, recall is 89%, f1-score is 87%, accuracy in the third scenario is 85%, precision is 85%, recall is 90%, and f1-score is 88%. And the fourth scenario gets an accuracy of 85%, precision 86%, recall 90%, and f1-score 90%.
Data Grouping scholarship applicants Bantuan Belajar Mahasiswa (BBM) grouped into 3 categories entitled of students who are eligible to receive, be considered, and not eligible to receive scholarship. Grouping into 3 groups is useful to make it easier to determine the scholarship recipients fuel. K-Medoids algorithm is an algorithm of clustering techniques based partitions. This technique can group data is student scholarship applicants. The purpose of this study was to measure the performance of the algorithm, this measurement in view of the results of the cluster by calculating the value of purity (purity measure) of each cluster is generated. The data used in this research is data of students who apply for scholarships as many as 36 students. Data will be converted into three datasets with different formats, namely the partial codification attribute data, attributes and attribute the overall codification of the original data. Value purity on the whole dataset of data codification greatest value is 91.67%, it can be concluded that the K-Medoids algorithm is more suitable for use in a dataset with attributes encoded format overall.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.