The increase in the use of energy sources causes air pollution. The Air Pollutant Index (API) is information about the air quality of a place and at a certain time. API has several parameters, namely SO2, PM10, NO2, O3, and CO. In this study, the KNN method was used to assist categorize air quality. However, all training data were used during the classification process with KNN causes a long prediction process. Another problem with KNN is difficult to determine the optimal value of the K parameter in KNN. The Particle Swarm Optimization (PSO) method can be used for problems on KNN. Therefore, the aim of this study is to predict air quality based on the API by combining the KNN-PSO method. The dataset used is the API dataset for the DKI Jakarta area 2017-2019 totaling 1075 data. The results showed the accuracy for the KNN-PSO method was 98.42% with a precision value of 97.75% and a recall value of 98.13%. To further analyze the results on the combined method, the results of this study were compared with the KNN method only. The results obtained from the KNN method are lower than the KNN-PSO method. So it can be concluded that the KNN-PSO method is great and robust in air quality classification or prediction.
The purpose of this study was to determine how the effect of using Bootstrapping Samples for resampling the Harlev dataset in improving the performance of single-cell pap smear classification by dealing with the data imbalance problem. The Harlev dataset used in this study consists of 917 data with 20 attributes. The number of classes on the label had data imbalance in the dataset that affected single-cell pap smear classification performance. The data imbalance in the classification causes machine learning algorithms to produce poor performance in the minority class because they were overwhelmed by the majority class. To overcome it, The resampling data could be used with Sample Bootstrapping. The results of the Sample Bootstrapping were evaluated using the Artificial Neural Network and K-Nearest Neighbors classification methods. The classification used was seven classes and two classes. The classification results using these two methods showed an increase in accuracy, precision, and recall values. The performance improvement reached 10.82% for the two classes classification and 35% for the seven classes classification. It was concluded that Sample Boostrapping was good and robust in improving the classification method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.