Feature Selection (F.S.) reduces the number of features by removing unnecessary, redundant, and noisy information while keeping a relatively decent classification accuracy. F.S. can be considered an optimization problem. As the problem is challenging and there are many local solutions, stochastic optimization algorithms may be beneficial .This paper proposes a novel approach to dimension reduction in feature selection. As a seminal attempt, this work uses binary variants of the recent Marine Predators Algorithm (MPA) to select the optimal feature subset to improve classification accuracy. MPA is a new and novel nature-inspired metaheuristic. This research proposes an algorithm that is a hybridization between MPA and k-Nearest Neighbors (k-NN) called MPA-KNN. k-Nearest Neighbors (k-NN) is used to evaluate the selected features on medical datasets with feature sizes ranging from tiny to massive. The proposed methods are evaluated on 18 well-known UCI medical dataset benchmarks and compared with eight well-regarded metaheuristic wrapper-based approaches. The core exploratory and exploitative processes are adapted in MPA to select the optimal and meaningful features for achieving the most accurate classification. The results show that the proposed MPA-KNN approach had a remarkable capability to select the optimal and significant features. It performed better than the well-established metaheuristic algorithms we tested. The algorithms we used for comparison are Grey Wolf Optimizer (GWO), MothFlame Optimization Algorithm (MFO), Sine Cosine Algorithm (SCA), Whale Optimization Algorithm (WOA), Slap Swarm Algorithm (SSA), Butterfly Optimization Algorithm (BFO), and Harris Hawks Optimization (HHO). This paper is the first work that implements MPA for Feature Selection problems. The results ensure that the proposed MPA-KNN approach has a remarkable capability to select the optimal and significant features and performed better than several metaheuristic algorithms. MPA-KNN achieves the best averages accuracy, Sensitivity, and Specificity rates of all datasets.
In comparison to the competitors, engineers must provide quick, low-cost, and dependable solutions. The advancement of intelligence generated by machines and its application in almost every field has created a need to reduce the human role in image processing while also making time and labor profit. Lepidopterology is the discipline of entomology dedicated to the scientific analysis of caterpillars and the three butterfly superfamilies. Students studying lepidopterology must generally capture butterflies with nets and dissect them to discover the insect’s family types and shape. This research work aims to assist science students in correctly recognizing butterflies without harming the insects during their analysis. This paper discusses transfer-learning-based neural network models to identify butterfly species. The datasets are collected from the Kaggle website, which contains 10,035 images of 75 different species of butterflies. From the available dataset, 15 unusual species were selected, including various butterfly orientations, photography angles, butterfly lengths, occlusion, and backdrop complexity. When we analyzed the dataset, we found an imbalanced class distribution among the 15 identified classes, leading to overfitting. The proposed system performs data augmentation to prevent data scarcity and reduce overfitting. The augmented dataset is also used to improve the accuracy of the data models. This research work utilizes transfer learning based on various convolutional neural network architectures such as VGG16, VGG19, MobileNet, Xception, ResNet50, and InceptionV3 to classify the butterfly species into various categories. All the proposed models are evaluated using precision, recall, F-Measure, and accuracy. The investigation findings reveal that the InceptionV3 architecture provides an accuracy of 94.66%, superior to all other architectures.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.