In cardiology, as in other medical specialties, early and accurate diagnosis of heart disease is crucial as it has been the leading cause of death over the past few decades. Early prediction of heart disease is now more crucial than ever. However, the state-of-the-art heart disease prediction strategy put more emphasis on classifier selection in enhancing the accuracy and performance of heart disease prediction, and seldom considers feature reduction techniques. Furthermore, there are several factors that lead to heart disease, and it is critical to identify the most significant characteristics in order to achieve the best prediction accuracy and increase prediction performance. Feature reduction reduces the dimensionality of the information, which may allow learning algorithms to work quicker and more efficiently, producing predictive models with the best rate of accuracy. In this study, we explored and suggested a hybrid of two distinct feature reduction techniques, chi-squared and analysis of variance (ANOVA). In addition, using the ensemble stacking method, classification is performed on selected features to classify the data. Using the optimal features based on hybrid features combination, the performance of a stacking ensemble based on logistic regression yields the best result with 93.44%. This can be summarized as the feature selection method can take into account as an effective method for the prediction of heart disease.
The rise in heart disease among the general population is alarming. This is because cardiovascular disease is the leading cause of death, and several studies have been conducted to assist cardiologists in identifying the primary cause of heart disease. The classification accuracy of single classifiers utilised in most recent studies to predict heart disease is quite low. The accuracy of classification can be enhanced by integrating the output of multiple classifiers in an ensemble technique. Even though they can deliver the best classification accuracy, the existing ensemble approaches that integrate all classifiers are quite resource-intensive. This study thus proposes a stacking ensemble that selects the optimal subset of classifiers to produce meta-classifiers. In addition, the research compares the effectiveness of several meta-classifiers to further enhance classification. There are ten types of algorithms, including logistic regression (LR), support vector classifier (SVC), random forest (RF), extra tree classifier (ETC), naïve bayes (NB), extreme gradient boosting (XGB), decision tree (DT), k-nearest neighbor (KNN), multilayer perceptron (MLP), and stochastic gradient descent (SGD) are used as a base classifier. The construction of the meta-classifier utilised three different algorithms consisting of LR, MLP, and SVC. The prediction results from the base classifier are then used as input for the stacking ensemble. The study demonstrates that using a stacking ensemble performs better than any other single algorithm in the base classifier. The meta-classifier of logistic regression yielded 90.16% results which is better than any base classifiers. In conclusion, we could assume that the ensemble stacking approach can be considered an additional means of achieving better accuracy and has improved the performance of the classification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.