Background
Cardiovascular disorders (CVDs) are widely considered the leading cause of death worldwide. Lower and middle-income countries (LMICs) like Bangladesh are also affected by several types of CVDs such as heart failure and stroke. The leading factors of death in Bangladesh have increasingly switched from severe infections and parasitic illness to CVDs recently.
Materials and methods
The study dataset is a random sample of the 391 CVD patients' medical records collected between August 2022 and April 2023 using simple random sampling. Moreover, 260 data are also collected from individuals with no CVD problem for comparison purposes. Crosstabs and chi-square are used to find the association between CVD and explanatory variables. Logistic regression, Naïve Bayes classifier, Decision Tree, AdaBoost classifier, Random Forest, Bagging Tree, and Ensemble learning classifiers are used to predict CVD in this study. The performance evaluations encompassed accuracy, sensitivity, specificity, and the area under the receiver operator characteristic (AU-ROC) curve.
Result
Random Forest has the highest precision among the five techniques considered. The precision rates for the mentioned classifiers are as follows: Logistic Regression (93.67%), Naïve Bayes (94.87%), Decision Tree (96.1%), AdaBoost (94.94%), Random Forest (96.15%), and Bagging Tree (94.87%). The Random Forest classifier maintains the highest balance between correct and incorrect predictions. With 98.04% accuracy, the Random Forest Classifier achieves the best precision (96.15%), robust recall (100%), and a high F1 score (97.7%). In contrast, the Logistic Regression model achieves the lowest accuracy at 95.42%. Remarkably, the Random Forest classifier attains the highest AUC value (0.989).
Conclusion
This research is mainly focused on identifying factors that are critical in impacting CVD patients and predicting CVD risk. It is strongly advised that the Random Forest technique be implemented in the system for predicting cardiac disease. This research may change clinical practice by giving doctors a new instrument to determine a patient's prognosis for CVD.