2022
DOI: 10.3389/fcvm.2022.1025705
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning outperforms traditional logistic regression and offers new possibilities for cardiovascular risk prediction: A study involving 143,043 Chinese patients with hypertension

Abstract: IntroductionIdentifying people at risk of cardiovascular diseases (CVD) is a cornerstone of preventive cardiology. We developed machine learning (ML) algorithms and investigated their performance in predicting patients’ current CVD risk (coronary heart disease and stroke in this study).Materials and methodsWe compared traditional logistic regression (LR) with five ML algorithms LR with Elastic-Net, Random Forest (RF), XGBoost (XGB), Support Vector Machine, Deep Learning, and an Ensemble model averaging predict… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 29 publications
(25 reference statements)
0
3
0
Order By: Relevance
“…The predictive model will employ the Logistic Regression algorithm to estimate the mobility gradient of hospitals in 2021, categorizing them into three classes: LOW, MEDIUM, and HIGH mobility, based on the distance patients travel to reach the hospital. This choice is supported by studies demonstrating the effectiveness of Logistic Regression in predicting mobility behaviors and health risks, such as the analysis of travel behavior during the COVID-19 pandemic [19] and the assessment of cardiovascular risk [32], highlighting its applicability in complex healthcare contexts.…”
Section: Prediction Modelmentioning
confidence: 95%
“…The predictive model will employ the Logistic Regression algorithm to estimate the mobility gradient of hospitals in 2021, categorizing them into three classes: LOW, MEDIUM, and HIGH mobility, based on the distance patients travel to reach the hospital. This choice is supported by studies demonstrating the effectiveness of Logistic Regression in predicting mobility behaviors and health risks, such as the analysis of travel behavior during the COVID-19 pandemic [19] and the assessment of cardiovascular risk [32], highlighting its applicability in complex healthcare contexts.…”
Section: Prediction Modelmentioning
confidence: 95%
“…A longitudinal study involving 143,043 patients with hypertension was performed to predict long-term CVD risk. The study reported that advanced machine learning algorithms using RF performed better than traditional LR [ 19 ]. A longitudinal cohort study compared clinical risk predictions among patients with CVD using 19 prediction techniques.…”
Section: Introductionmentioning
confidence: 99%
“…Wu et al [13] developed a carotid plaque risk prediction tool among asymptomatic population based on machine learning (ML) algorithms, including extreme gradient boosting (XGBoost), gradient boosting decision tree, random forest (RF), and support vector machine, and achieved good performance, but the substantial complexity of the model may limit its practical use, while the commonly used risk prediction tools in the cardiovascular field, including Framingham risk score [15] and its modified model [16], were mostly based on traditional statistical models, including logistic regression and cox proportional-hazards regression. Although numerous studies have demonstrated that ML algorithms outperformed traditional statistical models in predictive performance throughout medical fields [17][18][19] due to their capability to analyze and learn the complex interactions and nonlinear associations among variables [17,20,21], the latter still own irreplaceable strengths, including their natural transparency, interpretability, and robustness, which boost their practicality in clinical research [22]. Therefore, using ML algorithms alone or traditional regression methods alone to train prediction models usually results in either accurate but complicated black boxes or practical but unsatisfactory-performed scoring systems.…”
Section: Introductionmentioning
confidence: 99%