2019
DOI: 10.3390/jcm8071050
|View full text |Cite
|
Sign up to set email alerts
|

Physician-Friendly Machine Learning: A Case Study with Cardiovascular Disease Risk Prediction

Abstract: Machine learning is often perceived as a sophisticated technology accessible only by highly trained experts. This prevents many physicians and biologists from using this tool in their research. The goal of this paper is to eliminate this out-dated perception. We argue that the recent development of auto machine learning techniques enables biomedical researchers to quickly build competitive machine learning classifiers without requiring in-depth knowledge about the underlying algorithms. We study the case of pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
25
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 55 publications
(33 citation statements)
references
References 27 publications
0
25
0
Order By: Relevance
“…Meghana et al [21] used "auto-sklearn", an automatic machine learning (AutoML) library for developing classifiers of CVDs. They experimented on both the heart UCI dataset and a cardiovascular disease dataset consisting of 70,000 records of patients and, as a result, AutoML outperformed traditional machine learning classifiers.…”
Section: Related Workmentioning
confidence: 99%
“…Meghana et al [21] used "auto-sklearn", an automatic machine learning (AutoML) library for developing classifiers of CVDs. They experimented on both the heart UCI dataset and a cardiovascular disease dataset consisting of 70,000 records of patients and, as a result, AutoML outperformed traditional machine learning classifiers.…”
Section: Related Workmentioning
confidence: 99%
“…This dataset contains in total 303 patient records with 76 attributes for each one, but only 14 of them are used for our evaluation to make our scores comparable to previous works. In particular, the Cleveland dataset is the only one that has been used by ML researchers to this date [6], [7], [12], [13], [22], [23], [30][31][32][33]. Tab.…”
Section: Dataset Descriptionmentioning
confidence: 99%
“…Therefore, the Gini coefficient or information entropy is also needed to evaluate the importance of features. This experiment ranks the importance of all features by using the feature importance method in Python's Sklearn library [ 27 ]. The feature importance method ranks the features according to the number of Gini coefficient drops.…”
Section: Experiments Designmentioning
confidence: 99%