2019
DOI: 10.1371/journal.pone.0226765
|View full text |Cite
|
Sign up to set email alerts
|

Predicting breast cancer risk using personal health data and machine learning models

Abstract: Among women, breast cancer is a leading cause of death. Breast cancer risk predictions can inform screening and preventative actions. Previous works found that adding inputs to the widely-used Gail model improved its ability to predict breast cancer risk. However, these models used simple statistical architectures and the additional inputs were derived from costly and / or invasive procedures. By contrast, we developed machine learning models that used highly accessible personal health data to predict five-yea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

5
64
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 76 publications
(70 citation statements)
references
References 28 publications
(62 reference statements)
5
64
0
1
Order By: Relevance
“…Third, our models were trained and tested on the derivation cohort, and validated on another validation cohort randomly split from the entire dataset. Such external validation verifies the generalizability of our models to new data 36 . Finally, the superiority of ML in terms of developing predictive models becomes clearer on larger populations with greater numbers of predictors.…”
Section: Discussionsupporting
confidence: 56%
“…Third, our models were trained and tested on the derivation cohort, and validated on another validation cohort randomly split from the entire dataset. Such external validation verifies the generalizability of our models to new data 36 . Finally, the superiority of ML in terms of developing predictive models becomes clearer on larger populations with greater numbers of predictors.…”
Section: Discussionsupporting
confidence: 56%
“…Stark et al [21] developed six models of machine learning by applying Gaussian naïve Bayes, decision tree, discriminant analysis, logistic regression analysis, support vector machine, and feed-forward ANN. They yielded five-year breast cancer risk estimates that are more accurate than those resulted using only BCRAT tools.…”
Section: Related Workmentioning
confidence: 99%
“…Following the progress in the computational power and data storage systems, such techniques are applied extensively in the healthcare eld 18,19 . Researchers have implemented machine learning to address several aspects of breast cancer management [20][21][22][23] and some works have previously used statistical or machine learning algorithms on various categories of predictors to provide breast cancer risk scoring systems [24][25][26] . However, each speci c clinical problem requires a scale that receives its inputs from the data that is available to its intended end-users, and performs well in the relevant patient population and healthcare setting 27,28 .…”
Section: Introductionmentioning
confidence: 99%