2023
DOI: 10.3390/cancers15102741
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning Algorithms to Predict Breast Cancer Recurrence Using Structured and Unstructured Sources from Electronic Health Records

Abstract: Recurrence is a critical aspect of breast cancer (BC) that is inexorably tied to mortality. Reuse of healthcare data through Machine Learning (ML) algorithms offers great opportunities to improve the stratification of patients at risk of cancer recurrence. We hypothesized that combining features from structured and unstructured sources would provide better prediction results for 5-year cancer recurrence than either source alone. We collected and preprocessed clinical data from a cohort of BC patients, resultin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 50 publications
0
6
0
Order By: Relevance
“…The ML model based on extreme gradient boosting (XGB) was selected in our study because of its generalizability, low risk of overfitting, high interpretability [25], and high scalability [34]. XGB has been confirmed to be a reliable method for recognizing patterns in other diseases such as lupus erythematosus [16], traumatic brain injury-induced coagulopathy [35], epilepsy [36], diabetes [37], Alzheimer's disease [38,39], HIV [40,41], or different types of cancer [42][43][44][45][46]. We, therefore, used the aforementioned ML technique to determine which factors were most predictive of disease severity in a closed group of patients hospitalized for COVID-19 during the first two months of the pandemic, a time when the population did not yet have herd immunity and had not yet been vaccinated.…”
Section: Discussionmentioning
confidence: 99%
“…The ML model based on extreme gradient boosting (XGB) was selected in our study because of its generalizability, low risk of overfitting, high interpretability [25], and high scalability [34]. XGB has been confirmed to be a reliable method for recognizing patterns in other diseases such as lupus erythematosus [16], traumatic brain injury-induced coagulopathy [35], epilepsy [36], diabetes [37], Alzheimer's disease [38,39], HIV [40,41], or different types of cancer [42][43][44][45][46]. We, therefore, used the aforementioned ML technique to determine which factors were most predictive of disease severity in a closed group of patients hospitalized for COVID-19 during the first two months of the pandemic, a time when the population did not yet have herd immunity and had not yet been vaccinated.…”
Section: Discussionmentioning
confidence: 99%
“…Several ML/AI studies have been conducted in oncology [29][30][31][32][33][34][35]50,51]. Some of these have attempted to predict 5-10-year breast cancer recurrences using both structured and unstructured clinicopathological data from Electronic Health Records (EHRs) [50,51]; however, these studies focused primarily on predicting local rather than distant recurrences.…”
Section: Discussionmentioning
confidence: 99%
“…Machine learning serves as a pivotal tool in our study enabling us to delve into complex medical data and make precise predictions regarding survival outcomes in patients who have undergone breast-conserving surgery (BCS) [19]. Unlike conventional prognostic models, machine learning facilitates the creation of personalized prognostic models by considering a broad spectrum of patient-speci c data, including clinical, pathological, and demographic information [20].…”
Section: Discussionmentioning
confidence: 99%