2018
DOI: 10.1007/s40471-018-0165-9
|View full text |Cite
|
Sign up to set email alerts
|

Using Machine Learning to Identify Health Outcomes from Electronic Health Record Data

Abstract: Purpose of review: Electronic health records (EHRs) contain valuable data for identifying health outcomes, but these data also present numerous challenges when creating computable phenotyping algorithms. Machine learning methods could help with some of these challenges. In this review, we discuss four common scenarios that researchers may find helpful for thinking critically about when and for what tasks machine learning may be used to identify health outcomes from EHR data. Recent findings: We first conside… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
32
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 64 publications
(32 citation statements)
references
References 64 publications
0
32
0
Order By: Relevance
“…Performance in the development of such algorithms demands a strong collaboration between clinicians and data scientists from the precise formulation of problems to a common and balanced interpretation of results . Moreover, the data‐driven processes inherent to machine learning applications need multiple internal and external validations before dissemination to multisite application and generalization . An important limitation associated with such Big Data analyses is related to data complexity and heterogeneity in quality with significant rates of anomalies that have been identified when assessing the quality of metadata records .…”
Section: The Challenges Of Big Datamentioning
confidence: 99%
“…Performance in the development of such algorithms demands a strong collaboration between clinicians and data scientists from the precise formulation of problems to a common and balanced interpretation of results . Moreover, the data‐driven processes inherent to machine learning applications need multiple internal and external validations before dissemination to multisite application and generalization . An important limitation associated with such Big Data analyses is related to data complexity and heterogeneity in quality with significant rates of anomalies that have been identified when assessing the quality of metadata records .…”
Section: The Challenges Of Big Datamentioning
confidence: 99%
“…With signi cant evolution in recent years [10], machine learning methods are powerful tools in supporting medical diagnoses. Studies [11,9,12] have shown that these methods are capable of predicting and identifying diseases based on laboratory tests and clinical data with similar accuracy to a human specialist. Other studies [13,14,15] have also been able to assist in the diagnosis of diabetes by making use of machine learning techniques.…”
Section: Introductionmentioning
confidence: 99%
“…Because inconsistencies and inaccuracies are a reality when using EHR to gather data, depending solely on one indication -like a particular diagnostic code or test result -for even well-defined health outcomes (e.g. hypertension) does not always lead to accurate classifications(Wong, Horwitz, et al, 2018). Two proposed solutions to allow incomplete EHR data to train more predictive models are to include more features in the training data and to use multiple data types to identify a single target(Wong, Horwitz et al, 2018).…”
mentioning
confidence: 99%
“…hypertension) does not always lead to accurate classifications(Wong, Horwitz, et al, 2018). Two proposed solutions to allow incomplete EHR data to train more predictive models are to include more features in the training data and to use multiple data types to identify a single target(Wong, Horwitz et al, 2018). Including more features is particularly useful when the relationship between the target and the variables is intricate.…”
mentioning
confidence: 99%
See 1 more Smart Citation