2022
DOI: 10.1093/jamia/ocac216
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning approaches for electronic health records phenotyping: a methodical review

Abstract: Objective Accurate and rapid phenotyping is a prerequisite to leveraging electronic health records for biomedical research. While early phenotyping relied on rule-based algorithms curated by experts, machine learning (ML) approaches have emerged as an alternative to improve scalability across phenotypes and healthcare settings. This study evaluates ML-based phenotyping with respect to (1) the data sources used, (2) the phenotypes considered, (3) the methods applied, and (4) the reporting and … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 48 publications
(28 citation statements)
references
References 132 publications
0
12
1
Order By: Relevance
“…2021. Development of Machine Learning approaches and Electronic Health Record Data algorithms for the identification of CPPD ( 51 ).…”
Section: Cppd Disease Timelinementioning
confidence: 99%
“…2021. Development of Machine Learning approaches and Electronic Health Record Data algorithms for the identification of CPPD ( 51 ).…”
Section: Cppd Disease Timelinementioning
confidence: 99%
“…It is very challenging to extract knowledge from the electronic medical system 1 . Various approaches, including the use of structured data 2 , natural language processing toolboxes 35 , and others have been shown to hold some promise.…”
Section: Introductionmentioning
confidence: 99%
“…Recent efforts have been directed towards enhancing the accuracy of assigning disease phenotypes while maintaining scalability of EHR driven studies. This has led to the development of label-efficient, high-throughput, semi-supervised or weakly supervised machine learning algorithms that can precisely identify disease phenotypes (4; 9). Since relevant information pertaining to a phenotype is often dispersed across different fields in the EHR, including unstructured clinical notes, weakly supervised techniques that utilize multiple features have proven to be effective across diverse healthcare systems and disease phenotypes.…”
Section: Introductionmentioning
confidence: 99%