2017
DOI: 10.1093/jamia/ocx111
|View full text |Cite
|
Sign up to set email alerts
|

Enabling phenotypic big data with PheNorm

Abstract: The accuracy of the PheNorm algorithms is on par with algorithms trained with annotated samples. PheNorm fully automates the generation of accurate phenotyping algorithms and demonstrates the capacity for EHR-driven annotations to scale to the next level - phenotypic big data.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
111
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 92 publications
(111 citation statements)
references
References 48 publications
0
111
0
Order By: Relevance
“…Finally, the claims‐based outcome is likely imperfect and may be particularly so for earlier onset or less common forms of dementia; however, we note that dementia is typically underdiagnosed (which would bias the present study toward the null hypothesis) and prior formal validation has found positive predictive value of dementia codes to be >75% in most health systems . Further work is needed on highly scalable computed phenotypes as many existing approaches consume a full medical record and thus are poorly suited to time‐to‐event analysis . The NLP approach reported here could contribute to that effort.…”
Section: Discussionmentioning
confidence: 97%
See 1 more Smart Citation
“…Finally, the claims‐based outcome is likely imperfect and may be particularly so for earlier onset or less common forms of dementia; however, we note that dementia is typically underdiagnosed (which would bias the present study toward the null hypothesis) and prior formal validation has found positive predictive value of dementia codes to be >75% in most health systems . Further work is needed on highly scalable computed phenotypes as many existing approaches consume a full medical record and thus are poorly suited to time‐to‐event analysis . The NLP approach reported here could contribute to that effort.…”
Section: Discussionmentioning
confidence: 97%
“…52,53 Further work is needed on highly scalable computed phenotypes as many existing approaches consume a full medical record and thus are poorly suited to time-toevent analysis. 54,55 The NLP approach reported here could contribute to that effort.…”
Section: Discussionmentioning
confidence: 99%
“…Unstructured data have also been used to define phenotypes, particularly for diseases with unreliable ICD9 classifications such as some psychiatric diseases, using natural language processing methods . Such methods can also be used to obtain patient measures such as smoking status .…”
Section: Statistical Issues Related To Biobank Researchmentioning
confidence: 99%
“…Some challenges include dealing with misspellings, tenses, alternative phrasing, negation, and defining a trained dictionary of words and phrases that may correspond to a particular phenotype. Algorithms are usually trained using expert annotations, but new methods have attempted to automate this step as well . Additional machine learning methods have also been used to define phenotypes (eg, imaging analytics from medical imaging datasets) using a broad spectrum of patient information …”
Section: Statistical Issues Related To Biobank Researchmentioning
confidence: 99%
See 1 more Smart Citation