2015
DOI: 10.1176/appi.ajp.2014.14030423
|View full text |Cite
|
Sign up to set email alerts
|

Validation of Electronic Health Record Phenotyping of Bipolar Disorder Cases and Controls

Abstract: Objective To validate the use of electronic health records (EHRs) for the diagnosis of bipolar disorder (BD) and controls. Methods EHR data were obtained from a healthcare system of more than 4.2 million patients spanning more than 20 years. Chart review by experienced clinicians was used to identify text features and coded data consistent or inconsistent with a diagnosis of BD. Natural language processing (NLP) was used to train a diagnostic algorithm with 95% specificity for classifying BD. Filtered coded … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

2
144
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
9
1

Relationship

5
5

Authors

Journals

citations
Cited by 129 publications
(146 citation statements)
references
References 31 publications
2
144
0
Order By: Relevance
“…However, our head-to-head comparison of the heritability estimates between self-reported illness and ICD-10 codes showed largely consistent results, indicating that both phenotypic approaches at least captured comparable variations in these phenotypes. Prior research evaluating phenotypes derived from electronic health records (EHR) indicate that greater phenotypic validity can be achieved when diagnostic codes are supplemented with text mining methods [55][56][57][58]. The specificity of the disease codes might also be improved by leveraging the medication records in the UK Biobank.…”
Section: Discussionmentioning
confidence: 99%
“…However, our head-to-head comparison of the heritability estimates between self-reported illness and ICD-10 codes showed largely consistent results, indicating that both phenotypic approaches at least captured comparable variations in these phenotypes. Prior research evaluating phenotypes derived from electronic health records (EHR) indicate that greater phenotypic validity can be achieved when diagnostic codes are supplemented with text mining methods [55][56][57][58]. The specificity of the disease codes might also be improved by leveraging the medication records in the UK Biobank.…”
Section: Discussionmentioning
confidence: 99%
“…Specifically, we apply latent Dirichilet allocation (LDA), a means of identifying commonly cooccurring features, to derive a set of 50 disease topics. Then we test those topics for association with common genetic variation and compare this approach to capture a given diagnosis varies widely, even when diagnosis-specific classifiers are applied to augment single codes (2,3). As such, approaches that focus on individual diagnostic codes are limited by inaccurate, missing or heterogeneous diagnoses; eg, where individuals with cystic fibrosis might be represented by male infertility, diabetes and chronic rhinosinusitis even in the absence of a diagnostic code for cystic fibrosis (4).…”
mentioning
confidence: 99%
“…Specifically, given the lack of treatments in neurological disorders, the use of algorithms in the identification of new patents is a pressing need for the biopharmaceutical sector. One study outlined the effective application of semiautomated mining of EHRs to ascertain bipolar disorder patients and control subjects with high specificity and predictive value when compared with diagnostic interviews [28]. This technique could have broad applicability across many research areas in neurology.…”
Section: Discovering or Validating New Markers For Patient Stratificamentioning
confidence: 99%