2018
DOI: 10.1016/j.jbi.2018.01.004
|View full text |Cite
|
Sign up to set email alerts
|

Estimating summary statistics for electronic health record laboratory data for use in high-throughput phenotyping algorithms

Abstract: We study the question of how to represent or summarize raw laboratory data taken from an electronic health record (EHR) using parametric model selection to reduce or cope with biases induced through clinical care. It has been previously demonstrated that the health care process (Hripcsak and Albers, 2012, 2013), as defined by measurement context (Hripcsak and Albers, 2013; Albers et al., 2012) and measurement patterns (Albers and Hripcsak, 2010, 2012), can influence how EHR data are distributed statistically (… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(14 citation statements)
references
References 60 publications
0
14
0
Order By: Relevance
“…[61][62][63] Recent works propose phenotyping strategies to overcome hurdles using multiple data sources to more accurately ascertain disease status. [64][65][66][67][68][69][70][71][72] However, future work is needed to provide statistical methods for incorporating data of different types for phenome generation. For a detailed review of phenotyping procedures, see Bush et al 7 Figure S8 provides some examples of the types of structured and unstructured EHR information that can be used to construct phenotypes.…”
Section: 13mentioning
confidence: 99%
See 1 more Smart Citation
“…[61][62][63] Recent works propose phenotyping strategies to overcome hurdles using multiple data sources to more accurately ascertain disease status. [64][65][66][67][68][69][70][71][72] However, future work is needed to provide statistical methods for incorporating data of different types for phenome generation. For a detailed review of phenotyping procedures, see Bush et al 7 Figure S8 provides some examples of the types of structured and unstructured EHR information that can be used to construct phenotypes.…”
Section: 13mentioning
confidence: 99%
“…Recent works propose phenotyping strategies to overcome hurdles using multiple data sources to more accurately ascertain disease status . However, future work is needed to provide statistical methods for incorporating data of different types for phenome generation.…”
Section: Statistical Issues Related To Biobank Researchmentioning
confidence: 99%
“…Traditional predictive modeling can detect some environmental and system modifiers through latent factors, but HPM-ExpertSignals encourage explicit modeling of difficult-to-detect modifiers using a top-down modeling approach. 6 , 47 , 48 For example, our team has demonstrated the implicit identification of contextual factors and data-driven covariates for modeling laboratory data 47 and survival models of chronic kidney disease 48 for healthcare process biases and effects.…”
Section: Resultsmentioning
confidence: 99%
“…The variability in the presence of SLP referrals and definitive diagnoses describes our diverse set of cases that met our definition for inclusion (DLD classification) and highlights the need for efficient identification of LDs (see Supplemental Material S3 ). Other recently developed EHR data mining algorithms employ natural language processing to automatically classify phenotypes using text recognition from medical notes and other clinical information ( Albers et al, 2018 ; Deferio et al, 2018 ; Wei et al, 2016 ). APT-DLD, however, is automated, is rule based, and does not depend on natural language processing or access to medical notes.…”
Section: Discussionmentioning
confidence: 99%