2019
DOI: 10.1371/journal.pone.0212999
|View full text |Cite
|
Sign up to set email alerts
|

Generating and evaluating a propensity model using textual features from electronic medical records

Abstract: Background Propensity score (PS) methods are commonly used to control for confounding in comparative effectiveness studies. Electronic health records (EHRs) contain much unstructured data that could be used as proxies for potential confounding factors. The goal of this study was to assess whether the unstructured information can also be used to construct PS models that would allow to properly deal with confounding. We used an example of coxibs (Cox-2 inhibitors) vs. traditional NSAIDs and the risk… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 35 publications
(38 reference statements)
0
4
0
Order By: Relevance
“…Studies have also shown that models that optimize prediction can underfit for purposes of confounding adjustment and are not necessarily optimal for reducing bias in estimated treatment effects 43 . Recent work independent of ours has explored the use of large‐scale feature engineering from unstructured EHR text for improved confounding control for propensity score analyses 46 . This work found benefits confounding control, but the benefits were incremental and findings were limited to a single empirical study 46 .…”
Section: Discussionmentioning
confidence: 95%
See 2 more Smart Citations
“…Studies have also shown that models that optimize prediction can underfit for purposes of confounding adjustment and are not necessarily optimal for reducing bias in estimated treatment effects 43 . Recent work independent of ours has explored the use of large‐scale feature engineering from unstructured EHR text for improved confounding control for propensity score analyses 46 . This work found benefits confounding control, but the benefits were incremental and findings were limited to a single empirical study 46 .…”
Section: Discussionmentioning
confidence: 95%
“…Recent work independent of ours has explored the use of large‐scale feature engineering from unstructured EHR text for improved confounding control for propensity score analyses 46 . This work found benefits confounding control, but the benefits were incremental and findings were limited to a single empirical study 46 . Determining to what extent the addition of these EHR‐generated features can improve confounding control is a more difficult problem that we leave to future research.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Natural language processing (NLP) is a subfield of machine learning that can be used to generate variables from unstructured free text 31 . NLP methods are increasingly used to identify health outcomes from EHRs, but the application of NLP algorithms for purposes of identifying high‐dimensional sets of confounding factors is limited 32 . More research is needed on the use of NLP algorithms for generating high‐dimensional sets of proxy confounders and the value of unstructured EHR data in proxy adjustment.…”
Section: Generating Features For Proxy Confounder Adjustmentmentioning
confidence: 99%