2022
DOI: 10.1002/pds.5500
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning for improving high‐dimensional proxy confounder adjustment in healthcare database studies: An overview of the current literature

Abstract: Purpose Supplementing investigator‐specified variables with large numbers of empirically identified features that collectively serve as ‘proxies’ for unspecified or unmeasured factors can often improve confounding control in studies utilizing administrative healthcare databases. Consequently, there has been a recent focus on the development of data‐driven methods for high‐dimensional proxy confounder adjustment in pharmacoepidemiologic research. In this paper, we survey current approaches and recent advancemen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 110 publications
0
2
0
Order By: Relevance
“…While logistic regression is the traditional approach for estimating the propensity score, least absolute shrinkage and selection operator (LASSO) and other regularized regression tools have been recommended for hdPS 24,25 . Additionally, machine learning models like random forest and neural networks have shown promising results in prior studies utilizing alternative models for estimating the propensity score 25–28 . Future research should explore the incorporation of regularized regression and machine learning models, in addition to logistic regression, in the SL‐hdPS ensemble.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…While logistic regression is the traditional approach for estimating the propensity score, least absolute shrinkage and selection operator (LASSO) and other regularized regression tools have been recommended for hdPS 24,25 . Additionally, machine learning models like random forest and neural networks have shown promising results in prior studies utilizing alternative models for estimating the propensity score 25–28 . Future research should explore the incorporation of regularized regression and machine learning models, in addition to logistic regression, in the SL‐hdPS ensemble.…”
Section: Discussionmentioning
confidence: 99%
“…24,25 Additionally, machine learning models like random forest and neural networks have shown promising results in prior studies utilizing alternative models for estimating the propensity score. [25][26][27][28] Future research should explore the incorporation of regularized regression and machine learning models, in addition to logistic regression, in the SL-hdPS ensemble. This is particularly pertinent given SL's ability to incorporate a wide variety of models.…”
Section: Discussionmentioning
confidence: 99%