2022
DOI: 10.48550/arxiv.2202.02096
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

To Impute or not to Impute? Missing Data in Treatment Effect Estimation

Abstract: Missing data is a systemic problem in practical scenarios that causes noise and bias when estimating treatment effects. This makes treatment effect estimation from data with missingness a particularly tricky endeavour. A key reason for this is that standard assumptions on missingness are rendered insufficient due to the presence of an additional variable, treatment, besides the individual and the outcome. Having a treatment variable introduces additional complexity with respect to why some variables are missin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…Depending on which imputation strategy we use, we may introduce additional parametric assumptions into our pipeline. Furthermore, depending on which mechanism governs the missingness patterns, we may have to make additional structural assumptions before imputation can even begin [14,16,77]. Making such a structural assumptions, before learning a graph using Peters et al [73] would make our solution for the practitioner's problem invalid once again.…”
Section: Definition 2 (Transition)mentioning
confidence: 99%
See 1 more Smart Citation
“…Depending on which imputation strategy we use, we may introduce additional parametric assumptions into our pipeline. Furthermore, depending on which mechanism governs the missingness patterns, we may have to make additional structural assumptions before imputation can even begin [14,16,77]. Making such a structural assumptions, before learning a graph using Peters et al [73] would make our solution for the practitioner's problem invalid once again.…”
Section: Definition 2 (Transition)mentioning
confidence: 99%
“…CDL can help define interaction effects of missingness and covariates to improve the accuracy of imputation methods. For example, CDL-based approaches allow deep learning to accurately impute missing values, respecting the causal interaction between missingness indicators and treatment selection [14][15][16]. CDL-based methods outperform existing imputation techniques in terms of both imputation accuracy and unbiased estimation from data with missing values.…”
Section: Real-world Applicationsmentioning
confidence: 99%
“…However, few literature investigated these strategies in the estimation of medical treatment effect. Berrevoets et al found that selective imputation performed better than naively impute all data in causal context 11 , but it’s difficult to identify either missingness causing treatment or missingness caused by treatment pattern in practice. Thus our study aimed to compare various MI strategies with complete case analysis in treatment effect estimation systematically to evaluate whether imputing data works better and which imputation strategy should be adopted in RCT emulation using RWD.…”
Section: Introductionmentioning
confidence: 99%