2010
DOI: 10.1002/sim.4124
|View full text |Cite
|
Sign up to set email alerts
|

Estimating propensity scores with missing covariate data using general location mixture models

Abstract: Abstract[In many observational studies, researchers estimate causal effects using propensity scores, e.g., by matching or sub-classifying on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. We propose to use multiple imputation to create completed datasets, from which propensity scores can be estimated, with a general location mixture model. The model assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the sa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
50
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
9

Relationship

3
6

Authors

Journals

citations
Cited by 36 publications
(50 citation statements)
references
References 58 publications
0
50
0
Order By: Relevance
“…To account for a few covariates with a small fraction of missing data and to avoid bias if these data were not missing completely at random, we used the general location mixture model proposed by Mitra and Reiter. 20 This approach uses multiply imputed data to handle the missing values with an additional covariate that assists in identifying patients who switched to RAL-sparing ART but would have been good candidates for RAL-containing ART. Furthermore, we incorporated a maximum likelihood-based estimation procedure into the logistic regression model 21 to account for baseline HIV RNA values that were undetectable.…”
Section: Methodsmentioning
confidence: 99%
“…To account for a few covariates with a small fraction of missing data and to avoid bias if these data were not missing completely at random, we used the general location mixture model proposed by Mitra and Reiter. 20 This approach uses multiply imputed data to handle the missing values with an additional covariate that assists in identifying patients who switched to RAL-sparing ART but would have been good candidates for RAL-containing ART. Furthermore, we incorporated a maximum likelihood-based estimation procedure into the logistic regression model 21 to account for baseline HIV RNA values that were undetectable.…”
Section: Methodsmentioning
confidence: 99%
“…A straightforward approach consists of two independent steps: first impute the missing data and then draw causal inferences from the imputed complete data (e.g. Mitra and Reiter, 2011). However, how the missing values are imputed may have a nontrivial impact on the subsequent causal analysis (e.g.…”
Section: Unintentional Missing Datamentioning
confidence: 99%
“…We assume that an analyst is interested in determining the relationship between PIATM and the effect of treatment after adjusting for relevant pre-treatment variables. We use the same fourteen background covariates used in Mitra and Reiter (2011). These are the child's race (Hispanic, black or other), the mother's race (Hispanic, black, Asian, white, Hawaiian/Pacific Islander, American Indian or other), the child's sex, indicator variable on whether the child's grandparents were present at birth and another variable indicating the presence of the mother's spouse at birth, the number of years between 1979 and when the mother gave birth (square root transformed), the mother's score on the Armed Forces Qualification Test (square root transformed), the mother's highest educational achievement, the child's birth weight, the number of days spent by the mother in the hospital (log transformed), the number of days spent by the child in the hospital (log transformed), the number of weeks the mother worked in the year preceding to child birth categorised into four groups (0 weeks, 1-47 weeks, 48-51 weeks and 52 weeks), the number of weeks the child was born premature categorised into three groups (0 weeks, 1-4 weeks and >5 weeks preterm), and family income (log transformed) at the time of the birth of the child.…”
Section: Nlsy Data Setmentioning
confidence: 99%
“…Following Mitra and Reiter (2011) we dichotomise the variable that measures duration of breastfeeding so as to split units into two groups; the control group, comprises those units who were breastfed for less than 24 weeks, while the treatment group comprises those units who were breastfed for 24 weeks or more. We assume that an analyst is interested in determining the relationship between PIATM and the effect of treatment after adjusting for relevant pre-treatment variables.…”
Section: Nlsy Data Setmentioning
confidence: 99%