Estimating propensity scores with missing covariate data using general location mixture models

Mitra, Robin; Reiter, Jerome P.

doi:10.1002/sim.4124

Cited by 36 publications

(50 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To account for a few covariates with a small fraction of missing data and to avoid bias if these data were not missing completely at random, we used the general location mixture model proposed by Mitra and Reiter. 20 This approach uses multiply imputed data to handle the missing values with an additional covariate that assists in identifying patients who switched to RAL-sparing ART but would have been good candidates for RAL-containing ART. Furthermore, we incorporated a maximum likelihood-based estimation procedure into the logistic regression model 21 to account for baseline HIV RNA values that were undetectable.…”

Section: Methodsmentioning

confidence: 99%

Long-term immunologic and virologic responses on raltegravir-containing regimens among ART-experienced participants in the HIV Outpatient Study

Wiegand¹,

Armon

Chmiel

et al. 2015

HIV Clinical Trials

View full text Add to dashboard Cite

Objectives Raltegravir (RAL)-containing antiretroviral therapy (ART) produced better immunologic and virologic responses than optimized background ART in clinical trials of heavily ART-experienced patients, but few data exist on long-term outcomes in routine HIV care. Methods We studied ART-experienced HIV outpatient study (HOPS) participants seen at 10 US HIV-specialty clinics during 2007–2011. We identified patients who started (baseline date) either continuous ≥30 days of RAL-containing or RAL-sparing ART, and used propensity score (PS) matching methods to account for baseline clinical and demographic differences. We used Kaplan–Meier methods and log-rank tests for the matched subsets to evaluate probability of death, achieving HIV RNA <50 copies/ml, and CD4 cell count (CD4) increase of ≥50 cells mm−3 during follow-up. Results Among 784 RAL-exposed and 1062 RAL-unexposed patients, 472 from each group were matched by PS. At baseline, the 472 RAL-exposed patients (mean nadir CD4, 205 cells mm−3; mean baseline CD4, 460 cells mm−3; HIV RNA <50 copies ml−1 in 61%; mean years on prescribed ART, 7.5) were similar to RAL unexposed. During a mean follow-up of over 3 years, mortality rates and immunologic and virologic trajectories did not differ between the two groups. Among patients with detectable baseline HIV RNA levels, 76% of RAL-exposed and 63% of RAL-unexposed achieved HIV RNA <50 copies ml−1 (P=0.51); 69 and 58%, respectively, achieved a CD4 increase ≥50 cells mm−3 (P=0.70). Discussion In our large cohort of US ART-experienced patients with a wide spectrum of clinical history, similar outcomes were observed when prescribed RAL containing versus other contemporary ART.

show abstract

Section: Methodsmentioning

confidence: 99%

Long-term immunologic and virologic responses on raltegravir-containing regimens among ART-experienced participants in the HIV Outpatient Study

Wiegand¹,

Armon

Chmiel

et al. 2015

HIV Clinical Trials

View full text Add to dashboard Cite

show abstract

“…A straightforward approach consists of two independent steps: first impute the missing data and then draw causal inferences from the imputed complete data (e.g. Mitra and Reiter, 2011). However, how the missing values are imputed may have a nontrivial impact on the subsequent causal analysis (e.g.…”

Section: Unintentional Missing Datamentioning

confidence: 99%

Causal Inference: A Missing Data Perspective

Ding¹,

Li²

2018

Statist. Sci.

View full text Add to dashboard Cite

Inferring causal effects of treatments is a central goal in many disciplines. The potential outcomes framework is a main statistical approach to causal inference, in which a causal effect is defined as a comparison of the potential outcomes of the same units under different treatment conditions. Because for each unit at most one of the potential outcomes is observed and the rest are missing, causal inference is inherently a missing data problem. Indeed, there is a close analogy in the terminology and the inferential framework between causal inference and missing data. Despite the intrinsic connection between the two subjects, statistical analyses of causal inference and missing data also have marked differences in aims, settings and methods. This article provides a systematic review of causal inference from the missing data perspective.Focusing on ignorable treatment assignment mechanisms, we discuss a wide range of causal inference methods that have analogues in missing data analysis, such as imputation, inverse probability weighting and doubly-robust methods. Under each of the three modes of inference-Frequentist, Bayesian, and Fisherian randomization-we present the general structure of inference for both finite-sample and superpopulation estimands, and illustrate via specific examples. We identify open questions to motivate more research to bridge the two fields.

show abstract

“…We assume that an analyst is interested in determining the relationship between PIATM and the effect of treatment after adjusting for relevant pre-treatment variables. We use the same fourteen background covariates used in Mitra and Reiter (2011). These are the child's race (Hispanic, black or other), the mother's race (Hispanic, black, Asian, white, Hawaiian/Pacific Islander, American Indian or other), the child's sex, indicator variable on whether the child's grandparents were present at birth and another variable indicating the presence of the mother's spouse at birth, the number of years between 1979 and when the mother gave birth (square root transformed), the mother's score on the Armed Forces Qualification Test (square root transformed), the mother's highest educational achievement, the child's birth weight, the number of days spent by the mother in the hospital (log transformed), the number of days spent by the child in the hospital (log transformed), the number of weeks the mother worked in the year preceding to child birth categorised into four groups (0 weeks, 1-47 weeks, 48-51 weeks and 52 weeks), the number of weeks the child was born premature categorised into three groups (0 weeks, 1-4 weeks and >5 weeks preterm), and family income (log transformed) at the time of the birth of the child.…”

Section: Nlsy Data Setmentioning

confidence: 99%

“…Following Mitra and Reiter (2011) we dichotomise the variable that measures duration of breastfeeding so as to split units into two groups; the control group, comprises those units who were breastfed for less than 24 weeks, while the treatment group comprises those units who were breastfed for 24 weeks or more. We assume that an analyst is interested in determining the relationship between PIATM and the effect of treatment after adjusting for relevant pre-treatment variables.…”

Section: Nlsy Data Setmentioning

confidence: 99%

Using mixtures oftdensities to make inferences in the presence of missing data with a small number of multiply imputed data sets

Rashid

Mitra

Steele

2015

Computational Statistics & Data Analysis

View full text Add to dashboard Cite

Estimating propensity scores with missing covariate data using general location mixture models

Cited by 36 publications

References 58 publications

Long-term immunologic and virologic responses on raltegravir-containing regimens among ART-experienced participants in the HIV Outpatient Study

Long-term immunologic and virologic responses on raltegravir-containing regimens among ART-experienced participants in the HIV Outpatient Study

Causal Inference: A Missing Data Perspective

Using mixtures oftdensities to make inferences in the presence of missing data with a small number of multiply imputed data sets

Contact Info

Product

Resources

About