In this paper, we focus on the estimation of a high-dimensional inverse covariance (precision) matrix. We propose a simple improvement of the graphical lasso (glasso) framework that is able to attain better statistical performance without increasing significantly the computational cost. The proposed improvement is based on computing a root of the sample covariance matrix to reduce the spread of the associated eigenvalues. Through extensive numerical results, using both simulated and real datasets, we show that the proposed modification improves the glasso procedure. Our results reveal that the square-root improvement can be a reasonable choice in practice.
The presence of confounding by high-dimensional variables complicates estimation of the average effect of a point treatment. On the one hand, it necessitates the use of variable selection strategies or more general data-adaptive high-dimensional statistical methods. On the other hand, the use of such techniques tends to result in biased estimators with a non-standard asymptotic behaviour. Double-robust estimators are vital for offering a resolution because they possess a so-called small bias property. This means that their bias vanishes faster than the bias in the nuisance parameter estimators when the relevant smoothing parameter goes to zero, provided that certain sparsity assumptions hold. This property has been exploited to achieve valid (uniform) inference of the average causal effect when data-adaptive estimators of the propensity score and conditional outcome mean both converge to their respective truths at sufficiently fast rate (e.g., Farrell, 2015;Belloni et al., 2016). In this article, we extend this work in order to retain valid (uniform) inference when one of these estimators does not converge to the truth, regardless of which. This is done by generalising prior work by Vermeulen and Vansteelandt (2015) to incorporate regularisation. The proposed penalised bias-reduced double-robust estimation strategy exhibits promising performance in extensive simulation studies and a data analysis, relative to competing proposals.
After variable selection, standard inferential procedures for regression parameters may not be uniformly valid; there is no finite-sample size at which a standard test is guaranteed to approximately attain its nominal size. This problem is exacerbated in high-dimensional settings, where variable selection becomes unavoidable. This has prompted a flurry of activity in developing uniformly valid hypothesis tests for a lowdimensional regression parameter (eg, the causal effect of an exposure on an outcome) in high-dimensional models. So far there has been limited focus on model misspecification, although this is inevitable in high-dimensional settings. We propose tests of the null that are uniformly valid under sparsity conditions weaker than those typically invoked in the literature, assuming working models for the exposure and outcome are both correctly specified. When one of the models is misspecified, by amending the procedure for estimating the nuisance parameters, our tests continue to be valid; hence, they are doubly robust. Our proposals are straightforward to implement using existing software for penalized maximum likelihood estimation and do not require sample splitting. We illustrate them in simulations and an analysis of data obtained from the Ghent University intensive care unit.
We consider estimation of the average effect of time‐varying dichotomous exposure on outcome using inverse probability weighting (IPW) under the assumption that there is no unmeasured confounding of the exposure–outcome association at each time point. Despite the popularity of IPW, its performance is often poor due to instability of the estimated weights. We develop an estimating equation‐based strategy for the nuisance parameters indexing the weights at each time point, aimed at preventing highly volatile weights and ensuring the stability of IPW estimation. Our proposed approach targets the estimation of the counterfactual mean under a chosen treatment regime and requires fitting a separate propensity score model at each time point. We discuss and examine extensions to enable the fitting of marginal structural models using one propensity score model across all time points. Extensive simulation studies demonstrate adequate performance of our approach compared with the maximum likelihood propensity score estimator and the covariate balancing propensity score estimator.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.