Robust learning for optimal treatment decision with NP-dimensionality

Shi, Chengchun; Song, Rui

doi:10.1214/16-ejs1178

Cited by 30 publications

(28 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed method can be extended to observation studies with high-dimensional covariates, where the propensity score model needs to be correctly specified from data. Related work on propensity score estimation can be found in [16]. However, the derivation of the limiting distribution of our desparsified estimator for the treatment-covariates interaction coefficients would be much more involved and requires further investigation.…”

Section: Discussionmentioning

confidence: 99%

“…We adopt the robust regression approach of [16] and transform the interaction from

A_{i} (β_{0}^{T} {\overset{X}{true}}_{i})

(A_{i} - π (X_{i})) (β_{0}^{T} {\overset{X}{true}}_{i})

, where π ( X i ) = E ( A i | X i ) is the propensity score. Because E ( A i − π | X i ) = 0, the transformed interaction is orthogonal to the baseline function μ ( X i ) given X i .…”

Section: Methods and Theorymentioning

confidence: 99%

“…Because E ( A i − π | X i ) = 0, the transformed interaction is orthogonal to the baseline function μ ( X i ) given X i . By doing so, we can protect the estimation of β 0 from the effect of baseline function misspecification [8, 16]. For simplicity of presentation, we consider a completely randomized study with prespecified propensity score, i.e.…”

Section: Methods and Theorymentioning

confidence: 99%

“…Existing methods can be broadly partitioned into regression-based or classification-based approaches. Popular regression-based approaches include Q-learning [21, 11, 3, 6, 7, 17] and A-learning [13, 10, 8, 16, 15]. Q-learning models the conditional mean of the outcome given covariates and treatment while A-learning directly models the interaction between treatment and covariates that is sufficient for treatment decisions.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

High-dimensional inference for personalized treatment decision

Jeng¹,

Peng²

2018

Electron. J. Statist.

Self Cite

View full text Add to dashboard Cite

Recent development in statistical methodology for personalized treatment decision has utilized high-dimensional regression to take into account a large number of patients’ covariates and described personalized treatment decision through interactions between treatment and covariates. While a subset of interaction terms can be obtained by existing variable selection methods to indicate relevant covariates for making treatment decision, there often lacks statistical interpretation of the results. This paper proposes an asymptotically unbiased estimator based on Lasso solution for the interaction coefficients. We derive the limiting distribution of the estimator when baseline function of the regression model is unknown and possibly misspecified. Confidence intervals and p-values are derived to infer the effects of the patients’ covariates in making treatment decision. We confirm the accuracy of the proposed method and its robustness against misspecified function in simulation and apply the method to STAR*D study for major depression disorder.

show abstract

Section: Discussionmentioning

confidence: 99%

“…We adopt the robust regression approach of [16] and transform the interaction from

A_{i} (β_{0}^{T} {\overset{X}{true}}_{i})

(A_{i} - π (X_{i})) (β_{0}^{T} {\overset{X}{true}}_{i})

, where π ( X i ) = E ( A i | X i ) is the propensity score. Because E ( A i − π | X i ) = 0, the transformed interaction is orthogonal to the baseline function μ ( X i ) given X i .…”

Section: Methods and Theorymentioning

confidence: 99%

Section: Methods and Theorymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

High-dimensional inference for personalized treatment decision

Jeng¹,

Peng²

2018

Electron. J. Statist.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Lu et al [2013] considered model selection for estimating optimal treatment regime via penalized least square. Shi et al [2016] extended Lu’s method to cases where the propensity score is unknown. They studied the theoretical properties of the proposed estimator given the number of covariates is of the non-polynomial (NP) order of the sample size.…”

Section: Literature Reviewmentioning

confidence: 99%

Deep advantage learning for optimal dynamic treatment regime

Liang

Song

2018

Statistical Theory and Related Fields

Self Cite

View full text Add to dashboard Cite

Recently deep learning has successfully achieved state-of-the-art performance on many difficult tasks. Deep neural network outperforms many existing popular methods in the field of reinforcement learning. It can also identify important covariates automatically. Parameter sharing of convolutional neural network (CNN) greatly reduces the amount of parameters in the neural network, which allows for high scalability. However few research has been done on deep advantage learning (A-learning). In this paper, we present a deep A-learning approach to estimate optimal dynamic treatment regime. A-learning models the advantage function, which is of direct relevance to the goal. We use an inverse probability weighting (IPW) method to estimate the difference between potential outcomes, which does not require to make any model assumption on the baseline mean function. We implemented different architectures of deep CNN and convexified convolutional neural networks (CCNN). The proposed deep A-learning methods are applied to a data from the STAR*D trial and are shown to have better performance compared with the penalized least square estimator using a linear decision rule.

show abstract

Multithreshold change plane model: Estimation theory and applications in subgroup identification

Jin

et al. 2021

Statistics in Medicine

View full text Add to dashboard Cite

We propose a multithreshold change plane regression model which naturally partitions the observed subjects into subgroups with different covariate effects. The underlying grouping variable is a linear function of observed covariates and thus multiple thresholds produce change planes in the covariate space. We contribute a novel two‐stage estimation approach to determine the number of subgroups, the location of thresholds, and all other regression parameters. In the first stage we adopt a group selection principle to consistently identify the number of subgroups, while in the second stage change point locations and model parameter estimates are refined by a penalized induced smoothing technique. Our procedure allows sparse solutions for relatively moderate‐ or high‐dimensional covariates. We further establish the asymptotic properties of our proposed estimators under appropriate technical conditions. We evaluate the performance of the proposed methods by simulation studies and provide illustrations using two medical data examples. Our proposal for subgroup identification may lead to an immediate application in personalized medicine.

show abstract

Robust learning for optimal treatment decision with NP-dimensionality

Cited by 30 publications

References 26 publications

High-dimensional inference for personalized treatment decision

High-dimensional inference for personalized treatment decision

Deep advantage learning for optimal dynamic treatment regime

Multithreshold change plane model: Estimation theory and applications in subgroup identification

Contact Info

Product

Resources

About