2019
DOI: 10.1093/biostatistics/kxz042
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning

Abstract: Summary In recent decades, the fields of statistical and machine learning have seen a revolution in the development of data-adaptive regression methods that have optimal performance under flexible, sometimes minimal, assumptions on the true regression functions. These developments have impacted all areas of applied and theoretical statistics and have allowed data analysts to avoid the biases incurred under the pervasive practice of parametric model misspecification. In this commentary, I discuss… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
27
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(32 citation statements)
references
References 21 publications
0
27
0
Order By: Relevance
“…In this paper, we focused on the estimation of a causal effect given that the identifiability conditions were satisfied. In practice, the predictive performance of the Q-model is not sufficient to ensure the absence of bias in the estimation of the causal effect, which requires a precise conceptual knowledge of the causal model 35 .…”
Section: Discussionmentioning
confidence: 99%
“…In this paper, we focused on the estimation of a causal effect given that the identifiability conditions were satisfied. In practice, the predictive performance of the Q-model is not sufficient to ensure the absence of bias in the estimation of the causal effect, which requires a precise conceptual knowledge of the causal model 35 .…”
Section: Discussionmentioning
confidence: 99%
“…It is an effective approach by using machine learning methods to predict active antibacterial compounds [ 26 , 28 , 36 ]. The accuracy of the prediction model is affected by many factors, such as the quality of the benchmark datasets [ 37 ], the representative molecular characteristics of the compounds [ 16 ], the applicable machine learning models [ 9 ], and the optimized model parameters [ 38 ]. This study collected a large amount of experimental data on the antibacterial activity of compounds from the ChEMBL and PubChem databases.…”
Section: Discussionmentioning
confidence: 99%
“…First, while our simulations aimed at providing new empirical evidence about the operating characteristics of state‐of‐the‐art machine learning techniques for estimating TEH with survival data, we only provided frequentist coverage probability for AFT‐BART‐NP because the credible intervals are readily available from the MCMC output. On the other hand, it may be challenging to precisely estimate the variance of the ISTE using DL, RSF, and TSHEE, without resorting to more computationally intensive sample‐splitting methods 2,15,58,59 . Developing and investigating new methods for the variance and interval estimation for frequentist machine learning represent an important avenue for future research.…”
Section: Discussionmentioning
confidence: 99%