Response-Adaptive Randomization in Clinical Trials: From Myths to Practical Considerations

Robertson, David S.; Lee, Kim May; Lopez-Kolkovska, Boryana C.; Villar, Sofía S.

doi:10.1214/22-sts865

Cited by 16 publications

(8 citation statements)

References 142 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this paper, we propose a metric for comparing group sequential designs based on the cohort most acutely impacted by the choice of design and illustrate how this metric may be applied to select a design in the ARREST and ACCESS contexts. RAR designs are commonly compared using inferential and estimation metrics (e.g., type I error, power, and bias) rather than measures of patient benefit, which remain underreported and have received little attention in the RAR literature (Robertson et al., 2020). This is in part because existing patient benefit metrics, including the expected number of trial failures, the proportion of patients assigned to the inferior arm, and the probability of a treatment imbalance in the wrong direction, are often limited by failures to hold type I and II error rates constant or to account for the different sample size requirements of the designs under consideration (Karrison et al., 2003; Morgan and Coad, 2007; Zhu and Hu, 2010; Robertson et al., 2020).…”

Section: Introductionmentioning

confidence: 99%

“…RAR designs are commonly compared using inferential and estimation metrics (e.g., type I error, power, and bias) rather than measures of patient benefit, which remain underreported and have received little attention in the RAR literature (Robertson et al., 2020). This is in part because existing patient benefit metrics, including the expected number of trial failures, the proportion of patients assigned to the inferior arm, and the probability of a treatment imbalance in the wrong direction, are often limited by failures to hold type I and II error rates constant or to account for the different sample size requirements of the designs under consideration (Karrison et al., 2003; Morgan and Coad, 2007; Zhu and Hu, 2010; Robertson et al., 2020). One approach to correct for the latter issue is to compare designs with respect to the expected number of failures within a finite patient horizon (Villar et al., 2015, a) (Villar et al., 2015, b).…”

Section: Introductionmentioning

confidence: 99%

“…However, as far as we are aware, no specific guidance exists for selecting an appropriate horizon and there is a need, as suggested by Robertson et al. (2020), for patient benefit metrics that clearly quantify the ethical properties of RAR designs while considering patient benefit both within and outside of a trial. Our proposed metric improves on existing patient benefit metrics by considering a set of feasible group sequential designs with equal type I and II error rates and measuring the expected number of failures in the fixed group of individuals who are directly impacted by the design choice.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

An Alternative Metric for Evaluating the Potential Patient Benefit of Response-Adaptive Randomization Procedures

Proper

Murray

2022

Biometrics

View full text Add to dashboard Cite

When planning a two‐arm group sequential clinical trial with a binary primary outcome that has severe implications for quality of life (e.g., mortality), investigators may strive to find the design that maximizes in‐trial patient benefit. In such cases, Bayesian response‐adaptive randomization (BRAR) is often considered because it can alter the allocation ratio throughout the trial in favor of the treatment that is currently performing better. Although previous studies have recommended using fixed randomization over BRAR based on patient benefit metrics calculated from the realized trial sample size, these previous comparisons have been limited by failures to hold type I and II error rates constant across designs or consider the impacts on all individuals directly affected by the design choice. In this paper, we propose a metric for comparing designs with the same type I and II error rates that reflects expected outcomes among individuals who would participate in the trial if enrollment is open when they become eligible. We demonstrate how to use the proposed metric to guide the choice of design in the context of two recent trials in persons suffering out of hospital cardiac arrest. Using computer simulation, we demonstrate that various implementations of group sequential BRAR offer modest improvements with respect to the proposed metric relative to conventional group sequential monitoring alone.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An Alternative Metric for Evaluating the Potential Patient Benefit of Response-Adaptive Randomization Procedures

Proper

Murray

2022

Biometrics

View full text Add to dashboard Cite

show abstract

“…There has been intense debate in the clinical trials literature regarding the merits and perils of adaptive randomization. A recent review by Robertson et al 4 provides an extensive summary of the available methods including concerns raised for their use in clinical trials and potential approaches for mitigation. Commonly cited areas of concern include sample size imbalance in the opposite direction, loss of statistical power, biased effect estimates, and potential for invalid inferences with small samples in the frequentist framework because of the correlation between treatment assignment and outcome induced by the adaptation.…”

mentioning

confidence: 99%

“…The jury seems to be out on this question with numerous influential voices in the statistical and clinical trials community landing on opposite sides of the debate. [4][5][6][7] The authors of the current manuscript include a simulation study in the supplemental materials with the intent to demonstrate that the use of response-adaptive randomization resulted in fewer patients being randomly assigned to CC-115 relative to a conventional 1:1:1:1 randomized design. This is misleading as the enrollment to the CC-115 arm was restricted by the 3 1 3 design used for the safety lead-in.…”

mentioning

confidence: 99%

Bayesian Adaptive Randomization: Full of Promise With a Helping of Caution

Onar-Thomas

2023

JCO

View full text Add to dashboard Cite

Multinomial Thompson sampling for rating scales and prior considerations for calibrating uncertainty

Deliu

2023

Stat Methods Appl

View full text Add to dashboard Cite

Bandit algorithms such as Thompson sampling (TS) have been put forth for decades as useful tools for conducting adaptively-randomised experiments. By skewing the allocation toward superior arms, they can substantially improve particular outcomes of interest for both participants and investigators. For example, they may use participants’ ratings for continuously optimising their experience with a program. However, most of the bandit and TS variants are based on either binary or continuous outcome models, leading to suboptimal performances in rating scale data. Guided by behavioural experiments we conducted online, we address this problem by introducing Multinomial-TS for rating scales. After assessing its improved empirical performance in unique optimal arm scenarios, we explore potential considerations (including prior’s role) for calibrating uncertainty and balancing arm allocation in scenarios with no unique optimal arms.

show abstract

Response-Adaptive Randomization in Clinical Trials: From Myths to Practical Considerations

Cited by 16 publications

References 142 publications

An Alternative Metric for Evaluating the Potential Patient Benefit of Response-Adaptive Randomization Procedures

An Alternative Metric for Evaluating the Potential Patient Benefit of Response-Adaptive Randomization Procedures

Bayesian Adaptive Randomization: Full of Promise With a Helping of Caution

Multinomial Thompson sampling for rating scales and prior considerations for calibrating uncertainty

Contact Info

Product

Resources

About