Reinforcement learning and Bayesian data assimilation for model‐informed precision dosing in oncology

Maier, Corinna; Hartung, Niklas; Kloft, Charlotte; Huisinga, Wilhelm; Wiljes, Jana de

doi:10.1002/psp4.12588

Cited by 29 publications

(37 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A possible way to deal with sparse rewards in the light of multiple goals is hindsight experience replay, where different learning episodes are replayed with different goals and the agent can derive reward signals regarding different outcomes [ 66 ]. In most applications of RL in healthcare, rewards are coded quantitatively rather than qualitatively, which can be useful for certain use cases where the outcome, in fact, is a metric variable (such as absolute neutrophile count [ 34 ]); however, it remains challenging when the outcome first has to be transformed or a priori model building has to be performed manually [ 29 ]. Alternatively, preference models can be used as a representation of qualitative feedback to rank the agent’s behavioral trajectories [ 67 , 68 ].…”

Section: Discussionmentioning

confidence: 99%

“…Their method ranks drug sensitivity prediction algorithms and recommends the optimal algorithms for a given drug–cell line pair in order to achieve optimal responses. To account for chemotherapy-associated toxicity, Maier et al [ 34 ] proposed an RL-based framework that is guided by absolute neutrophil counts for adjusting subsequent drug doses. Using simulated reinforcement trials [ 35 ], Zhao et al [ 36 ] applied Q-learning to stage IIIB/IV non-small cell lung cancer and reported optimized first and second treatment lines as well as optimal selection for initiating second-line therapy.…”

Section: Recent Studies Of Reinforcement Learning In Malignant Diseasementioning

confidence: 99%

See 1 more Smart Citation

Reinforcement Learning for Precision Oncology

Wendt

Bornhäuser

Middeke

2021

Cancers

View full text Add to dashboard Cite

Precision oncology is grounded in the increasing understanding of genetic and molecular mechanisms that underly malignant disease and offer different treatment pathways for the individual patient. The growing complexity of medical data has led to the implementation of machine learning techniques that are vastly applied for risk assessment and outcome prediction using either supervised or unsupervised learning. Still largely overlooked is reinforcement learning (RL) that addresses sequential tasks by exploring the underlying dynamics of an environment and shaping it by taking actions in order to maximize cumulative rewards over time, thereby achieving optimal long-term outcomes. Recent breakthroughs in RL demonstrated remarkable results in gameplay and autonomous driving, often achieving human-like or even superhuman performance. While this type of machine learning holds the potential to become a helpful decision support tool, it comes with a set of distinctive challenges that need to be addressed to ensure applicability, validity and safety. In this review, we highlight recent advances of RL focusing on studies in oncology and point out current challenges and pitfalls that need to be accounted for in future studies in order to successfully develop RL-based decision support systems for precision oncology.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Recent Studies Of Reinforcement Learning In Malignant Diseasementioning

confidence: 99%

Reinforcement Learning for Precision Oncology

Wendt

Bornhäuser

Middeke

2021

Cancers

View full text Add to dashboard Cite

show abstract

“…Applications of ML to MIPD to date have found that ML models are often able to accurately estimate past drug exposure, 24,25 predict future drug exposure, 26–28 or select doses 29–32 . However, the improvement in accuracy from these earlier approaches comes at the expense of pharmacological interpretability and the ability to simulate patient response to alternative dosing regimens 24,33,34 .…”

Section: Discussionmentioning

confidence: 99%

“…Applications of ML to MIPD to date have found that ML models are often able to accurately estimate past drug exposure, 24 , 25 predict future drug exposure, 26 , 27 , 28 or select doses. 29 , 30 , 31 , 32 However, the improvement in accuracy from these earlier approaches comes at the expense of pharmacological interpretability and the ability to simulate patient response to alternative dosing regimens. 24 , 33 , 34 An advantage of the combination of ML and PK models as described here is that clinical decision making is augmented by ML while maintaining the ability to forecast patient PKs and extract mechanistic insight from PK parameter estimates.…”

Section: Discussionmentioning

confidence: 99%

A hybrid machine learning/pharmacokinetic approach outperforms maximum a posteriori Bayesian estimation by selectively flattening model priors

Hughes¹,

Keizer²

2021

CPT Pharmacom & Syst Pharma

View full text Add to dashboard Cite

Model-informed precision dosing (MIPD) approaches typically apply maximum a posteriori (MAP) Bayesian estimation to determine individual pharmacokinetic (PK) parameters with the goal of optimizing future dosing regimens. This process combines knowledge about the individual, in the form of drug levels or pharmacodynamic biomarkers, with prior knowledge of the drug PK in the general population. Use of "flattened priors" (FP), in which the weight of the model priors is reduced relative to observations about the patient, has been previously proposed to estimate individual PK parameters in instances where the patient is poorly described by the PK model. However, little is known about the predictive performance of FP and when to apply FP in MIPD. Here, FP is evaluated in a data set of 4679 adult patients treated with vancomycin.Depending on the PK model, prediction error could be reduced by applying FP in 42-55% of PK parameter estimations. Machine learning (ML) models could identify instances where FP would outperform MAP with a specificity of 81-86%, reducing overall root mean squared error (RMSE) of PK model predictions by 12-22% (0.5-1.2 mg/L) relative to MAP alone. The factors most indicative of the use of FP were past prediction residuals and bias in past PK predictions. A more clinically practical minimal model was developed using only these two features, reducing RMSE by 5-18% (0.20-0.93 mg/L) relative to MAP. This hybrid ML/PK approach advances the precision dosing toolkit by leveraging the power of ML while maintaining the mechanistic insight and interpretability of pharmacokinetic models.

show abstract

“…As an example of high clinical relevance, we focus on paclitaxel causing neutropenia as the most frequent and life‐threatening toxicity in oncology. Models describing paclitaxel‐induced neutropenia build the basis for neutrophil‐guided MIPD to individualize chemotherapy dosing 18–21 . Since the publication of the gold‐standard model for neutropenia, 22 many model variants have been developed, which differ not only in parameter estimates, 23–26 but also in their structure 17,27–29 …”

Section: Introductionmentioning

confidence: 99%

A continued learning approach for model‐informed precision dosing: Updating models in clinical practice

Maier

Wiljes

Hartung

et al. 2021

CPT Pharmacom & Syst Pharma

Self Cite

View full text Add to dashboard Cite

Model‐informed precision dosing (MIPD) is a quantitative dosing framework that combines prior knowledge on the drug‐disease‐patient system with patient data from therapeutic drug/ biomarker monitoring (TDM) to support individualized dosing in ongoing treatment. Structural models and prior parameter distributions used in MIPD approaches typically build on prior clinical trials that involve only a limited number of patients selected according to some exclusion/inclusion criteria. Compared to the prior clinical trial population, the patient population in clinical practice can be expected to also include altered behavior and/or increased interindividual variability, the extent of which, however, is typically unknown. Here, we address the question of how to adapt and refine models on the level of the model parameters to better reflect this real‐world diversity. We propose an approach for continued learning across patients during MIPD using a sequential hierarchical Bayesian framework. The approach builds on two stages to separate the update of the individual patient parameters from updating the population parameters. Consequently, it enables continued learning across hospitals or study centers, because only summary patient data (on the level of model parameters) need to be shared, but no individual TDM data. We illustrate this continued learning approach with neutrophil‐guided dosing of paclitaxel. The present study constitutes an important step toward building confidence in MIPD and eventually establishing MIPD increasingly in everyday therapeutic use.

show abstract

Reinforcement learning and Bayesian data assimilation for model‐informed precision dosing in oncology

Cited by 29 publications

References 39 publications

Reinforcement Learning for Precision Oncology

Reinforcement Learning for Precision Oncology

A hybrid machine learning/pharmacokinetic approach outperforms maximum a posteriori Bayesian estimation by selectively flattening model priors

A continued learning approach for model‐informed precision dosing: Updating models in clinical practice

Contact Info

Product

Resources

About