Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition

Festor, Paul; Luise, Giulia; Komorowski, Matthieu; Faisal, A. Aldo

doi:10.48550/arxiv.2109.07827

Cited by 3 publications

(3 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…An emerging new avenue in the field is to augment AI models so that they can quantify their own confidence or uncertainty over their recommendations. 19 Going forward, it may be helpful to algorithmically combine the communication of uncertainty that a system has about itself, which reflects the risk of unwanted behaviour as we have shown in other domains of risk-aware control by medical devices, 20 with its safety features, that we have shown here.…”

Section: Discussionmentioning

confidence: 84%

Assuring the safety of AI-based clinical decision support systems: a case study of the AI Clinician for sepsis treatment

Festor

Jia

Gordon

et al. 2022

BMJ Health Care Inform

Self Cite

View full text Add to dashboard Cite

ObjectivesEstablishing confidence in the safety of Artificial Intelligence (AI)-based clinical decision support systems is important prior to clinical deployment and regulatory approval for systems with increasing autonomy. Here, we undertook safety assurance of the AI Clinician, a previously published reinforcement learning-based treatment recommendation system for sepsis.MethodsAs part of the safety assurance, we defined four clinical hazards in sepsis resuscitation based on clinical expert opinion and the existing literature. We then identified a set of unsafe scenarios, intended to limit the action space of the AI agent with the goal of reducing the likelihood of hazardous decisions.ResultsUsing a subset of the Medical Information Mart for Intensive Care (MIMIC-III) database, we demonstrated that our previously published ‘AI clinician’ recommended fewer hazardous decisions than human clinicians in three out of our four predefined clinical scenarios, while the difference was not statistically significant in the fourth scenario. Then, we modified the reward function to satisfy our safety constraints and trained a new AI Clinician agent. The retrained model shows enhanced safety, without negatively impacting model performance.DiscussionWhile some contextual patient information absent from the data may have pushed human clinicians to take hazardous actions, the data were curated to limit the impact of this confounder.ConclusionThese advances provide a use case for the systematic safety assurance of AI-based clinical systems towards the generation of explicit safety evidence, which could be replicated for other AI applications or other clinical contexts, and inform medical device regulatory bodies.

show abstract

Section: Discussionmentioning

confidence: 84%

Assuring the safety of AI-based clinical decision support systems: a case study of the AI Clinician for sepsis treatment

Festor

Jia

Gordon

et al. 2022

BMJ Health Care Inform

Self Cite

View full text Add to dashboard Cite

show abstract

“…, (x L−1 , u L−1 )} generated by following the clinician's policy H ∼ µ CP . Approaches for off-policy evaluation are an active topic of research in the context of related reinforcement learning frameworks (Hanna et al, 2017;Thomas et al, 2015;Festor et al, 2021). It should be noted that the factors driving µ CP may differ from the cost function chosen to derive µ OP , which is why effective calibration is important to objectively compare the two policies.…”

Section: Policy Cost Estimationmentioning

confidence: 99%

Optimal discharge of patients from intensive care via a data-driven policy learning framework

Lejarza¹,

Calvert²,

Attwood³

et al. 2021

Preprint

View full text Add to dashboard Cite

Clinical decision support tools rooted in machine learning and optimization can provide significant value to healthcare providers, including through better management of intensive care units. In particular, it is important that the patient discharge task addresses the nuanced trade-off between decreasing a patient's length of stay (and associated hospitalization costs) and the risk of readmission or even death following the discharge decision. This work introduces an end-to-end general framework for capturing this trade-off to recommend optimal discharge timing decisions given a patient's electronic health records. A data-driven approach is used to derive a parsimonious, discrete state space representation that captures a patient's physiological condition. Based on this model and a given cost function, an infinitehorizon discounted Markov decision process is formulated and solved numerically to compute an optimal discharge policy, whose value is assessed using off-policy evaluation strategies. Extensive numerical experiments are performed to validate the proposed framework using real-life intensive care unit patient data.

show abstract

“…Recently, there has been an increased volume of research which try to learn optimal treatment strategies for critically ill and in particular for septic patients (Komorowski et al, 2018;Chen et al, 2019;Raghu et al, 2017;Li et al, 2019;Peng et al, 2018;Festor et al, 2021;Nanayakkara et al, 2022b), using Reinforcement Learning (RL) methods. Given the enormous mortality, morbidity and economic burden (Liu et al, 2014;Rhee et al, 2017;Paoli et al, 2018), the ambiguity regarding optimal treatment strategies and lack of accepted guidelines for treatment (Marik, 2015;Jarczak et al, 2021), such attempts are certainly justified.…”

Section: Introductionmentioning

confidence: 99%

Reinforcement Learning For Survival: A Clinically Motivated Method For Critically Ill Patients

Nanayakkara¹

2022

Preprint

View full text Add to dashboard Cite

There has been considerable interest in leveraging RL and stochastic control methods to learn optimal treatment strategies for critically ill patients, directly from observational data. However, there is significant ambiguity on the control objective and on the best reward choice for the standard RL objective. In this work, we propose a clinically motivated control objective for critically ill patients, for which the value functions have a simple medical interpretation. Further, we present theoretical results and adapt our method to a practical Deep RL algorithm, which can be used alongside any value based Deep RL method. We experiment on a large sepsis cohort and show that our method produces results consistent with clinical knowledge.

show abstract

Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition

Cited by 3 publications

References 16 publications

Assuring the safety of AI-based clinical decision support systems: a case study of the AI Clinician for sepsis treatment

Assuring the safety of AI-based clinical decision support systems: a case study of the AI Clinician for sepsis treatment

Optimal discharge of patients from intensive care via a data-driven policy learning framework

Reinforcement Learning For Survival: A Clinically Motivated Method For Critically Ill Patients

Contact Info

Product

Resources

About