2019
DOI: 10.1038/s41591-018-0310-5
|View full text |Cite
|
Sign up to set email alerts
|

Guidelines for reinforcement learning in healthcare

Abstract: In this Comment, we provide guidelines for reinforcement learning for patient treatment decisions that we hope will accelerate the rate at which observational cohorts can inform healthcare practice in a safe, risk-conscious manner.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
175
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 266 publications
(175 citation statements)
references
References 4 publications
0
175
0
Order By: Relevance
“…As a result, long-term effects are harder to estimate. 8 The main difference between supervised and unsupervised learnings is whether the training data set has labeled outputs corresponding to input data. The supervised learning infers a mathematical relationship between the inputs and the labeled outputs while the unsupervised learning infers a function that expresses hidden characteristics reside in input data.…”
Section: Supervised and Unsupervised Learningmentioning
confidence: 99%
“…As a result, long-term effects are harder to estimate. 8 The main difference between supervised and unsupervised learnings is whether the training data set has labeled outputs corresponding to input data. The supervised learning infers a mathematical relationship between the inputs and the labeled outputs while the unsupervised learning infers a function that expresses hidden characteristics reside in input data.…”
Section: Supervised and Unsupervised Learningmentioning
confidence: 99%
“…Similarly to the way we used drivers’ gaze in a driving simulator to train a self-driving car algorithm in that simulator, we can use rich neurobehavioural data from an expert driver in extreme conditions to train a real self-driving car algorithm to response successfully in extreme conditions. Likewise, on the side of control systems, we showed for example how using ethomic data obtained from natural tasks (movement data 41 , electrophysiological data 42 , decision making data 43 ) can be harnessed to boost AI system performance. The neurobehavioural approach demonstrated here suggests how we can succeed in future to close-the-loop between person and vehicle.…”
Section: Discussionmentioning
confidence: 99%
“…A relatively new approach to optimising treatment of critically ill patients with sepsis in the ICU is with the use of computerised programs that utilise artificial intelligence (AI) in generating their recommendations. Unlike dosing software with Bayesian forecasting which uses PK and statistical modelling to individualise therapy to patients, AI software uses reinforcement learning to generate recommendations on appropriate interventions required to achieve predetermined targets for patients [104]. Artificial intelligence software examines data from large patient population databases to identify interventions associated with the target outcome and combines this information with an individual patient's characteristics to determine the most appropriate intervention that will maximise the probability of achieving the predefined outcome [105,106].…”
Section: Artificial Intelligence Softwarementioning
confidence: 99%