2023
DOI: 10.1038/s41591-023-02552-9
|View full text |Cite
|
Sign up to set email alerts
|

Optimized glycemic control of type 2 diabetes with reinforcement learning: a proof-of-concept trial

Guangyu Wang,
Xiaohong Liu,
Zhen Ying
et al.

Abstract: The personalized titration and optimization of insulin regimens for treatment of type 2 diabetes (T2D) are resource-demanding healthcare tasks. Here we propose a model-based reinforcement learning (RL) framework (called RL-DITR), which learns the optimal insulin regimen by analyzing glycemic state rewards through patient model interactions. When evaluated during the development phase for managing hospitalized patients with T2D, RL-DITR achieved superior insulin titration optimization (mean absolute error (MAE)… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 41 publications
(38 reference statements)
0
3
0
Order By: Relevance
“…First, we used a fine-tuned DistilBERT model to quantify the semantic similarity between reference and model-generated diagnosis statements. 15–17 Next, we employed natural language generation (NLG) metrics conventionally used to evaluate models designed for image captioning or translation tasks to assess the syntactic similarity between statements. These included ROUGE, BLEU, and METEOR, ranging from 0 to 1, and CIDEr, ranging from 0 to 5, with higher values indicating better overlap between reference and generated text.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…First, we used a fine-tuned DistilBERT model to quantify the semantic similarity between reference and model-generated diagnosis statements. 15–17 Next, we employed natural language generation (NLG) metrics conventionally used to evaluate models designed for image captioning or translation tasks to assess the syntactic similarity between statements. These included ROUGE, BLEU, and METEOR, ranging from 0 to 1, and CIDEr, ranging from 0 to 5, with higher values indicating better overlap between reference and generated text.…”
Section: Resultsmentioning
confidence: 99%
“…To evaluate the semantic similarity between original and model-generated diagnosis statements, we fine-tuned a lightweight DistilBERT model, 15,36 pretrained on a large corpus of electronic health record notes, 16,17 in the same set of cardiologist-confirmed diagnosis statements used to train the vision-text transformer model. The training mirrored the standard approach for training masked language models, with a chunk size of 128 tokens and a masking probability of 0.15.…”
Section: Methodsmentioning
confidence: 99%
“…Sepsis resuscitation systems face particular regulatory challenges due to uncertainty over the gold standard doses of IV fluids and vasopressors in each resuscitation scenario for comparison with the system’s recommendations. A recent study of a reinforcement learning system for insulin prescriptions addressed similar problems by using a blinded panel of expert endocrinologists to evaluate the quality of treatment recommendations ( 57 ). The generalization issues highlighted above have led some to advocate that models be recalibrated in each new setting where they are deployed ( 58 , 59 ), but current medical device regulations lack well-defined pathways for approving machine learning systems that update (“retrain”) their prediction engines.…”
Section: Current Approaches: Promise and Pitfallsmentioning
confidence: 99%
“…Artificial Intelligence (AI) driven systems are set to take an increasingly prominent supportive role in decision-making including in high-stakes settings 1,2 . While the final decision remains in human hands, understanding how AI recommendations impact their user's behaviour is crucial.…”
Section: Introductionmentioning
confidence: 99%