2021
DOI: 10.1016/j.bspc.2021.102847
|View full text |Cite
|
Sign up to set email alerts
|

Personalized vital signs control based on continuous action-space reinforcement learning with supervised experience

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(18 citation statements)
references
References 32 publications
0
18
0
Order By: Relevance
“…Because only the patient's survival is concerned, the reward is observed after a long sequence of decisions. We also apply intermediate rewards and final reward in the form of SOFA change and survival after 90 days respectively 25 . SOFA represents the evidence of organ dysfunction and has been recommended by experts as a screening tool for sepsis 34 .…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Because only the patient's survival is concerned, the reward is observed after a long sequence of decisions. We also apply intermediate rewards and final reward in the form of SOFA change and survival after 90 days respectively 25 . SOFA represents the evidence of organ dysfunction and has been recommended by experts as a screening tool for sepsis 34 .…”
Section: Methodsmentioning
confidence: 99%
“…Off-policy evaluation. In experiments, we use the intermediate reward parameter đ›œ 𝑠 = 0.6 and the terminal reward parameter đ›œ 𝑇 = 24, following the setting in existing works 25 . Specifically, the terminal reward is 24 if the patient survives, otherwise -24.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations