2021 IEEE International Conference on Mechatronics and Automation (ICMA) 2021
DOI: 10.1109/icma52036.2021.9512675
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of a Reinforcement Learning Algorithm for Vascular Intervention Surgery

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
24
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(24 citation statements)
references
References 21 publications
0
24
0
Order By: Relevance
“…As shown in Figure 1, 462 studies met the search criteria, and 21 full-text studies were assessed against the eligibility criteria. A total of 14 were identified for review (Rafii-Tari et al, 2013Chi et al, 2018aChi et al, ,b, 2020Behr et al, 2019;You et al, 2019;Zhao et al, 2019;Kweon et al, 2021;Meng et al, 2021Meng et al, , 2022Cho et al, 2022;Karstensen et al, 2022;Wang et al, 2022). The characteristics of the fourteen studies are listed in Table 2.…”
Section: Studiesmentioning
confidence: 99%
See 2 more Smart Citations
“…As shown in Figure 1, 462 studies met the search criteria, and 21 full-text studies were assessed against the eligibility criteria. A total of 14 were identified for review (Rafii-Tari et al, 2013Chi et al, 2018aChi et al, ,b, 2020Behr et al, 2019;You et al, 2019;Zhao et al, 2019;Kweon et al, 2021;Meng et al, 2021Meng et al, , 2022Cho et al, 2022;Karstensen et al, 2022;Wang et al, 2022). The characteristics of the fourteen studies are listed in Table 2.…”
Section: Studiesmentioning
confidence: 99%
“…. RL methods RL was used in nine studies (9/14, 64%) with algorithms including A3C, DDPG, DQN, Dueling DQN, HER, PI 2 , PPO, and Rainbow (Chi et al, 2018a(Chi et al, , 2020Behr et al, 2019;You et al, 2019;Kweon et al, 2021;Meng et al, 2021Meng et al, , 2022Cho et al, 2022;Karstensen et al, 2022). Demonstrator data in some form (GAIL, Behavior Cloning, or HD) was used as a precursor in four of the studies (4/14, 29%) during training (LfD), in conjunction with other RL algorithms (Chi et al, 2018a;Behr et al, 2019;Kweon et al, 2021;Cho et al, 2022).…”
Section: Yolo Supervised Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…Analogous to physicians using X-ray fluoroscopy for intraoperative navigation [28], it is more realistic to use images for instrument-manipulation skills. Several state-of-the-art RL algorithms, such as Dueling Deep Q-Network [29] and Asynchronous Advantage Actor-Critic [30], have been applied with preoperative vascular models to learn instrument-manipulation skills with high-dimension images [31,32]. Due to notorious sample inefficiency of RL, existing research is limited to digital simulation environments [31,32].…”
Section: Introductionmentioning
confidence: 99%
“…Several state-of-the-art RL algorithms, such as Dueling Deep Q-Network [29] and Asynchronous Advantage Actor-Critic [30], have been applied with preoperative vascular models to learn instrument-manipulation skills with high-dimension images [31,32]. Due to notorious sample inefficiency of RL, existing research is limited to digital simulation environments [31,32]. However, there is a non-negligible gap between digital simulations and real environments, which limits the clinical value of learned instrument-manipulation skills.…”
Section: Introductionmentioning
confidence: 99%