2022
DOI: 10.1017/s0263574722001527
|View full text |Cite
|
Sign up to set email alerts
|

Discrete soft actor-critic with auto-encoder on vascular robotic system

Abstract: Instrument delivery is critical part in vascular intervention surgery. Due to the soft-body structure of instruments, the relationship between manipulation commands and instrument motion is non-linear, making instrument delivery challenging and time-consuming. Reinforcement learning has the potential to learn manipulation skills and automate instrument delivery with enhanced success rates and reduced workload of physicians. However, due to the sample inefficiency when using high-dimensional images, existing re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 36 publications
0
3
0
Order By: Relevance
“…For each aortic arch the pathlength is calculated based on the centerlines of the individual geometry. This is an adaption of the reward utilized in [ 12 ] enhanced by the dense feature from [ 14 ] and [ 15 ]. Parameters for the constant penalty and change in pathlength are chosen such that they approximately equalize each other with an optimal action.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For each aortic arch the pathlength is calculated based on the centerlines of the individual geometry. This is an adaption of the reward utilized in [ 12 ] enhanced by the dense feature from [ 14 ] and [ 15 ]. Parameters for the constant penalty and change in pathlength are chosen such that they approximately equalize each other with an optimal action.…”
Section: Methodsmentioning
confidence: 99%
“…Recent research regarding autonomous control of endovascular instruments during navigation in interventions utilize learning-based approaches [ 8 15 ]. They utilize supervised learning [ 8 ], deep-q-networks [ 9 , 10 ], asynchronous advantage actor-critic [ 11 ], deep deterministic policy gradients with hindsight experience replay [ 12 ], generative adversarial imitation learning and proximal policy optimization [ 13 ], soft actor critic [ 14 ] or discrete soft actor critic with auto-encoder [ 15 ]. These approaches are all trained on one vessel geometry and show reduced success [ 13 ] on a different geometry.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning is used by Li et al [ 148 ] to solve the problem of automating instrument delivery in vascular intervention surgery. Existing reinforcement learning techniques are constrained by the non-linear connection between manipulation commands and instrument movements in actual vascular robotic systems.…”
Section: Deep Rl For Robotic Manipulationmentioning
confidence: 99%