Our system is currently under heavy load due to increased usage. We're actively working on upgrades to improve performance. Thank you for your patience.
2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561620
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Concentric Tube Robot Control with a Goal-Based Curriculum

Abstract: Concentric Tube Robots (CTRs), a type of continuum robot, are a collection of concentric, pre-curved tubes composed of super elastic nickel titanium alloy. CTRs can bend and twist from the interactions between neighboring tubes causing the kinematics and therefore control of the end-effector to be very challenging to model. In this paper, we develop a control scheme for a CTR end-effector in Cartesian space with no prior kinematic model using a deep reinforcement learning (DRL) approach with a goal-based curri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(17 citation statements)
references
References 21 publications
0
17
0
Order By: Relevance
“…Using deep deterministic policy gradient (DDPG) [6] with hindsight experience replay (HER) [7], the training parameters are as follows. The number of training timesteps was 3 million, buffer size was 500, 000 with the policy network having 3 hidden networks with 256 units per layer, the initial goal tolerance and final goal tolerance were 20 mm and 1 mm applied over 1.5 million steps using a decay function [3]. Zero-mean Gaussian noise of 1.8 mm was applied to Î”đ›œ 𝑖 and 0.025 radians to Î”đ›Œ 𝑖 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Using deep deterministic policy gradient (DDPG) [6] with hindsight experience replay (HER) [7], the training parameters are as follows. The number of training timesteps was 3 million, buffer size was 500, 000 with the policy network having 3 hidden networks with 256 units per layer, the initial goal tolerance and final goal tolerance were 20 mm and 1 mm applied over 1.5 million steps using a decay function [3]. Zero-mean Gaussian noise of 1.8 mm was applied to Î”đ›œ 𝑖 and 0.025 radians to Î”đ›Œ 𝑖 .…”
Section: Methodsmentioning
confidence: 99%
“…This motivates the development of an end-to-end model-free control framework for CTRs. We extend our previous model-free deep reinforcement learning (deepRL) method [3] with an initial proof of concept for generalization. The task we give the agent then is to control the end-effector Cartesian robot tip position by means of actions that represent changes in joint values to reach a desired position in the robot workspace whilst considering a specific CTR system.…”
Section: Introductionmentioning
confidence: 99%
“…For the simulation problem, several works estimate the tip's pose or the entire CTCR shape in dependency of the actuation parameters [11][12][13]. The problem of path planning or control is addressed by the inverse relation with similar methods [12,[14][15][16]. Such approaches rely on a densely sampled and valid data set of high measurement accuracy, which is time-consuming to establish [17].…”
Section: Concentric Tube Continuum Robotsmentioning
confidence: 99%
“…Such approaches rely on a densely sampled and valid data set of high measurement accuracy, which is time-consuming to establish [17]. For that reason, some authors stick with learning other physical models [14][15][16]. Kuntz et al are using a combination of measurements and simulation data [13].…”
Section: Concentric Tube Continuum Robotsmentioning
confidence: 99%
“…For learning the kinematics of a CTCR task, we show in [4] that the accuracy and convergence are significantly improved by using this simple yet effective transformation to decorrelate the joint space. As of now, this approach has been used in publications [4], [5], [6] on machine learning. Yet, we are confident that disentanglement will also be useful for other research fields in the continuum robotics research community.…”
Section: Introductionmentioning
confidence: 99%