Deep Reinforcement Learning for Concentric Tube Robot Control with a Goal-Based Curriculum

Iyengar, Keshav; Stoyanov, Danail

doi:10.1109/icra48506.2021.9561620

Cited by 7 publications

(17 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using deep deterministic policy gradient (DDPG) [6] with hindsight experience replay (HER) [7], the training parameters are as follows. The number of training timesteps was 3 million, buffer size was 500, 000 with the policy network having 3 hidden networks with 256 units per layer, the initial goal tolerance and final goal tolerance were 20 mm and 1 mm applied over 1.5 million steps using a decay function [3]. Zero-mean Gaussian noise of 1.8 mm was applied to Δ𝛽 𝑖 and 0.025 radians to Δ𝛼 𝑖 .…”

Section: Methodsmentioning

confidence: 99%

“…This motivates the development of an end-to-end model-free control framework for CTRs. We extend our previous model-free deep reinforcement learning (deepRL) method [3] with an initial proof of concept for generalization. The task we give the agent then is to control the end-effector Cartesian robot tip position by means of actions that represent changes in joint values to reach a desired position in the robot workspace whilst considering a specific CTR system.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Generalization for Deep Reinforcement Learning for Inverse Kinematics of Concentric Tube Robots

Iyengar¹,

Spurgeon²,

Stoyanov³

2022

Proceedings of the 14th Hamlyn Symposium on Medical Robotics 2022

View full text Add to dashboard Cite

Concentric tube robots (CTRs) are a class of continuum robot that depend on the interactions between neigh- bouring, concentrically aligned tubes to produce the curvilinear shapes of the robot backbone [1]. The main application of these unique robots is that of minimally invasive surgery (MIS), where most of the developments for CTRs have been focused. Due to the confined workspaces and resulting extended learning times for surgeons in MIS, dexterous, compliant continuum robots such as CTRs have been under development in prefer- ence to the mechanically rigid and limited degrees-of- freedom (DOF) robots used in interventional medicine today. The precurved tubes in CTRs, sometimes referred to as active cannulas or catheters, are manufactured from super-elastic materials like Nickel-Titanium alloys with each tube nested concentrically. From the base, the individual tubes can be actuated through extension and rotation, which results in the bending and twisting of the backbone as well as access to the surgical site through the channel and robot tip. Clinically, CTRs are motivated for use in brain, cardiac, gastric surgery as well other procedures [2]. Due to tube interactions, modelling and control is non- trivial. Position control for CTRs has relied on model development, and although a balance between compu- tation and accuracy has been reached in the literature [1], there remain issues such as performance in the presence of tube parameter discrepancies and the impact of unmodelled physical phenomena such as friction and permanent plastic deformation. This motivates the devel- opment of an end-to-end model-free control framework for CTRs. We extend our previous model-free deep reinforcement learning (deepRL) method [3] with an initial proof of concept for generalization. The task we give the agent then is to control the end-effector Cartesian robot tip position by means of actions that represent changes in joint values to reach a desired position in the robot workspace whilst considering a specific CTR system.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Generalization for Deep Reinforcement Learning for Inverse Kinematics of Concentric Tube Robots

Iyengar¹,

Spurgeon²,

Stoyanov³

2022

Proceedings of the 14th Hamlyn Symposium on Medical Robotics 2022

View full text Add to dashboard Cite

show abstract

“…For the simulation problem, several works estimate the tip's pose or the entire CTCR shape in dependency of the actuation parameters [11][12][13]. The problem of path planning or control is addressed by the inverse relation with similar methods [12,[14][15][16]. Such approaches rely on a densely sampled and valid data set of high measurement accuracy, which is time-consuming to establish [17].…”

Section: Concentric Tube Continuum Robotsmentioning

confidence: 99%

“…Such approaches rely on a densely sampled and valid data set of high measurement accuracy, which is time-consuming to establish [17]. For that reason, some authors stick with learning other physical models [14][15][16]. Kuntz et al are using a combination of measurements and simulation data [13].…”

Section: Concentric Tube Continuum Robotsmentioning

confidence: 99%

Data augmentation for design of concentric tube continuum robots by generative adversarial networks

Hoffmann,

Gulakala,

Mühlenhoff

et al. 2023

Proc Appl Math and Mech

View full text Add to dashboard Cite

Concentric tube continuum robots are a promising type of robot for various medical applications. Their application in neurosurgery poses challenging requirements for design and control that can be addressed by physics‐informed data‐based approaches. A prerequisite to data‐based modeling is an informative, rich data set. However, limited access to experimental data raises interest in partially or entirely synthetic data sets. In this contribution, we study the application of generative adversarial networks (GANs) for data augmentation in a data‐based design process of such robots. We propose a GAN framework suitable for curve‐fitting to generate synthetic trajectories of robots along with their corresponding control parameters. Our evaluation shows that the GANs can efficiently produce meaningful synthetic trajectories and control parameter pairs that show a good agreement with simulated trajectories.

show abstract

“…For learning the kinematics of a CTCR task, we show in [4] that the accuracy and convergence are significantly improved by using this simple yet effective transformation to decorrelate the joint space. As of now, this approach has been used in publications [4], [5], [6] on machine learning. Yet, we are confident that disentanglement will also be useful for other research fields in the continuum robotics research community.…”

Section: Introductionmentioning

confidence: 99%

A Dataset and Benchmark for Learning the Kinematics of Concentric Tube Continuum Robots

Grassmann

Chen

Liang

et al. 2022

2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

Concentric tube continuum robots utilize nested tubes, which are subject to a set of inequalities. Current approaches to account for inequalities rely on branching methods such as if-else statements. It can introduce discontinuities, may result in a complicated decision tree, has a high wall-clock time, and cannot be vectorized. This affects the behavior and result of downstream methods in control, learning, workspace estimation, and path planning, among others.In this paper, we investigate a mapping to mitigate branching methods. We derive a lower triangular transformation matrix to disentangle the inequalities and provide proof for the unique existence. It transforms the interdependent inequalities into independent box constraints. Further investigations are made for sampling, control, and workspace estimation. Approaches utilizing the proposed mapping are at least 14 times faster (up to 176 times faster), generate always valid joint configurations, are more interpretable, and are easier to extend.

show abstract

Deep Reinforcement Learning for Concentric Tube Robot Control with a Goal-Based Curriculum

Cited by 7 publications

References 21 publications

Generalization for Deep Reinforcement Learning for Inverse Kinematics of Concentric Tube Robots

Generalization for Deep Reinforcement Learning for Inverse Kinematics of Concentric Tube Robots

Data augmentation for design of concentric tube continuum robots by generative adversarial networks

A Dataset and Benchmark for Learning the Kinematics of Concentric Tube Continuum Robots

Contact Info

Product

Resources

About