Deep reinforcement learning for tensegrity robot locomotion

Zhang, Marvin; Geng, Xinyang; Bruce, Jonathan; Caluwaerts, Ken; Vespignani, Massimo; SunSpiral, Vytas; Abbeel, Pieter; Levine, Sergey

doi:10.1109/icra.2017.7989079

Cited by 73 publications

(49 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Due to the inherent coupled, nonlinear dynamics of the robot, multi-cable actuation policies render robotic control a challenging intellectual task, providing a launch point for future work. We look forward to exploring the integration of artificial intelligence (particularly evolutionary algorithms and deep reinforcement learning architectures) in this robotic platform to optimize locomotive gaits on varied inclines, and even generate optimal tensegrity topologies, areas which have proven promising in prior work [17], [18]. We hope to leverage learning algorithms to achieve more fluid and efficient locomotion using a robust and fully autonomous control policy.…”

Section: Discussionmentioning

confidence: 99%

Inclined surface locomotion strategies for spherical tensegrity robots

Chen

Cera

Zhu

et al. 2017

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

This paper presents a new teleoperated spherical tensegrity robot capable of performing locomotion on steep inclined surfaces. With a novel control scheme centered around the simultaneous actuation of multiple cables, the robot demonstrates robust climbing on inclined surfaces in hardware experiments and speeds significantly faster than previous spherical tensegrity models. This robot is an improvement over other iterations in the TT-series and the first tensegrity to achieve reliable locomotion on inclined surfaces of up to 24 • . We analyze locomotion in simulation and hardware under single and multicable actuation, and introduce two novel multi-cable actuation policies, suited for steep incline climbing and speed, respectively. We propose compelling justifications for the increased dynamic ability of the robot and motivate development of optimization algorithms able to take advantage of the robot's increased control authority.

show abstract

Section: Discussionmentioning

confidence: 99%

Inclined surface locomotion strategies for spherical tensegrity robots

Chen

Cera

Zhu

et al. 2017

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

show abstract

“…Model-based closed-loop control has been mostly limited to low-dimensional structures [23], [24], [25], [13], [26]. More complex and high dimensional systems have been addressed with model-free methods [16], [22], [18], [27], [28], [29] or open-loop control [30], [31], [32], [20]. In order to use a tensegrity spine with Laika, a model-based closed-loop tracking controller was developed by the authors in [12] and is improved upon in this work.…”

Section: A Tensegrity Robots and Controlmentioning

confidence: 99%

“…(27)(28)(29), in the context of robotics and control systems, did not address these rank issues when implemented for this 2D spine. Reducing (27)(28)(29) to a cablesonly formulation by optimizing only over q s as suggested in [20] only exacerbates these rank issues by defining A with fewer columns. Additionally, the relaxation of this equalityconstrained problem to an inequality-constrained formulation, as used in [20], did not make the problem feasible.…”

Section: Existence Of Solutions and Rank Deficiencymentioning

confidence: 99%

See 1 more Smart Citation

Model-Predictive Control With Inverse Statics Optimization for Tensegrity Spine Robots

Sabelhaus

Zhao

Zhu

et al. 2021

IEEE Trans. Contr. Syst. Technol.

View full text Add to dashboard Cite

Robots with flexible spines based on tensegrity structures have potential advantages over traditional designs with rigid torsos. However, these robots can be difficult to control due to their high-dimensional nonlinear dynamics. To overcome these issues, this work presents two controllers for tensegrity spine robots, using model-predictive control (MPC), and demonstrates the first closed-loop control of such structures. The first of the two controllers is formulated using only state tracking with smoothing constraints. The second controller, newly introduced in this work, tracks both state and input reference trajectories without smoothing. The reference input trajectory is calculated using a rigid-body reformulation of the inverse kinematics of tensegrity structures, and introduces the first feasible solutions to the problem for certain tensegrity topologies. This second controller significantly reduces the number of parameters involved in designing the control system, making the task much easier. The controllers are simulated with 2D and 3D models of a particular tensegrity spine, designed for use as the backbone of a quadruped robot. These simulations illustrate the different benefits of the higher performance of the smoothing controller versus the lower tuning complexity of the more general input-tracking formulation. Both controllers show noise insensitivity and low tracking error, and can be used for different control goals. The reference input tracking controller is also simulated against an additional model of a similar robot, thereby demonstrating its generality.

show abstract

“…Further details are presented in Alg. 2 2) Adversarial transfer of encoder from sim-to-real: Once we have a policy that is performing well in the simulator, we aim to learn an encoder that generates the same distribution of latent states over real images as the pre-trained encoder. To achieve this we begin by freezing the source encoder's learned weights.…”

Section: B Policy Transfer To the Real Robotmentioning

confidence: 99%

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Bharadhwaj

Wang

Bengio

et al. 2019

2019 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

Learning effective visuomotor policies for robots purely from data is challenging, but also appealing since a learning-based system should not require manual tuning or calibration. In the case of a robot operating in a real environment the training process can be costly, time-consuming, and even dangerous since failures are common at the start of training. For this reason, it is desirable to be able to leverage simulation and off-policy data to the extent possible to train the robot. In this work, we introduce a robust framework that plans in simulation and transfers well to the real environment. Our model incorporates a gradient-descent based planning module, which, given the initial image and goal image, encodes the images to a lower dimensional latent state and plans a trajectory to reach the goal. The model, consisting of the encoder and planner modules, is trained through a meta-learning strategy in simulation first. We subsequently perform adversarial domain transfer on the encoder by using a bank of unlabelled but random images from the simulation and real environments to enable the encoder to map images from the real and simulated environments to a similarly distributed latent representation. By fine tuning the entire model (encoder + planner) with far fewer real world expert demonstrations, we show successful planning performances in different navigation tasks.

show abstract

Deep reinforcement learning for tensegrity robot locomotion

Cited by 73 publications

References 26 publications

Inclined surface locomotion strategies for spherical tensegrity robots

Inclined surface locomotion strategies for spherical tensegrity robots

Model-Predictive Control With Inverse Statics Optimization for Tensegrity Spine Robots

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Contact Info

Product

Resources

About