DeepGait: Planning and Control of Quadrupedal Gaits Using Deep Reinforcement Learning

Tsounis, Vassilios; Alge, Mitja; Lee, Joonho; Farshidian, Farbod; Hutter, Marco

doi:10.1109/lra.2020.2979660

Cited by 147 publications

(85 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We decide to tackle these challenges using model-free control via Reinforcement Learning (RL), which has shown impressive results when it comes to the control of complex motions. It has been used to solve a Rubik's cube with a robotic hand [16], learn locomotion on complex terrains, [17], [18], play table tennis [19], teach robots to imitate animals [20] and stand up from arbitrary initial conditions [21]. In addition to its capacity to solve complex tasks, once trained, RL has the advantage of requiring much less computation than optimization methods.…”

Section: High Inertia Feetmentioning

confidence: 99%

Cat-Like Jumping and Landing of Legged Robots in Low Gravity Using Deep Reinforcement Learning

et al. 2022

Self Cite

View full text Add to dashboard Cite

We show that learned policies can be applied to solve legged locomotion control tasks with extensive flight phases, such as those encountered in space exploration. Using an offthe-shelf deep reinforcement learning algorithm, we trained a neural network to control a jumping quadruped robot while solely using its limbs for attitude control. We present tasks of increasing complexity leading to a combination of threedimensional (re-)orientation and landing locomotion behaviors of a quadruped robot traversing simulated low-gravity celestial bodies. We show that our approach easily generalizes across these tasks and successfully trains policies for each case. Using sim-toreal transfer, we deploy trained policies in the real world on the SpaceBok robot placed on an experimental testbed designed for two-dimensional micro-gravity experiments. The experimental results demonstrate that repetitive, controlled jumping and landing with natural agility is possible.Index Terms-Legged Robots; Deep Learning in Robotics and Automation; Space Robotics and Automation * We choose to use the gravity of Ceres, −0.27m/s 2 ≈ 0.03g [22]. Ceres is the smallest dwarf planet of the solar system. The low gravity environment makes it an interesting target for jumping locomotion.† https://youtu.be/KQhlZa42fe4

show abstract

Section: High Inertia Feetmentioning

confidence: 99%

Cat-Like Jumping and Landing of Legged Robots in Low Gravity Using Deep Reinforcement Learning

et al. 2022

Self Cite

View full text Add to dashboard Cite

show abstract

“…Every module, with the exception of the burst generators, independently and continuously seeks to match its perception with its goal value through corrective outputs in real time. Unlike designs that only implement feedback control at the level of joint control and some form of feed-forward computation above that to generate behavior (11,14, 22, 25, 28, 32, 43, 45, 50), our design uses closed loop negative feedback control at every level of the hierarchy. This feature replicates the purposive nature of animal behavior, which is often mistakenly assumed to be a feedforward process whereby a stimulus input is transformed by the nervous system and results in motor output (51).…”

Section: Discussionmentioning

confidence: 99%

“…The ‘nervous system’ of the robot operates through a hierarchical network of simple control system modules. Unlike other robot control architectures that perform model-based control and planning (11, 12, 13, 14, 22, 23, 25, 33, 45, 50), our control architecture generates robust and adaptive goal-directed behavior through a simple feedback process requiring no model of the environment, prediction of future states, or learning. Unlike architectures in which behavior is generated by environmental stimuli or internal system dynamics (1, 2, 3, 4, 18, 24, 28, 29, 32, 36, 35, 37, 43, 46), our architecture generates adaptive behavior by automatically achieving continuously changing internal goals in the control hierarchy.…”

Section: Introductionmentioning

confidence: 99%

Achieving natural behavior in a robot using neurally inspired hierarchical control

2021

Preprint

View full text Add to dashboard Cite

Terrestrial locomotion presents tremendous computational challenges on account of the enormous degrees of freedom in legged animals, and the complex and unpredictable properties of the natural environment and the effectors. Yet the nervous system can achieve locomotion with ease. Here we introduce a quadrupedal robot capable of goal-directed posture control and locomotion over rough terrain. The underlying control architecture is a hierarchical network of simple negative feedback control systems inspired by the organization of the vertebrate nervous system. Without using an internal model or feedforward planning, and without any training, our robot shows robust posture control and locomotor behavior in novel environments containing unpredictable disturbances.

show abstract

“…The two-layer CPG network is chosen as the locomotion generator instead of learning joint position commands directly like most of the other studies (Hwangbo et al, 2019 ; Tsounis et al, 2020 ). There are three reasons for this: (1) the CPG network constrains the basic locomotion of the robot, which reduces the search space and accelerates the learning; (2) compared to 18 joint position or joint torque commands, learning symmetric CPG coupling parameters lowers the dimension of the action space; (3) the CPG network outputs smooth joint position commands, which are easier to be realized in the real robot.…”

Section: Locomotion Optimization Via Reinforcement Learningmentioning

confidence: 99%

Adaptive Locomotion Control of a Hexapod Robot via Bio-Inspired Learning

et al. 2021

View full text Add to dashboard Cite

In this paper, an adaptive locomotion control approach for a hexapod robot is proposed. Inspired from biological neuro control systems, a 3D two-layer artificial center pattern generator (CPG) network is adopted to generate the locomotion of the robot. The first layer of the CPG is responsible for generating several basic locomotion patterns and the functional configuration of this layer is determined through kinematics analysis. The second layer of the CPG controls the limb behavior of the robot to adapt to environment change in a specific locomotion pattern. To enable the adaptability of the limb behavior controller, a reinforcement learning (RL)-based approach is employed to tune the CPG parameters. Owing to symmetrical structure of the robot, only two parameters need to be learned iteratively. Thus, the proposed approach can be used in practice. Finally, both simulations and experiments are conducted to verify the effectiveness of the proposed control approach.

show abstract

DeepGait: Planning and Control of Quadrupedal Gaits Using Deep Reinforcement Learning

Cited by 147 publications

References 26 publications

Cat-Like Jumping and Landing of Legged Robots in Low Gravity Using Deep Reinforcement Learning

Cat-Like Jumping and Landing of Legged Robots in Low Gravity Using Deep Reinforcement Learning

Achieving natural behavior in a robot using neurally inspired hierarchical control

Adaptive Locomotion Control of a Hexapod Robot via Bio-Inspired Learning

Contact Info

Product

Resources

About