Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks

Morales, Gil; Rufino, Cristyan

doi:10.3390/app9030502

Cited by 36 publications

(22 citation statements)

References 24 publications

(22 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The platform is employed here to simulate the complex and dynamic environment, where oscillations are introduced to the system to imitate the real external disturbances. Compared with other studies where the robots are trained on a flat surface or a platform with fixed angle under the experimental environment [15][16][17], the proposed platform is able to provide a more complex and dynamic environment where the robot is capable of learning a more robust and efficient controller. Thus, the experiments in this paper will not only show the convergence of the learning procedure, but also involve the robustness of the learned controller to adapt to different complex and dynamic environments.…”

Section: Formulation Of the Problemmentioning

confidence: 99%

“…Many attempts based on model-free RL frameworks have been made recently to involve RL into biped robot walking control to avoid calculating the mathematical model. Gil [15] utilized Q-Learning to find a sequence of pose that allows a NAO robot to reach the furthest distance in the shortest time, while still keeping a straight path without falling down. However, the actions were discrete, thus it lacked a smooth transfer between two poses.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning

Chen

2020

IEEE Access

View full text Add to dashboard Cite

In this paper, we proposed a novel Hybrid Reinforcement Learning framework to maintain the stability of a biped robot (NAO) while it is walking on static and dynamic platforms. The reinforcement learning framework consists of the Model-based off-line Estimator, the Actor Network Pre-training scheme, and the Mode-free on-line optimizer. We proposed the Hierarchical Gaussian Processes as the Mode-based Estimator to predict a rough model of the system and to obtain the initial control input. Then, the initial control input is employed to pre-train the Actor Network by using the initial control input. Finally, a modelfree optimizer based on Deep Deterministic Policy Gradient framework is introduced to fine tune the Actor Network and to generate the best actions. The proposed reinforcement learning framework not only successfully avoids the distribution mismatch problem while combining model-based scheme with modelfree structure, but also improves the sample efficiency for the on-line learning procedure. Simulation results show that the proposed Hybrid Reinforcement Learning mechanism enables the NAO robot to maintain balance while walking on static and dynamic platforms. The robustness of the learned controllers in adapting to platforms with different angles, different magnitudes, and different frequencies is tested.

show abstract

Section: Formulation Of the Problemmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning

Chen

2020

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Even after designing the bipedal stability and smooth trajectories, online strategies (Table 3) are required for the stable landing of the foot on the ground and avoid sudden jerks while walking which will harm the bipedal [10][11][12][13]. The force/ torque sensor at ankle results in sustained oscillations in SSP which is overcome by damping oscillator parameters.…”

Section: Proposed Model Of Bipedal Robotmentioning

confidence: 99%

Comparative Study of Learning and Execution of Bipedal by Using Forgetting Mechanism in Reinforcement Learning Algorithm

Singh¹,

Prateek²,

Pasricha³

2020

JESA

View full text Add to dashboard Cite

A bipedal which resembles humans are programmed for performing specific tasks. The proposed work scope was to design, program, and validate RL based algorithms for navigation of Bipedal walking. The bipedal navigation implements forgetting mechanism in the traditional Q-learning algorithm which results in learning walk without prior learned dynamics of the system. Simulations were carried out for all three joints of each leg for evaluating the feasibility of the forgetting mechanism algorithm, the optimal policy, and the optimal actions for navigation. The reinforcement control algorithms for bipedal had been applied to take the self-decision. Bipedal senses the current state and moves to the goal state by learning or by using data stored in the lookup table in the execution phase. This reduces the learning and execution number of iterations of the bipedal by a considerable amount but total learning and execution time remains approximately the same. Simulation is done on the MATLAB platform and SimSpace Multibody dynamics toolbox to verify results. The bipedal model performs object identification, object localization in the dynamic environment, learns through the RL controller then executes to reach the object identified.

show abstract

“…Reinforcement learning can be applied to efficient gait control of a biped robot. Gil et al [37] showed a reinforcement learning mechanism to handle stability and efficiency of movement, thus improving speed and precision of the trajectory. Yang et al [38] showed an interesting work to transform the complex motion of robot turning into a simple translational motion.…”

Section: Advanced Mobile Roboticsmentioning

confidence: 99%

Special Feature on Advanced Mobile Robotics

Kim

2019

Applied Sciences

View full text Add to dashboard Cite

show abstract

Learning an Efficient Gait Cycle of a Biped Robot Based on Reinforcement Learning and Artificial Neural Networks

Cited by 36 publications

References 24 publications

Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning

Walking Control of a Biped Robot on Static and Rotating Platforms Based on Hybrid Reinforcement Learning

Comparative Study of Learning and Execution of Bipedal by Using Forgetting Mechanism in Reinforcement Learning Algorithm

Special Feature on Advanced Mobile Robotics

Contact Info

Product

Resources

About