2020
DOI: 10.3390/s20164468
|View full text |Cite
|
Sign up to set email alerts
|

Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning

Abstract: In this work, we introduced a novel hybrid reinforcement learning scheme to balance a biped robot (NAO) on an oscillating platform, where the rotation of the platform is considered as the external disturbance to the robot. The platform had two degrees of freedom in rotation, pitch and roll. The state space comprised the position of center of pressure, and joint angles and joint velocities of two legs. The action space consisted of the joint angles of ankles, knees, and hips. By adding the inverse kinematics te… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…Cristyan et al [11] proposed a multilevel system and used the Q-learning method in this system to make the NAO robot walk quickly in simulation. Ao et al [12] proposed a hybrid reinforcement learning method to keep the NAO robot balanced on an oscillation platform with different frequencies and amplitudes in the simulation. Takamitsu et al [13] proposed a learning framework for central pattern generation (CPG)-based biped locomotion with a policy gradient method and applied it to real robots.…”
Section: Biped Robot Controlled By Drlmentioning
confidence: 99%
See 1 more Smart Citation
“…Cristyan et al [11] proposed a multilevel system and used the Q-learning method in this system to make the NAO robot walk quickly in simulation. Ao et al [12] proposed a hybrid reinforcement learning method to keep the NAO robot balanced on an oscillation platform with different frequencies and amplitudes in the simulation. Takamitsu et al [13] proposed a learning framework for central pattern generation (CPG)-based biped locomotion with a policy gradient method and applied it to real robots.…”
Section: Biped Robot Controlled By Drlmentioning
confidence: 99%
“…The purpose of this review is to improve the estimation of the value function. The loss function is constructed as loss = Q − V(s t ; w) 2 (12) and the single-step update w is…”
Section: Policy Trainingmentioning
confidence: 99%
“…We also implemented the velocity loop control for experiments. We applied the PI controller of Equation (15) to simplify hardware implementation, because it provided similar responses to the standard robust controller but with a much simpler form. We set the TWIP system at the two locations (see Figure 5a) with an initial angle of ψ(0) = −9.5 • and a reference velocity V re f = 0.…”
Section: The Velocity Loop Controlmentioning
confidence: 99%
“…Liu et al [14] proposed Machines 2021, 9, 205 2 of 21 a real-time balance control for a small-sized biped robot and applied a gyroscope and an accelerometer to detect the robot inclination and balance it when being pushed. Xi and Chen [15] applied inverse kinematics and reinforcement learning to balance a biped robot on an oscillating platform.…”
Section: Introductionmentioning
confidence: 99%
“…In [32], the analytical procedure is extended. See [33][34][35][36][37][38] for further information on bipedal robot balance or stability control. As a result, the current study proposes a high-level control system to maintain the ZMP trajectory within a stable zone while following the intended COM references.…”
Section: Introductionmentioning
confidence: 99%