2008
DOI: 10.1177/0278364907084980
|View full text |Cite
|
Sign up to set email alerts
|

Learning CPG-based Biped Locomotion with a Policy Gradient Method: Application to a Humanoid Robot

Abstract: In this paper we describe a learning framework for a central pattern generator (CPG)-based biped locomotion controller using a policy gradient method. Our goals in this study are to achieve CPG-based biped walking with a 3D hardware humanoid and to develop an efficient learning algorithm with CPG by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feedback controller can be acquired within a few thousand trials by numerical simulations and the controller obta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
135
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
7
2
1

Relationship

2
8

Authors

Journals

citations
Cited by 191 publications
(135 citation statements)
references
References 25 publications
0
135
0
Order By: Relevance
“…However, finding the correct coupling weights to generate a desired gait is a difficult task. Hence, some researchers focused on the design of architectures for CPGs constructed by coupled oscillators, and in this way succeeded to generate common gaits in legged robots such as bipeds [13][14], and quadrupeds [15][16][17], even in legless robots such as fish [18], and snake robots [19][20].…”
Section:  Applications Of Cpg-based Control In Roboticsmentioning
confidence: 99%
“…However, finding the correct coupling weights to generate a desired gait is a difficult task. Hence, some researchers focused on the design of architectures for CPGs constructed by coupled oscillators, and in this way succeeded to generate common gaits in legged robots such as bipeds [13][14], and quadrupeds [15][16][17], even in legless robots such as fish [18], and snake robots [19][20].…”
Section:  Applications Of Cpg-based Control In Roboticsmentioning
confidence: 99%
“…Both posture control and biped locomotion are challenging research topics for a human-like and autonomous humanoid robot. Our exploration focused on biologically inspired control algorithms for locomotion using three different humanoid robots (DB-chan [72,76,77], Fujitsu Automation HOAP-2 [48] and CB-i [78]) as well as a small humanoid robot provided by the SONY Corp. [79].…”
Section: Creating a Brain By Brain-motivated Humanoid-motor-learning mentioning
confidence: 99%
“…For rhythmic behaviors half-elliptical locuses have been used as a representation of the gait pattern of a robot dog [Kohl and Stone, 2004]. Neural Networks: Instead of analytically describing rhythmic movements, neural networks can be used as oscillators to learn gaits of a a two legged robot [Geng et al, 2006, Endo et al, 2008. Also a peg-in-hole (see Figure 2.1b) and a ball-balancing task as well as a navigation task [Hailu and Sommer, 1998] have been learned with neural networks as policy function approximators.…”
Section: Pre-structured Policiesmentioning
confidence: 99%