2014
DOI: 10.3389/fncom.2014.00062
|View full text |Cite
|
Sign up to set email alerts
|

Learning modular policies for robotics

Abstract: A promising idea for scaling robot learning to more complex tasks is to use elemental behaviors as building blocks to compose more complex behavior. Ideally, such building blocks are used in combination with a learning algorithm that is able to learn to select, adapt, sequence and co-activate the building blocks. While there has been a lot of work on approaches that support one of these requirements, no learning algorithm exists that unifies all these properties in one framework. In this paper we present our w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(12 citation statements)
references
References 29 publications
(61 reference statements)
0
12
0
Order By: Relevance
“…Along the same direction, the Probabilistic Movement Primitive (ProMP) approach in [4], [5], uses a model-free approach to encode a distribution over trajectories and analytically derive a stochastic feedback controller to reproduce the given trajectory distribution. This allows for flexibility over the possible motion generation such as spatial and temporal rescaling, combination and blending of the modeled motion primitives (MPs).…”
Section: Related Workmentioning
confidence: 99%
“…Along the same direction, the Probabilistic Movement Primitive (ProMP) approach in [4], [5], uses a model-free approach to encode a distribution over trajectories and analytically derive a stochastic feedback controller to reproduce the given trajectory distribution. This allows for flexibility over the possible motion generation such as spatial and temporal rescaling, combination and blending of the modeled motion primitives (MPs).…”
Section: Related Workmentioning
confidence: 99%
“…Each cell in this architecture can also be modeled as a complicated neural system and more neural systems are coupled in this network. DMPs, as a mathematical dynamics modifier, work with actor-critic RL mechanism (Kober and Peters, 2010 ) to optimize/learn a locomotor system. In the future work, a sophisticated memory system, which includes short-term memory and long-term memory, is required in our system to map the contextual factors into parameter space.…”
Section: Discussionmentioning
confidence: 99%
“…In this paper, we unify and complement our prior work [33,36,37] on ProMPs. Note that the reference [33] contains only a brief summary of our work on ProMPs presented in the context of an overview paper that spans over multiple topics. Therefore, [33] provides less information than the corresponding conference papers.…”
Section: Introductionmentioning
confidence: 96%
“…Note that the reference [33] contains only a brief summary of our work on ProMPs presented in the context of an overview paper that spans over multiple topics. Therefore, [33] provides less information than the corresponding conference papers. In this paper, we present much more details which are necessary to reproduce the results.…”
Section: Introductionmentioning
confidence: 99%