2019 International Conference on Robotics and Automation (ICRA) 2019
DOI: 10.1109/icra.2019.8793808
|View full text |Cite
|
Sign up to set email alerts
|

Factored Contextual Policy Search with Bayesian optimization

Abstract: Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different task contexts. Contextual policy search offers data-efficient learning and generalization by explicitly conditioning the policy on a parametric context space. In this paper, we further structure the contextual policy representation. We propose to factor contexts into two components: target contexts that describe the task objectives, e.g. target position for throwin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…In this context, hindsight refers to the capacity to exploit information about the degree to which an arbitrary goal has been achieved while another goal was intended. Prior to our work, hindsight has been limited to off-policy reinforcement learning algorithms that rely on experience replay (Andrychowicz et al, 2017) and policy search based on Bayesian optimization (Karkus et al, 2016;Pinsler et al, 2019).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In this context, hindsight refers to the capacity to exploit information about the degree to which an arbitrary goal has been achieved while another goal was intended. Prior to our work, hindsight has been limited to off-policy reinforcement learning algorithms that rely on experience replay (Andrychowicz et al, 2017) and policy search based on Bayesian optimization (Karkus et al, 2016;Pinsler et al, 2019).…”
Section: Discussionmentioning
confidence: 99%
“…In earlier work, Karkus, Kupcsik, Hsu, and Lee (2016) introduced hindsight to policy search based on Bayesian optimization (Metzen, Fabisch, & Hansen, 2015). This work was recently extended by Pinsler, Karkus, Kupcsik, Hsu, and Lee (2019).…”
Section: Introductionmentioning
confidence: 88%
See 1 more Smart Citation
“…Subsequently, the robot skills can be honed by updating the MP parameters through trial-and-error within the framework of Policy Search (PS), a branch of Reinforcement Learning (RL) [2] responsible for resolving which trajectories to evaluate in consideration of the rewards of each execution. Thus, PS algorithms have proved successful in several robotic applications [3], including the contextual case, in which robots are required to adapt to changing environments [4], [5], [6], [7].…”
Section: Introductionmentioning
confidence: 99%