2021
DOI: 10.48550/arxiv.2112.00597
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

Abstract: Complex sequential tasks in continuous-control settings often require agents to successfully traverse a set of "narrow passages" in their state space. Solving such tasks with a sparse reward in a sample-efficient manner poses a challenge to modern reinforcement learning (RL) due to the associated long-horizon nature of the problem and the lack of sufficient positive signal during learning. Various tools have been applied to address this challenge. When available, large sets of demonstrations can guide agent ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 28 publications
(42 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?