2020
DOI: 10.48550/arxiv.2002.12336
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hallucinative Topological Memory for Zero-Shot Visual Planning

Kara Liu,
Thanard Kurutach,
Christine Tung
et al.

Abstract: In visual planning (VP), an agent learns to plan goal-directed behavior from observations of a dynamical system obtained offline, e.g., images obtained from self-supervised robot interaction. Most previous works on VP approached the problem by planning in a learned latent space, resulting in low-quality visual plans, and difficult training algorithms. Here, instead, we propose a simple VP method that plans directly in image space and displays competitive performance. We build on the semi-parametric topological… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 17 publications
0
7
0
Order By: Relevance
“…Gao et al [16] also proposes a deep neural network for navigation with a local 2-D map provided as additional signal. Liu et al [17] requires a global context (e.g., an accurate global layout) of the environment to guide the local controller in novel domains. All of the above methods require accurate high-level information; however, they all illustrate promising zero-shot generalization performance in novel tasks, suggesting that high-level maps and contexts are useful for navigation.…”
Section: Related Workmentioning
confidence: 99%
“…Gao et al [16] also proposes a deep neural network for navigation with a local 2-D map provided as additional signal. Liu et al [17] requires a global context (e.g., an accurate global layout) of the environment to guide the local controller in novel domains. All of the above methods require accurate high-level information; however, they all illustrate promising zero-shot generalization performance in novel tasks, suggesting that high-level maps and contexts are useful for navigation.…”
Section: Related Workmentioning
confidence: 99%
“…Planning through latent space with gradient descent is a core concept shared by several previous works, in which the same embedding is used for both the goal and the current state. The basic idea is that the distance between the latent representation of the current state and the latent representation of the goal could be exploited for planning purposes [10]. The idea can be extended by using a dynamics model to predict the latent state resulting from an action.…”
Section: Planning By Backpropagationmentioning
confidence: 99%
“…In the limit, zero demonstrations of the new task are provided. This is a special case of zero-shot learning in policy domain, which has become an emerging sub-field within meta-learning [10]. Meta-imitation learning refers to meta-learning approaches that use imitation learning during the meta-training phase.…”
Section: Introductionmentioning
confidence: 99%
“…Some are constrained to use local control policies [14,2] or use rule-based algorithms such as beam or A * search [2,15,14] to perform localized path corrections. Others focus on processing long-range observations instead of actions [16] or employ offline pre-training schemes to learn topological structures [12,13,17]. This is challenging since accurate construction of graphs is non-trivial and requires special adaptation to work during real-time navigation [13,17].…”
Section: Introductionmentioning
confidence: 99%
“…Others focus on processing long-range observations instead of actions [16] or employ offline pre-training schemes to learn topological structures [12,13,17]. This is challenging since accurate construction of graphs is non-trivial and requires special adaptation to work during real-time navigation [13,17]. Under the guidance of natural language instruction, the autonomous agent needs to navigate through the environment from the start state to the target location (red flag).…”
Section: Introductionmentioning
confidence: 99%