2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.60
|View full text |Cite
|
Sign up to set email alerts
|

Visual Semantic Planning Using Deep Successor Representations

Abstract: A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world. In this work, we address the problem of visual semantic planning: the task of predicting a sequence of actions from visual observations that transform a dynamic environment from an initial state to a goal state. Doing so entails knowledge about objects and their affordances, as well as actions and their preconditions and effects. We propose learning these through inte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
110
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 128 publications
(113 citation statements)
references
References 36 publications
3
110
0
Order By: Relevance
“…Gandhi et al [19] collect a dataset of drone crashes and train self-supervised agents to avoid obstacles. A number of new challenging tasks have been proposed including instruction-based navigation [6,7], target-driven navigation [2,4], embodied/interactive question answering [1,9], and task planning [5].…”
Section: Related Workmentioning
confidence: 99%
“…Gandhi et al [19] collect a dataset of drone crashes and train self-supervised agents to avoid obstacles. A number of new challenging tasks have been proposed including instruction-based navigation [6,7], target-driven navigation [2,4], embodied/interactive question answering [1,9], and task planning [5].…”
Section: Related Workmentioning
confidence: 99%
“…Autonomous agents, controlled by neural network policies and trained with reinforcement learning algorithms, have been used in a wide range of robot navigation applications [1,2,3,23,28,32,47,50,51]. In many of these applications, the agent needs to perform tasks over long time horizons in unseen environments.…”
Section: Introductionmentioning
confidence: 99%
“…7 These properties should be replicated in neural networks if they are to serve as accurate models of natural intelligence. New neural network architectures are slowly starting to take steps in this direction (e.g., (Santoro et al, 2017;Zhu et al, 2017;Louizos et al, 2017)).…”
Section: Reasoningmentioning
confidence: 99%