2018
DOI: 10.1177/0278364918781001
|View full text |Cite
|
Sign up to set email alerts
|

Data-driven planning via imitation learning

Abstract: Robot planning is the process of selecting a sequence of actions that optimize for a task specific objective. For instance, the objective for a navigation task would be to find collision free paths, while the objective for an exploration task would be to map unknown areas. The optimal solutions to such tasks are heavily influenced by the implicit structure in the environment, i.e. the configuration of objects in the world. State-of-the-art planning approaches, however, do not exploit this structure, thereby ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 49 publications
(39 citation statements)
references
References 87 publications
0
39
0
Order By: Relevance
“…Given the structured, unified nature of FLYBO, the proposed data and framework can be used to support the development of Reinforcement-Learning (RL) based approaches to autonomous exploration. Methods that build on RL typically require a moderate amount of training data [13] and an efficient means of scaling the number of experiments which could naturally benefit from the flexibility of our system.…”
Section: Discussion and Perspectivesmentioning
confidence: 99%
“…Given the structured, unified nature of FLYBO, the proposed data and framework can be used to support the development of Reinforcement-Learning (RL) based approaches to autonomous exploration. Methods that build on RL typically require a moderate amount of training data [13] and an efficient means of scaling the number of experiments which could naturally benefit from the flexibility of our system.…”
Section: Discussion and Perspectivesmentioning
confidence: 99%
“…The realizability gap between the two is vast, resulting in a trivially large regret bound [14]. Instead, Choudhury et al [7] show that imitating the clairvoyant oracle is in fact equivalent to imitating a corresponding hallucinating oracle, that computes an instantaneous posterior over worlds given the edge evaluations so far and computes the expected clairvoyant oracle action value over this posterior i.e,…”
Section: Component 1: Unrealizability Of the Clairvoyant Oraclementioning
confidence: 99%
“…Interestingly, if we were to reveal the status of all the edges during training, we can conceive of a clairvoyant oracle [7] that can select the optimal sequence of edges to invalidate. In fact, we show that the oracular selector is equivalent to set cover, for which greedy approximations exist.…”
Section: Introductionmentioning
confidence: 99%
“…It has to be noted that, similar to [12], motion planning for data generation is conducted with full environmental information in order to guarantee fast convergence to a cost-minimizing solution. In contrast to that, the recorded occupancy grid only fuses the current history of sensor observations resulting in unobserved areas due to occlusions.…”
Section: A Data Generationmentioning
confidence: 99%