Data-driven planning via imitation learning

Choudhury, Sanjiban; Bhardwaj, Mohak; Arora, Sankalp; Kapoor, Ashish; Ranade, Gireeja; Scherer, Sebastian; Dey, Debadeepta

doi:10.1177/0278364918781001

Cited by 49 publications

(39 citation statements)

References 87 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Given the structured, unified nature of FLYBO, the proposed data and framework can be used to support the development of Reinforcement-Learning (RL) based approaches to autonomous exploration. Methods that build on RL typically require a moderate amount of training data [13] and an efficient means of scaling the number of experiments which could naturally benefit from the flexibility of our system.…”

Section: Discussion and Perspectivesmentioning

confidence: 99%

FLYBO: A Unified Benchmark Environment for Autonomous Flying Robots

Brunel

Bourki²,

Strauss

et al. 2021

2021 International Conference on 3D Vision (3DV)

View full text Add to dashboard Cite

Figure 1: An MAV equipped with odometry-and active depth sensors autonomously explores a complex synthetic area from FLYBO (a) while gradually mapping the scene throughout different exploration stages and planning trajectories online (b-d).Simultaneously, the perceived surfaces are also reconstructed online (close-up views). FLYBO provides datasets, references and a framework to benchmark such systems w.r.t their volumetric exploration and online surface reconstruction capabilities.

show abstract

Section: Discussion and Perspectivesmentioning

confidence: 99%

FLYBO: A Unified Benchmark Environment for Autonomous Flying Robots

Brunel

Bourki²,

Strauss

et al. 2021

2021 International Conference on 3D Vision (3DV)

View full text Add to dashboard Cite

show abstract

“…The realizability gap between the two is vast, resulting in a trivially large regret bound [14]. Instead, Choudhury et al [7] show that imitating the clairvoyant oracle is in fact equivalent to imitating a corresponding hallucinating oracle, that computes an instantaneous posterior over worlds given the edge evaluations so far and computes the expected clairvoyant oracle action value over this posterior i.e,…”

Section: Component 1: Unrealizability Of the Clairvoyant Oraclementioning

confidence: 99%

“…Interestingly, if we were to reveal the status of all the edges during training, we can conceive of a clairvoyant oracle [7] that can select the optimal sequence of edges to invalidate. In fact, we show that the oracular selector is equivalent to set cover, for which greedy approximations exist.…”

Section: Introductionmentioning

confidence: 99%

Leveraging Experience in Lazy Search

Bhardwaj

Choudhury²,

Boots

et al. 2019

Robotics: Science and Systems XV

Self Cite

View full text Add to dashboard Cite

Lazy graph search algorithms are efficient at solving motion planning problems where edge evaluation is the computational bottleneck. These algorithms work by lazily computing the shortest potentially feasible path, evaluating edges along that path, and repeating until a feasible path is found. The order in which edges are selected is critical to minimizing the total number of edge evaluations: a good edge selector chooses edges that are not only likely to be invalid, but also eliminates future paths from consideration. We wish to learn such a selector by leveraging prior experience. We formulate this problem as a Markov Decision Process (MDP) on the state of the search problem. While solving this large MDP is generally intractable, we show that we can compute oracular selectors that can solve the MDP during training. With access to such oracles, we use imitation learning to find effective policies. If new search problems are sufficiently similar to problems solved during training, the learned policy will choose a good edge evaluation ordering and solve the motion planning problem quickly. We evaluate our algorithms on a wide range of 2D and 7D problems and show that the learned selector outperforms baseline commonly used heuristics.

show abstract

“…It has to be noted that, similar to [12], motion planning for data generation is conducted with full environmental information in order to guarantee fast convergence to a cost-minimizing solution. In contrast to that, the recorded occupancy grid only fuses the current history of sensor observations resulting in unobserved areas due to occlusions.…”

Section: A Data Generationmentioning

confidence: 99%

Learning to Predict Ego-Vehicle Poses for Sampling-Based Nonholonomic Motion Planning

Banzhaf

Sanzenbacher

Baumann

et al. 2019

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

This work is an extended version of [1] and therefore partially copyrighted by IEEE: c 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Abstract-Sampling-based motion planning is an effective tool to compute safe trajectories for automated vehicles in complex environments. However, a fast convergence to the optimal solution can only be ensured with the use of problemspecific sampling distributions. Due to the large variety of driving situations within the context of automated driving, it is very challenging to manually design such distributions. This paper introduces therefore a data-driven approach utilizing a deep convolutional neural network (CNN): Given the current driving situation, future ego-vehicle poses can be directly generated from the output of the CNN allowing to guide the motion planner efficiently towards the optimal solution. A benchmark highlights that the CNN predicts future vehicle poses with a higher accuracy compared to uniform sampling and a state-of-the-art A*-based approach. Combining this CNNguided sampling with the motion planner Bidirectional RRT* reduces the computation time by up to an order of magnitude and yields a faster convergence to a lower cost as well as a success rate of 100 % in the tested scenarios.

show abstract

Data-driven planning via imitation learning

Cited by 49 publications

References 87 publications

FLYBO: A Unified Benchmark Environment for Autonomous Flying Robots

FLYBO: A Unified Benchmark Environment for Autonomous Flying Robots

Leveraging Experience in Lazy Search

Learning to Predict Ego-Vehicle Poses for Sampling-Based Nonholonomic Motion Planning

Contact Info

Product

Resources

About