PRM-RL: Long-range Robotic Navigation Tasks by Combining Reinforcement Learning and Sampling-Based Planning

Faust, Aleksandra; Oslund, Kenneth; Ramírez, Óscar; Francis, Anthony G.; Tapia, Lydia; Fišer, Marek; Davidson, James

doi:10.1109/icra.2018.8461096

Cited by 267 publications

(173 citation statements)

References 28 publications

Supporting

Mentioning

157

Contrasting

Unclassified

Order By: Relevance

“…Overall, we show improved performance, better roadmap generation, and easier on-robot transfer, including a relative success rate increase of 40% over [21], and 94% over [11], while maintaining good performance despite increasing noise. We also show that only adding edges when agents can always navigate them makes roadmaps cheaper to build and improves navigation success; denser roadmaps also have higher simulated success rates but at substantial roadmap construction cost.…”

Section: Introductionmentioning

confidence: 79%

“…In other words, PRM-RL is a tool for generating paths which an RL agent can reliably satisfy without violating the constraints of its task. In [21] we demonstrated PRM-RL's success on tasks with constraints, but in this work, we focus solely on the navigation task, which collapses the task predicate L(x) to remaining within C f ree , and collapses the full configuration space available to the robot to a task space T limited to the robot's position and orientation.…”

Section: Problem Statementmentioning

confidence: 99%

“…The selected baselines are AutoRL [12], PRM-SL [34], PRM-GAPF (a modification of [11]), and our original PRM-RL with a hand-tuned RL policy [21]. The AutoRL [12] baseline policy is not guided by a PRM.…”

Section: ) Baselinesmentioning

confidence: 99%

See 2 more Smart Citations

Long-Range Indoor Navigation With PRM-RL

et al. 2020

Self Cite

View full text Add to dashboard Cite

Long-range indoor navigation requires guiding robots with noisy sensors and controls through cluttered environments along paths that span a variety of buildings. We achieve this with PRM-RL, a hierarchical robot navigation method in which reinforcement learning agents that map noisy sensors to robot controls learn to solve short-range obstacle avoidance tasks, and then sampling-based planners map where these agents can reliably navigate in simulation; these roadmaps and agents are then deployed on-robot, guiding the robot along the shortest path where the agents are likely to succeed. Here we use Probabilistic Roadmaps (PRMs) as the sampling-based planner and AutoRL as the reinforcement learning method in the indoor navigation context. We evaluate the method in simulation for kinematic differential drive and kinodynamic car-like robots in several environments, and on-robot for differential-drive robots at two physical sites. Our results show PRM-RL with AutoRL is more successful than several baselines, is robust to noise, and can guide robots over hundreds of meters in the face of noise and obstacles in both simulation and on-robot, including over 3.3 kilometers of physical robot navigation. The video is available at https://youtu.be/xN-OWX5gKvQ

show abstract

Section: Introductionmentioning

confidence: 79%

Section: Problem Statementmentioning

confidence: 99%

See 1 more Smart Citation

Long-Range Indoor Navigation With PRM-RL

et al. 2020

Self Cite

View full text Add to dashboard Cite

show abstract

“…Hence, it promising to combine these approaches and merge the advantages of both. Faust et al [15] use a reinforcement learning agent to learn short-range, point-to-point navigation policies for 2D and 3D action spaces which capture the robot dynamic and task constraint without considering the large-scale topology. Sampling-based planning is used to plan waypoints which give the planning a long-range goal-directed behavior.…”

Section: Related Workmentioning

confidence: 99%

Towards Learning Abstract Representations for Locomotion Planning in High-dimensional State Spaces

Klamt

Behnke

2019

2019 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

Ground robots which are able to navigate a variety of terrains are needed in many domains. One of the key aspects is the capability to adapt to the ground structure, which can be realized through movable body parts coming along with additional degrees of freedom (DoF). However, planning respective locomotion is challenging since suitable representations result in large state spaces. Employing an additional abstract representation-which is coarser, lower-dimensional, and semantically enriched-can support the planning.While a desired robot representation and action set of such an abstract representation can be easily defined, the cost function requires large tuning efforts. We propose a method to represent the cost function as a CNN. Training of the network is done on generated artificial data, while it generalizes well to the abstraction of real world scenes. We further apply our method to the problem of search-based planning of hybrid drivingstepping locomotion. The abstract representation is used as a powerful informed heuristic which accelerates planning by multiple orders of magnitude. Abstract representation HeuristicPlanner (e.g., A*, RRT, PRM) Path

show abstract

“…A bad reward function may lead to local minima or may lead an agent to wander around forever. To overcome this issue in long-range navigation tasks [20] presents a hybrid approach that combines sampling based path planning with RL. [14], [15] adds a separate pre-step supervised learning(SL) phase to the RL approach to teach agents how to reach their goal location without caring about colliding with other agents.…”

Section: Introductionmentioning

confidence: 99%

Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

Semnani

Liu

Everett

et al. 2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

This paper introduces a hybrid algorithm of deep reinforcement learning (RL) and Force-based motion planning (FMP) to solve distributed motion planning problem in dense and dynamic environments. Individually, RL and FMP algorithms each have their own limitations. FMP is not able to produce time-optimal paths and existing RL solutions are not able to produce collision-free paths in dense environments. Therefore, we first tried improving the performance of recent RL approaches by introducing a new reward function that not only eliminates the requirement of a pre supervised learning (SL) step but also decreases the chance of collision in crowded environments. That improved things, but there were still a lot of failure cases. So, we developed a hybrid approach to leverage the simpler FMP approach in stuck, simple and high-risk cases, and continue using RL for normal cases in which FMP can't produce optimal path. Also, we extend GA3C-CADRL algorithm to 3D environment. Simulation results show that the proposed algorithm outperforms both deep RL and FMP algorithms and produces up to 50% more successful scenarios than deep RL and up to 75% less extra time to reach goal than FMP.

show abstract

PRM-RL: Long-range Robotic Navigation Tasks by Combining Reinforcement Learning and Sampling-Based Planning

Cited by 267 publications

References 28 publications

Long-Range Indoor Navigation With PRM-RL

Long-Range Indoor Navigation With PRM-RL

Towards Learning Abstract Representations for Locomotion Planning in High-dimensional State Spaces

Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

Contact Info

Product

Resources

About