Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning

Everett, Michael; Chen, Yu Fan; How, Jonathan P.

doi:10.1109/iros.2018.8593871

Cited by 487 publications

(336 citation statements)

References 22 publications

Supporting

Mentioning

295

Contrasting

Unclassified

Order By: Relevance

“…This RL framework applies a reward function, R col s jn , u , to penalize the agent in case of collision, and reward in case of reaching its goal. Two different types of RL algorithms are used in this RL framework, value-based [22], [15] and policybased [14] learning. Value-based algorithm assumes that other agents continue their current velocities until next step, ∆t, to be able to extract policy from the value function, V s jn t .…”

Section: A Collision Avoidance With Deep Rl (Ga3c-cadrl)mentioning

confidence: 99%

“…Many different curriculum training paradigms used in the literature. For example, [14] starts training with SL, then runs two RL phases: one with 2-4 agents in the environment and next with 4-10 agents. [16] uses a two-stage training process, the first stage has 20 agents placed randomly in a simple environment without any obstacle.…”

Section: B Training the Policymentioning

confidence: 99%

“…There are a number of recent studies looking to learningbased approaches for multi-agent motion planning problem. Learning-based techniques offload the expensive real-time motion planning computations to an off-line training procedure in which a policy function that implicitly encodes cooperative behavior among agents is learned [13], [14], [15], [16], [17], [18]. The learned policy will be used by each agent later in real-time to select the best action at each time.…”

Section: Introductionmentioning

confidence: 99%

“…However, in practice, it is useful to extract agent-level information from multiple sensors data to have more precise decisions. For example, if the Lidar sensor data show the presence of an object in the surrounding area, an agent reaction may differ if the object is another agent or if it is just an obstacle [14]. GA3C-CADRL [14] is another recent RL-based algorithm that has shown good performance in solving the path planning problem.…”

Section: Introductionmentioning

confidence: 99%

“…For example, if the Lidar sensor data show the presence of an object in the surrounding area, an agent reaction may differ if the object is another agent or if it is just an obstacle [14]. GA3C-CADRL [14] is another recent RL-based algorithm that has shown good performance in solving the path planning problem. However, RL approach presented in GA3C-CADRL is hard to train as its reward is sparse.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

Semnani

Liu

Everett

et al. 2020

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

This paper introduces a hybrid algorithm of deep reinforcement learning (RL) and Force-based motion planning (FMP) to solve distributed motion planning problem in dense and dynamic environments. Individually, RL and FMP algorithms each have their own limitations. FMP is not able to produce time-optimal paths and existing RL solutions are not able to produce collision-free paths in dense environments. Therefore, we first tried improving the performance of recent RL approaches by introducing a new reward function that not only eliminates the requirement of a pre supervised learning (SL) step but also decreases the chance of collision in crowded environments. That improved things, but there were still a lot of failure cases. So, we developed a hybrid approach to leverage the simpler FMP approach in stuck, simple and high-risk cases, and continue using RL for normal cases in which FMP can't produce optimal path. Also, we extend GA3C-CADRL algorithm to 3D environment. Simulation results show that the proposed algorithm outperforms both deep RL and FMP algorithms and produces up to 50% more successful scenarios than deep RL and up to 75% less extra time to reach goal than FMP.

show abstract

Section: A Collision Avoidance With Deep Rl (Ga3c-cadrl)mentioning

confidence: 99%

Section: B Training the Policymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

Semnani

Liu

Everett

et al. 2020

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

show abstract

Social Interaction‐Aware Dynamical Models and Decision‐Making for Autonomous Vehicles

Crosato,

Tian,

Shum

et al. 2023

Advanced Intelligent Systems

View full text Add to dashboard Cite

Interaction‐aware autonomous driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the AV to be able to understand and predict the behaviour of human road users. In this literature review, the current state of IAAD research is surveyed. Commencing with an examination of terminology, attention is drawn to challenges and existing models employed for modeling the behaviour of drivers and pedestrians. Next, a comprehensive review is conducted on various techniques proposed for interaction modeling, encompassing cognitive methods, machine‐learning approaches, and game‐theoretic methods. The conclusion is reached through a discussion of potential advantages and risks associated with IAAD, along with the illumination of pivotal research inquiries necessitating future exploration.

show abstract

Effects of a Social Force Model Reward in Robot Navigation Based on Deep Reinforcement Learning

Viyuela¹,

Sanfeliu²

2019

Advances in Intelligent Systems and Computing

View full text Add to dashboard Cite

In this paper is proposed an inclusion of the Social Force Model (SFM) into a concrete Deep Reinforcement Learning (RL) framework for robot navigation. These types of techniques have demonstrated to be useful to deal with different types of environments to achieve a goal. In Deep RL, a description of the world to describe the states and a reward adapted to the environment are crucial elements to get the desire behaviour and achieve a high performance. For this reason, this work adds a dense reward function based on SFM and uses the forces in the states like an additional description. Furthermore, obstacles are added to improve the behaviour of works that only consider moving agents. This SFM inclusion can offer a better description of the obstacles for the navigation. Several simulations have been done to check the effects of these modifications in the average performance.

show abstract

Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning

Cited by 487 publications

References 22 publications

Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

Multi-Agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning

Social Interaction‐Aware Dynamical Models and Decision‐Making for Autonomous Vehicles

Effects of a Social Force Model Reward in Robot Navigation Based on Deep Reinforcement Learning

Contact Info

Product

Resources

About