TDPP-Net: Achieving three-dimensional path planning via a deep neural network architecture

Wu, Keyu; Esfahani, Mahdi Abolfazli; Yuan, Shenghai; Wang, Han

doi:10.1016/j.neucom.2019.05.001

Cited by 45 publications

(18 citation statements)

References 24 publications

(25 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Training Frameworkmentioning

confidence: 99%

“…According to the target Q-values and their current values estimated by the online network, our loss function is then defined as: (13) where α 1 , α 2 , and α 3 are three scaling factors. The first two terms in Equation (13) are aimed at minimizing the differences between the predicted Q-values and their corresponding target values. And because the angular and linear velocities are performed concurrently during the interaction, the last term is designed to minimize the difference between the Q-value estimates corresponding to the two commands.…”

Section: Training Frameworkmentioning

confidence: 99%

“…In recent years, approaches developed based on various deep learning techniques have been introduced to address the bottlenecks of conventional methods and explore solutions to the aforementioned issues [10]- [13]. As an important category, deep reinforcement learning (DRL) based methods have gained rapidly growing popularity since they demonstrate promising performance while impose no requirement on labeled data.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

BND*-DDQN: Learn to Steer Autonomously Through Deep Reinforcement Learning

Wang

Esfahani

et al. 2021

IEEE Trans. Cogn. Dev. Syst.

Self Cite

View full text Add to dashboard Cite

HAL is a multidisciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

show abstract

Section: Training Frameworkmentioning

confidence: 99%

Section: Training Frameworkmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

BND*-DDQN: Learn to Steer Autonomously Through Deep Reinforcement Learning

Wang

Esfahani

et al. 2021

IEEE Trans. Cogn. Dev. Syst.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Learning-based planning algorithms have increasingly become more common [23], [24], [25], [26]. In most learningbased path planning algorithms, imitation learning [27] plays a key role [28]. Neural networks have been used to improve the classic algorithms, for instance by adaptively sampling a particular region of a configuration space in sampling-based algorithms [29].…”

Section: Related Workmentioning

confidence: 99%

“…Reinforcement Learning (RL) approaches such as value iteration networks (VINs) [13], [36], learning-fromdemonstration (LfD) [37], guided policy search (GPS) [36], and universal planning networks (UPN) [38] have also been used for path planning. Wu et al present three-dimensional path planning network (TDPP-Net) [28], which is an end-toend network that predicts 3D actions via 2D CNNs. TDPP-Net learns a policy via supervised imitation learning from the Dijkstra's algorithm.…”

Section: Related Workmentioning

confidence: 99%

Waypoint Planning Networks

Toma

Jaafar

Hsueh

et al. 2021

Preprint

View full text Add to dashboard Cite

With the recent advances in machine learning, path planning algorithms are also evolving; however, the learned path planning algorithms often have difficulty competing with success rates of classic algorithms. We propose waypoint planning networks (WPN), a hybrid algorithm based on LSTMs with a local kernel-a classic algorithm such as A*, and a global kernel using a learned algorithm. WPN produces a more computationally efficient and robust solution. We compare WPN against A*, as well as related works including motion planning networks (MPNet) and value iteration networks (VIN). In this paper, the design and experiments have been conducted for 2D environments. Experimental results outline the benefits of WPN, both in efficiency and generalization. It is shown that WPN's search space is considerably less than A*, while being able to generate near optimal results. Additionally, WPN works on partial maps, unlike A* which needs the full map in advance. The code is available online 1 .

show abstract