UAV Path Planning Based on Multi-Layer Reinforcement Learning Technique

Cui, Zhengyang; Wang, Yong

doi:10.1109/access.2021.3073704

Cited by 49 publications

(18 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Yao and Zhao [20] suggested the model predictive control algorithm to search optimal or sub-optimal collision-free trajectories for a UAV in the midst of dynamic obstacle conditions. However, Besada-Portas et al [18]; Cui and Wang [19]; and Yao and Zhao [20] have shown the software-based computer simulation results of UAVs in their studies. Heidari and Saska [9] developed a heuristic approach-based open-loop control system to select optimal values of the dynamic control parameters such as thrust force and torque of a quadcopter for trajectory optimisation.…”

Section: Background and Literature Reviewmentioning

confidence: 99%

“…Comparative performance analysis of different multiobjective evolutionary algorithms as a path planner of UAV has been studied by Besada-Portas et al [18]). Cui and Wang [19] implemented the reinforcement learning algorithm to design local and global path planners for a UAV among static and moving obstacles. Yao and Zhao [20] suggested the model predictive control algorithm to search optimal or sub-optimal collision-free trajectories for a UAV in the midst of dynamic obstacle conditions.…”

Section: Background and Literature Reviewmentioning

confidence: 99%

See 1 more Smart Citation

Design and Analysis of a Novel Concept-Based Unmanned Aerial Vehicle with Ground Traversing Capability

Kumar

Gour

Pandey

et al. 2022

Acta Mechanica Et Automatica

View full text Add to dashboard Cite

Unmanned aerial vehicle (UAV) is a typical aircraft that is operated remotely by a human operator or autonomously by an on-board microcontroller. The UAV typically carries offensive ordnance, target designators, sensors or electronic transmitters designed for one or more applications. Such application can be in the field of defence surveillance, border patrol, search, bomb disposals, logistics and so forth. These UAVs are also being used in some other areas, such as medical purposes including for medicine delivery, rescue operations, agricultural applications and so on. However, these UAVs can only fly in the sky, and they cannot travel on the ground for other applications. Therefore, in this paper, we design and present the novel concept-based UAV, which can also travel on the ground and rough terrain as an unmanned ground vehicle (UGV). This means that according to our requirement, we can use this as a quadcopter and caterpillar wheel–based UGV using a single remote control unit. Further, the current study also briefly discusses the two-dimensional (2D) and three-dimensional (3D) SolidWorks models of the novel concept-based combined vehicle (UAV + UGV), together with a physical model of a combined vehicle (UAV + UGV) and its various components. Moreover, the kinematic analysis of a combined vehicle (UAV + UGV) has been studied, and the motion controlling kinematic equations have been derived. Then, the real-time aerial and ground motions and orientations and control-based experimental results of a combined vehicle (UAV + UGV) are presented to demonstrate the robustness and effectiveness of the proposed vehicle.

show abstract

Section: Background and Literature Reviewmentioning

confidence: 99%

Section: Background and Literature Reviewmentioning

confidence: 99%

Design and Analysis of a Novel Concept-Based Unmanned Aerial Vehicle with Ground Traversing Capability

Kumar

Gour

Pandey

et al. 2022

Acta Mechanica Et Automatica

View full text Add to dashboard Cite

show abstract

“…In the literature, there are several proposals to address path planning problems using reinforcement learning [11], [12]. Some proposals based on reinforcement learning have been combined with other techniques to improve performance.…”

Section: Related Workmentioning

confidence: 99%

Mobile Robot Path Planning Using a QAPF Learning Algorithm for Known and Unknown Environments

et al. 2022

View full text Add to dashboard Cite

This paper presents the computation of feasible paths for mobile robots in known and unknown environments using a QAPF learning algorithm. Q-learning is a reinforcement learning algorithm that has increased in popularity in mobile robot path planning in recent times, due to its self-learning capability without requiring a priori model of the environment. However, Q-learning shows slow convergence to the optimal solution, notwithstanding such an advantage. To address this limitation, the concept of partially guided Q-learning is employed wherein, the artificial potential field (APF) method is utilized to improve the classical Q-learning approach. Therefore, the proposed QAPF learning algorithm for path planning can enhance learning speed and improve final performance using the combination of Q-learning and the APF method. Criteria used to measure planning effectiveness include path length, path smoothness, and learning time. Experiments demonstrate that the QAPF algorithm successfully achieves better learning values that outperform the classical Q-learning approach in all the test environments presented in terms of the criteria mentioned above in offline and online path planning modes. The QAPF learning algorithm reached an improvement of 18.83% in path length for the online mode, an improvement of 169.75% in path smoothness for the offline mode, and an improvement of 74.84% in training time over the classical approach.INDEX TERMS Path planning, Q-learning, Artificial potential field, Reinforcement learning, Mobile robots.

show abstract

“…Some maps oversimplify the environment representation: the map is divided into a grid with equally-sized smaller cells that store information about the environment [68,73,93]. Others oversimplify the environment's structure by simplifying objects representation or by using 1D/2D to represent the environment [34,36,39,41,47,55,65,67,74,86,87] .…”

Section: Map-based Navigationmentioning

confidence: 99%

“…• Step 1 -Define State Type: When assessing an RL task, it is essential to comprehend the state that can be obtained from the surrounding environment. For instance, some navigation tasks simplify the environment's states using gridcell representations [68,73,93], where the agent has a limited and predetermined set of states, whereas in other tasks, the environment can have unlimited states [34,38,40]. Therefore, this steps involves a decision between limited vs. unlimited states.…”

Section: Problem Formulation and Algorithm Selectionmentioning

confidence: 99%

Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review

AlMahamid

Grolinger

2022

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

UAV Path Planning Based on Multi-Layer Reinforcement Learning Technique

Cited by 49 publications

References 36 publications

Design and Analysis of a Novel Concept-Based Unmanned Aerial Vehicle with Ground Traversing Capability

Design and Analysis of a Novel Concept-Based Unmanned Aerial Vehicle with Ground Traversing Capability

Mobile Robot Path Planning Using a QAPF Learning Algorithm for Known and Unknown Environments

Autonomous Unmanned Aerial Vehicle navigation using Reinforcement Learning: A systematic review

Contact Info

Product

Resources

About