Multiple UAVs Path Planning Based on Deep Reinforcement Learning in Communication Denial Environment

Xu, Yahao; Wei, Yiran; Jiang, Keyang; Wang, Di; Deng, Hongbin

doi:10.3390/math11020405

Cited by 13 publications

(12 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This can be described as a constraint that ensures that each point in the region of interest is a specific distance from the drone path. UAV must avoid collisions with barriers to comply with the obstacle avoidance limitation [36]. This can be described as a limitation that prevents the UAV's path from crossing any obstacles.…”

Section: Max Iterations 1000mentioning

confidence: 99%

Energy Efficient Path Planning Scheme for Unmanned Aerial Vehicle Using Hybrid Generic Algorithm-Based Q-Learning Optimization

Saeed,

Ali,

Abdelhaq

et al. 2024

IEEE Access

View full text Add to dashboard Cite

Efficient path planning optimization strategies are required to maximize flying time while consuming the least energy. This research offers a novel approach for energy-efficient path planning for Unmanned Aerial Vehicles (UAVs) that combines a hybrid evolutionary algorithm and Q-learning while accounting for the UAV's velocity and distance from obstacles. To overcome the constraints of traditional optimization approaches, the hybrid methodology combines genetic algorithms and Q-learning. The suggested approach optimizes path-planning decisions based on realtime information by considering the UAV's velocity and distance from obstacles. Genetic Algorithm (GA) creates a wide collection of candidate pathways. In contrast, Q-learning uses reinforcement learning to make educated selections based on the UAV's present velocity and proximity to static obstacles. This integration allows the UAV to modify its path dynamically based on its energy requirements and environmental constraints. The main goal is to develop a UAV path planning scheme capable of dealing with obstacle-filled environments to improve energy efficiency and collision avoidance during flight missions. Our experimental results show that the hybrid technique outperforms the classical GA method in terms of energy efficiency by significantly reducing energy consumption while maintaining a suitable collision rate and the best path cost to the desired locations. The analysis results improve the performance of the hybrid GA/QL algorithm by more than 57.14% compared to classical GA.

show abstract

Section: Max Iterations 1000mentioning

confidence: 99%

Energy Efficient Path Planning Scheme for Unmanned Aerial Vehicle Using Hybrid Generic Algorithm-Based Q-Learning Optimization

Saeed,

Ali,

Abdelhaq

et al. 2024

IEEE Access

View full text Add to dashboard Cite

show abstract

“…indicates the randomness of the policy and is calculated as shown in Equation (2). γ is the discount factor, which indicates the length of time in the future to be considered.…”

Section: Sac Reinforcement-learning Algorithm Based On Security Const...mentioning

confidence: 99%

“…Deep reinforcement learning is an algorithm that integrates deep neural networks and reinforcement learning to solve complex decision-making tasks [2]. With the fitting ability of deep neural networks, it solves the mapping from observed features to strategy and value functions, and at the same time, it uses reinforcement-learning algorithms to define the optimization problem and optimization objective and continuously improves the decision-making ability of the agent in the process of interacting with information from the environment.…”

Section: Introductionmentioning

confidence: 99%

“…(1) We use fast path-generation algorithm to control the robot to generate expert trajectories, combine SAC reinforcement learning with imitation learning based on expert trajectories to solve the problem of the poor navigation ability of the agent in the initial state, and improve the training safety and convergence speed of reinforcement learning. (2) Considering that the priority of expert trajectory data is higher than that of agent trajectory data, we improve the priority calculation method on the basis of the TDerror (time-difference error) priority replay technique, which improves the utilization efficiency of the data in the experience replay pool. (3) We introduced an RNN to improve the obstacle-avoidance ability of SAC navigation policy in a dynamic environment.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Soft Actor-Critic Deep Reinforcement-Learning-Based Robot Navigation Method Using LiDAR

Liu,

Wang,

Zhao

et al. 2024

Remote Sensing

View full text Add to dashboard Cite

When there are dynamic obstacles in the environment, it is difficult for traditional path-generation algorithms to achieve desired obstacle-avoidance results. To solve this problem, we propose a robot navigation control method based on SAC (Soft Actor-Critic) Deep Reinforcement Learning. Firstly, we use a fast path-generation algorithm to control the robot to generate expert trajectories when the robot encounters danger as well as when it approaches a target, and we combine SAC reinforcement learning with imitation learning based on expert trajectories to improve the safety of training. Then, for the hybrid data consisting of agent data and expert data, we use an improved prioritized experience replay method to improve the learning efficiency of the policies. Finally, we introduce RNN (Recurrent Neural Network) units into the network structure of the SAC Deep Reinforcement-Learning navigation policy to improve the agent’s transfer inference ability in a new environment and obstacle-avoidance ability in dynamic environments. Through simulation and practical experiments, it is fully verified that our method has a higher training efficiency and navigation success rate compared to state-of-the-art reinforcement-learning algorithms, which further enhances the obstacle-avoidance capability of the robot system.

show abstract

“…As one of the DRL algorithms, the Deep Q-Network (DQN) algorithm is a method to approximate the Q-learning function through a neural network. DQN methods have been increasingly applied in the field of path planning, and several brilliant algorithms based on it have been put forward [22][23][24][25]. Yin Cheng et al [26] have developed a concise DRL obstacle-avoidance algorithm that designed a comprehensive reward function for behaviors such as obstacle avoidance, target approach, speed correction, and attitude correction in dynamic environments, using the deep Q-network (DQN) architecture, to overcome the usability issue caused by the complicated control law in the traditional analytic approach.…”

Section: Introductionmentioning

confidence: 99%

A Stealth–Distance Dynamic Weight Deep Q-Network Algorithm for Three-Dimensional Path Planning of Unmanned Aerial Helicopter

Wang

Huang

2023

Aerospace

View full text Add to dashboard Cite

Unmanned aerial helicopters (UAHs) have been widely used recently for reconnaissance operations and other risky missions. Meanwhile, the threats to UAHs have been becoming more and more serious, mainly from radar and flights. It is essential for a UAH to select a safe flight path, as well as proper flying attitudes, to evade detection operations, and the stealth abilities of the UAH can be helpful for this. In this paper, a stealth–distance dynamic weight Deep Q-Network (SDDW-DQN) algorithm is proposed for path planning in a UAH. Additionally, the dynamic weight is applied in the reward function, which can reflect the priorities of target distance and stealth in different flight states. For the path-planning simulation, the dynamic model of UAHs and the guidance model of flight are put forward, and the stealth model of UAHs, including the radar cross-section (RCS) and the infrared radiation (IR) intensity of UAHs, is established. The simulation results show that the SDDW-DQN algorithm can be helpful in the evasion by UAHs of radar detection and flight operations, and the dynamic weight can contribute to better path-planning results.

show abstract

Multiple UAVs Path Planning Based on Deep Reinforcement Learning in Communication Denial Environment

Cited by 13 publications

References 22 publications

Energy Efficient Path Planning Scheme for Unmanned Aerial Vehicle Using Hybrid Generic Algorithm-Based Q-Learning Optimization

Energy Efficient Path Planning Scheme for Unmanned Aerial Vehicle Using Hybrid Generic Algorithm-Based Q-Learning Optimization

A Soft Actor-Critic Deep Reinforcement-Learning-Based Robot Navigation Method Using LiDAR

A Stealth–Distance Dynamic Weight Deep Q-Network Algorithm for Three-Dimensional Path Planning of Unmanned Aerial Helicopter

Contact Info

Product

Resources

About