2023
DOI: 10.3390/math11020405
|View full text |Cite
|
Sign up to set email alerts
|

Multiple UAVs Path Planning Based on Deep Reinforcement Learning in Communication Denial Environment

Abstract: In this paper, we propose a C51-Duel-IP (C51 Dueling DQN with Independent Policy) dynamic destination path-planning algorithm to solve the problem of autonomous navigation and avoidance of multiple Unmanned Aerial Vehicles (UAVs) in the communication denial environment. Our proposed algorithm expresses the Q function output by the Dueling network as a Q distribution, which improves the fitting ability of the Q value. We also extend the single-step temporal differential (TD) to the N-step timing differential, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(12 citation statements)
references
References 22 publications
0
2
0
Order By: Relevance
“…This can be described as a constraint that ensures that each point in the region of interest is a specific distance from the drone path. UAV must avoid collisions with barriers to comply with the obstacle avoidance limitation [36]. This can be described as a limitation that prevents the UAV's path from crossing any obstacles.…”
Section: Max Iterations 1000mentioning
confidence: 99%
“…This can be described as a constraint that ensures that each point in the region of interest is a specific distance from the drone path. UAV must avoid collisions with barriers to comply with the obstacle avoidance limitation [36]. This can be described as a limitation that prevents the UAV's path from crossing any obstacles.…”
Section: Max Iterations 1000mentioning
confidence: 99%
“…indicates the randomness of the policy and is calculated as shown in Equation (2). γ is the discount factor, which indicates the length of time in the future to be considered.…”
Section: Sac Reinforcement-learning Algorithm Based On Security Const...mentioning
confidence: 99%
“…Deep reinforcement learning is an algorithm that integrates deep neural networks and reinforcement learning to solve complex decision-making tasks [2]. With the fitting ability of deep neural networks, it solves the mapping from observed features to strategy and value functions, and at the same time, it uses reinforcement-learning algorithms to define the optimization problem and optimization objective and continuously improves the decision-making ability of the agent in the process of interacting with information from the environment.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…As one of the DRL algorithms, the Deep Q-Network (DQN) algorithm is a method to approximate the Q-learning function through a neural network. DQN methods have been increasingly applied in the field of path planning, and several brilliant algorithms based on it have been put forward [22][23][24][25]. Yin Cheng et al [26] have developed a concise DRL obstacle-avoidance algorithm that designed a comprehensive reward function for behaviors such as obstacle avoidance, target approach, speed correction, and attitude correction in dynamic environments, using the deep Q-network (DQN) architecture, to overcome the usability issue caused by the complicated control law in the traditional analytic approach.…”
Section: Introductionmentioning
confidence: 99%