2021
DOI: 10.3390/aerospace8090258
|View full text |Cite
|
Sign up to set email alerts
|

Unmanned Aerial Vehicle Pitch Control under Delay Using Deep Reinforcement Learning with Continuous Action in Wind Tunnel Test

Abstract: Nonlinear flight controllers for fixed-wing unmanned aerial vehicles (UAVs) can potentially be developed using deep reinforcement learning. However, there is often a reality gap between the simulation models used to train these controllers and the real world. This study experimentally investigated the application of deep reinforcement learning to the pitch control of a UAV in wind tunnel tests, with a particular focus of investigating the effect of time delays on flight controller performance. Multiple neural … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 18 publications
(22 reference statements)
0
3
0
Order By: Relevance
“…In the future, the proposed controller will be used to conduct hardware-inthe-loop simulation experiments to further verify the stability of the algorithm. However, the gap between the simulation and the real world presents additional challenges, such as the oscillation of the controller [34]. DQN…”
Section: Discussionmentioning
confidence: 99%
“…In the future, the proposed controller will be used to conduct hardware-inthe-loop simulation experiments to further verify the stability of the algorithm. However, the gap between the simulation and the real world presents additional challenges, such as the oscillation of the controller [34]. DQN…”
Section: Discussionmentioning
confidence: 99%
“…then, the disturbance observation error of observer can converge to 0 within a fixed time T 1 , and the convergence time is independent of the initial state. (18) where ε p > 0.…”
Section: Controller Design and Stability Analysismentioning
confidence: 99%
“…A sliding mode control method was discussed in [17], and finite-time convergence of the quadrotor trajectory tracking was proved by using Lyapunov theory. In addition, some learning-based control techniques have been implemented on the UAV, such as reinforcement learning [18] and deep learning [19,20].…”
Section: Introductionmentioning
confidence: 99%