2022 30th Mediterranean Conference on Control and Automation (MED) 2022
DOI: 10.1109/med54222.2022.9837220
|View full text |Cite
|
Sign up to set email alerts
|

A Deep Reinforcement Learning Motion Control Strategy of a Multi-rotor UAV for Payload Transportation with Minimum Swing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 18 publications
0
4
0
Order By: Relevance
“…For example, in [12] the chosen discount factor is close to 1. Li et al [57] and Hu and Wang [53] chose a value of 0.9, and Castro et al [66] and Panetsos et al [60] chose a discount factor value of 0.99.…”
Section: Value-function-based Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, in [12] the chosen discount factor is close to 1. Li et al [57] and Hu and Wang [53] chose a value of 0.9, and Castro et al [66] and Panetsos et al [60] chose a discount factor value of 0.99.…”
Section: Value-function-based Algorithmsmentioning
confidence: 99%
“…Panetsos et al [60] offer a solution to the payload transportation challenge using a DRL approach. An attitude PID controller is used in the inner loop of the cascaded controller structure, while a position controller in the outer loop is replaced with a TD3-based DRL algorithm.…”
Section: Actor-critic Algorithmsmentioning
confidence: 99%
“…No connection between UAV and ground 47,48,50,51,[53][54][55][56][58][59][60][61][62][63][64][65][66][67][68][69]71,[74][75][76][77][78][79][80][81][82][83][86][87][88]90,91,129,134,137,140,143] Connection for mechanical purposes during part of the operation [193] (only during recovery of payload) [119,131] Very high operating altitude (>1 km) [147,148,175] Short-term missions (inspection) [133,163]…”
Section: Reason For Not Transferring Power Over the Tether Publicationsmentioning
confidence: 99%
“…Model-free reinforcement learning (RL) has shown to have benefits in these situations as its sample-based properties allow for robust trajectory generation even if the model of the system is unknown [7]. Only a few RL-based solutions for the specific case of a RUAV and suspended payload are found in the literature [9,10,11,12,13].…”
Section: Introductionmentioning
confidence: 99%
“…The state-of-the-art deep RL algorithm for continuous autonomous control is currently the twin-delayed deep deterministic policy gradient (TD3) [14]. This algorithm has been applied to RUAV navigation [15,16], but, to the best of our knowledge, only one application to a multi-rotor with a suspended payload is available [13]. The TD3 agent is used to replace the position PID controller as a swing-minimizing controller and shows good tracking of a variety of 3D waypoints with significant swing reduction.…”
Section: Introductionmentioning
confidence: 99%