2021 6th International Conference on Robotics and Automation Engineering (ICRAE) 2021
DOI: 10.1109/icrae53653.2021.9657767
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Flocking Control of UAVs in Complex Environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 57 publications
0
5
0
Order By: Relevance
“…The ‘Machine learning’ method can be used for aggregation (Trianni et al, 2003), flocking (Baldassarre et al, 2003; Salimi & Pasquier, 2021) and pattern formation (Sharma et al, 2022). It is reasonable to assume that the other behaviours can be obtained with the right fitness function or the correct reward mechanism.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The ‘Machine learning’ method can be used for aggregation (Trianni et al, 2003), flocking (Baldassarre et al, 2003; Salimi & Pasquier, 2021) and pattern formation (Sharma et al, 2022). It is reasonable to assume that the other behaviours can be obtained with the right fitness function or the correct reward mechanism.…”
Section: Discussionmentioning
confidence: 99%
“…Deep learning is used to help the robot find the best policy by representing this policy with a neural network of which the weights are adjusted through time. Flocking (Salimi & Pasquier, 2021) and pattern formation (Sharma et al, 2022) are two examples of collective behaviours that can be obtained with this method and process.…”
Section: Self-organisation Methodsmentioning
confidence: 99%
“…Policy Gradients DDPG PPO Other [23,24,39,40,59, (QMIX), [38] (DDQN), [93] (DDQN), [94] (DDQN), [95] (DQN), [96] (DQN) [46][47][48][49][50][97][98][99][100][101][102][103][104][105][106], [22,[52][53][54][55][56][57] [86,88,129-140] (TRPO), [141] (TRPO), [81] (TRPO), [142] (TD3), [143] (SAC), [144] (SAC)…”
Section: Q-networkmentioning
confidence: 99%
“…Ground Robots Manipulators [23,24,38,46,53,56,57,68,76,83,85,91,104,106,110,110,120,123,134,135,138,139,141,142,161,167,171,172,186,205,207,[218][219][220]230,240,269] [ 22,39,49,52,54,55,59,70,71,[73][74][75][77][78][79]…”
Section: Aerial Robotsmentioning
confidence: 99%
See 1 more Smart Citation