2023
DOI: 10.1016/j.oceaneng.2023.113811
|View full text |Cite
|
Sign up to set email alerts
|

Towards end-to-end formation control for robotic fish via deep reinforcement learning with non-expert imitation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 24 publications
0
3
0
Order By: Relevance
“…Compared to bionic action control tasks and motion control tasks, decision making tasks for bionic underwater robots are more diverse, such as searching [ 69 ], obstacle avoidance [ 115 ], formation control [ 116 , 117 ], and other swarm strategies [ 118 , 119 , 120 ]. The majority of current research on RL-based decision making for bionic underwater robots is conducted in simulation environments.…”
Section: Rl-based Methods In Task Spaces Of Bionic Underwater Robotsmentioning
confidence: 99%
See 2 more Smart Citations
“…Compared to bionic action control tasks and motion control tasks, decision making tasks for bionic underwater robots are more diverse, such as searching [ 69 ], obstacle avoidance [ 115 ], formation control [ 116 , 117 ], and other swarm strategies [ 118 , 119 , 120 ]. The majority of current research on RL-based decision making for bionic underwater robots is conducted in simulation environments.…”
Section: Rl-based Methods In Task Spaces Of Bionic Underwater Robotsmentioning
confidence: 99%
“…For decentralized circle formation control for fish-like robots, a new MARL method based on value decomposition networks (VDN) was proposed [ 16 ], and the cognitive consistency of multi-agents realized by parameter sharing and the centralized training mechanism with decentralized execution is an important factor in the effective formation of control methods. A dueling double DQN (D3QN)-based approach in the leader–follower topology was proposed for end-to-end formation control [ 117 ], and the blindness of agent exploration at the beginning of training was reduced through imitation learning. Similarly, ref.…”
Section: Rl-based Methods In Task Spaces Of Bionic Underwater Robotsmentioning
confidence: 99%
See 1 more Smart Citation
“…Expanding this concept, Ho J and team introduced the Generative Adversarial Imitation Learning (GAIL) algo-rithm [20], adeptly selecting trajectories closely mirroring human demonstrations, thereby curtailing inefficient maneuvers and enhancing the training process's speed. In a similar vein, Sun Y et al ingeniously melded DQN with behavioral cloning to develop the D3QN algorithm [21], markedly diminishing exploration randomness in initial training phases. Furthermore, Peng X B et al devised the deep mimic approach [22], ingeniously segmenting the reward function into an aggregate of imitation-based exponential components, thereby refining the reinforcement learning process.…”
Section: Introductionmentioning
confidence: 99%