Target Search Control of AUV in Underwater Environment With Deep Reinforcement Learning

Cao, Xiang; Sun, Changyin; Yan, Ming

doi:10.1109/access.2019.2929120

Cited by 51 publications

(22 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…At the same time, DRL and dualstream Q-learning are applied to AUV obstacle avoidance and navigation to further optimize the search path. e simulation results show that the method can effectively control AUV to explore unknown environments [104].…”

Section: Human-inspired Algorithmsmentioning

confidence: 96%

“…It can improve the AUV's intelligence level and realize the path planning of the AUV in complex and unknown environments. In recent years, RL methods (such as Q-learning [99,100] and Sarsa [19]) or deep reinforcement learning methods [104] have achieved excellent results in AUV path planning. In 2018, Cao et al used reinforcement learning and Gaussian process regression to solve the path planning with bathymetric aids and modelled the value function as a Gaussian process to minimize the location uncertainty when the AUV reaches the target point [113].…”

Section: Direction C: Intelligent Path Planning Algorithmsmentioning

confidence: 99%

See 1 more Smart Citation

Research Progress of Path Planning Methods for Autonomous Underwater Vehicle

Guo

Liu

Fan

et al. 2021

Mathematical Problems in Engineering

View full text Add to dashboard Cite

Path planning is a key technology for autonomous underwater vehicle (AUV) navigation. With the emphasis and research on AUV, AUV path planning technology is continuously developing. Path planning techniques generally include environment modelling methods and path planning algorithms. Based on a brief description of the environment modelling methods, this paper focuses on the path planning algorithms commonly used by AUV. According to the basic principles of the algorithm, the AUV path planning algorithms are divided into four categories: artificial potential field methods, geometric model search methods, random sampling methods, and intelligent bionic methods. In this review, we summarize in detail the development and application of various path planning algorithms in recent years. Meanwhile, we analyse the advantages and disadvantages of various algorithms and their improvement methods. Obstacles, ocean currents, and undersea terrain have an impact on AUV path planning. Therefore, how to deal with the complex underwater environment adds some limits to AUV path planning algorithms. In addition to the external environment, path planning algorithms also need to consider AUV’s physical constraints, such as energy constraints and motion constraints. Then, we analyse the motion constraints in AUV path planning. Finally, we discuss the development direction of AUV path planning algorithm. Time-varying ocean currents, special obstacles, multiobjective constraints, and practicability will be the problems that AUV path planning algorithms need to solve.

show abstract

Section: Human-inspired Algorithmsmentioning

confidence: 96%

Section: Direction C: Intelligent Path Planning Algorithmsmentioning

confidence: 99%

Research Progress of Path Planning Methods for Autonomous Underwater Vehicle

Guo

Liu

Fan

et al. 2021

Mathematical Problems in Engineering

View full text Add to dashboard Cite

show abstract

“…The training process uses the DQL algorithm. We have done a preliminary study on the DRL algorithm for path planning, and its specific working process is shown in [39].…”

Section: Path Planning Of Huntingmentioning

confidence: 99%

“…They are represented by dots in different colors. The original position of the target is (39,41,11). It's represented by a purple star.…”

Section: A Target Hunting When Auv Is the Same Velocity As The Targetmentioning

confidence: 99%

“…The simulation research target hunting when the velocity of AUV is slower than the speed of moving target. In the simulation with five AUVs, their original positions are (17,43,42), (39,45,3), (19,34,6), (40,22,5), and (37,9,23). The target is located in (44, 47, 38) originally and its trajectory is purple.…”

Section: B Target Hunting When the Velocity Of Auv Is Slower Than Thmentioning

confidence: 99%

See 1 more Smart Citation

Hunting Algorithm for Multi-AUV Based on Dynamic Prediction of Target Trajectory in 3D Underwater Environment

Cao

2020

IEEE Access

Self Cite

View full text Add to dashboard Cite

In the research of multi-robot systems, multi-AUV (multiple autonomous underwater vehicles) cooperative target hunting is a hot issue. In order to improve the target hunting efficiency of multi-AUV, a multi-AUV hunting algorithm based on dynamic prediction for the trajectory of the moving target is proposed in this paper. Firstly, with moving of the target, sample points are updated dynamically to predict the possible position of a target in a short period time by using the fitting of a polynomial, and the safe domain of the moving target, which is a denied area for the hunting AUVs, is built to avoid the target's escape when it detects AUVs. Secondly, the method of negotiation is adopted to allocate appropriate desired hunting points for each AUV. Finally, the AUVs arrive at desired hunting points rapidly through deep reinforcement learning (DRL) algorithm to achieve hunting the moving target. The simulations show that hunting AUVs can surround the moving target of which the trajectory is unknown rapidly and accurately by the algorithm in the 3D environment with complex obstacles and results obtained is satisfactory. INDEX TERMS Multi-AUV hunting, dynamic prediction, deep reinforcement learning, desired hunting point XIANG CAO was born in Sichuan, China. He received the B.Sc. degree in electronic and information engineering from Southwest University, Chongqing, China, in 2004, and the M.Sc. degree in communication and information systems from

show abstract