2020
DOI: 10.48550/arxiv.2008.07240
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles

Abstract: This paper presents a novel model-reference reinforcement learning algorithm for the intelligent tracking control of uncertain autonomous surface vehicles with collision avoidance. The proposed control algorithm combines a conventional control method with reinforcement learning to enhance control accuracy and intelligence. In the proposed control design, a nominal system is considered for the design of a baseline tracking controller using a conventional control approach. The nominal system also defines the des… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

3
1

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 56 publications
0
5
0
Order By: Relevance
“…Motivated by the works in [20], [25], [26], [27], we propose to incorporate the theoretical result in Theorem 1 to formulate a constrained optimisation problem, based on SAC [17]. First of all, a Lyapunov candidate needs to be selected at the first instance.…”
Section: Lyapunov-based Reinforcement Learningmentioning
confidence: 99%
“…Motivated by the works in [20], [25], [26], [27], we propose to incorporate the theoretical result in Theorem 1 to formulate a constrained optimisation problem, based on SAC [17]. First of all, a Lyapunov candidate needs to be selected at the first instance.…”
Section: Lyapunov-based Reinforcement Learningmentioning
confidence: 99%
“…The convergence guarantee can be validated by checking the values of Lagrange multipliers. When the Lyapunov constraint in (28) is satisfied, the parameter λ should continuously decrease to zero. In Fig.…”
Section: A Algorithm Convergencementioning
confidence: 99%
“…Only until recently, the asymptotic stability in model-free RL is given for robotic control tasks [26]. In [27], [28], the stability of a system with a combination of a classic baseline controller and a RL controller is proved for autonomous surface vehicles with collisions.…”
Section: Introductionmentioning
confidence: 99%
“…But it is used to guarantee the safety of the agent in the training instead of the stability. To guarantee stability, Zhang et al in Zhang, Dong and Pan (2020), Zhang, Pan and Reppa (2020) respectively proposed basic control based SAC algorithms for ships and multi-agents. Furthermore, Lyapunov-based soft actor-critic algorithms have been proposed for traditional control and estimation design in our previous work, in which the stability has been proved by solely using data (Han, Zhang, Wang, & Pan, 2020;Hu, Wu, & Pan, 2020) and a learned Lyapunov function constraint.…”
Section: Introductionmentioning
confidence: 99%