2021
DOI: 10.1002/rnc.5624
|View full text |Cite
|
Sign up to set email alerts
|

Suboptimal reduced control of unknown nonlinear singularly perturbed systems via reinforcement learning

Abstract: In this paper, a suboptimal reduced control method is proposed for a class of nonlinear singularly perturbed systems (SPSs) with unknown dynamics. By using singular perturbation theory, the original system is reduced to a reduced system, by which a policy iterative method is proposed to solve the corresponding reduced Hamilton–Jacobi–Bellman (HJB) equation with convergence guaranteed. A reinforcement learning (RL) algorithm is proposed to implement the policy iterative method without using any knowledge of the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 45 publications
0
0
0
Order By: Relevance
“…In this way, the state constraints (2) of the original systems are converted to the tracking error constraints described by (10). Considering Equation (7) and system (1), we have…”
Section: System Transformationmentioning
confidence: 99%
See 1 more Smart Citation
“…In this way, the state constraints (2) of the original systems are converted to the tracking error constraints described by (10). Considering Equation (7) and system (1), we have…”
Section: System Transformationmentioning
confidence: 99%
“…The decentralized control method for multi-agent linear SPSs was provided in [6], where the synchronization problem of networks of linear SPSs was rewritten as stabilization of uncertain linear SPSs to reduce the associated energy and the communication load. The suboptimal control strategy for nonlinear SPSs with known fast-subsystems was investigated in [7], in which it was innovative to the suboptimal performance of the considered systems without having any knowledge of the system. An adaptive dynamic programming scheme is designed for linear SPSs, whose slow-subsystems are unknown [8].…”
Section: Introductionmentioning
confidence: 99%