2021
DOI: 10.48550/arxiv.2105.06577
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Integration of adaptive control and reinforcement learning for real-time control and learning

Abstract: This paper considers the problem of real-time control and learning in dynamic systems subjected to uncertainties. Adaptive approaches are proposed to address the problem, which are combined with methods and tools in Reinforcement Learning (RL) and Machine Learning (ML). Algorithms are proposed in continuous-time that combine adaptive approaches with RL leading to online control policies that guarantee stable behavior in the presence of parametric uncertainties that occur in real-time. Algorithms are proposed i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

1
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 54 publications
1
1
0
Order By: Relevance
“…The high-order tuner in [6] is continuous, and is later proved in [5] to result in exponential parameter convergence. The same exponential convergence results have also been proved in [9] using a different approach from [5]. But a discrete-time parameter convergence analysis has been lacking in the literature.…”
Section: Introductionsupporting
confidence: 53%
“…The high-order tuner in [6] is continuous, and is later proved in [5] to result in exponential parameter convergence. The same exponential convergence results have also been proved in [9] using a different approach from [5]. But a discrete-time parameter convergence analysis has been lacking in the literature.…”
Section: Introductionsupporting
confidence: 53%
“…The fact that the two approaches are different suggests that there are ways in which they could be integrated so as to realize their combined advantages. The focus of AC on stability and RL on optimality suggests that one such candidate is a multiloop approach, with an inner loop focused on AC methods that are capable of delivering real-time performance with stability guarantees and an outer loop focused on RL methods that can anticipate an optimal policy when the sim-to-real gap is small (93)(94)(95)(96)(97).…”
Section: Disclosure Statementmentioning
confidence: 99%