2021 American Control Conference (ACC) 2021
DOI: 10.23919/acc50511.2021.9483023
|View full text |Cite
|
Sign up to set email alerts
|

Regret-Optimal Controller for the Full-Information Problem

Abstract: We consider the infinite-horizon, discrete-time fullinformation control problem. Motivated by learning theory, as a criterion for controller design we focus on regret, defined as the difference between the LQR cost of a causal controller (that has only access to past and current disturbances) and the LQR cost of a clairvoyant one (that has also access to future disturbances). In the full-information setting, there is a unique optimal non-causal controller that in terms of LQR cost dominates all other controlle… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
23
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 15 publications
(23 citation statements)
references
References 6 publications
0
23
0
Order By: Relevance
“…The sets of the causal and strcitly causal policies are denoted by Π C and Π S.C. respectively. Moreover, we restrict our attention in both scenarios to linear policies 1 .…”
Section: A the Settingmentioning
confidence: 99%
“…The sets of the causal and strcitly causal policies are denoted by Π C and Π S.C. respectively. Moreover, we restrict our attention in both scenarios to linear policies 1 .…”
Section: A the Settingmentioning
confidence: 99%
“…Motivated by competitive design approaches in online learning [3], two control paradigms have been recently proposed as alternatives to the classical paradigms, namely regret-optimal control [4] and competitive-ratio control [5]. These criteria aim to minimize the difference or the ratio between the costs of a causal controller (to be designed) and the cost of the optimal clairvoyant controller that has access to entire disturbances sequence.…”
Section: Introductionmentioning
confidence: 99%
“…In the competitive-ratio case, we minimize the corresponding worst-case ratio of these two costs. These strategies result in novel controller behavior, e.g., the regret-optimal controller interpolates between H 2 and H ∞ to achieve the best of both worlds [4], and provide superior performance in various control tasks with various disturbance characteristics [4][5][6].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations