Certainty Equivalence is Efficient for Linear Quadratic Control

Mania, Horia; Tu, Stephen; Recht, Benjamin

doi:10.48550/arxiv.1902.07826

Cited by 13 publications

(47 citation statements)

References 19 publications

(51 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This way of designing the control policy is also known as the certainty equivalence approach (e.g., [4]). Specifically, the authors in [10,25] provided an online algorithm for the LQR problem with unknown system matrices and showed that the regret of the algorithm is Õ( √ N ), where N is the number of time steps in the LQR problem and Õ(•) hides logarithmic factors in N . Note that the authors in [1,11,10,25] considered the infinite horizon LQR setting.…”

Section: Related Workmentioning

confidence: 99%

“…Specifically, the authors in [10,25] provided an online algorithm for the LQR problem with unknown system matrices and showed that the regret of the algorithm is Õ( √ N ), where N is the number of time steps in the LQR problem and Õ(•) hides logarithmic factors in N . Note that the authors in [1,11,10,25] considered the infinite horizon LQR setting. We extend the analyses and results in [25] to the finite horizon LQR setting when solving the problem considered in this paper.…”

Section: Related Workmentioning

confidence: 99%

“…Note that the authors in [1,11,10,25] considered the infinite horizon LQR setting. We extend the analyses and results in [25] to the finite horizon LQR setting when solving the problem considered in this paper.…”

Section: Related Workmentioning

confidence: 99%

“…We analyze the regret of the proposed algorithm and show that the regret is Õ( √ T ) with high probability, where T is the number of rounds in the problem and Õ(•) hides logarithmic factors in T . When analyzing the regret of our algorithm, we also extend the certainty equivalence approach proposed in [25] for learning LQR over an infinite horizon to the finite-horizon setting.…”

Section: Contributionsmentioning

confidence: 99%

“…For the controller design, we leverage the certainty equivalence approach, which has been studied for the LQR problem over an infinite horizon with unknown system model (e.g., [11,25]). Specifically, in the certainty equivalence approach, we design a control policy based on estimated system matrices, denoted as Â and B.…”

Section: Controller Design Using Certainty Equivalence Approachmentioning

confidence: 99%

See 4 more Smart Citations

Online Actuator Selection and Controller Design for Linear Quadratic Regulation with Unknown System Model

Ye¹,

Ming²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

We study the simultaneous actuator selection and controller design problem for linear quadratic regulation over a finite horizon, when the system matrices are unknown a priori. We propose an online actuator selection algorithm to solve the problem which specifies both a set of actuators to be utilized and the control policy corresponding to the set of selected actuators. Specifically, our algorithm is a model based learning algorithm which maintains an estimate of the system matrices using the system trajectories. The algorithm then leverages an algorithm for the multiarmed bandit problem to determine the set of actuators under an actuator selection budget constraint and also identifies the corresponding control policy that minimizes a quadratic cost based on the estimated system matrices. We show that the proposed online actuator selection algorithm yields a sublinear regret.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Contributionsmentioning

confidence: 99%

Section: Controller Design Using Certainty Equivalence Approachmentioning

confidence: 99%

See 3 more Smart Citations

Online Actuator Selection and Controller Design for Linear Quadratic Regulation with Unknown System Model

Ye¹,

Ming²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

Behavioral systems theory in data-driven analysis, signal processing, and control

Markovsky

Dörfler

2021

Annual Reviews in Control

161

View full text Add to dashboard Cite

Controlling unknown linear dynamics with bounded multiplicative regret

et al. 2022

View full text Add to dashboard Cite

We consider a simple control problem in which the underlying dynamics depend on a parameter that is unknown and must be learned. We exhibit a control strategy which is optimal to within a multiplicative constant. While most authors find strategies which are successful as the time horizon tends to infinity, our strategy achieves lowest expected cost up to a constant factor for a fixed time horizon.

show abstract

Certainty Equivalence is Efficient for Linear Quadratic Control

Cited by 13 publications

References 19 publications

Online Actuator Selection and Controller Design for Linear Quadratic Regulation with Unknown System Model

Online Actuator Selection and Controller Design for Linear Quadratic Regulation with Unknown System Model

Behavioral systems theory in data-driven analysis, signal processing, and control

Controlling unknown linear dynamics with bounded multiplicative regret

Contact Info

Product

Resources

About