This work presents an algorithmic scheme for solving the infinite-time constrained linear quadratic regulation problem. We employ an accelerated version of a popular proximal gradient scheme, commonly known as the Forward-Backward Splitting (FBS), and prove its convergence to the optimal solution in our infinite-dimensional setting. Each iteration of the algorithm requires only finite memory, is computationally cheap, and makes no use of terminal invariant sets; hence, the algorithm can be applied to systems of very large dimensions. The acceleration brings in 'optimal' convergence rates O(1/k 2 ) for function values and O(1/k) for primal iterates and renders the proposed method a practical alternative to model predictive control schemes for setpoint tracking. In addition, for the case when the true system is subject to disturbances or modelling errors, we propose an efficient warm-starting procedure, which significantly reduces the number of iterations when the algorithm is applied in closed-loop. Numerical examples demonstrate the approach.
Index TermsConstrained LQR, Alternating minimization, Operator splitting I. INTRODUCTION An important extension of the famous result of [1] on the closed form solution of the infinitehorizon linear quadratic regulation (LQR) problem is the case where input and state variables are constrained. This problem is computationally significantly more difficult and has been by and large addressed only approximately. A prime example of an approximation scheme is model predictive control (MPC), which approximates the infinite-time constrained problem by a finitetime one. Stability of such MPC controllers is then typically enforced by adding a suitable terminal constraint and a terminal penalty. The inclusion of a terminal constraint limits the feasible region of the MPC controller, and, consequently, the region of attraction of the closedloop system. In practical applications, this problem is typically overcome by simply choosing a "sufficiently" long horizon based on process insight (e.g., dominant time constant). Closed-loop behavior is then analyzed a posteriori, for instance by exhaustive simulation.There have been few results addressing directly the infinite-horizon constrained LQR (CLQR) problem. Among the most well-known efforts are the works