“…In a closely related line of work, Dean et al [2018] provide an O(T 2/3 ) regret bound for robust adaptive LQR control, drawing inspiration from classical methods in system identification and robust adaptive control. It has since been shown that certainty equivalent control, without robustness, can attain the (locally) minimax optimal O( √ T ) regret [Mania et al, 2019, Faradonbeh et al, 2020, Lale et al, 2020a, Jedra and Proutiere, 2021. In particular, by providing nearly matching upper and lower bounds, Simchowitz and Foster [2020] refine this analysis and establish that the optimal rate, without taking system theoretic quantities into account, is R T = Θ( p 2 nT ).…”