“…We employ switching regret for non-convex optimization. More refined analysis may be possible via generalizations such as strongly adaptive or dynamic regret (Daniely et al, 2015;Jun et al, 2017;Zhang et al, 2018;Jacobsen & Cutkosky, 2022;Cutkosky, 2020;Lu et al, 2022;Luo et al, 2022;Zhang et al, 2021;Baby & Wang, 2022;Zhang et al, 2022). Moreover, our analysis assumes perfect tuning of constants (e.g., D, T, K) for simplicity.…”