“…Due to these properties, GPs gained increasing attention in the field of reinforcement learning and system identification. Especially, when safety guarantees are necessary, GPs are favored in reinforcement learning (Berkenkamp et al, 2016a(Berkenkamp et al, ,c, 2017Koller et al, 2018) as well as control (Berkenkamp and Schoellig, 2015;Umlauft et al, 2017;Beckers and Hirche, 2018;Lederer et al, 2020;Umlauft et al, 2018;Helwa et al, 2019). These approaches heavily rely on error bounds of GP regression and are therefore limited by the strict assumptions made in previous works on GP uniform error bounds (Srinivas et al, 2012;Chowdhury and Gopalan, 2017).…”