Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics

Berkenkamp, Felix; Krause, Andreas; Schoellig, Angela P.

doi:10.1007/s10994-021-06019-1

Cited by 130 publications

(121 citation statements)

References 29 publications

Supporting

Mentioning

101

Contrasting

Order By: Relevance

“…1) Robot skill parameter inference: Several approaches for the automatic optimization of robot skill parameters have been proposed, which rely on gradient-free optimization techniques such as evolutionary algorithms [7], [8], [9] or Bayesian optimization [10], [11], [12] due to the non-differentiability of most skill libraries and frameworks. Gradient-free approaches require frequent execution of the skills during optimization, which is a time-consuming process if done on real robot systems, has to be repeated whenever the task objectives change and often require good initial parameterizations.…”

Section: Related Workmentioning

confidence: 99%

Robot Program Parameter Inference via Differentiable Shadow Program Inversion

Alt,

Katic,

Jäkel

et al. 2021

Preprint

View full text Add to dashboard Cite

Challenging manipulation tasks can be solved effectively by combining individual robot skills, which must be parameterized for the concrete physical environment and task at hand. This is time-consuming and difficult for human programmers, particularly for force-controlled skills. To this end, we present Shadow Program Inversion (SPI), a novel approach to infer optimal skill parameters directly from data. SPI leverages unsupervised learning to train an auxiliary differentiable program representation ("shadow program") and realizes parameter inference via gradient-based model inversion. Our method enables the use of efficient first-order optimizers to infer optimal parameters for originally non-differentiable skills, including many skill variants currently used in production. SPI zero-shot generalizes across task objectives, meaning that shadow programs do not need to be retrained to infer parameters for different task variants. We evaluate our methods on three different robots and skill frameworks in industrial and household scenarios. Code and examples are available at https://innolab.artiminds.com/icra2021.

show abstract

Section: Related Workmentioning

confidence: 99%

Robot Program Parameter Inference via Differentiable Shadow Program Inversion

Alt,

Katic,

Jäkel

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Due to these properties, GPs gained increasing attention in the field of reinforcement learning and system identification. Especially, when safety guarantees are necessary, GPs are favored in reinforcement learning (Berkenkamp et al, 2016a(Berkenkamp et al, ,c, 2017Koller et al, 2018) as well as control (Berkenkamp and Schoellig, 2015;Umlauft et al, 2017;Beckers and Hirche, 2018;Lederer et al, 2020;Umlauft et al, 2018;Helwa et al, 2019). These approaches heavily rely on error bounds of GP regression and are therefore limited by the strict assumptions made in previous works on GP uniform error bounds (Srinivas et al, 2012;Chowdhury and Gopalan, 2017).…”

Section: Introductionmentioning

confidence: 99%

Uniform Error and Posterior Variance Bounds for Gaussian Process Regression with Application to Safe Control

Lederer,

Umlauft,

Hirche

2021

Preprint

View full text Add to dashboard Cite

In application areas where data generation is expensive, Gaussian processes are a preferred supervised learning model due to their high data-efficiency. Particularly in model-based control, Gaussian processes allow the derivation of performance guarantees using probabilistic model error bounds. To make these approaches applicable in practice, two open challenges must be solved i) Existing error bounds rely on prior knowledge, which might not be available for many real-world tasks. (ii) The relationship between training data and the posterior variance, which mainly drives the error bound, is not well understood and prevents the asymptotic analysis. This article addresses these issues by presenting a novel uniform error bound using Lipschitz continuity and an analysis of the posterior variance function for a large class of kernels. Additionally, we show how these results can be used to guarantee safe control of an unknown dynamical system and provide numerical illustration examples.

show abstract

“…In this work, we restrict the range of the optimization variables to a limited set where the system is stable and focus on overshoot and set point tracking errors. Bayesian optimization in controller tuning where stability is guaranteed through safe exploration has been proposed in (Berkenkamp et al, 2016b), and applied for robotic applications (Berkenkamp et al, 2016a), and in process systems (Khosravi et al, 2019a,b). The proposed Bayesian optimization tuning ensures a compromise between the need of extensive number of trials for finding the optimal gains (according to a specified performance criterion), and a single trial, as resulting from standard methods, where a sub-optimal gain with respect to the performance of the system is found, but stability is ensured for a wide range of operation.…”

Section: Introductionmentioning

confidence: 99%

Cascade Control: Data-Driven Tuning Approach Based on Bayesian Optimization

Khosravi¹,

Behrunani²,

Smith³

et al. 2020

Preprint

View full text Add to dashboard Cite

Cascaded controller tuning is a multi-step iterative procedure that needs to be performed routinely upon maintenance and modification of mechanical systems. An automated data-driven method for cascaded controller tuning based on Bayesian optimization is proposed. The method is tested on a linear axis drive, modeled using a combination of first principles model and system identification. A custom cost function based on performance indicators derived from system data at different candidate configurations of controller parameters is modeled by a Gaussian process. It is further optimized by minimization of an acquisition function which serves as a sampling criterion to determine the subsequent candidate configuration for experimental trial and improvement of the cost model iteratively, until a minimum according to a termination criterion is found. This results in a data-efficient procedure that can be easily adapted to varying loads or mechanical modifications of the system. The method is further compared to several classical methods for auto-tuning, and demonstrates higher performance according to the defined data-driven performance indicators. The influence of the training data on a cost prior on the number of iterations required to reach optimum is studied, demonstrating the efficiency of the Bayesian optimization tuning method.

show abstract

Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics

Cited by 130 publications

References 29 publications

Robot Program Parameter Inference via Differentiable Shadow Program Inversion

Robot Program Parameter Inference via Differentiable Shadow Program Inversion

Uniform Error and Posterior Variance Bounds for Gaussian Process Regression with Application to Safe Control

Cascade Control: Data-Driven Tuning Approach Based on Bayesian Optimization

Contact Info

Product

Resources

About