Automatic LQR tuning based on Gaussian process global optimization

Marco, Alonso; Hennig, Philipp; Bohg, Jeannette; Schaal, Stefan; Trimpe, Sebastian

doi:10.1109/icra.2016.7487144

Cited by 115 publications

(128 citation statements)

References 22 publications

Supporting

Mentioning

126

Contrasting

Unclassified

Order By: Relevance

“…Deisenroth et al [9], for example, developed a Bayesian approach to tune a cart-pole system, whereas Berkenkamp et al [6] proposed to use Bayesian optimization to safely tune robotic controllers for quadrotors. Moreover, Marco et al [16] combined Bayesian optimization with optimal control to tune LQR regulators. In contrast to these approaches, Akrour et al [1] suggested to direct the optimization process by using a search distribution, however, the optimization loses expressibility on a global scope since it only optimizes locally.…”

Section: Related Workmentioning

confidence: 99%

Combining Simulations and Real-Robot Experiments for Bayesian Optimization of Bipedal Gait Stabilization

Rodríguez

Brandenburger

Behnke

2019

RoboCup 2018: Robot World Cup XXII

View full text Add to dashboard Cite

Walking controllers often require parametrization which must be tuned according to some cost function. To estimate these parameters, simulations can be performed which are cheap but do not fully represent reality. Real-robot experiments, on the other hand, are more expensive and lead to hardware wear-off. In this paper, we propose an approach for combining simulations and real experiments to learn gait stabilization parameters. We use a Bayesian optimization method which selects the most informative points in parameter space to evaluate based on the entropy of the cost function to optimize. Experiments with the igus Humanoid Open Platform demonstrate the effectiveness of our approach. * Both authors contributed equally.

show abstract

Section: Related Workmentioning

confidence: 99%

Combining Simulations and Real-Robot Experiments for Bayesian Optimization of Bipedal Gait Stabilization

Rodríguez

Brandenburger

Behnke

2019

RoboCup 2018: Robot World Cup XXII

View full text Add to dashboard Cite

show abstract

“…BO with GPs has successfully been used, for example, for gait learning with bipedal walkers [9], quadrupedal robots [10], and in cm-scale hexapodal robots [11]. It has also been proposed for automatic feedback controller tuning [12]- [14]. A major strength of GPs is that they allow one to include existing information about the system in the form of a probabilistic prior.…”

Section: Gait Learning Approachmentioning

confidence: 99%

“…We formulate the problem of gait learning in soft microrobots as a parametric controller tuning problem. This work builds on the automatic controller tuning methods proposed in [12], where an unknown controller cost function was learned in a data-efficient way using BO with GPs.…”

Section: A Controller Learningmentioning

confidence: 99%

Gait Learning for Soft Microrobots Controlled by Light Fields

Rohr

Trimpe

Marco

et al. 2018

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Self Cite

View full text Add to dashboard Cite

Soft microrobots based on photoresponsive materials and controlled by light fields can generate a variety of different gaits. This inherent flexibility can be exploited to maximize their locomotion performance in a given environment and used to adapt them to changing conditions. Albeit, because of the lack of accurate locomotion models, and given the intrinsic variability among microrobots, analytical control design is not possible. Common data-driven approaches, on the other hand, require running prohibitive numbers of experiments and lead to very sample-specific results. Here we propose a probabilistic learning approach for light-controlled soft microrobots based on Bayesian Optimization (BO) and Gaussian Processes (GPs). The proposed approach results in a learning scheme that is dataefficient, enabling gait optimization with a limited experimental budget, and robust against differences among microrobot samples. These features are obtained by designing the learning scheme through the comparison of different GP priors and BO settings on a semi-synthetic data set. The developed learning scheme is validated in microrobot experiments, resulting in a 115% improvement in a microrobot's locomotion performance with an experimental budget of only 20 tests. These encouraging results lead the way toward self-adaptive microrobotic systems based on light-controlled soft microrobots and probabilistic learning control.

show abstract

“…BO for controller learning has recently also been suggested in [12], [20], [21], which include successful demonstrations in laboratory experiments. A discrete event controller is optimized for a walking robot in [12], and state-feedback controllers are tuned in [20] for a quadrotor and in [21] for a humanoid robot balancing a pole. Herein, we present results of applying BO for a typical control problem in the automotive industry (throttle valve control) and consider two types of control objectives, different from those in [12], [20], [21].…”

Section: Introductionmentioning

confidence: 99%

“…A discrete event controller is optimized for a walking robot in [12], and state-feedback controllers are tuned in [20] for a quadrotor and in [21] for a humanoid robot balancing a pole. Herein, we present results of applying BO for a typical control problem in the automotive industry (throttle valve control) and consider two types of control objectives, different from those in [12], [20], [21]. The proposed controller learning framework, which combines BO with ADRC, is different from the controllers in the mentioned references.…”

Section: Introductionmentioning

confidence: 99%

Data-Efficient Autotuning With Bayesian Optimization: An Industrial Control Study

Neumann-Brosig

Marco

Schwarzmann

et al. 2020

IEEE Trans. Contr. Syst. Technol.

Self Cite

View full text Add to dashboard Cite

Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a user-defined cost. The probabilistic model is updated with data, which is obtained by testing a set of parameters on the physical system and evaluating the cost. In order to learn fast, the Bayesian optimization algorithm selects the next parameters to evaluate in a systematic way, for example, by maximizing information gain about the optimum. The algorithm thus iteratively finds the globally optimal parameters with only few experiments. Taking throttle valve control as a representative industrial control example, the proposed auto-tuning method is shown to outperform manual calibration: it consistently achieves better performance with a low number of experiments. The proposed auto-tuning framework is flexible and can handle different control structures and objectives.

show abstract

Automatic LQR tuning based on Gaussian process global optimization

Cited by 115 publications

References 22 publications

Combining Simulations and Real-Robot Experiments for Bayesian Optimization of Bipedal Gait Stabilization

Combining Simulations and Real-Robot Experiments for Bayesian Optimization of Bipedal Gait Stabilization

Gait Learning for Soft Microrobots Controlled by Light Fields

Data-Efficient Autotuning With Bayesian Optimization: An Industrial Control Study

Contact Info

Product

Resources

About