Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Volpp, Michael; Fröhlich, Lukas P.; Fischer, Kirsten; Doerr, Andreas; Falkner, Stefan; Hutter, Frank; Daniel, Christian

doi:10.48550/arxiv.1904.02642

Cited by 5 publications

(6 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We find this especially promising for meta-learning, potentially building on LEO (Rusu et al, 2018). Inspired by DCEM, other more powerful sampling-based optimizers could be made differentiable in the same way, potentially optimizers that leverage gradient-based information in the inner optimization steps (Sekhon & Mebane, 1998;Theodorou et al, 2010;Stulp & Sigaud, 2012;Maheswaranathan et al, 2018) or by also learning the hyper-parameters of structured optimizers (Li & Malik, 2016;Volpp et al, 2019;Chen et al, 2017).…”

Section: Discussionmentioning

confidence: 99%

The Differentiable Cross-Entropy Method

Amos,

Yarats

2019

Preprint

View full text Add to dashboard Cite

We study the Cross-Entropy Method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable variant (DCEM) that enables us to differentiate the output of CEM with respect to the objective function's parameters. In the machine learning setting this brings CEM inside of the end-to-end learning pipeline where this has otherwise been impossible. We show applications in a synthetic energy-based structured prediction task and in non-convex continuous control. In the control setting we show on the simulated cheetah and walker tasks that we can embed their optimal action sequences with DCEM and then use policy optimization to fine-tune components of the controller as a step towards combining model-based and model-free RL.

show abstract

Section: Discussionmentioning

confidence: 99%

The Differentiable Cross-Entropy Method

Amos,

Yarats

2019

Preprint

View full text Add to dashboard Cite

show abstract

“…A possible extension of PROFET would be to consider multi-fidelity benchmarks (Klein et al, 2017a;Kandasamy et al, 2017;Klein et al, 2017c) where cheap, but approximate fidelities of the objective function are available, e. g. learning curves or dataset subsets. Furthermore, since PROFET also provides gradient information it could serve as a training distribution for learning-to-learn approaches (Chen et al, 2017;Volpp et al, 2019).…”

Section: Comparing State-of-the-art Hpo Methodsmentioning

confidence: 99%

Meta-Surrogate Benchmarking for Hyperparameter Optimization

Klein,

Dai,

Hutter

et al. 2019

Preprint

Self Cite

View full text Add to dashboard Cite

Despite the recent progress in hyperparameter optimization (HPO), available benchmarks that resemble real-world scenarios consist of a few and very large problem instances that are expensive to solve. This blocks researchers and practitioners not only from systematically running large-scale comparisons that are needed to draw statistically significant results but also from reproducing experiments that were conducted before. This work proposes a method to alleviate these issues by means of a meta-surrogate model for HPO tasks trained on off-line generated data. The model combines a probabilistic encoder with a multi-task model such that it can generate inexpensive and realistic tasks of the class of problems of interest. We demonstrate that benchmarking HPO methods on samples of the generative model allows us to draw more coherent and statistically significant conclusions that can be reached orders of magnitude faster than using the original tasks. We provide evidence of our findings for various HPO methods on a wide class of problems.Preprint. Under review.

show abstract

“…This may be done by i) updating the predictive mean and variance of the surrogate model learned in the target campaign, 42 or ii) by means of an ensemble of acquisition functions. 43,44 Prior work has argued that the latter option better aggregates information from source campaigns with varying objective value ranges 43 (see ESI † Sec. S.2.A and S.2.B for additional details).…”

Section: Related Workmentioning

confidence: 99%

“…Sets of source measurements are then sampled from these perturbed surfaces. Additional details on the generation (following the procedure proposed by Volpp et al 44 ) of source data are given in ESI † Sec. S. 4.…”

Section: Analytical Benchmarksmentioning

confidence: 99%