2020
DOI: 10.48550/arxiv.2005.01936
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Regret Bounds for Safe Gaussian Process Bandit Optimization

Abstract: Many applications require a learner to make sequential decisions given uncertainty regarding both the system's payoff function and safety constraints. In safety-critical systems, it is paramount that the learner's actions do not violate the safety constraints at any stage of the learning process. In this paper, we study a stochastic bandit optimization problem where the unknown payoff and constraint functions are sampled from Gaussian Processes (GPs) first considered in [Srinivas et al., 2010]. We develop a sa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 12 publications
(20 reference statements)
0
1
0
Order By: Relevance
“…4.3, compounds the problem. Prior works, e.g., in robotics and other areas [12], [13], [64], [65], have proposed Bayesian optimization algorithms with safety constraints. Their main idea lays upon the definition: every t we define a subset of safe controls S t ⊆ X that satisfy the constraints with certainty.…”
Section: Sbp-vran: Safe Bayesian Optimizationmentioning
confidence: 99%
“…4.3, compounds the problem. Prior works, e.g., in robotics and other areas [12], [13], [64], [65], have proposed Bayesian optimization algorithms with safety constraints. Their main idea lays upon the definition: every t we define a subset of safe controls S t ⊆ X that satisfy the constraints with certainty.…”
Section: Sbp-vran: Safe Bayesian Optimizationmentioning
confidence: 99%