Safe Value Functions

Massiani, Pierre-François; Heim, Steve; Solowjow, Friedrich; Trimpe, Sebastian

doi:10.48550/arxiv.2105.12204

Cited by 3 publications

(3 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Learning with CBFs: Approaches that use CBFs during learning typically assume that a valid CBF is already given, while we focus on constructing CBFs so that our approach can be viewed as complementary. In [19], it is shown how safe and optimal reward functions can be obtained, and how these are related to CBFs. The authors in [20] use CBFs to learn a provably correct neural network safety guard for kinematic bicycle models.…”

Section: A Related Workmentioning

confidence: 99%

Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations

Lindemann,

Robey,

Jiang

et al. 2024

IEEE Open J. Control. Syst.

View full text Add to dashboard Cite

This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations. We assume that a model of the system dynamics and a state estimator are available along with corresponding error bounds, e.g., estimated from data in practice. We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety, as defined through controlled forward invariance of a safe set. We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior, e.g., data collected from a human operator or an expert controller. When the parametrization of the ROCBF is linear, then we show that, under mild assumptions, the optimization problem is convex. Along with the optimization problem, we provide verifiable conditions in terms of the density of the data, smoothness of the system model and state estimator, and the size of the error bounds that guarantee validity of the obtained ROCBF. Towards obtaining a practical control algorithm, we propose an algorithmic implementation of our theoretical framework that accounts for assumptions made in our framework in practice. We validate our algorithm in the autonomous driving simulator CARLA and demonstrate how to learn safe control laws from simulated RGB camera images. Control Barrier Functions, Data-driven Robust Control, Output Feedback Control. INDEX TERMS

show abstract

Section: A Related Workmentioning

confidence: 99%

Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations

Lindemann,

Robey,

Jiang

et al. 2024

IEEE Open J. Control. Syst.

View full text Add to dashboard Cite

show abstract

“…These works typically assume that a CBF is already given, while we in this paper focus on constructing CBFs so that our approach should be viewed as complementary. In [17], it is shown how safe and optimal reward functions can be obtained and how these are related to CBFs. The authors in [18] learn a provably correct neural network safety guard for kinematic bicycle models using CBFs as safety filters.…”

Section: Related Workmentioning

confidence: 99%

Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations

Lindemann¹,

Robey²,

Jiang³

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper addresses learning safe control laws from expert demonstrations. We assume that appropriate models of the system dynamics and the output measurement map are available, along with corresponding error bounds. We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety, as defined through controlled forward invariance of a safe set. We then present an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior, e.g., data collected from a human operator. Along with the optimization problem, we provide verifiable conditions that guarantee validity of the obtained ROCBF. These conditions are stated in terms of the density of the data and on Lipschitz and boundedness constants of the learned function and the models of the system dynamics and the output measurement map. When the parametrization of the ROCBF is linear, then, under mild assumptions, the optimization problem is convex. We validate our findings in the autonomous driving simulator CARLA and show how to learn safe control laws from RGB camera images.

show abstract

“…Note that the RAU module encodes both safety and performance specifications in the reward function. It is shown in [64] that a big enough penalty function can assure that the optimal solution for the original task can be learned safely.…”

Section: B Risk Assessment Unit and Reward Design Using Preview Infor...mentioning

confidence: 99%

A Risk-Averse Preview-based $Q$-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles

Mazouchi¹,

Nageshrao²,

Modares³

2021

Preprint

View full text Add to dashboard Cite

A risk-averse preview-based Q-learning planner is presented for navigation of autonomous vehicles. To this end, the multi-lane road ahead of a vehicle is represented by a finitestate non-stationary Markov decision process (MDP). A risk assessment unit module is then presented that leverages the preview information provided by sensors along with a stochastic reachability module to assign reward values to the MDP states and update them as scenarios develop. A sampling-based riskaverse preview-based Q-learning algorithm is finally developed that generates samples using the preview information and reward function to learn risk-averse optimal planning strategies without actual interaction with the environment. The risk factor is imposed on the objective function to avoid fluctuation of the Q values, which can jeopardize the vehicle's safety and/or performance. The overall hybrid automaton model of the system is leveraged to develop a feasibility check unit module that detects unfeasible plans and enables the planner system to proactively react to the changes of the environment. Theoretical results are provided to bound the number of samples required to guarantee -optimal planning with a high probability. Finally, to verify the efficiency of the presented algorithm, its implementation on highway driving of an autonomous vehicle in a varying traffic density is considered.

show abstract

Safe Value Functions

Cited by 3 publications

References 26 publications

Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations

Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations

Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations

A Risk-Averse Preview-based $Q$-Learning Algorithm: Application to Highway Driving of Autonomous Vehicles

Contact Info

Product

Resources

About