On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Liu, Zuxin; Guo, Zijian; Cen, Zhepeng; Zhang, Huan; Tan, Jie; Li, Bo; Zhao, Ding

doi:10.48550/arxiv.2205.14691

Cited by 6 publications

(12 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similarly, the PPO agent that achieves the best route completion (Comp) score presents, however, the highest RR and SS scores, which means that it may run red lights and stop signs most frequently. This observation suggests the inherent contradiction between some safety metrics and functionality metrics, which is also unveiled in some previous studies [46,38,39].…”

Section: Benchmark Resultssupporting

confidence: 52%

SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles

Xu¹,

Ding²,

Weijie³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

As shown by recent studies, machine intelligence-enabled systems are vulnerable to test cases resulting from either adversarial manipulation or natural distribution shifts. This has raised great concerns about deploying machine learning algorithms for real-world applications, especially in the safety-critical domains such as autonomous driving (AD). On the other hand, traditional AD testing on naturalistic scenarios requires hundreds of millions of driving miles due to the high dimensionality and rareness of the safety-critical scenarios in the real world. As a result, several approaches for autonomous driving evaluation have been explored, which are usually, however, based on different simulation platforms, types of safety-critical scenarios, scenario generation algorithms, and driving route variations. Thus, despite a large amount of effort in autonomous driving testing, it is still challenging to compare and understand the effectiveness and efficiency of different testing scenario generation algorithms and testing mechanisms under similar conditions. In this paper, we aim to provide the first unified platform SafeBench to integrate different types of safety-critical testing scenarios, scenario generation algorithms, and other variations such as driving routes and environments. In particular, we consider 8 safety-critical testing scenarios following National Highway Traffic Safety Administration (NHTSA) and develop 4 scenario generation algorithms considering 10 variations for each scenario. Meanwhile, we implement 4 deep reinforcement learning-based AD algorithms with 4 types of input (e.g., bird's-eye view, camera) to perform fair comparisons on SafeBench. We find our generated testing scenarios are indeed more challenging and observe the trade-off between the performance of AD agents under benign and safety-critical testing scenarios. We believe our unified platform SafeBench for large-scale and effective autonomous driving testing will motivate the development of new testing scenario generation and safe AD algorithms. SafeBench is available at https://safebench.github.io. * Equal Contribution Preprint. Under review.

show abstract

Section: Benchmark Resultssupporting

confidence: 52%

SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles

Xu¹,

Ding²,

Weijie³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Beyond robotics continuous control tasks and simulated games, robust RL is also tested in mobile robot tasks and autonomous driving scenarios. Liu et al [108] propose a safe and robust benchmark containing mobile robot tasks based on Bullet safety gym [60] environments. Jaafra et al [77] propose to test in CARLA simulator [41] with different conditions, including the traffic density, such as the number of dynamic objects, and visual effects such as weather and lightening conditions.…”

Section: Application Benchmarks and Resourcesmentioning

confidence: 99%

“…To better describe the unique properties of a safe RL problem, we provide the feasibility, optimality, and temptation definitions following the previous work [108]. Their figure illustrations for one CMDP are presented in Fig.…”

Section: Problem Formulation Of Safe Reinforcement Learningmentioning

confidence: 99%

“…We could observe that improving any aspects of trustworthiness might potentially induce a drop in the best possible task performance. For instance, improving the safety of an RL agent may lead the agent to be conservative in exploring high-rewarding regions, and thus has relatively lower task performance than the unsafe one [108]; increasing the robustness against adversarial perturbations may over-smooth the policy, and thus decrease the task performance; training a generalizable policy on multiple tasks may decrease the performance on a single task due to the limitation of model capacity. As a result, we can see that improving the trustworthiness may be at the cost of sacrificing the optimal task performance, which is also a reflection of the no-free-lunch theorem.…”

Section: What Is the Relation Between The Different Aspects Of Trustw...mentioning

confidence: 99%

“…It has rich meanings that go beyond its literal sense, and motivates a comprehensive framework that includes multiple principles, requirements, and criteria [3]. Recently, there has been exciting progress in the area of trustworthy RL [2,5,48,107,108,121,129,137,140,145,148,165,171,201], which greatly help to advance our understanding of intrinsic vulnerabilities in RL and potential solutions in particular aspects of trustworthy RL. It is clear that the next leap toward trustworthy RL will require a holistic and fundamental understanding of the challenges of these problems, the weakness, and advantages of existing trustworthy RL approaches, and a paradigm shift of trustworthy RL based on existing work.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability

Xu¹,

Liu²,

Huang³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

A trustworthy reinforcement learning algorithm should be competent in solving challenging real-world problems, including robustly handling uncertainties, satisfying safety constraints to avoid catastrophic failures, and generalizing to unseen scenarios during deployments. This study aims to overview these main perspectives of trustworthy reinforcement learning considering its intrinsic vulnerabilities on robustness, safety, and generalizability. In particular, we give rigorous formulations, categorize corresponding methodologies, and discuss benchmarks for each perspective. Moreover, we provide an outlook section to spur promising future directions with a brief discussion on extrinsic vulnerabilities considering human feedback. We hope this survey could bring together separate threads of studies together in a unified framework and promote the trustworthiness of reinforcement learning. CCS Concepts: • Computing methodologies → Reinforcement learning; Markov decision processes; • Security and privacy → Social aspects of security and privacy; • Computer systems organization → Robotics; • Hardware → Safety critical systems.

show abstract