Mark Koren scite author profile

This paper presents a method for testing the decision making systems of autonomous vehicles. Our approach involves perturbing stochastic elements in the vehicle's environment until the vehicle is involved in a collision. Instead of applying direct Monte Carlo sampling to find collision scenarios, we formulate the problem as a Markov decision process and use reinforcement learning algorithms to find the most likely failure scenarios. This paper presents Monte Carlo Tree Search (MCTS) and Deep Reinforcement Learning (DRL) solutions that can scale to large environments. We show that DRL can find more likely failure scenarios than MCTS with fewer calls to the simulator. A simulation scenario involving a vehicle approaching a crosswalk is used to validate the framework. Our proposed approach is very general and can be easily applied to other scenarios given the appropriate models of the vehicle and the environment.

show abstract

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

Brundage¹,

Avin²,

Wang³

et al. 2020

Preprint

100

View full text Add to dashboard Cite

Efficient Autonomy Validation in Simulation with Adaptive Stress Testing

Koren

Kochenderfer

2019

View full text Add to dashboard Cite

During the development of autonomous systems such as driverless cars, it is important to characterize the scenarios that are most likely to result in failure. Adaptive Stress Testing (AST) provides a way to search for the mostlikely failure scenario as a Markov decision process (MDP). Our previous work used a deep reinforcement learning (DRL) solver to identify likely failure scenarios. However, the solver's use of a feed-forward neural network with a discretized space of possible initial conditions poses two major problems. First, the system is not treated as a black box, in that it requires analyzing the internal state of the system, which leads to considerable implementation complexities. Second, in order to simulate realistic settings, a new instance of the solver needs to be run for each initial condition. Running a new solver for each initial condition not only significantly increases the computational complexity, but also disregards the underlying relationship between similar initial conditions. We provide a solution to both problems by employing a recurrent neural network that takes a set of initial conditions from a continuous space as input. This approach enables robust and efficient detection of failures because the solution generalizes across the entire space of initial conditions. By simulating an instance where an autonomous car drives while a pedestrian is crossing a road, we demonstrate the solver is now capable of finding solutions for problems that would have previously been intractable.

show abstract

Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Koren¹,

Nassar²,

Kochenderfer³

2021

View full text Add to dashboard Cite

Adaptive Stress Testing without Domain Heuristics using Go-Explore

Koren

Kochenderfer

2020

View full text Add to dashboard Cite

Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Koren¹,

Nassar²,

Kochenderfer³

2021

Preprint

View full text Add to dashboard Cite

Validating the safety of autonomous systems generally requires the use of high-fidelity simulators that adequately capture the variability of real-world scenarios. However, it is generally not feasible to exhaustively search the space of simulation scenarios for failures. Adaptive stress testing (AST) is a method that uses reinforcement learning to find the most likely failure of a system. AST with a deep reinforcement learning solver has been shown to be effective in finding failures across a range of different systems. This approach generally involves running many simulations, which can be very expensive when using a high-fidelity simulator. To improve efficiency, we present a method that first finds failures in a low-fidelity simulator. It then uses the backward algorithm, which trains a deep neural network policy using a single expert demonstration, to adapt the low-fidelity failures to high-fidelity. We have created a series of autonomous vehicle validation case studies that represent some of the ways low-fidelity and highfidelity simulators can differ, such as time discretization. We demonstrate in a variety of case studies that this new AST approach is able to find failures with significantly fewer highfidelity simulation steps than are needed when just running AST directly in high-fidelity. As a proof of concept, we also demonstrate AST on NVIDIA's DriveSim simulator, an industry state-of-the-art high-fidelity simulator for finding failures in autonomous vehicles.

show abstract

Efficient Autonomy Validation in Simulation with Adaptive Stress Testing

Koren¹,

Kochenderfer²

2019

Preprint

View full text Add to dashboard Cite

Adaptive Stress Testing without Domain Heuristics using Go-Explore

Koren¹,

Kochenderfer²

2020

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mark Koren

Adaptive Stress Testing for Autonomous Vehicles

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

Efficient Autonomy Validation in Simulation with Adaptive Stress Testing

Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Adaptive Stress Testing without Domain Heuristics using Go-Explore

Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm

Efficient Autonomy Validation in Simulation with Adaptive Stress Testing

Adaptive Stress Testing without Domain Heuristics using Go-Explore

Contact Info

Product

Resources

About