2018 IEEE Intelligent Vehicles Symposium (IV) 2018
DOI: 10.1109/ivs.2018.8500400
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Stress Testing for Autonomous Vehicles

Abstract: This paper presents a method for testing the decision making systems of autonomous vehicles. Our approach involves perturbing stochastic elements in the vehicle's environment until the vehicle is involved in a collision. Instead of applying direct Monte Carlo sampling to find collision scenarios, we formulate the problem as a Markov decision process and use reinforcement learning algorithms to find the most likely failure scenarios. This paper presents Monte Carlo Tree Search (MCTS) and Deep Reinforcement Lear… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
128
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 141 publications
(128 citation statements)
references
References 16 publications
0
128
0
Order By: Relevance
“…This section outlines the problem used in simulation to test AST, the hyper-parameters of the DRL solver, and the reward structure. For bench-marking purposes, we follow the experiment setup-simulation, pedestrian models, and SUT model-proposed in our previous work [10]. The problem has a 5-dimensional state-space and a 6-dimensional action space, and is run for up to 50 time-steps.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…This section outlines the problem used in simulation to test AST, the hyper-parameters of the DRL solver, and the reward structure. For bench-marking purposes, we follow the experiment setup-simulation, pedestrian models, and SUT model-proposed in our previous work [10]. The problem has a 5-dimensional state-space and a 6-dimensional action space, and is run for up to 50 time-steps.…”
Section: Methodsmentioning
confidence: 99%
“…We previously added a new deep reinforcement learning (DRL) solver to AST [10]. The solver is interchangeable with the commonly-used MCTS solver.…”
Section: B Recurrent Deep Reinforcement Learning Solvermentioning
confidence: 99%
See 1 more Smart Citation
“…The constants α and β are set to 10 000 and 1000, respectively, to penalize the algorithm for not finding a collision. Solvers used in our application include Monte Carlo Tree Search (MCTS) [14] and Trust Region Policy Optimization (TRPO) [15] because both have been shown to successfully find failures when combined with AST [8], [11].…”
Section: A Adaptive Stress Testingmentioning
confidence: 99%
“…Koren et al [50] used DRL for adaptive stress testing of autonomous vehicles aimed at finding some problematic selfdriving scenarios which may lead to a collision with a moving pedestrian. Their work is similar to our previous work in [51] that finds the worst sequences of actions to maximize the resource utilization on the SUT.…”
Section: Related Workmentioning
confidence: 99%