2019 IEEE Intelligent Vehicles Symposium (IV) 2019
DOI: 10.1109/ivs.2019.8813791
|View full text |Cite
|
Sign up to set email alerts
|

Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning

Abstract: For highly automated driving above SAE level 3, behavior generation algorithms must reliably consider the inherent uncertainties of the traffic environment, e.g. arising from the variety of human driving styles. Such uncertainties can generate ambiguous decisions, requiring the algorithm to appropriately balance low-probability hazardous events, e.g. collisions, and high-probability beneficial events, e.g. quickly crossing the intersection. State-of-the-art behavior generation algorithms lack a distributional … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 27 publications
(16 citation statements)
references
References 22 publications
(42 reference statements)
0
16
0
Order By: Relevance
“…The CVaR approach does not require a backup policy and is therefore suitable for environments where such a policy is hard to define, e.g., the Atari-57 benchmark [26]. Bernhard et al first trained a risk-neutral IQN agent, then lowered the CVaR threshold after the training process had been completed, and showed a reduction of collisions in an intersection driving scenario [14]. However, such a procedure does not provide the correct estimate of the return distribution Z τ (s, a) for a risk-averse setting (α < 1) and could lead to arbitrary decisions.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The CVaR approach does not require a backup policy and is therefore suitable for environments where such a policy is hard to define, e.g., the Atari-57 benchmark [26]. Bernhard et al first trained a risk-neutral IQN agent, then lowered the CVaR threshold after the training process had been completed, and showed a reduction of collisions in an intersection driving scenario [14]. However, such a procedure does not provide the correct estimate of the return distribution Z τ (s, a) for a risk-averse setting (α < 1) and could lead to arbitrary decisions.…”
Section: Discussionmentioning
confidence: 99%
“…However, previous studies do not estimate the aleatoric or the epistemic uncertainty of the decision that the trained agent recommends. One exception is the study by Bernhard et al, where a distributional RL approach is used to create a risk-sensitive decision-making agent [14]. However, the method is not applied in a theoretically consistent way and can therefore cause arbitrary decisions, which is further discussed in Sect.…”
Section: Introductionmentioning
confidence: 99%
“…Existing probabilistic risk definitions often consider collision as harmful events in riskbased planning approaches [9,[17][18][19] using the state probability, the probability of spatial overlap at discrete times [20]. Value-based probabilistic risk definitions arose in finance and are applied to robotics and autonomous driving [21][22][23]. Defining risk based on collisions is infeasible considering the vast amount of samples required to approximate human collision probabilities P col ≈10 −7 .…”
Section: A Risk-metrics and Levelsmentioning
confidence: 99%
“…Robustness against inaccuracies in prediction should be further evaluated, e.g. by advancing risk-aware planning [26].…”
Section: A Behavior Modelingmentioning
confidence: 99%