2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018
DOI: 10.1109/iros.2018.8593420
|View full text |Cite
|
Sign up to set email alerts
|

Safe Reinforcement Learning on Autonomous Vehicles

Abstract: There have been numerous advances in reinforcement learning, but the typically unconstrained exploration of the learning process prevents the adoption of these methods in many safety critical applications. Recent work in safe reinforcement learning uses idealized models to achieve their guarantees, but these models do not easily accommodate the stochasticity or high-dimensionality of real world systems. We investigate how prediction provides a general and intuitive framework to constraint exploration, and show… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 56 publications
(53 citation statements)
references
References 25 publications
(25 reference statements)
0
41
0
Order By: Relevance
“…4) Safe (deep) reinforcement learning: Ultimately, research evolved in a novel direction on exploring safe (deep) RL techniques. The scholars in [27] investigated how to use prediction for safe learning. Addressing the challenges of existing RB and RL approaches, a modular decision making algorithm was proposed in [28].…”
Section: Related Workmentioning
confidence: 99%
“…4) Safe (deep) reinforcement learning: Ultimately, research evolved in a novel direction on exploring safe (deep) RL techniques. The scholars in [27] investigated how to use prediction for safe learning. Addressing the challenges of existing RB and RL approaches, a modular decision making algorithm was proposed in [28].…”
Section: Related Workmentioning
confidence: 99%
“…The proposed algorithm combines multimodal pedestrian trajectory forecasting and vehicle kinematic constraints to ensure smooth pedestrian-vehicle interactions, efficient operation, and safety. Deep Q-learning networks (DQNs) [26] are also proposed to learn policies that optimize intelligent vehicle intersection handling. They have two goals in mind.…”
Section: Reinforcement Learning (Rl) Modelmentioning
confidence: 99%
“…Several strategies have been proposed to make solving these problems more efficient. These techniques include making assumptions that restrict branching such as using klevel reasoning [19], assuming simplified models of other agents [20], and using temporarily extended actions to reduce the required depth of a search [21]. There are other works in autonomous driving literature which also use game theoretic strategies, and like our work, they adopt some of these approximations.…”
Section: Davidmentioning
confidence: 99%