Safe Reinforcement Learning with Scene Decomposition for Navigating Complex Urban Environments

Bouton, Maxime; Nakhaei, Alireza; Fujimura, Kikuo; Kochenderfer, Mykel J.

doi:10.1109/ivs.2019.8813803

Cited by 67 publications

(43 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One method to alter the action selection process is to prioritize actions that are estimated to be safer [3]. However, this approach does not prove the nonexistence of unsafe behaviors.…”

Section: A Modification Of Exploration Processmentioning

confidence: 99%

“…Various approaches have been proposed to increase the safety of RL methods by modifying the optimality criterion [1], [2] or by verifying the exploration processes with external guidance [3]- [10]. By modifying the optimality objective, agents behave more cautious than those trained without a risk measure included in the objective; however, the absence of unsafe behaviors cannot be proven.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Safe Reinforcement Learning for Autonomous Lane Changing Using Set-Based Prediction

Krasowski

Wang

Althoff

2020

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)

View full text Add to dashboard Cite

Machine learning approaches often lack safety guarantees, which are often a key requirement in real-world tasks. This paper addresses the lack of safety guarantees by extending reinforcement learning with a safety layer that restricts the action space to the subspace of safe actions. We demonstrate the proposed approach using lane changing in autonomous driving. To distinguish safe actions from unsafe ones, we compare planned motions with the set of possible occupancies of traffic participants generated by set-based predictions. In situations where no safe action exists, a verified failsafe controller is executed. We used real-world highway traffic data to train and test the proposed approach. The evaluation result shows that the proposed approach trains agents that do not cause collisions during training and deployment.

show abstract

“…One method to alter the action selection process is to prioritize actions that are estimated to be safer [3]. However, this approach does not prove the nonexistence of unsafe behaviors.…”

Section: A Modification Of Exploration Processmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Safe Reinforcement Learning for Autonomous Lane Changing Using Set-Based Prediction

Krasowski

Wang

Althoff

2020

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)

View full text Add to dashboard Cite

show abstract

“…A different strategy is followed by [35], where the authors defined a custom set of traffic rules based on the environment, the driver, and the road graph. With these rules, a RL driver learns to safely make lane-changing decisions, where the driver's decision making is combined with the formal safety verification of the rules, to ensure that only safe actions are taken by the driver A similar approach is considered in [7], where the authors replaced the formal safety verification with a learnable safety belief module, as part of the driver's policy.…”

Section: Related Workmentioning

confidence: 99%

“…The use of simulations and synthetic data [5] for training have allowed to assess neural networks capabilities in many different realistic environments and different degrees of complexity. Many driving simulators have been designed, from the low-level ones that allow the drivers to control the hand brake of their car [6], to higher-level ones, in which the drivers can control their car acceleration and lane-change [7]. Some simulators model the traffic in an urban road network [8], some others model car's intersection access [9][10][11][12], or roundabout insertion [13].…”

Section: Introductionmentioning

confidence: 99%

On the Impact of the Rules on Autonomous Drive Learning

et al. 2020

View full text Add to dashboard Cite

Autonomous vehicles raise many ethical and moral issues that are not easy to deal with and that, if not addressed correctly, might be an obstacle to the advent of such a technological revolution. These issues are critical because autonomous vehicles will interact with human road users in new ways and current traffic rules might not be suitable for the resulting environment. We consider the problem of learning optimal behavior for autonomous vehicles using Reinforcement Learning in a simple road graph environment. In particular, we investigate the impact of traffic rules on the learned behaviors and consider a scenario where drivers are punished when they are not compliant with the rules, i.e., a scenario in which violation of traffic rules cannot be fully prevented. We performed an extensive experimental campaign, in a simulated environment, in which drivers were trained with and without rules, and assessed the learned behaviors in terms of efficiency and safety. The results show that drivers trained with rules enforcement are willing to reduce their efficiency in exchange for being compliant to the rules, thus leading to higher overall safety.

show abstract

“…RL provides a flexibility in the choice of the interaction model. RL has been applied to a variety of driving scenarios such as lane changing [10], or intersection navigation [11], [12].…”

Section: Introductionmentioning

confidence: 99%

Cooperation-Aware Reinforcement Learning for Merging in Dense Traffic

Bouton

Nakhaei

Fujimura

et al. 2019

2019 IEEE Intelligent Transportation Systems Conference (ITSC)

Self Cite

View full text Add to dashboard Cite

Decision making in dense traffic can be challenging for autonomous vehicles. An autonomous system only relying on predefined road priorities and considering other drivers as moving objects will cause the vehicle to freeze and fail the maneuver. Human drivers leverage the cooperation of other drivers to avoid such deadlock situations and convince others to change their behavior. Decision making algorithms must reason about the interaction with other drivers and anticipate a broad range of driver behaviors. In this work, we present a reinforcement learning approach to learn how to interact with drivers with different cooperation levels. We enhanced the performance of traditional reinforcement learning algorithms by maintaining a belief over the level of cooperation of other drivers. We show that our agent successfully learns how to navigate a dense merging scenario with less deadlocks than with online planning methods.

show abstract

Safe Reinforcement Learning with Scene Decomposition for Navigating Complex Urban Environments

Cited by 67 publications

References 16 publications

Safe Reinforcement Learning for Autonomous Lane Changing Using Set-Based Prediction

Safe Reinforcement Learning for Autonomous Lane Changing Using Set-Based Prediction

On the Impact of the Rules on Autonomous Drive Learning

Cooperation-Aware Reinforcement Learning for Merging in Dense Traffic

Contact Info

Product

Resources

About