2017
DOI: 10.48550/arxiv.1705.01196
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Navigating Occluded Intersections with Autonomous Vehicles using Deep Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 13 publications
(18 citation statements)
references
References 0 publications
0
18
0
Order By: Relevance
“…We evaluated CM3 on the problem of learning cooperative policies for negotiating lane changes among multiple autonomous vehicles in the Simulation of Urban Mobility (SUMO) traffic simulator [11]. While previous work have applied reinforcement learning to autonomous driving tasks in simulation [9,12,20], they modeled the problem as a single-agent MDP, in which other vehicles behave according to hand-designed policies without the capacity for strategic response to the learning agent. However, driving in real-world traffic must involve deliberate cooperation 3 among interacting vehicles who have different individual intentions (e.g.…”
Section: Methodsmentioning
confidence: 99%
“…We evaluated CM3 on the problem of learning cooperative policies for negotiating lane changes among multiple autonomous vehicles in the Simulation of Urban Mobility (SUMO) traffic simulator [11]. While previous work have applied reinforcement learning to autonomous driving tasks in simulation [9,12,20], they modeled the problem as a single-agent MDP, in which other vehicles behave according to hand-designed policies without the capacity for strategic response to the learning agent. However, driving in real-world traffic must involve deliberate cooperation 3 among interacting vehicles who have different individual intentions (e.g.…”
Section: Methodsmentioning
confidence: 99%
“…Pac-man and Enduro in Fig. 15) and real-world motion planning tasks [31] [51]. DQN utilizes CNN to approximate Q values (Fig.…”
Section: Nature Deep Q-learning Networkmentioning
confidence: 99%
“…Road users simulation is essential to the development of maneuver decision making modules for automated vehicles. In [12] a system able to enter in an intersection is trained while other vehicles followed a deterministic model called Intelligent Driver Model (IDM, [27]). In [16] a lane change maneuver module is learned using DRL in a scenario where other vehicles follow a simple lane keeping behavior with collision avoidance, while in [14] they are also able to overtake relying on hard-coded rules.…”
Section: Related Workmentioning
confidence: 99%
“…Typical solutions ( [28], [4]) for handling those particular maneuvers consist on rule-based methods which use some notion of the time-to-collision ( [29]), so that they will be executed only if there is enough time in the worst case scenario. These solutions lead to excessively cautious behaviors due to the lack of interpretation of the situation, and suggested the use of machine learning approaches, such as Partially Observable Markov Decision Processes ( [17]) or Deep Learning techniques ( [12]), in order to infer intentions of other drivers. However, training machine learning algorithms of this kind typically requires simulated environments, and so the behavioral simulation of other drivers plays an important role.…”
Section: Introductionmentioning
confidence: 99%