2019
DOI: 10.1049/iet-its.2019.0249
|View full text |Cite
|
Sign up to set email alerts
|

Deep reinforcement‐learning‐based driving policy for autonomous road vehicles

Abstract: In this work we consider the problem of path planning for an autonomous vehicle that moves on a freeway. The most common approaches that are used to address this problem are based on optimal control methods, which make assumptions about the model of the environment and the system dynamics. On the contrary, we propose the development of a driving policy based on reinforcement learning. In this way, the proposed driving policy makes minimal or no assumptions about the environment, since a priori knowledge about … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(21 citation statements)
references
References 48 publications
0
21
0
Order By: Relevance
“…It is not easy to achieve, though. The major issue with using the RL method is the dependence on a reward function, which must be hand-crafted based on engineering experience and has to be applicable to all driving scenarios (125). RL methods might cause undesirable driving behaviors by directly transferring their driving policy learned in non-congestion states.…”
Section: Limitations In Learning Algorithmsmentioning
confidence: 99%
“…It is not easy to achieve, though. The major issue with using the RL method is the dependence on a reward function, which must be hand-crafted based on engineering experience and has to be applicable to all driving scenarios (125). RL methods might cause undesirable driving behaviors by directly transferring their driving policy learned in non-congestion states.…”
Section: Limitations In Learning Algorithmsmentioning
confidence: 99%
“…The algorithm in [12] adopted Recurrent Neural Networks for information integration, and learned an effective driving policy on simulators. The authors in [13] proposed a driving policy that makes little assumption about the environment. The work in [14] developed a realistic translation network to make sim2real possible.…”
Section: Reinforcement Learningmentioning
confidence: 99%
“…Authors in [120] studied car following and lane changing behaviours of autonomous vehicles using DDDP method on VISSIM. Another RL-based autonomous driving policy is described by Makantasis et al [121] using DDQN with prioritized experience replay in mixed autonomy scenarios. Proposed deep RL-based driving policy is compared with DP-based optimal policy in different traffic densities using SUMO.…”
Section: A Autonomous Drivingmentioning
confidence: 99%