2022
DOI: 10.1016/j.trc.2021.103452
|View full text |Cite
|
Sign up to set email alerts
|

Decision making of autonomous vehicles in lane change scenarios: Deep reinforcement learning approaches with risk awareness

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 134 publications
(58 citation statements)
references
References 51 publications
0
44
0
Order By: Relevance
“…One popular paradigm is the lateral decision making schemes with the deep Qnetwork (DQN) or its variants. A lane change decision-making framework for autonomous vehicles is developed to learn risk sensitive driving policies using risk-awareness prioritized replay DQN in [12]. A lane change decision making method is presented for autonomous vehicles through DQN with safety verification in [19].…”
Section: A Reinforcement Learning Based Lateral Decision Making For A...mentioning
confidence: 99%
See 1 more Smart Citation
“…One popular paradigm is the lateral decision making schemes with the deep Qnetwork (DQN) or its variants. A lane change decision-making framework for autonomous vehicles is developed to learn risk sensitive driving policies using risk-awareness prioritized replay DQN in [12]. A lane change decision making method is presented for autonomous vehicles through DQN with safety verification in [19].…”
Section: A Reinforcement Learning Based Lateral Decision Making For A...mentioning
confidence: 99%
“…While existing RL based decision making methods of autonomous vehicles have achieved many compelling results [10], [11], [12], [13], the real-world driving tasks involve unavoidable measurement errors or sensor noises which may mislead an autonomous vehicle into making suboptimal decisions, even cause catastrophic failures. In light of these risks, autonomous vehicles are required to ensure that their decision making systems can handle the natural observation uncertainties from sensing and perception system, especially adversarial perturbations.…”
Section: Introductionmentioning
confidence: 99%
“…The corresponding reward and penalty functions were designed to curb speeding, encourage high-speed driving, and punish low-speed blocking for these three-speed ranges. In order to make CAVs faster and more stable to explore the optimal speed, we used the exponential function to design a soft reward function [28].…”
Section: Parameters Incentive Punishedmentioning
confidence: 99%
“…Through RL, the network parameters of DL are tuned to continuously optimize their own behavior strategies; its framework is shown in Figure 2. This method has become a new research hotspot in the field of artificial intelligence and has been applied in fields such as robot control [30][31][32], autonomous driving [33], and machine vision [34][35][36][37][38][39][40][41]. In this paper, the vehicle autonomous driving decision problem is modeled with a partially observable Markov decision process (POMDP) [42], and the autonomous driving strategy optimization problem is solved by identifying the optimal driving strategy of the POMDP.…”
Section: Modeling Of the Automatic Driving Strategy Optimization Problemmentioning
confidence: 99%