2020
DOI: 10.1177/0361198120931848
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning Agent with Varying Actions Strategy for Solving the Eco-Approach and Departure Problem at Signalized Intersections

Abstract: Eco-approach and departure is a complex control problem wherein a driver’s actions are guided over a period of time or distance so as to optimize fuel consumption. Reinforcement learning (RL) is a machine learning paradigm that mimics human learning behavior, in which an agent attempts to solve a given control problem by interacting with the environment and developing an optimal policy. Unlike the methods implemented in previous studies for solving the eco-driving problem, RL does not require prior kn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 20 publications
(34 reference statements)
0
6
0
Order By: Relevance
“…Shi et al (2018) used traditional (non-deep) Q-learning to develop an efficient driving strategy for approaching signalized intersections. Mousa et al (2020) used deep Q-learning with prioritized experience replay, target networks and double-learning to train an RL agent to approach and depart efficiently at signalized intersections for situations where no other vehicles are interfering. Wang et al (2022a, 2022b) focused on the CAV control problem in mixed traffic flow at signalized intersections with particular considerations of the oscillations induced by human drivers.…”
Section: Connected and Automated Vehicle Trajectory Planningmentioning
confidence: 99%
“…Shi et al (2018) used traditional (non-deep) Q-learning to develop an efficient driving strategy for approaching signalized intersections. Mousa et al (2020) used deep Q-learning with prioritized experience replay, target networks and double-learning to train an RL agent to approach and depart efficiently at signalized intersections for situations where no other vehicles are interfering. Wang et al (2022a, 2022b) focused on the CAV control problem in mixed traffic flow at signalized intersections with particular considerations of the oscillations induced by human drivers.…”
Section: Connected and Automated Vehicle Trajectory Planningmentioning
confidence: 99%
“…Controlling of CV is one representative in this field, as the vehicle can move longitudinally and laterally on the road. While the majority of the existing studies are confined to car following motion (Shi et al, 2018;Mousa et al, 2020;Zhou et al, 2020;Wegener et al, 2021), this paper develops a parameterized action space to naturally describe the control problem with hybrid actions and thereby implement joint optimization of car-following and lane-changing movement. Figure 3 presents an example of parameterized action space.…”
Section: Agent Frameworkmentioning
confidence: 99%
“…As one of the value-based reinforcement learning algorithms, the Q-learning approach cannot control the vehicle in continuous acceleration space, and thereby causes local optimum and uneven trajectory in most cases. The framework developed by Mousa et al (2020) provided insight into the DRL-based eco-driving system, which introduced deep Q network (DQN) to improve the fuel performance of the controlled CV. However, major disadvantage similar to the work of Shi et al (2018) was encountered in their study, namely, the losing efficacy in continuous action space.…”
Section: Introductionmentioning
confidence: 99%
“…It is usually accomplished by maintaining a smooth driving speed and avoiding sudden acceleration as much as possible. The eco-driving concept is conventionally studied for gasoline vehicles ( 37 ) with the intent to attain fuel-saving and emission-reducing goals. However, with the development and deployment of the prevailing electric vehicles (EVs), relative studies were also developed to achieve eco-driving for EVs ( 8, 9 ) and mixed gasoline and electric vehicles ( 10 ).…”
mentioning
confidence: 99%
“…Meanwhile, the action space cannot be very large as they also applied the value-based RL algorithm. Similarly, the method given by Mousa et al still faces the defects of discretized action ( 5 ). The deep deterministic policy gradient (DDPG) approach is widely used to address the above issue with stable performance ( 27 ).…”
mentioning
confidence: 99%