Deep Reinforcement Learning Agent with Varying Actions Strategy for Solving the Eco-Approach and Departure Problem at Signalized Intersections

Mousa, Saleh; Ishak, Sherif; Mousa, Ragab M.; Codjoe, Julius; Elhenawy, Mohammed

doi:10.1177/0361198120931848

Cited by 20 publications

(8 citation statements)

References 20 publications

(34 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Shi et al (2018) used traditional (non-deep) Q-learning to develop an efficient driving strategy for approaching signalized intersections. Mousa et al (2020) used deep Q-learning with prioritized experience replay, target networks and double-learning to train an RL agent to approach and depart efficiently at signalized intersections for situations where no other vehicles are interfering. Wang et al (2022a, 2022b) focused on the CAV control problem in mixed traffic flow at signalized intersections with particular considerations of the oscillations induced by human drivers.…”

Section: Connected and Automated Vehicle Trajectory Planningmentioning

confidence: 99%

Intersection control with connected and automated vehicles: a review

2022

JICV

View full text Add to dashboard Cite

Purpose This paper aims to review the studies on intersection control with connected and automated vehicles (CAVs). Design/methodology/approach The most seminal and recent research in this area is reviewed. This study specifically focuses on two categories: CAV trajectory planning and joint intersection and CAV control. Findings It is found that there is a lack of widely recognized benchmarks in this area, which hinders the validation and demonstration of new studies. Originality/value In this review, the authors focus on the methodological approaches taken to empower intersection control with CAVs. The authors hope the present review could shed light on the state-of-the-art methods, research gaps and future research directions.

show abstract

Section: Connected and Automated Vehicle Trajectory Planningmentioning

confidence: 99%

Intersection control with connected and automated vehicles: a review

2022

JICV

View full text Add to dashboard Cite

show abstract

“…Controlling of CV is one representative in this field, as the vehicle can move longitudinally and laterally on the road. While the majority of the existing studies are confined to car following motion (Shi et al, 2018;Mousa et al, 2020;Zhou et al, 2020;Wegener et al, 2021), this paper develops a parameterized action space to naturally describe the control problem with hybrid actions and thereby implement joint optimization of car-following and lane-changing movement. Figure 3 presents an example of parameterized action space.…”

Section: Agent Frameworkmentioning

confidence: 99%

“…As one of the value-based reinforcement learning algorithms, the Q-learning approach cannot control the vehicle in continuous acceleration space, and thereby causes local optimum and uneven trajectory in most cases. The framework developed by Mousa et al (2020) provided insight into the DRL-based eco-driving system, which introduced deep Q network (DQN) to improve the fuel performance of the controlled CV. However, major disadvantage similar to the work of Shi et al (2018) was encountered in their study, namely, the losing efficacy in continuous action space.…”

Section: Introductionmentioning

confidence: 99%

Eco-driving for Electric Connected Vehicles at Signalized Intersections: A Parameterized Reinforcement Learning approach

Lü¹,

Zhang²,

Li³

2022

Preprint

View full text Add to dashboard Cite

This paper proposes an eco-driving framework for electric connected vehicles (CVs) based on reinforcement learning (RL) to improve vehicle energy efficiency at signalized intersections. The vehicle agent is specified by integrating the model-based car-following policy, lane-changing policy, and the RL policy, to ensure safe operation of a CV. Subsequently, a Markov Decision Process (MDP) is formulated, which enables the vehicle to perform longitudinal control and lateral decisions, jointly optimizing the car-following and lane-changing behaviors of the CVs in the vicinity of intersections. Then, the hybrid action space is parameterized as a hierarchical structure and thereby trains the agents with two-dimensional motion patterns in a dynamic traffic environment. Finally, our proposed methods are evaluated in SUMO software from both a single-vehicle-based perspective and a flow-based perspective. The results show that our strategy can significantly reduce energy consumption by learning proper action schemes without any interruption of other human-driven vehicles (HDVs).

show abstract

“…It is usually accomplished by maintaining a smooth driving speed and avoiding sudden acceleration as much as possible. The eco-driving concept is conventionally studied for gasoline vehicles ( 3–7 ) with the intent to attain fuel-saving and emission-reducing goals. However, with the development and deployment of the prevailing electric vehicles (EVs), relative studies were also developed to achieve eco-driving for EVs ( 8, 9 ) and mixed gasoline and electric vehicles ( 10 ).…”

mentioning

confidence: 99%

“…Meanwhile, the action space cannot be very large as they also applied the value-based RL algorithm. Similarly, the method given by Mousa et al still faces the defects of discretized action ( 5 ). The deep deterministic policy gradient (DDPG) approach is widely used to address the above issue with stable performance ( 27 ).…”

mentioning

confidence: 99%

Navigating Electric Vehicles Along a Signalized Corridor via Reinforcement Learning: Toward Adaptive Eco-Driving Control

Zhang

Jiang

Cui

et al. 2022

Transportation Research Record

View full text Add to dashboard Cite

One problem associated with the operation of electric vehicles (EVs) is the limited battery, which cannot guarantee their endurance. The increasing electricity consumption will also impose a burden on economy and ecology of the vehicles. To achieve energy saving, this paper proposes an adaptive eco-driving method in the environment of signalized corridors. The framework with adaptive and real-time control is implemented by the reinforcement learning technique. First, the operation of EVs in the proximity of intersections is defined as a Markov Decision Process (MDP) to apply the twin delayed deep deterministic policy gradient (TD3) algorithm, to deal with the decision process with continuous action space. Therefore, the speed of the vehicle can be adjusted continuously. Second, safety, traffic mobility, energy consumption, and comfort are all considered by designing a comprehensive reward function for the MDP. Third, the simulation study takes Aoti Street in Nanjing City with several consecutive signalized intersections as the research road network, and the state representation in MDP considers the information from consecutive downstream traffic signals. After the parameter tuning procedure, simulations are carried out for three typical eco-driving scenarios, including free flow, car following, and congestion flow. By comparing with default car-following behavior in the simulation platform SUMO and several state-of-the-art deep reinforcement learning algorithms, the proposed strategy shows a balanced and stable performance.

show abstract

Deep Reinforcement Learning Agent with Varying Actions Strategy for Solving the Eco-Approach and Departure Problem at Signalized Intersections

Cited by 20 publications

References 20 publications

Intersection control with connected and automated vehicles: a review

Intersection control with connected and automated vehicles: a review

Eco-driving for Electric Connected Vehicles at Signalized Intersections: A Parameterized Reinforcement Learning approach

Navigating Electric Vehicles Along a Signalized Corridor via Reinforcement Learning: Toward Adaptive Eco-Driving Control

Contact Info

Product

Resources

About