Q-Learning for Adaptive Traffic Signal Control Based on Delay Minimization Strategy

Lu, Shoufeng; Liu, Ximin; Dai, Shun‐Dong

doi:10.1109/icnsc.2008.4525304

Cited by 31 publications

(22 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, most of these studies have considered a simplified simulation environment Arel et al, 2010;Camponogara & Kraus, 2003;De Oliveira et al, 2006;Richter et al, 2007), and/or assumed hypothetical traffic flows Arel et al, 2010;Camponogara & Kraus, 2003;De Oliveira et al, 2006;Richter et al, 2007;Shoufeng et al, 2008;Thorpe, 1997) which does not necessarily mimic the reality. This article investigates the effect of the following design parameters to bridge this gap in the literature: (1) learning method (Q-Learning vs. SARSA vs. TD(λ)), (2) traffic state representation (queue length vs. queues and arrivals vs. delay), (3) action selection method (∈-greedy vs. softmax vs. ∈-softmax), (4) traffic signal phasing scheme (variable vs. fixed), (5) reward definition (delay vs. cumulative delay vs. balancing queues), and (6) variability of flow arrivals to the intersection (uniform vs. variable arrival rates).…”

Section: Rl-based Adaptive Traffic Signal Control: the State Of The Artmentioning

confidence: 98%

“…This section focuses on the studies that considered RL for adaptive traffic signal control (Abdulhai, Pringle, & Karakoulas, 2003;Arel, Liu, Urbanik, & Kohls, 2010;Balaji, German, & Srinivasan, 2010;Camponogara & Kraus, 2003;De Oliveira et al, 2006;Richter, Aberdeen, & Yu, 2007;Salkham, Cunningham, Garg, & Cahill, 2008;Shoufeng, Ximin, & Shiqiang, 2008;Thorpe, 1997;Wiering, 2000). Table 1 and contrast these studies.…”

Section: Rl-based Adaptive Traffic Signal Control: the State Of The Artmentioning

confidence: 98%

See 1 more Smart Citation

Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control

El-Tantawy

Abdulhai

Abdelgawad

2014

Journal of Intelligent Transportation Systems

135

View full text Add to dashboard Cite

Adaptive traffic signal control (ATSC) is a promising technique to alleviate traffic congestion. This article focuses on the development of an adaptive traffic signal control system using Reinforcement Learning (RL) as one of the efficient approaches to solve such stochastic closed loop optimal control problem. A generic RL control engine is developed and applied to a multiphase traffic signal at an isolated intersection in Downtown Toronto in a simulation environment. Paramics, a microscopic simulation platform, is used to train and evaluate the adaptive traffic control system. This article investigates the following dimensions of the control problem: 1) RL learning methods, 2) traffic state representations, 3) action selection methods, 4) traffic signal phasing schemes, 5) reward definitions, and 6) variability of flow arrivals to the intersection. The system was tested on three networks (i.e., small, medium, large-scale) to ensure seamless transferability of the system design and results. The RL controller is benchmarked against optimized pretimed control and actuated control. The RL-based controller saves 48% average vehicle delay when compared to optimized pretimed controller and fully-actuated controller. In addition, the effect of the best design of RL-based ATSC system is tested on a large-scale application of 59 intersections in downtown Toronto and the results are compared versus the base case scenario of signal control systems in the field which are mix of pretimed and actuated controllers. The RL-based ATSC results in the following savings: average delay (27%), queue length (28%), and l CO 2 emission factors (28%).

show abstract

Section: Rl-based Adaptive Traffic Signal Control: the State Of The Artmentioning

confidence: 98%

Section: Rl-based Adaptive Traffic Signal Control: the State Of The Artmentioning

confidence: 98%

Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control

El-Tantawy

Abdulhai

Abdelgawad

2014

Journal of Intelligent Transportation Systems

135

View full text Add to dashboard Cite

show abstract

“…Various algorithms have been developed for traffic management in the literature [7][8][9][10][11][12][13][14][15][16][17].…”

Section: Problem Definition and Motivationmentioning

confidence: 99%

A real time traffic simulator utilizing an adaptive fuzzy inference mechanism by tuning fuzzy parameters

2011

View full text Add to dashboard Cite

Traffic lights are installed at intersections mostly for traffic management. Traffic signals turn on during the amount of time determined. Intelligent traffic management systems emerge as a need to handle the dynamicity of traffic. These systems are first implemented on simulators in order to mimic the real life situations before realization.Yet, we have implemented a real time traffic simulator with an adaptive fuzzy inference algorithm that arranges the foreseen light signal duration. It changes the time duration of lights depending on waiting vehicles behind green and red lights at crossroad. The simulation has also been supported with real time graphical visualization. Given a scenario, it creates random traffic flows according to specified parameters. Next, obtained results have been interpreted in the simulation environment.According to inferences from adaptive environment, TSK (Takagi-Sugeno-Kang) and Mamdani models have also been implemented to give baselines for verification. Several experiments have been conducted and compared against classical techniques such as Webster (1958) Road research technical paper No 39 and HCM (2000) TRB, special report 209, statistically to demonstrate the effectiveness of the proposed method.

show abstract

“…When the state of Markov Decision Process is very big or continuous, the computation and memory load will become very big and can not be solved then. On the other side, in the traditional Q learning algorithm, Q value is updated in the form of table record, the effeciency of this kind of learning is relatively slow, which will directly influence the performance of the controller [2].…”

Section: Introductionmentioning

confidence: 99%

“…Sutton proposed a developed learning algorithm for non-deterministic Markov decision processes [1]. Lu Shoufeng applied table Q-learning to dynamically control the traffic signals at an isolated intersection [2]. Wei Wu also developed a coordinated urban traffic signal control approach based on multi-agent reinforcement learning [3].…”

Section: Introductionmentioning

confidence: 99%

Development of a Neural Network Based Q Learning Algorithm for Traffic Signal Control

Chong

2011

AEF

View full text Add to dashboard Cite

Abstract. As one kind of reinforcement learning method, Q learning algorithm has already been proved to achieve many significant results in traffic signal control area. However, when the state of Markov Decision Process is very big or continuous, the computation load and the memory load will become very big and can not be solved then. Therefore, this paper proposed a neural network based Q learning algorithm to solve this problem known as "Curse of Dimensionality". This new method realized generalization of conventional Q learnig algorithm in huge and continuous state space as neural network is a very effective value function approximator. Experiment has been implemented upon an isolated intersection and simulation results show that the proposed method can improve the traffic efficiency significantly than the conventional Q learning algorithm.

show abstract

Q-Learning for Adaptive Traffic Signal Control Based on Delay Minimization Strategy

Cited by 31 publications

References 5 publications

Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control

Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control

A real time traffic simulator utilizing an adaptive fuzzy inference mechanism by tuning fuzzy parameters

Development of a Neural Network Based Q Learning Algorithm for Traffic Signal Control

Contact Info

Product

Resources

About