Proceedings of the 2021 5th International Conference on Intelligent Systems, Metaheuristics &Amp; Swarm Intelligence 2021
DOI: 10.1145/3461598.3461608
|View full text |Cite
|
Sign up to set email alerts
|

Management of Traffic Signals using Deep Reinforcement Learning in Bidirectional Recurrent Neural Network in ITS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 5 publications
0
6
0
Order By: Relevance
“…Typically, the agent is trained for numerous episodes to predict the expected cumulative discounted future reward ( Gt) when certain actions are applied to certain states. The primary goal is to identify a policy function πθfalse(atfalse|stfalse),atbold-italicA,stbold-italicS, which maximizes Gt as (2) [9]. Gt=i=tTbold-italicγitRi, where T represents the number of iterations in an episode.…”
Section: Present Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Typically, the agent is trained for numerous episodes to predict the expected cumulative discounted future reward ( Gt) when certain actions are applied to certain states. The primary goal is to identify a policy function πθfalse(atfalse|stfalse),atbold-italicA,stbold-italicS, which maximizes Gt as (2) [9]. Gt=i=tTbold-italicγitRi, where T represents the number of iterations in an episode.…”
Section: Present Workmentioning
confidence: 99%
“…The bidirectional layers in long short‐term memorysmemories (LSTMs) and gated recurrent units (GRUs) allow networks to include both backward and forward state information at each time step, ensuring that they are adequately equipped to handle enormous data [9].…”
Section: Present Workmentioning
confidence: 99%
See 3 more Smart Citations