ICC 2019 - 2019 IEEE International Conference on Communications (ICC) 2019
DOI: 10.1109/icc.2019.8761949
|View full text |Cite
|
Sign up to set email alerts
|

Decentralized Deep Reinforcement Learning for Delay-Power Tradeoff in Vehicular Communications

Abstract: This paper targets at the problem of radio resource management for expected long-term delay-power tradeoff in vehicular communications. At each decision epoch, the road side unit observes the global network state, allocates channels and schedules data packets for all vehicle user equipment-pairs (VUEpairs). The decision-making procedure is modelled as a discretetime Markov decision process (MDP). The technical challenges in solving an optimal control policy originate from highly spatial mobility of vehicles an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…It should be noted that optimizing AoI is totally different from queuing delay minimization as in our prior work [26]. The queuing delay can unnecessarily increase the age of an information update [27].…”
Section: Introductionmentioning
confidence: 93%
See 1 more Smart Citation
“…It should be noted that optimizing AoI is totally different from queuing delay minimization as in our prior work [26]. The queuing delay can unnecessarily increase the age of an information update [27].…”
Section: Introductionmentioning
confidence: 93%
“…The time horizon is discretized into scheduling slots, with each being of equal time duration τ and indexed by a positive integer j ∈ N + . For each VUE-pair, the vTx always follows at a fixed distance of ℓ to the vRx, which moves according to a Manhattan mobility model [15], [26]. We let y j k,(vT x) = (y j,1 k,(vT x) , y j,2 k,(vT x) ) and y j k,(vRx) = (y j,1 k,(vRx) , y j,2 k,(vRx) ) denote, respectively, the Euclidean coordinates of geographical More specifically, the channel state H j k = ψ · h(y j k,(vT x) , y j k,(vRx) ) ∈ H over the frequency bands experienced by a VUE-pair k ∈ K during each slot j includes a fast fading component 1 ψ and a path loss h(y j k,(vT x) , y j k,(vRx) ) that applies…”
Section: A Network and Channel Modelsmentioning
confidence: 99%