2019
DOI: 10.1109/access.2019.2920662
|View full text |Cite
|
Sign up to set email alerts
|

High-Reliability Multi-Agent Q-Learning-Based Scheduling for D2D Microgrid Communications

Abstract: This paper proposes a multi-agent Q-learning-based resource allocation algorithm that allows long-term evolution (LTE)-enabled device-to-device (D2D) communication agents to generate the orthogonal transmission schedules outside the network coverage. This algorithm reduces packet drop rates (PDR) in distributed D2D communication networks to meet the quality-of-service requirements of the microgrid communications. The data traffic characteristics of three archetypal smart grid applications, namely demand respon… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…From the perspective of grid tasks, the optimization model introduces the time constraints in the traditional grid task scheduling algorithm into the Ad Hoc grid energy optimization. By reducing the task completion time MET, to achieve The purpose of energy optimization [5][6]. Therefore, the scheduling model is based on the traditional energy optimization model by adding the factor of time cost, and comprehensively considering the time factor and energy factor to realize the energy optimization of scheduling [6][7].…”
Section: Energy Optimization Modelmentioning
confidence: 99%
“…From the perspective of grid tasks, the optimization model introduces the time constraints in the traditional grid task scheduling algorithm into the Ad Hoc grid energy optimization. By reducing the task completion time MET, to achieve The purpose of energy optimization [5][6]. Therefore, the scheduling model is based on the traditional energy optimization model by adding the factor of time cost, and comprehensively considering the time factor and energy factor to realize the energy optimization of scheduling [6][7].…”
Section: Energy Optimization Modelmentioning
confidence: 99%
“…5, each hidden layer is composed of a fully connected (FC) sublayer and a rectified linear unit (ReLU) sublayer in series. The ReLU sublayer improves calculation speed and prediction accuracy by retaining the positive values and eliminating the (15)…”
Section: Multipath Dnn Trainingmentioning
confidence: 99%
“…Most of the current research on RL methods in wireless resource allocation is based on the action-value function to approximate the optimal action selection [14][15][16]. In [14], the authors present a software-defined satellite-terrestrial network framework, which jointly considered the networking, caching, and computing resources in satelliteterrestrial networks.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…It has been deeply studied because it has the advantages of simple implementation and strong convergence. For example, in [16] the authors proposed the multi-agent Q-learning algorithm to reduce packet drop rates in D2D communication. Huang et al [17] proposed a distributed Q-learning algorithm in heterogeneous networks to minimise the total transmission power.…”
Section: Introductionmentioning
confidence: 99%