2021
DOI: 10.1109/tpds.2020.3042599
|View full text |Cite
|
Sign up to set email alerts
|

Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
35
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 87 publications
(39 citation statements)
references
References 28 publications
0
35
0
Order By: Relevance
“…To improve the convergence of the DQN algorithm in an edge computing environment, Xiong et al [30] proposed a DQN-based algorithm combined with multiple replay memories to minimize the execution time of one IoT application. Qiu et al [31] studied the distributed DRL in an edge computing environment with a single edge server to minimize the energy cost of running IoT applications, consisted of independent tasks. To obtain this goal, they combined deep neuro-evolution and policy gradient to improve the convergence results.…”
Section: Edge Computingmentioning
confidence: 99%
See 4 more Smart Citations
“…To improve the convergence of the DQN algorithm in an edge computing environment, Xiong et al [30] proposed a DQN-based algorithm combined with multiple replay memories to minimize the execution time of one IoT application. Qiu et al [31] studied the distributed DRL in an edge computing environment with a single edge server to minimize the energy cost of running IoT applications, consisted of independent tasks. To obtain this goal, they combined deep neuro-evolution and policy gradient to improve the convergence results.…”
Section: Edge Computingmentioning
confidence: 99%
“…Moreover, the weighted cost model can be changed to execution time or energy consumption model by assigning w 1 = 1, w 2 = 0 or w 1 = 0, w 2 = 1, respectively. Since the application placement problem in heterogeneous environments is an NP-hard problem [31], the problem's complexity grows exponentially as the number of heterogeneous servers and/or the number of tasks within an IoT application increases. Thus, the optimal policy of the application placement problem cannot be obtained in polynomial time by iterative approaches.…”
Section: Weighted Cost Modelmentioning
confidence: 99%
See 3 more Smart Citations