Stacked Autoencoder-Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks

Jiang, Feibo; Wang, Kezhi; Dong, Li; Pan, Cunhua; Yang, Kun

doi:10.1109/jiot.2020.2988457

Cited by 44 publications

(36 citation statements)

References 37 publications

(40 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table III characterizes the objective function values of the DIRS framework under different distributions of IoTDs. For evaluating the influence of the different position distributions of IoTDs, we define the normalized reward rate (NRR), which is equal to that the inferred reward dividing the optimal reward [44]. In NRR, the inferred reward in the numerator is calculated from the output of the DIRS framework, and the optimal reward in the denominator is obtained from the particle swarm optimization (PSO) algorithm which is always applied to solve large-scale MINLP problems with high quality but low efficiency [44].…”

Section: B Performance Evaluation For Different Modules Of Dirsmentioning

confidence: 99%

“…For evaluating the influence of the different position distributions of IoTDs, we define the normalized reward rate (NRR), which is equal to that the inferred reward dividing the optimal reward [44]. In NRR, the inferred reward in the numerator is calculated from the output of the DIRS framework, and the optimal reward in the denominator is obtained from the particle swarm optimization (PSO) algorithm which is always applied to solve large-scale MINLP problems with high quality but low efficiency [44]. From Table III, one can see that the IoTDs with nonuniform distributions (e.g., Gaussian distribution and Lévy distribution) obtain lower objective function values.…”

Section: B Performance Evaluation For Different Modules Of Dirsmentioning

confidence: 99%

See 1 more Smart Citation

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multiagent Ensemble Deep Reinforcement Learning With Imitation Acceleration

Jiang

Dong

Wang

et al. 2022

IEEE Internet Things J.

Self Cite

View full text Add to dashboard Cite

In large-scale mobile edge computing (MEC) systems, the task latency and energy consumption are important for massive resource-consuming and delay-sensitive Internet of things devices (IoTDs). Against this background, we propose a distributed intelligent resource scheduling (DIRS) framework to minimize the sum of task latency and energy consumption for all IoTDs, which can be formulated as a mixed integer nonlinear programming. The DIRS framework includes centralized training relying on the global information and distributed decision making by each agent deployed in each MEC server. Specifically, we first introduce a novel multi-agent ensemble-assisted distributed deep reinforcement learning (DRL) architecture, which can simplify the overall neural network structure of each agent by partitioning the state space and also improve the performance of a single agent by combining decisions of all the agents. Secondly, we apply action refinement to enhance the exploration ability of the proposed DIRS framework, where the near-optimal state-action pairs are obtained by a novel Levy flight search. Finally, an imitation acceleration scheme is presented to pre-train all the agents, which can significantly accelerate the learning process of the proposed framework through learning the professional experience from a small amount of demonstration data. The simulation results in three typical scenarios demonstrate that the proposed DIRS framework is efficient and outperforms the existing benchmark schemes.

show abstract

Section: B Performance Evaluation For Different Modules Of Dirsmentioning

confidence: 99%

Section: B Performance Evaluation For Different Modules Of Dirsmentioning

confidence: 99%

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multiagent Ensemble Deep Reinforcement Learning With Imitation Acceleration

Jiang

Dong

Wang

et al. 2022

IEEE Internet Things J.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Auto-encoder algorithm may be applied. For example, H-MEC could benefit from a pre-training scheme of Stacked Auto-Encoder (SAE) [12] for automatic feature learning. In particular, SAE can be applied to train an attack detection model with a mix of unlabelled normal/attack samples so that the model identifies patterns of attack and normal data by an auto-encoder scheme, this can in turn improve the accuracy of the attack detection model on unseen and mutated attacks.…”

Section: A Ai-based Solutionsmentioning

confidence: 99%

AI Driven Heterogeneous MEC System with UAV Assistance for Dynamic Environment: Challenges and Solutions

et al. 2021

Self Cite

View full text Add to dashboard Cite

By taking full advantage of Computing, Communication and Caching (3C) resources at the network edge, Mobile Edge Computing (MEC) is envisioned as one of the key enablers for the next generation networks. However, current fixed-location MEC architecture may not be able to make realtime decision in dynamic environment, especially in large-scale scenarios. To address this issue, in this paper, a Heterogeneous MEC (H-MEC) architecture is proposed, which is composed of fixed unit, i.e., Ground Stations (GSs) as well as moving nodes, i.e., Ground Vehicles (GVs) and Unmanned Aerial Vehicles (UAVs), all with 3C resource enabled. The key challenges in H-MEC, i.e., mobile edge node management, real-time decision making, user association and resource allocation along with the possible Artificial Intelligence (AI)-based solutions are discussed. In addition, the AI-based joint Resource schEduling (ARE) framework with two different AI-based mechanisms, i.e., Deep neural network (DNN)-based and deep reinforcement learning (DRL)-based architectures are proposed. DNN-based solution with online incremental learning applies the global optimizer and therefore has better performance than the DRL-based architecture with online policy updating, but requires longer training time. The simulation results are given to verify the efficiency of our proposed ARE framework.

show abstract

“…Simulated annealing (SA) is a probabilistic heuristic search technique based on the annealing process in metallurgy. Thanks to its fast convergence, less parameter, and simplicity, SA has been widely adapted for decision making and optimization in recent years [35]. In this paper, we propose an SA based gateway selection method, i.e., SAGA, to solve P2.1 and obtain the near optimal gateway selection decision, whose details are listed in Algorithm 2.…”

mentioning

confidence: 99%

Joint Gateway Selection and Resource Allocation for Cross-Tier Communication in Space-Air-Ground Integrated IoT Networks

2021

View full text Add to dashboard Cite

Stacked Autoencoder-Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks

Cited by 44 publications

References 37 publications

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multiagent Ensemble Deep Reinforcement Learning With Imitation Acceleration

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multiagent Ensemble Deep Reinforcement Learning With Imitation Acceleration

AI Driven Heterogeneous MEC System with UAV Assistance for Dynamic Environment: Challenges and Solutions

Joint Gateway Selection and Resource Allocation for Cross-Tier Communication in Space-Air-Ground Integrated IoT Networks

Contact Info

Product

Resources

About