2021
DOI: 10.1109/twc.2020.3029143
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Delay-Oriented IoT Task Scheduling in SAGIN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
50
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 173 publications
(50 citation statements)
references
References 37 publications
0
50
0
Order By: Relevance
“…Yang et al dealt with resource allocation and task offloading problems to reduce the overall power consumption in their networks [7]. In another study, Zhou et al included satellite computation/communication as an alternative for UAVs [8]. They modelled their problem as a constrained Markov decision process (MDP) and provided a deep reinforcement learning solution.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Yang et al dealt with resource allocation and task offloading problems to reduce the overall power consumption in their networks [7]. In another study, Zhou et al included satellite computation/communication as an alternative for UAVs [8]. They modelled their problem as a constrained Markov decision process (MDP) and provided a deep reinforcement learning solution.…”
Section: Related Workmentioning
confidence: 99%
“…4) ILP Solver: In addition to the heuristics we explained above, we use GUROBI Solver [11] to find the optimum solution for our multi-objective maximization problem (Eqs. [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15]. Despite the NP-hard property of our problem, that method could be used for only small solution space problems.…”
Section: A Q-learning Approachmentioning
confidence: 99%
“…(1) DRL-based Computation Offloading in UAV-Assisted MEC Networks: Zhou et al [10] formulated the computing task scheduling problem as a constrained Markov decision process (CMDP) and solved it by proposing a novel risk-sensitive DRL method, where the UAV's energy consumption violation is defined as the risk metric. Liu et al [11] modified the vanilla Q-Learning algorithm to maximize the profit of the UAV under the constant cruising path.…”
Section: Related Work and Challengesmentioning
confidence: 99%
“…LEO satellite network provides latitude of 160 km to 2000 km. To minimize end-to-end propagation delay, for better communication, wide coverage area and assured quality of services; many LEO satellite based commercial networks such as 'SpaceX', 'OneWeb', 'LeoSat' and 'Iridium' have come into existence [9,10]. LEO satellite based IoT services possess the potential of long-distance communication in remote areas.…”
Section: Introductionmentioning
confidence: 99%