2020
DOI: 10.1109/comst.2020.2988367
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges

Abstract: The Internet of Things (IoT) extends the Internet connectivity into billions of IoT devices around the world, which collect and share information to reflect the status of physical world. The Autonomous Control System (ACS), on the other hand, performs control functions on the physical systems without external intervention over an extended period of time. The integration of IoT and ACS results in a new concept -autonomous IoT (AIoT). The sensors collect information on the system status, based on which intellige… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
98
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 214 publications
(119 citation statements)
references
References 165 publications
0
98
0
Order By: Relevance
“…Since DRL problems are mainly based on Markov Decision Process (MDP) framework or its variants (e.g., Partially observable MDP [30], Markov games [17]), we first introduce the background of MDP. Typically, an MDP is defined by a fivetuple (S, A, P, R, γ), where S and A denote the sets of state and action, respectively.…”
Section: A Mdpmentioning
confidence: 99%
See 2 more Smart Citations
“…Since DRL problems are mainly based on Markov Decision Process (MDP) framework or its variants (e.g., Partially observable MDP [30], Markov games [17]), we first introduce the background of MDP. Typically, an MDP is defined by a fivetuple (S, A, P, R, γ), where S and A denote the sets of state and action, respectively.…”
Section: A Mdpmentioning
confidence: 99%
“…In the above research efforts, the proposed DQN-based methods can not deal with DRL problems with continuous actions, e.g., the generation output of Diesel Generators (DG) [30]. To support continuous actions, DDPG-based methods could be adopted.…”
Section: Applications Of Drl In Building Microgridsmentioning
confidence: 99%
See 1 more Smart Citation
“…Reference [116] discussed applications and challenges of DRL in this context and revealed opportunities to use DRL in all three layers of Internet of Things: perception layer (control of the physical system or its components), network layer (control of communications resources) and application layer (control of computation resources). Future considerations should take into account the use of blockchain technology in this context [117].…”
Section: Perspectivesmentioning
confidence: 99%
“…Related works: RL is an online machine learning method which learns an optimal policy through the interactions between the agent (the edge node in our case) and the environment. A comprehensive survey of RL based methods for autonomous IoT networks is presented in [7]. In [8], [9], the authors used RL to find an optimal caching policy for non-transient data (e.g., multimedia files).…”
Section: Introductionmentioning
confidence: 99%