Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges

Lei, Lei; Tan, Yue; Zheng, Kan; Liu, Shiwen; Zhang, Kuan; Shen, Xuemin

doi:10.1109/comst.2020.2988367

Cited by 214 publications

(119 citation statements)

References 165 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since DRL problems are mainly based on Markov Decision Process (MDP) framework or its variants (e.g., Partially observable MDP [30], Markov games [17]), we first introduce the background of MDP. Typically, an MDP is defined by a fivetuple (S, A, P, R, γ), where S and A denote the sets of state and action, respectively.…”

Section: A Mdpmentioning

confidence: 99%

“…In the above research efforts, the proposed DQN-based methods can not deal with DRL problems with continuous actions, e.g., the generation output of Diesel Generators (DG) [30]. To support continuous actions, DDPG-based methods could be adopted.…”

Section: Applications Of Drl In Building Microgridsmentioning

confidence: 99%

“…To support continuous actions, DDPG-based methods could be adopted. For example, Lei et al proposed a FH-DDPG based energy management algorithm for an isolated microgrid to minimize the sum of power generation cost and the power unbalance penalty [30]. Since model-free based DRL algorithms in existing works have low data efficiency, Shuai et al proposed a model-based DRL algorithm (i.e., MuZero) for the online scheduling of a residential microgrid under uncertainties [45] based on Monte-Carlo tree search (MCTS) strategy with a learned network model.…”

Section: Applications Of Drl In Building Microgridsmentioning

confidence: 99%

See 2 more Smart Citations

Deep Reinforcement Learning for Smart Home Energy Management

Xie

et al. 2020

IEEE Internet Things J.

281

View full text Add to dashboard Cite

In this paper, we investigate an energy cost minimization problem for a smart home in the absence of a building thermal dynamics model with the consideration of a comfortable temperature range. Due to the existence of model uncertainty, parameter uncertainty (e.g., renewable generation output, nonshiftable power demand, outdoor temperature, and electricity price) and temporally-coupled operational constraints, it is very challenging to determine the optimal energy management strategy for scheduling Heating, Ventilation, and Air Conditioning (HVAC) systems and energy storage systems in the smart home. To address the challenge, we first formulate the above problem as a Markov decision process, and then propose an energy management strategy based on Deep Deterministic Policy Gradients (DDPG). It is worth mentioning that the proposed strategy does not require the prior knowledge of uncertain parameters and building thermal dynamics model. Simulation results based on real-world traces demonstrate the effectiveness and robustness of the proposed strategy.

show abstract

Section: A Mdpmentioning

confidence: 99%

Section: Applications Of Drl In Building Microgridsmentioning

confidence: 99%

Section: Applications Of Drl In Building Microgridsmentioning

confidence: 99%

See 1 more Smart Citation

Deep Reinforcement Learning for Smart Home Energy Management

Xie

et al. 2020

IEEE Internet Things J.

281

View full text Add to dashboard Cite

show abstract

“…Reference [116] discussed applications and challenges of DRL in this context and revealed opportunities to use DRL in all three layers of Internet of Things: perception layer (control of the physical system or its components), network layer (control of communications resources) and application layer (control of computation resources). Future considerations should take into account the use of blockchain technology in this context [117].…”

Section: Perspectivesmentioning

confidence: 99%

(Deep) Reinforcement learning for electric power system control and related problems: A short review and perspectives

Glavić

2019

Annual Reviews in Control

106

View full text Add to dashboard Cite

This paper reviews existing works on (deep) reinforcement learning considerations in electric power system control. The works are reviewed as they relate to electric power system operating states (normal, preventive, emergency, restorative) and control levels (local, household, microgrid, subsystem, wide-area). Due attention is paid to the control-related problems considerations (cyber-security, big data analysis, short-term load forecast, and composite load modelling). Observations from reviewed literature are drawn and perspectives discussed. In order to make the text compact and as easy as possible to read, the focus is only on the works published (or "in press") in journals and books while conference publications are not included. Exceptions are several work available in open repositories likely to become journal publications in near future. Hopefully this paper could serve as a good source of information for all those interested in solving similar problems.

show abstract

“…Related works: RL is an online machine learning method which learns an optimal policy through the interactions between the agent (the edge node in our case) and the environment. A comprehensive survey of RL based methods for autonomous IoT networks is presented in [7]. In [8], [9], the authors used RL to find an optimal caching policy for non-transient data (e.g., multimedia files).…”

Section: Introductionmentioning

confidence: 99%

Age-Aware Status Update Control for Energy Harvesting IoT Sensors via Reinforcement Learning

Hatami

Jahandideh

Leinonen

et al. 2020

2020 IEEE 31st Annual International Symposium on Personal, Indoor and Mobile Radio Communications

View full text Add to dashboard Cite

We consider an IoT sensing network with multiple users, multiple energy harvesting sensors, and a wireless edge node acting as a gateway between the users and sensors. The users request for updates about the value of physical processes, each of which is measured by one sensor. The edge node has a cache storage that stores the most recently received measurements from each sensor. Upon receiving a request, the edge node can either command the corresponding sensor to send a status update, or use the data in the cache. We aim to find the best action of the edge node to minimize the average long-term cost which trade-offs between the age of information and energy consumption. We propose a practical reinforcement learning approach that finds an optimal policy without knowing the exact battery levels of the sensors. Simulation results show that the proposed method significantly reduces the average cost compared to several baseline methods.

show abstract

Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges

Cited by 214 publications

References 165 publications

Deep Reinforcement Learning for Smart Home Energy Management

Deep Reinforcement Learning for Smart Home Energy Management

(Deep) Reinforcement learning for electric power system control and related problems: A short review and perspectives

Age-Aware Status Update Control for Energy Harvesting IoT Sensors via Reinforcement Learning

Contact Info

Product

Resources

About