2020
DOI: 10.1016/j.automatica.2019.108759
|View full text |Cite
|
Sign up to set email alerts
|

Deep reinforcement learning for wireless sensor scheduling in cyber–physical systems

Abstract: This paper studies how to schedule wireless transmissions from sensors to estimate the states of multiple remote, dynamic processes. Sensors make observations of each of the processes. Information from the different sensors have to be transmitted to a central gateway over a wireless network for monitoring purposes, where typically fewer wireless channels are available than there are processes to be monitored. Such estimation problems routinely occur in large-scale Cyber-Physical Systems, especially when the dy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
93
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 115 publications
(103 citation statements)
references
References 28 publications
0
93
0
Order By: Relevance
“…In [116], a global routing agent with Q-Learning is proposed for weight adjustment of the state-of-the-art routing algorithm, aiming at achieving a balance between the overall delay and the lifetime of the network. The study in [117] focuses on a DRL-based sensor scheduling problem for allocating wireless channels to sensors for the purposes of remote state estimation of dynamical systems. The algorithm can be run online, and is model-free with respect to the wireless channel parameters.…”
Section: Aiot Network Layer -Iot Communication Networkmentioning
confidence: 99%
“…In [116], a global routing agent with Q-Learning is proposed for weight adjustment of the state-of-the-art routing algorithm, aiming at achieving a balance between the overall delay and the lifetime of the network. The study in [117] focuses on a DRL-based sensor scheduling problem for allocating wireless channels to sensors for the purposes of remote state estimation of dynamical systems. The algorithm can be run online, and is model-free with respect to the wireless channel parameters.…”
Section: Aiot Network Layer -Iot Communication Networkmentioning
confidence: 99%
“…C. The Existence of a Zooming Quantizer Taking (14) and (16) into (1), the plant state at t + n can be written as…”
Section: B Construction Of a Zooming Quantizermentioning
confidence: 99%
“…> 0. This is the case where the term in the parenthesis in (25) is positive and we can directly apply Hoeffding's inequality (Lemma 1) to get the desired bound (19).…”
Section: Theorem 1 (Sample-based Stability Analysis)mentioning
confidence: 99%
“…In contrast our work is focused on collecting data and learning unknown channel models instead of system dynamics. In the context of networked control systems very recent works from the last two years are proposing data-based approaches including deep learning for allocating resources and scheduling [12,15,25,32,40] as well as for controller design [6,35]. A related but broader topic is also the identification of switched systems [29] where the matrix dynamics are also unknown.…”
Section: Introductionmentioning
confidence: 99%