2020
DOI: 10.1109/tsp.2020.2973125
|View full text |Cite
|
Sign up to set email alerts
|

Accelerated Structure-Aware Reinforcement Learning for Delay-Sensitive Energy Harvesting Wireless Sensors

Abstract: We investigate an energy-harvesting wireless sensor transmitting latency-sensitive data over a fading channel. The sensor injects captured data packets into its transmission queue and relies on ambient energy harvested from the environment to transmit them. We aim to find the optimal scheduling policy that decides whether or not to transmit the queue's head-of-line packet at each transmission opportunity such that the expected packet queuing delay is minimized given the available harvested energy. No prior kno… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(2 citation statements)
references
References 43 publications
0
2
0
Order By: Relevance
“…Moreover, by using the policy structures, the complexity of point-to-point network transmission control in [20] was effectively reduced with the tools from graph signal processing employed for large state space. In [21], based on the structural properties, a novel accelerated reinforcement learning (RL) algorithm was formulated for an energy-harvesting wireless sensor with latency-sensitive data. Based on the formulated LP problem in our previous work [16], we also shown a threshold-based structure for the optimal transmission policies.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, by using the policy structures, the complexity of point-to-point network transmission control in [20] was effectively reduced with the tools from graph signal processing employed for large state space. In [21], based on the structural properties, a novel accelerated reinforcement learning (RL) algorithm was formulated for an energy-harvesting wireless sensor with latency-sensitive data. Based on the formulated LP problem in our previous work [16], we also shown a threshold-based structure for the optimal transmission policies.…”
Section: Introductionmentioning
confidence: 99%
“…In [23], RL is used to develop an optimal control policy based on minimum-delay transmission in a multi-hop energy harvesting wireless sensor network. [24] proposed a novel RL method based on value function approximation to tackle the scheduling problem online.…”
Section: Introductionmentioning
confidence: 99%