Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks

Prashanth, L A; Chatterjee, Abhranil; Bhatnagar, Shalabh

doi:10.1007/s11276-014-0762-6

Cited by 4 publications

(3 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, the forward propagation algorithms apply the carefully trained weight matrixes and bias vectors for carrying out the associated linear and [294] reduced-state SARSA cellular network dynamic channel allocation considering both mobile traffic and call handoffs. [295] on-policy SARSA CR network distributed multiagent sensing policy relying on local interactions among SU [296] on-policy SARSA MANET energy-aware reactive routing protocol for maximizing network lifetime [297] on-policy SARSA HetNet resource management for maximizing resource utilization and guaranteeing QoS [298] approximate SARSA P2P network energy harvesting aided power allocation policy for maximizing the throughput [299] Q-learning WBAN power control scheme to mitigate interference and to improve throughput [300] Q-learning OFDM system adaptive modulation and coding not relying on off-line training from PHY [301] Q-learning cooperative network efficient relay selection scheme meeting the symbol error rate requirement [302] decentralized Q-learning CR network aggregated interference control without introducing signaling overhead [303] convergent Q-learning WSN sensors' sleep scheduling scheme for minimizing the tracking error activation operations. By contrast, the backward propagation algorithms, which are widely used in the industrial field define a so-called loss function for quantifying the difference between the output produced by the training samples' and the real output.…”

Section: Deep Learning In Wireless Networkmentioning

confidence: 99%

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

Wang

Jiang

Zhang

et al. 2020

IEEE Commun. Surv. Tutorials

455

200

View full text Add to dashboard Cite

Future wireless networks have a substantial potential in terms of supporting a broad range of complex compelling applications both in military and civilian fields, where the users are able to enjoy high-rate, low-latency, low-cost and reliable information services. Achieving this ambitious goal requires new radio techniques for adaptive learning and intelligent decision making because of the complex heterogeneous nature of the network structures and wireless services. Machine learning (ML) algorithms have great success in supporting big data analytics, efficient parameter estimation and interactive decision making. Hence, in this article, we review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning. Furthermore, we investigate their employment in the compelling applications of wireless networks, including heterogeneous networks (Het-Nets), cognitive radios (CR), Internet of things (IoT), machine to machine networks (M2M), and so on. This article aims for assisting the readers in clarifying the motivation and methodology of the various ML algorithms, so as to invoke them for hitherto unexplored services as well as scenarios of future wireless networks.

show abstract

Section: Deep Learning In Wireless Networkmentioning

confidence: 99%

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

Wang

Jiang

Zhang

et al. 2020

IEEE Commun. Surv. Tutorials

455

200

View full text Add to dashboard Cite

show abstract

“…Moreover, due to the extensive research on machine learning, lots of researches have upgraded and extended the algorithm, which can be widely used in WSNs [22]. In [23], they proposed a double time scale Q-learning algorithm with function approximation to alleviate the curse of dimension problems. Although all of the above algorithms alleviate the state explosion problem, it is necessary to solve the action explosion problem to obtain the scalable solution.…”

Section: Related Workmentioning

confidence: 99%

“…Otherwise, this action will be punished. Corresponding reward or penalty is responsible for adjusting weight parameters for the deep neural network [23].…”

Section: Scheduling Strategy For Mtt-wsnsmentioning

confidence: 99%

Intelligent Sensing and Computing in Wireless Sensor Networks for Multiple Target Tracking

Zou

et al. 2022

Journal of Sensors

View full text Add to dashboard Cite

With sixth generation (6G) communication technologies, target sensing can be finished in milliseconds. The mobile tracking-oriented Internet of Things (MTT-IoT) as a kind of emerging application network can detect sensor nodes and track targets within their sensing ranges cooperatively. Nevertheless, huge data processing and low latency demands put tremendous pressure on the conventional architecture where sensing data is executed in the remote cloud and the short transmission distance of 6G channels presents new challenges into the design of network topology. To cope with the above difficulties, this paper proposes a new resource allocation scheme to perform delicate node scheduling and accurate tracking in multitarget tracking mobile networks. The dynamic tracking problem is formulated as an infinite horizon Markov Decision Process (MDP), where the state space that considers energy consumption, system responding delay, and target important degree is extended. A model-free reinforcement learning is applied to obtain satisfied tracking actions by frequent iterations, in which smart agents interact with the complicated environment directly. The performance of each episode is evaluated by the action-value function in search of the optimal reward. Simulation results demonstrate that the proposed scheme shows excellent tracking performance in terms of energy cost and tracking delay.

show abstract

Multiscale Q-learning with linear function approximation

Bhatnagar

Lakshmanan

2015

Discrete Event Dyn Syst

View full text Add to dashboard Cite

Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks

Cited by 4 publications

References 29 publications

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

Intelligent Sensing and Computing in Wireless Sensor Networks for Multiple Target Tracking

Multiscale Q-learning with linear function approximation

Contact Info

Product

Resources

About