Energy Efficient Two-Tier Data Dissemination Based on Q-Learning for Wireless Sensor Networks

Wang, Neng-Chung; Hsu, Wei-Jung

doi:10.1109/access.2020.2987861

Cited by 21 publications

(12 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…3.1 Q-learning algorithm to find obstacle free paths Q-learning is a reinforcement learning algorithm. One of its working principles is that the agent (the autonomous robot in our research) learns the best policy to adopt for a given scenario based on its interactions with the environment and the rewards gained [35]. According to its current state, a policy in Q-learning is how the agent chooses to behave, react, or make decisions at a given time.…”

Section: Safety Response Mechanism Methodologymentioning

confidence: 99%

“…The values in the Q table are computed based on a Bellman Equation [8]. Once the Q table is populated after several interactions, the agent uses the information in the Q table to choose the action giving the highest cumulative reward [35,49]. QL can be applied as a dynamic and incremental programming method to find the optimal strategy for a problem in a stepby-step learning mode.…”

Section: Q-learning Algorithmmentioning

confidence: 99%

Section: Reinforcement Learning (Rl)mentioning

confidence: 99%

See 2 more Smart Citations

A Safety Response Mechanism for an Autonomous Moving Robot in a Small Manufacturing Environment using Q-learning Algorithm and Speech Recognition

Kiangala

Wang

2021

Preprint

View full text Add to dashboard Cite

The industrial manufacturing sector is currently undergoing a tremendous revolution moving from traditional production processes to intelligent techniques. Under this revolution, known as Industry 4.0 (I40), a robot is no longer a static equipment but an active workforce to the factory production alongside human operators. Safety becomes crucial for humans and robots to ensure a smooth production run in such environments. Operators are subject to frequent safety inductions to react in emergencies but very little is done for robots. Our research proposes a safety response mechanism for a small manufacturing plant, through which an autonomous robot learns the obstacle-free trajectory to the closest safety exit in emergencies. We implement a reinforcement learning (RL) algorithm, Q-learning, to enable the path learning abilities of the robot. After obtaining the robot optimal path selection options with Q-learning, we code the outcome as a rule-based system for the safety response. We also program a speech recognition system for operators to react timeously, with a voice command, to an emergency that requires stopping all plant activities even when they are far away from emergency stops (ESTOPs) button. The factory emergency signal can be given by an ESTOP or a voice command sent directly to the factory central controller: an S7-1200 Siemens programmable logic controller (PLC) in this experiment. We simulate a simple and small manufacturing environment overview to test our safety procedure. Our results show that the safety response mechanism successfully generates paths without obstacles to the closest safety exits from all the factory locations.

show abstract

Section: Safety Response Mechanism Methodologymentioning

confidence: 99%

Section: Q-learning Algorithmmentioning

confidence: 99%

Section: Reinforcement Learning (Rl)mentioning

confidence: 99%

See 1 more Smart Citation

A Safety Response Mechanism for an Autonomous Moving Robot in a Small Manufacturing Environment using Q-learning Algorithm and Speech Recognition

Kiangala

Wang

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…The RL algorithm utilizes five main elements: the agent, the environment, the reward, the state, and the action. In its learning process, the agent performs several interactions with its environment by making some actions that will cause a change of state in the environment and result in a positive or a negative reward (penalty) [ 35 ]. Over the years, RL has been the subject of many kinds of research in various applications such as chemical reaction [ 36 ], resource management [ 37 ], traffic-light management [ 38 ], autonomous driving [ 39 ], dam management [ 40 ], surgery [ 41 ] and robotics [ 42 ].…”

Section: Short Theory and Background Overviewmentioning

confidence: 99%

An Experimental Safety Response Mechanism for an Autonomous Moving Robot in a Smart Manufacturing Environment Using Q-Learning Algorithm and Speech Recognition

Kiangala

Wang

2022

Sensors

View full text Add to dashboard Cite

The industrial manufacturing sector is undergoing a tremendous revolution moving from traditional production processes to intelligent techniques. Under this revolution, known as Industry 4.0 (I40), a robot is no longer static equipment but an active workforce to the factory production alongside human operators. Safety becomes crucial for humans and robots to ensure a smooth production run in such environments. The loss of operating moving robots in plant evacuation can be avoided with the adequate safety induction for them. Operators are subject to frequent safety inductions to react in emergencies, but very little is done for robots. Our research proposes an experimental safety response mechanism for a small manufacturing plant, through which an autonomous robot learns the obstacle-free trajectory to the closest safety exit in emergencies. We implement a reinforcement learning (RL) algorithm, Q-learning, to enable the path learning abilities of the robot. After obtaining the robot optimal path selection options with Q-learning, we code the outcome as a rule-based system for the safety response. We also program a speech recognition system for operators to react timeously, with a voice command, to an emergency that requires stopping all plant activities even when they are far away from the emergency stops (ESTOPs) button. An ESTOP or a voice command sent directly to the factory central controller can give the factory an emergency signal. We tested this functionality on real hardware from an S7-1200 Siemens programmable logic controller (PLC). We simulate a simple and small manufacturing environment overview to test our safety procedure. Our results show that the safety response mechanism successfully generates paths without obstacles to the closest safety exits from all the factory locations. Our research benefits any manufacturing SME intending to implement the initial and primary use of autonomous moving robots (AMR) in their factories. It also impacts manufacturing SMEs using legacy devices such as traditional PLCs by offering them intelligent strategies to incorporate current state-of-the-art technologies such as speech recognition to improve their performances. Our research empowers SMEs to adopt advanced and innovative technological concepts within their operations.

show abstract

“…QL algorithms can be used to iteratively change the MAC protocol parameters by a defined policy to achieve to a low energy state [32]. The TDMA-based adaptive task scheduling [33] method or two-tier data dissemination schemes based on Q-learning (TTDD-QL) [34] are energy efficient for wireless sensor networks (WSN). A cooperative energy-efficient model is presented in the article [35], where clustering, mobile sink deployment and variable sensing collaboratively improve the network lifetime.…”

Section: Introductionmentioning

confidence: 99%

Data-Driven Self-Learning Controller for Power-Aware Mobile Monitoring IoT Devices

Prauzek¹,

Paterova²,

Konecny³

et al. 2022

Computers, Materials &Amp; Continua

View full text Add to dashboard Cite

Nowadays, there is a significant need for maintenance free modern Internet of things (IoT) devices which can monitor an environment. IoT devices such as these are mobile embedded devices which provide data to the internet via Low Power Wide Area Network (LPWAN). LPWAN is a promising communications technology which allows machine to machine (M2M) communication and is suitable for small mobile embedded devices. The paper presents a novel data-driven self-learning (DDSL) controller algorithm which is dedicated to controlling small mobile maintenance-free embedded IoT devices. The DDSL algorithm is based on a modified Q-learning algorithm which allows energy efficient data-driven behavior of mobile embedded IoT devices. The aim of the DDSL algorithm is to dynamically set operation duty cycles according to the estimation of future collected data values, leading to effective operation of power-aware systems. The presented novel solution was tested on a historical data set and compared with a fixed duty cycle reference algorithm. The root mean square error (RMSE) and measurements parameters considered for the DDSL algorithm were compared to a reference algorithm and two independent criteria (the performance score parameter and normalized geometric distance) were used for overall evaluation and comparison. The experiments showed that the novel DDSL method reaches significantly lower RMSE while the number of transmitted data count is less than or equal to the fixed duty cycle algorithm. The overall criteria performance score is 40% higher than the reference algorithm base on static confirmation settings.

show abstract

Energy Efficient Two-Tier Data Dissemination Based on Q-Learning for Wireless Sensor Networks

Cited by 21 publications

References 26 publications

A Safety Response Mechanism for an Autonomous Moving Robot in a Small Manufacturing Environment using Q-learning Algorithm and Speech Recognition

A Safety Response Mechanism for an Autonomous Moving Robot in a Small Manufacturing Environment using Q-learning Algorithm and Speech Recognition

An Experimental Safety Response Mechanism for an Autonomous Moving Robot in a Smart Manufacturing Environment Using Q-Learning Algorithm and Speech Recognition

Data-Driven Self-Learning Controller for Power-Aware Mobile Monitoring IoT Devices

Contact Info

Product

Resources

About