Obtaining fault tolerance avoidance behavior using deep reinforcement learning

Gregori, Fidel Aznar; López, Mar Pujol; Aldeguer, Ramón Rizo

doi:10.1016/j.neucom.2018.11.090

Cited by 9 publications

(7 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When faced with different hardware conditions, the system may fail. Aznar et al [41] designed a navigation policy specifically for fault tolerance, whereby the proposed system can continue to work normally under sensor failure conditions and shows several advantages in its robustness, scalability, and practicality. Choi et al [42] studied the limited Field Of View (FOV) problem.…”

Section: Sensor Robustnessmentioning

confidence: 99%

Deep reinforcement learning based mobile robot navigation: A review

Zhu

Zhang

2021

Tsinghua Sci. Technol.

227

View full text Add to dashboard Cite

Navigation is a fundamental problem of mobile robots, for which Deep Reinforcement Learning (DRL) has received significant attention because of its strong representation and experience learning abilities. There is a growing trend of applying DRL to mobile robot navigation. In this paper, we review DRL methods and DRL-based navigation frameworks. Then we systematically compare and analyze the relationship and differences between four typical application scenarios: local obstacle avoidance, indoor navigation, multi-robot navigation, and social navigation. Next, we describe the development of DRL-based navigation. Last, we discuss the challenges and some possible solutions regarding DRL-based navigation.

show abstract

Section: Sensor Robustnessmentioning

confidence: 99%

Deep reinforcement learning based mobile robot navigation: A review

Zhu

Zhang

2021

Tsinghua Sci. Technol.

227

View full text Add to dashboard Cite

show abstract

“…The penalty area of QL is to absorb a policy, which expresses an agent pardons action to take under what surroundings that does not even necessitate a model of the environment and it can grip difficulties with stochastic transitions and plunders, deprived from necessitating adaptations [20], [120]. [23] IoT representation annotation [24] Data-driven management [25] Data and Feedback validation [26] Visualization and understanding [27] Learning environment detection [28] Fraud detection [29] Prediction of the performance [50] Classification of capability [51] Tolerance related acquisition [52] IoT crime forensics [53] Fraud detection in IoT application [54] IoT decision process and making [55] LA Intrusion prediction [30] IoT representation annotation [31] Data-driven management [32] Data and Feedback validation [33] Visualization and understanding [34] Learning environment detection [35] Fraud detection [36] Predicting Software Defects on IoTs [56] Prediction of behavioral changes [57] Signature verification [58] Analysis and decisions [59] Auto-selection of IoT task [60] Traffic incident detection [61] Telecommunication [62] Internet networks [63] MDP Intrusion prediction [37] IoT representation annotation [38] Data-driven management [39] Data and Feedback validation [40] Visualization and understanding [41] Learning environment detection [42] Fraud detection [43] Re...…”

Section: Q-learningmentioning

confidence: 99%

Reinforcement Learning Rebirth, Techniques, Challenges, and Resolutions

Shafik

Matinkhah

Etemadinejad

et al. 2020

JOIV : Int. J. Inform. Visualization

View full text Add to dashboard Cite

Reinforcement learning (RL) is a new propitious research space that is well-known nowadays on the internet of things (IoT), media and social sensing computing are addressing a broad and pertinent task through making decisions sequentially by deterministic and stochastic evolutions. The IoTs extend world connectivity to physical devices like electronic devices network by use interconnect with others over the Internet with the possibility of remotely being supervised and meticulous. In this paper, we comprehensively survey an in-depth assessment of RL techniques in IoT systems focusing on the main known RL techniques like artificial neural network (ANN), Q-learning, Markov Decision Process (MDP), Learning Automata (LA). This study examines and analyses learning technique with focusing on challenges, models performance, similarities and the differences in IoTs accomplish with most correlated proposed state of the art models. The results obtained can be used as a foundation for designing, a model implementation based on the bottlenecks currently assessed with an evaluation of the most fashionable hands-on utility of current methods for reinforcement learning.

show abstract

“…A recent trend in machine learning has been end-to-end learning, which condenses multiple stages of processing for a given task into a single deep neural network (see, e.g., [16], [17]). A similar idea was applied to FTC in [18], in which a model was trained using reinforcement learning to directly handle the fault of ultrasound sensors of a mobile robot in a kinematic obstacle avoidance problem. Even if the approach developed in [18] consisted of a sequence of deep neural networks with sensor measurements as input and robot action as output, the robot state was explicitly estimated as an intermediate variable.…”

Section: Introductionmentioning

confidence: 99%

“…The stages of FDI and control are replaced with a single recurrent neural network (RNN) with sensor measurements as input and control variables as output, in order to obtain a faster design process compared to classical methods. In contrast to [18], our deep FTC (DFTC) method has no explicit representation of the observed system states, and its training is based on supervised learning, rather than reinforcement learning. DFTC only requires (i) the availability of a (non-fault-tolerant) full state feedback control law, which is used as an ideal reference during the training phase, and (ii) the observability of the state vector using only the available non-faulty sensors, for all considered sensor faults.…”

Section: Introductionmentioning

confidence: 99%

End-to-End Deep Fault-Tolerant Control

Baimukashev

Rakhim

Rubagotti

et al. 2022

IEEE/ASME Trans. Mechatron.

View full text Add to dashboard Cite

Ideally, accurate sensor measurements are needed to achieve a good performance in the closed-loop control of mechatronic systems. As a consequence, sensor faults will prevent the system from working correctly, unless a fault-tolerant control (FTC) architecture is adopted. As model-based FTC algorithms for nonlinear systems are often challenging to design, this paper focuses on a new method for FTC in the presence of sensor faults, based on deep learning. The considered approach replaces the phases of fault detection and isolation and controller design with a single recurrent neural network, which has the value of past sensor measurements in a given time window as input, and the current values of the control variables as output. This endto-end deep FTC method is applied to a mechatronic system composed of a spherical inverted pendulum, whose configuration is changed via reaction wheels, in turn actuated by electric motors. The simulation and experimental results show that the proposed method can handle abrupt faults occurring in link position/velocity sensors. The provided supplementary material includes a video of real-world experiments and the software source code.

show abstract

Obtaining fault tolerance avoidance behavior using deep reinforcement learning

Cited by 9 publications

References 28 publications

Deep reinforcement learning based mobile robot navigation: A review

Deep reinforcement learning based mobile robot navigation: A review

Reinforcement Learning Rebirth, Techniques, Challenges, and Resolutions

End-to-End Deep Fault-Tolerant Control

Contact Info

Product

Resources

About