2020
DOI: 10.1109/lra.2019.2961598
|View full text |Cite
|
Sign up to set email alerts
|

When Your Robot Breaks: Active Learning During Plant Failure

Abstract: Detecting and adapting to catastrophic failures in robotic systems requires a robot to learn its new dynamics quickly and safely to best accomplish its goals. To address this challenging problem, we propose probabilistically-safe, online learning techniques to infer the altered dynamics of a robot at the moment a failure (e.g., physical damage) occurs. We combine model predictive control and active learning within a chance-constrained optimization framework to safely and efficiently learn the new plant model o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 25 publications
0
5
0
Order By: Relevance
“…average. In the high-dimensional domain, Fig 5a depicts the advantage of our method as a function of the number of the number of actions taken to learn the system's dynamical model; we achieve a 49% and 46% improvement over [24] and [25], respectively and a 46% improvement over [27] (Fig. 5b).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…average. In the high-dimensional domain, Fig 5a depicts the advantage of our method as a function of the number of the number of actions taken to learn the system's dynamical model; we achieve a 49% and 46% improvement over [24] and [25], respectively and a 46% improvement over [27] (Fig. 5b).…”
Section: Resultsmentioning
confidence: 99%
“…• Maximizing Diversity [25] -This acquisition function selects actions which maximize the difference between previously seen states and actions. • Bayesian Optimization (BaO) [2] -This algorithm was developed for the ADMETS domain (Section 4.1) and is based upon a Gaussian Process model.…”
Section: Baseline Comparisonsmentioning
confidence: 99%
See 1 more Smart Citation
“…During the online stage, the UAV with an untrained fault was tasked to follow a trajectory with a speed value not included in V. At the beginning of its operation, the UAV adapted the meta-trained network using K = 50 initial data points. The adapted network was used to make predictions and to update the reference trajectory according to (9). During the experiments, we did not apply the trajectory update on the z axis as the faults considered did not cause z deviations.…”
Section: Methodsmentioning
confidence: 99%
“…In addition to control approaches, machine learning techniques have been also widely used to improve the performance of UAVs under actuator failures or disturbances. In [9], the authors use MPC with active learning to learn the new model of the robot under failure and to provide necessary inputs. Reinforcement Learning (RL) techniques are also utilized to adjust the actuator control commands to compensate for component faults [10], [11].…”
Section: Related Workmentioning
confidence: 99%