2021
DOI: 10.48550/arxiv.2108.06266
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Safe Learning in Robotics: From Learning-Based Control to Safe Reinforcement Learning

Abstract: The last half-decade has seen a steep rise in the number of contributions on safe learning methods for real-world robotic deployments from both the control and reinforcement learning communities. This article provides a concise but holistic review of the recent advances made in using machine learning to achieve safe decision making under uncertainties, with a focus on unifying the language and frameworks used in control theory and reinforcement learning research. Our review includes: learning-based control app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 10 publications
(14 citation statements)
references
References 96 publications
(179 reference statements)
0
14
0
Order By: Relevance
“…Embedding such structure would facilitate faster training and potentially better generalization. Another exciting future direction is to augment our approach with methods from safe reinforcement learning [56] to provide generalization guarantees on the success of safety-critical systems while ensuring safety during the training process itself (in contrast to only providing guarantees for deployment, as we do here). Finally, we are working towards hardware experiments of our approach on a drone navigating an obstacle field under wind disturbances.…”
Section: Discussionmentioning
confidence: 99%
“…Embedding such structure would facilitate faster training and potentially better generalization. Another exciting future direction is to augment our approach with methods from safe reinforcement learning [56] to provide generalization guarantees on the success of safety-critical systems while ensuring safety during the training process itself (in contrast to only providing guarantees for deployment, as we do here). Finally, we are working towards hardware experiments of our approach on a drone navigating an obstacle field under wind disturbances.…”
Section: Discussionmentioning
confidence: 99%
“…Safety. We refer to [81] for a detailed review of learning-based control. Essentially, safety constraints can be embedded in Problem 2 in three ways, which in turn correspond to different safety levels [81].…”
Section: Commentsmentioning
confidence: 99%
“…Safe RL Our work is broadly related to the safe reinforcement learning and control literature; we refer interested readers to (Garcıa and Fernández 2015;Brunke et al 2021) for surveys on this topic. A popular class of approaches incorporates Lagrangian constraint regularization into the policy updates in policy-gradient algorithms (Achiam et al 2017;Ray, Achiam, and Amodei 2019;Tessler, Mankowitz, and Mannor 2018;Dalal et al 2018;Cheng et al 2019;Zhang, Vuong, and Ross 2020;Chow et al 2019).…”
Section: Related Workmentioning
confidence: 99%