Jakob Thumm scite author profile

Ensuring safety of reinforcement learning (RL) algorithms is crucial for many real-world tasks. However, vanilla RL does not guarantee safety for an agent. In recent years, several methods have been proposed to provide safety guarantees for RL. To the best of our knowledge, there is no comprehensive comparison of these provably safe RL methods. We therefore introduce a categorization for existing provably safe RL methods, and present the theoretical foundations for both continuous and discrete action spaces. Additionally, we evaluate provably safe RL on an inverted pendulum. In the experiments, it is shown that indeed only provably safe RL methods guarantee safety.

show abstract

Characterization of Road Condition with Data Mining Based on Measured Kinematic Vehicle Parameters

Masino

Thumm

Levasseur

et al. 2018

Journal of Advanced Transportation

View full text Add to dashboard Cite

This work aims at classifying the road condition with data mining methods using simple acceleration sensors and gyroscopes installed in vehicles. Two classifiers are developed with a support vector machine (SVM) to distinguish between different types of road surfaces, such as asphalt and concrete, and obstacles, such as potholes or railway crossings. From the sensor signals, frequency-based features are extracted, evaluated automatically with MANOVA. The selected features and their meaning to predict the classes are discussed. The best features are used for designing the classifiers. Finally, the methods, which are developed and applied in this work, are implemented in a Matlab toolbox with a graphical user interface. The toolbox visualizes the classification results on maps, thus enabling manual verification of the results. The accuracy of the cross-validation of classifying obstacles yields 81.0% on average and of classifying road material 96.1% on average. The results are discussed on a comprehensive exemplary data set.

show abstract

Reducing Safety Interventions in Provably Safe Reinforcement Learning

Thumm¹,

Pelat²,

Althoff³

2023

Preprint

View full text Add to dashboard Cite

Deep Reinforcement Learning (RL) has shown promise in addressing complex robotic challenges. In real-world applications, RL is often accompanied by failsafe controllers as a last resort to avoid catastrophic events. While necessary for safety, these interventions can result in undesirable behaviors, such as abrupt braking or aggressive steering. This paper proposes two safety intervention reduction methods: action replacement and projection, which change the agent's action if it leads to an unsafe state. These approaches are compared to the state-of-the-art constrained RL on the OpenAI safety gym benchmark and a human-robot collaboration task. Our study demonstrates that the combination of our method with provably safe RL leads to high-performing policies with zero safety violations and a low number of failsafe interventions. Our versatile method can be applied to a wide range of realworld robotics tasks, while effectively improving safety without sacrificing task performance.1 https://youtu.be/dIvhyV5z8bM

show abstract

Vehicle Learner Toolbox For Road Condition Estimation, Matlab Files

Thumm¹,

Masino²

2018

View full text Add to dashboard Cite

Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments

Thumm¹,

Althoff²

2022

Preprint

View full text Add to dashboard Cite

Parking events derived from trip data from MiD2008

Huber¹,

Höffer²,

Thumm³

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jakob Thumm

Learning from the crowd: Road infrastructure monitoring system

Multiple vehicle fusion for a robust road condition estimation based on vehicle sensors and data mining

Provably Safe Reinforcement Learning: A Theoretical and Experimental Comparison

Characterization of Road Condition with Data Mining Based on Measured Kinematic Vehicle Parameters

Reducing Safety Interventions in Provably Safe Reinforcement Learning

Vehicle Learner Toolbox For Road Condition Estimation, Matlab Files

Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments

Parking events derived from trip data from MiD2008

Contact Info

Product

Resources

About