Safe- to-Explore State Spaces: Ensuring Safe Exploration in Policy Search with Hierarchical Task Optimization

Lundell, Jens; Krug, Robert; Schaffernicht, Erik; Stoyanov, Todor; Kyrki, Ville

doi:10.1109/humanoids.2018.8624948

Cited by 3 publications

(5 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The prior work constrains the agent to explore in Safe-To-Explore-State-Spaces (STESS) [4], which decomposes a robotic skill into prioritized elemental tasks and a normalized Radial Basis Function (RBF) [4] network is used to represent the learning policy. We continue with STESS framework in this paper and further construct several phases for different period constraints.…”

Section: Related Workmentioning

confidence: 99%

“…We take advantage of STESS [4] to enable safe exploration of lower ranked RL task in the null space of higher ranked tasks.All tasks are solved in the acceleration space and the objective function can be formulated as…”

Section: B Reinforcement Learning In Null Spacementioning

confidence: 99%

“…Reinforcement learning (RL) involves performing a number of exploratory actions, often with a degree of randomness, which can lead to damage of the robot or its environment. This problem has been previously addressed by learning in simulation [1], [2], [3], safety exploration [4], [5], imitation learning [6], [7] and learning from demonstration (LfD) [8], [9]. However, some of these solutions do not guarantee safety, while others face difficulties when transferring from simulated to real environments.…”

Section: Introductionmentioning

confidence: 99%

“…We leverage a hierarchical framework that can guarantee constraint satisfaction in the least-square sense during the robot's trial-and-error exploration, where constraints are used to describe safety conditions. We base our work on Safe-To-Explore State Space (STESS) [4], which restricts the robotic operational space to a collision-free space. This is accomplished by decomposing a robotic skill (e. g., putting a book into a cabinet) into different prioritized tasks.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Null Space Based Efficient Reinforcement Learning with Hierarchical Safety Constraints

Yang

Stork

Stoyanov

2021

2021 European Conference on Mobile Robots (ECMR)

Self Cite

View full text Add to dashboard Cite

Section: Related Workmentioning

confidence: 99%

Section: B Reinforcement Learning In Null Spacementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Null Space Based Efficient Reinforcement Learning with Hierarchical Safety Constraints

Yang

Stork

Stoyanov

2021

2021 European Conference on Mobile Robots (ECMR)

Self Cite

View full text Add to dashboard Cite

“…The motion planning and control of the robot are completed in MoveIt, an open-source project in ROS. In this case, we modified an open-source unified robot description format (URDF) model of YuMi provided by Lundell et al [35]. Because YuMi's manipulators and grippers are controlled independently based on different IP addresses, we divided the whole robot into four motion planning groups after URDF remodeling: left arm, right arm, left hand and right hand.…”

Section: Robot-control Subsystem 1) Basic Configurationmentioning

confidence: 99%

Teleoperation of Collaborative Robot for Remote Dementia Care in Home Environments

Yang

Zhou

et al. 2020

IEEE J. Transl. Eng. Health Med.

View full text Add to dashboard Cite

As a senile chronic, progressive and currently incurable disease, dementia has an enormous impact on society and life quality of the elderly. The development of teleoperation technology has changed the traditional way of care delivery and brought a variety of novel applications for dementia care. In this paper, a telerobotic system is presented which gives the caregivers the capability of assisting dementia elderly remotely. The proposed system is composed of a dual-arm collaborative robot (YuMi) and a wearable motion capture device. The communication architecture is achieved by the robot operation system (ROS). The position-orientation data of the operator's hand are obtained and used to control the YuMi robot. Besides, a path-constrained mapping method is designed for motion trajectory tracking between the robot and the operator in the progress of teleoperation. Meanwhile, corresponding experiments are conducted to verify the performance of the trajectory tracking using the path-constrained mapping method. Results show that the position tracking deviation between the trajectory of the operator and the robot measured by dynamic time warping distance is 1.05 mm at the sampling frequency of 7.5 Hz. Moreover, the practicability of the proposed system was verified by teleoperating the YuMi robot to pick up a medicine bottle and further demonstrated by assisting an elderly woman in picking up a cup remotely. The proposed telerobotic system has potential utility for improving the life quality of dementia elderly and the care effect of their caregivers.

show abstract

Augmentation Enables One-Shot Generalization in Learning from Demonstration for Contact-Rich Manipulation

Li,

Baum,

Brock

2023

2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

Safe- to-Explore State Spaces: Ensuring Safe Exploration in Policy Search with Hierarchical Task Optimization

Cited by 3 publications

References 27 publications

Null Space Based Efficient Reinforcement Learning with Hierarchical Safety Constraints

Null Space Based Efficient Reinforcement Learning with Hierarchical Safety Constraints

Teleoperation of Collaborative Robot for Remote Dementia Care in Home Environments

Augmentation Enables One-Shot Generalization in Learning from Demonstration for Contact-Rich Manipulation

Contact Info

Product

Resources

About