Luca Bascetta scite author profile

This paper is about the exploitation of Lipschitz continuity properties for Markov Decision Processes to safely speed up policy-gradient algorithms. Starting from assumptions about the Lipschitz continuity of the state-transition model, the reward function, and the policies considered in the learning process, we show that both the expected return of a policy and its gradient are Lipschitz continuous w.r.t. policy parameters. By leveraging such properties, we define policy-parameter updates that guarantee a performance improvement at each iteration. The proposed methods are empirically evaluated and compared to other related approaches using different configurations of three popular control scenarios: the linear quadratic regulator, the mass-spring-damper system and the ship-steering control.

show abstract

Force Ripple Compensation in Linear Motors Based on Closed-Loop Position-Dependent Identification

Bascetta

Rocco

Magnani

2010

IEEE/ASME Trans. Mechatron.

View full text Add to dashboard Cite

Achieving Humanlike Motion: Resolving Redundancy for Anthropomorphic Industrial Manipulators

Zanchettin

Bascetta

Rocco

2013

IEEE Robot. Automat. Mag.

View full text Add to dashboard Cite

Walk-through programming for robotic manipulators based on admittance control

et al. 2013

View full text Add to dashboard Cite

The present paper addresses the issues that should be covered in order to develop walk-through programming techniques (i.e. a manual guidance of the robot) in an industrial scenario. First, an exact formulation of the dynamics of the tool the human should feel when interacting with the robot is presented. Then, the paper discusses a way to implement such dynamics on an industrial robot equipped with an open robot control system and a wrist force/torque sensor, as well as the safety issues related to the walk-through programming. In particular, two strategies that make use of admittance control to constrain the robot motion are presented. One slows down the robot when the velocity of the tool centre point exceeds a specified safety limit, the other one limits the robot workspace by way of virtual safety surfaces. Experimental results on a COMAU Smart Six robot are presented, showing the performance of the walk-through programming system endowed with the two proposed safety strategies

show abstract

Policy gradient approaches for multi-objective sequential decision making

Parisi

Pirotta

Smacchia

et al. 2014

View full text Add to dashboard Cite

Manipulability Optimization Control of a Serial Redundant Robot for Robot-assisted Minimally Invasive Surgery

Manivannan

et al. 2019

View full text Add to dashboard Cite

This paper proposes a manipulability optimization control of a 7-DoF robot manipulator for Robot-Assisted Minimally Invasive Surgery (RAMIS), which at the same time guarantees a Remote Center of Motion (RCM). The first degree of redundancy of the manipulator is used to achieve an RCM constraint, the second one is adopted for manipulability optimization. A hierarchical operational space formulation is introduced to integrate all the control components, including a Cartesian compliance control involving the main surgical task, a first null-space controller for the RCM constraint, and a second null-space controller for manipulability optimization. Experiments with virtual surgical tasks, in an augmented reality environment, were performed to validate the proposed control strategy using the KUKA LWR4+. The results demonstrate that end-effector accuracy and RCM constraint can be guaranteed, along with improving the manipulability of the surgical tip.

show abstract

An incremental approach to learning generalizable robot tasks from human demonstration

Ghalamzan

Paxton

Hager

et al. 2015

View full text Add to dashboard Cite

DMPs are a common method for learning a control policy for a task from demonstration.\ud This control policy consists of differential equations that can create a smooth trajectory to a new goal point.\ud However, DMPs only have a limited ability to generalize the demonstration to new environments and solve problems such as obstacle avoidance.\ud Moreover, standard DMP learning does not cope with the noise inherent to human demonstrations.\ud Here, we propose an approach for robot learning from demonstration that can generalize noisy task demonstrations to a new goal point and to an environment with obstacles.\ud This strategy for robot learning from demonstration results in a control policy that incorporates different types of learning from demonstration,\ud which correspond to different types of observational learning as outlined in developmental psychology

show abstract

Accurate sensorless lead-through programming for lightweight robots in structured environments

Ragaglia

Zanchettin

Bascetta

et al. 2016

Robotics and Computer-Integrated Manufacturing

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Luca Bascetta

Policy gradient in Lipschitz Markov Decision Processes

Force Ripple Compensation in Linear Motors Based on Closed-Loop Position-Dependent Identification

Achieving Humanlike Motion: Resolving Redundancy for Anthropomorphic Industrial Manipulators

Walk-through programming for robotic manipulators based on admittance control

Policy gradient approaches for multi-objective sequential decision making

Manipulability Optimization Control of a Serial Redundant Robot for Robot-assisted Minimally Invasive Surgery

An incremental approach to learning generalizable robot tasks from human demonstration

Accurate sensorless lead-through programming for lightweight robots in structured environments

Contact Info

Product

Resources

About