"Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

Hundt, Andrew; Killeen, Benjamin; Greene, Nicholas; Wu, Hongtao; Kwon, Heeyeon; Paxton, Chris; Hager, Gregory D.

doi:10.48550/arxiv.1909.11730

Cited by 3 publications

(3 citation statements)

References 19 publications

(38 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…There are two classes of DA: instance-based and feature-based. DA is utilized in (Bousmalis et al 2018;Yan et al 2017;Hundt et al 2020) research to optimize robot grasping. Research has shown that the DA method allows the model to learn a mapping from source to target domain (Tobin et al 2017).…”

Section: Related Workmentioning

confidence: 99%

Evaluation of Techniques for Sim2Real Reinforcement Learning

Ranaweera

Mahmoud

2023

FLAIRS

View full text Add to dashboard Cite

Reinforcement learning (RL) has demonstrated promising results in transferring learned policies from simulation to real-world environments. However, inconsistencies and discrepancies between the two environments cause a negative transfer. The phenomenon is commonly known as the “reality gap.” The reality gap prevents learned policies from generalizing to the physical environment. This paper aims to evaluate techniques to improve sim2real learning and bridge the reality gap using RL. For this research, a 3-DOF Stewart Platform was built virtually and physically. The goal of the platform was to guide and balance the marble towards the center of the Stewart platform. Custom API was created to induce noise, manipulate in-game physics, dynamics, and lighting conditions, and perform domain randomization to improve generalization. Two RL algorithms; Q-Learning and Actor-Critic were implemented to train the agent and to evaluate the performance in bridging the reality gap. This paper outlines the techniques utilized to create noise, domain randomization, perform training, results, and observations. Overall, the obtained results show the effectiveness of domain randomization and inducing noise during the agents' learning process. Additionally, the findings provide valuable insights into implementing sim2real RL algorithms to bridge the reality gap.

show abstract

Section: Related Workmentioning

confidence: 99%

Evaluation of Techniques for Sim2Real Reinforcement Learning

Ranaweera

Mahmoud

2023

FLAIRS

View full text Add to dashboard Cite

show abstract

“…The benefit of those models is their ability to project a goal image and their current observation into their feature space and compute a path towards the target feature for visual servo-ing (Watter et al, 2015;Byravan et al, 2018), reaching and pushing (Srinivas et al, 2018;Yu et al, 2019) with gradientbased optimisation methods. Visuomotor controllers trained in the reinforcement learning paradigm typically model the distance to a desired, visually specified goal via reward functions which can be either shaped explicitly based on expert domain knowledge (Hundt et al, 2019) or implicitly learned from user feedback about task success (Singh et al, 2019). Our approach of using dynamic images for goal distance estimation sets itself apart from these methods as it uses dynamic images as an efficient, non-parametric conditioning scheme.…”

Section: Related Workmentioning

confidence: 99%

Goal-Conditioned End-to-End Visuomotor Control for Versatile Skill Primitives

Groth

Hung

Vedaldi

et al. 2021

2021 IEEE International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

Visuomotor control (VMC) is an effective means of achieving basic manipulation tasks such as pushing or pick-and-place from raw images. Conditioning VMC on desired goal states is a promising way of achieving versatile skill primitives. However, common conditioning schemes either rely on task-specific fine tuning (e.g. using metalearning) or on sampling approaches using a forward model of scene dynamics i.e. modelpredictive control, leaving deployability and planning horizon severely limited. In this paper we propose a conditioning scheme which avoids these pitfalls by learning the controller and its conditioning in an end-to-end manner. Our model predicts complex action sequences based directly on a dynamic image representation of the robot motion and the distance to a given target observation. In contrast to related works, this enables our approach to efficiently perform complex pushing and pick-and-place tasks from raw image observations without predefined control primitives. We report significant improvements in task success over a representative model-predictive controller and also demonstrate our model's generalisation capabilities in challenging, unseen tasks handling unfamiliar objects.

show abstract

“…PyBullet and MuJoCo, on the other hand, present wider integration with DL and RL libraries and gym environments. In In those cases where system identification for one-shot transfer is the objective, researchers have often built or customized specific simulations that meet problem-specific requirements and constraints [32], [36], [41].…”

Section: F Simulation Environmentsmentioning

confidence: 99%

Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey

Zhao,

Queralta,

Westerlund

2020

Preprint

View full text Add to dashboard Cite

Deep reinforcement learning has recently seen huge success across multiple areas in the robotics domain. Owing to the limitations of gathering real-world data, i.e., sample inefficiency and the cost of collecting it, simulation environments are utilized for training the different agents. This not only aids in providing a potentially infinite data source, but also alleviates safety concerns with real robots. Nonetheless, the gap between the simulated and real worlds degrades the performance of the policies once the models are transferred into real robots. Multiple research efforts are therefore now being directed towards closing this sim-toreal gap and accomplish more efficient policy transfer. Recent years have seen the emergence of multiple methods applicable to different domains, but there is a lack, to the best of our knowledge, of a comprehensive review summarizing and putting into context the different methods. In this survey paper, we cover the fundamental background behind sim-to-real transfer in deep reinforcement learning and overview the main methods being utilized at the moment: domain randomization, domain adaptation, imitation learning, meta-learning and knowledge distillation. We categorize some of the most relevant recent works, and outline the main application scenarios. Finally, we discuss the main opportunities and challenges of the different approaches and point to the most promising directions.

show abstract

"Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

Cited by 3 publications

References 19 publications

Evaluation of Techniques for Sim2Real Reinforcement Learning

Evaluation of Techniques for Sim2Real Reinforcement Learning

Goal-Conditioned End-to-End Visuomotor Control for Versatile Skill Primitives

Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey

Contact Info

Product

Resources

About