Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection

Levine, Sergey; Pástor, Peter; Krizhevsky, Alex; Ibarz, Julian; Quillen, Deirdre

doi:10.1177/0278364917710318

Cited by 1,595 publications

(1,265 citation statements)

References 53 publications

Supporting

Mentioning

1,201

Contrasting

Unclassified

Order By: Relevance

“…Our approach requires relatively fewer training data for learning, as compared with other CNN based motor learning approaches [7], [8], [18]. Applying RL for learning a motor skill can require a lot of trials [8].…”

Section: Discussionmentioning

confidence: 99%

“…Since we want to use camera images, if a human is always present in the images during demonstrations, a CNN can learn human specific feature. Now during motion reproduction, if the human is not present in the image, then it can result in a failure of the task during reproduction phase [7], [8], [18]. Alternatively a human can provide teleoperated demonstrations as in [11], [18].…”

Section: A Deep-dmpmentioning

confidence: 99%

“…We select 45 motions (21600 data-points) for the training set while the remaining 5 motions (2400 data-points) are used for validating the learned model. This is lower than 800,000 grasp attempts in [7], at least 156 execution trails in [8] and 24, 500 and 3500 data-points in training and validation sets respectively in [18]. RGB images were captured with a Kinect Xbox 360 camera.…”

Section: A Deep-dmpmentioning

confidence: 99%

“…The use of such dedicated systems limits the use of these approaches for real world scenarios. Levine et al have shown that Convolutional Neural Networks (CNNs) can be used for generating motor actions, by extracting useful features directly from camera images [8]. Their experiment consists of learning robotic grasping from monocular images by using RL.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Learning deep movement primitives using convolutional neural networks

Pervez

Mao

Lee

2017

2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)

View full text Add to dashboard Cite

Abstract-Dynamic Movement Primitives (DMPs) are widely used for encoding motion data. Task parameterized DMP (TP-DMP) can adapt a learned skill to different situations. Mostly a customized vision system is used to extract task specific variables. This limits the use of such systems to real world scenarios. This paper proposes a method for combining the DMP with a Convolutional Neural Network (CNN). Our approach preserves the generalization properties associated with a DMP, while the CNN learns the task specific features from the camera images. This eliminates the need to extract the task parameters, by directly utilizing the camera image during the motion reproduction. The performance of the developed approach is demonstrated through a trash cleaning task, executed with a real robot. We also show that by using the data augmentation, the learned sweeping skill can be generalized for arbitrary objects. The experiments show the robustness of our approach for several different settings.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: A Deep-dmpmentioning

confidence: 99%

Section: A Deep-dmpmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Learning deep movement primitives using convolutional neural networks

Pervez

Mao

Lee

2017

2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)

View full text Add to dashboard Cite

show abstract

“…While the results are impressive, these methods usually require extensive amount of experimental data [17,25] or relatively restrictive settings [16]. It is unclear whether these method would work directly on more dynamic motor skills in the real-world, such as locomotion.…”

Section: Related Work a Deep Reinforcement Learningmentioning

confidence: 99%

Preparing for the Unknown: Learning a Universal Policy with Online System Identification

Yu¹,

Tan²,

Liu³

et al. 2017

Robotics: Science and Systems XIII

177

148

View full text Add to dashboard Cite

Abstract-We present a new method of learning control policies that successfully operate under unknown dynamic models. We create such policies by leveraging a large number of training examples that are generated using a physical simulator. Our system is made of two components: a Universal Policy (UP) and a function for Online System Identification (OSI). We describe our control policy as universal because it is trained over a wide array of dynamic models. These variations in the dynamic model may include differences in mass and inertia of the robots components, variable friction coefficients, or unknown mass of an object to be manipulated. By training the Universal Policy with this variation, the control policy is prepared for a wider array of possible conditions when executed in an unknown environment. The second part of our system uses the recent state and action history of the system to predict the dynamics model parameters µ. The value of µ from the Online System Identification is then provided as input to the control policy (along with the system state). Together, UP-OSI is a robust control policy that can be used across a wide range of dynamic models, and that is also responsive to sudden changes in the environment. We have evaluated the performance of this system on a variety of tasks, including the problem of cart-pole swing-up, the double inverted pendulum, locomotion of a hopper, and block-throwing of a manipulator. UP-OSI is effective at these tasks across a wide range of dynamic models. Moreover, when tested with dynamic models outside of the training range, UP-OSI outperforms the Universal Policy alone, even when UP is given the actual value of the model dynamics. In addition to the benefits of creating more robust controllers, UP-OSI also holds out promise of narrowing the Reality Gap between simulated and real physical systems.

show abstract

Applications of Artificial Intelligence, ML, and DL

Yarali¹

2021

Intelligent Connectivity

View full text Add to dashboard Cite

Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection

Cited by 1,595 publications

References 53 publications

Learning deep movement primitives using convolutional neural networks

Learning deep movement primitives using convolutional neural networks

Preparing for the Unknown: Learning a Universal Policy with Online System Identification

Applications of Artificial Intelligence, ML, and DL

Contact Info

Product

Resources

About