A Generalist Agent

Reed, Scott; Żołna, Konrad; Parisotto, Emilio; Colmenarejo, Sergio Gómez; Novikov, Alexander S.; Barth-Maron, Gabriel; Gimenez, Mai; Sulsky, Yury; Kay, Jackie; Springenberg, Jost Tobias; Eccles, Tom; Bruce, Jake; Razavi, Ali; Edwards, Ashley; Heess, Nicolas; Chen, Yutian; Hadsell, Raia; Vinyals, Oriol; Bordbar, Mahyar; Freitas, Nando de

doi:10.48550/arxiv.2205.06175

Cited by 67 publications

(91 citation statements)

References 65 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To more objectively evaluate the performance of the policies obtained with various hyperparameters and on different data sets, a series of quantitative metrics are proposed in this paper. In general, it can be said that, according to these, recurrent networks operating on complete state histories outperform simple deep neural networks operating in a Markovian regime-which is perhaps unsurprising, given the recent successes achieved in applying general-purpose sequence-to-sequence learning methods to the imitation learning domain [19]. As our model precision comparisons rely on indirect estimates rather than empirical data, it is hard to make a direct comparison with state-of-the-art performers in throwing tasks, such as [26], but it should be noted that they use learning at a higher level-outputting parameters that describe each throw-and do not use model outputs to generate Cartesian motion plans directly as proposed here.…”

Section: Discussionmentioning

confidence: 99%

“…Recurrent neural networks show up in earlier scientific literature periodically, such as in predicting a time-series of robot end-effector loads in an assembly task [17] and learning latent action plans from large, uncategorized play data sets [18]. However, current state-of-the-art performance across a wide variety of sequence prediction tasks-among them being imitation learning in a robotics context-is given by combining a large, universal transformer model with embedding schemes specific to various data modalities [19]. These results strongly suggest that structuring one's approach to be compatible with generalpurpose sequence predictor algorithms is preferable for ensuring its longevity.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

A Motion Capture and Imitation Learning Based Approach to Robot Control

2022

View full text Add to dashboard Cite

Imitation learning is a discipline of machine learning primarily concerned with replicating observed behavior of agents known to perform well on a given task, collected in demonstration data sets. In this paper, we set out to introduce a pipeline for collecting demonstrations and training models that can produce motion plans for industrial robots. Object throwing is defined as the motivating use case. Multiple input data modalities are surveyed, and motion capture is selected as the most practicable. Two model architectures operating autoregressively are examined—feedforward and recurrent neural networks. Trained models execute throws on a real robot successfully, and a battery of quantitative evaluation metrics is proposed. Recurrent neural networks outperform feedforward ones in most respects, but this advantage is not universal or conclusive. The data collection, pre-processing and model training aspects of our proposed approach show promise, but further work is required in developing Cartesian motion planning tools before it is applicable in production applications.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

A Motion Capture and Imitation Learning Based Approach to Robot Control

2022

View full text Add to dashboard Cite

show abstract

“…To more objectively evaluate the performance of the policies obtained with various hyperparameters and on different data sets, a series of quantitative metrics are proposed in this paper. In general, it can be said that, according to these, recurrent networks operating on complete state histories outperform simple deep neural networks operating in a Markovian regime -which is perhaps unsurprising, given the recent successes achieved in applying general-purpose sequence-to-sequence learning methods to the imitation learning domain [Reed et al, 2022]. As our model precision comparisons rely on indirect estimates rather than empirical data, it is hard to make a direct comparison with state-of-the-art performers in throwing tasks such as [Zeng et al, 2020] -but it should be noted that they use learning at a higher level -outputting parameters that describe each throw -and do not use model outputs to generate Cartesian motion plans directly as proposed here.…”

Section: Discussionmentioning

confidence: 99%

“…Recurrent neural networks show up in earlier scientific literature periodically, such as in predicting a time-series of robot end-effector loads in an assembly task [Scherzinger et al, 2019] and learning latent action plans from large, uncategorized play data sets [Lynch et al, 2020]. But current state-of-the-art performance across a wide variety of sequence prediction tasks -among them imitation learning in a robotics context -is given by combining a large, universal transformer model with embedding schemes specific to various data modalities [Reed et al, 2022]. These results strongly suggest that structuring one's approach to be compatible with general-purpose sequence predictor algorithms is preferable for ensuring its longevity.…”

Section: Related Workmentioning

confidence: 99%

A Motion Capture and Imitation Learning-based Approach to Robot Control

Racinskis¹,

Ārents²,

Greitāns³

2022

Preprint

View full text Add to dashboard Cite

Imitation Learning is a discipline of Machine Learning primarily concerned with replicating observed behavior of agents known to perform well on a given task, collected in demonstration data sets. In this paper, we set out to introduce a pipeline for collecting demonstrations and training models that can produce motion plans for industrial robots. Object throwing is defined as the motivating use case. Multiple input data modalities are surveyed, and motion capture is selected as the most practicable. Two model architectures operating autoregressively are examined -- feedforward and recurrent neural networks. Trained models execute throws on a real robot successfully, and a battery of quantitative evaluation metrics is proposed, including extrapolated throw accuracy estimates. Recurrent neural networks outperform feedforward ones in most respects, with the best models having an assessed mean throw error on the order of 0.1...0.2 m at distances of 1.5...2.0 m. The data collection, pre-processing, and model training aspects of our proposed approach show promise, but further work is required in developing Cartesian motion planning tools before it is suitable for application in production.

show abstract

“…Few-shot learners The primary approach today for achieving successful few-shot learning models is to pretrain them over huge relevant and diverse datasets and then fine-tune them for the new tasks (Brown et al, 2020;Reed et al, 2022). The problem with this approach is that the pretrained models become specific to the datasets they were trained on (Li et al, 2017;Nalisnick et al, 2019;Yin et al, 2020;Rajendran et al, 2020).…”

Section: Relation To Other Workmentioning

confidence: 99%

Naive Few-Shot Learning: Sequence Consistency Evaluation

Barak¹,

Loewenstein²

2022

Preprint

View full text Add to dashboard Cite

Cognitive psychologists often use the term fluid intelligence to describe the ability of humans to solve novel tasks without any prior training. In contrast to humans, deep neural networks can perform cognitive tasks only after extensive (pre-)training with a large number of relevant examples. Motivated by fluid intelligence research in the cognitive sciences, we built a benchmark task which we call sequence consistency evaluation (SCE) that can be used to address this gap. Solving the SCE task requires the ability to extract simple rules from sequences, a basic computation that is required for solving various intelligence tests in humans. We tested untrained (naive) deep learning models in the SCE task. Specifically, we compared Relation Networks (RN) and Contrastive Predictive Coding (CPC), two models that can extract simple rules from sequences, and found that the latter, which imposes a structure on the predictable rule does better. We further found that simple networks fare better in this task than complex ones. Finally, we show that this approach can be used for security camera anomaly detection without any prior training.Preprint. Under review.

show abstract

A Generalist Agent

Cited by 67 publications

References 65 publications

A Motion Capture and Imitation Learning Based Approach to Robot Control

A Motion Capture and Imitation Learning Based Approach to Robot Control

A Motion Capture and Imitation Learning-based Approach to Robot Control

Naive Few-Shot Learning: Sequence Consistency Evaluation

Contact Info

Product

Resources

About