Jonas Rothfuss scite author profile

Jonas Rothfuss

5Publications

142Citation Statements Received

84Citation Statements Given

How they've been cited

139

How they cite others

Affiliations

ETH Zurich, Karlsruhe Institute of Technology, École Polytechnique Fédérale de Lausanne

Publications

Order By: Most citations

ProMP: Proximal Meta-Policy Search

Rothfuss¹,

Lee²,

Clavera³

et al. 2018

Preprint

View full text Add to dashboard Cite

Credit assignment in Meta-reinforcement learning (Meta-RL) is still poorly understood. Existing methods either neglect credit assignment to pre-adaptation behavior or implement it naively. This leads to poor sample-efficiency during metatraining as well as ineffective task identification strategies. This paper provides a theoretical analysis of credit assignment in gradient-based Meta-RL. Building on the gained insights we develop a novel meta-learning algorithm that overcomes both the issue of poor credit assignment and previous difficulties in estimating meta-policy gradients. By controlling the statistical distance of both pre-adaptation and adapted policies during meta-policy search, the proposed algorithm endows efficient and stable meta-learning. Our approach leads to superior pre-adaptation policy behavior and consistently outperforms previous Meta-RL algorithms in sample-efficiency, wall-clock time, and asymptotic performance.

show abstract

PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees

Rothfuss¹,

Fortuin²,

Josifoski³

et al. 2020

Preprint

View full text Add to dashboard Cite

Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution

Rothfuss

Ferreira

Aksoy

et al. 2018

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

We present a novel deep neural network architecture for representing robot experiences in an episodic-like memory which facilitates encoding, recalling, and predicting action experiences. Our proposed unsupervised deep episodic memory model 1) encodes observed actions in a latent vector space and, based on this latent encoding, 2) infers most similar episodes previously experienced, 3) reconstructs original episodes, and 4) predicts future frames in an end-to-end fashion. Results show that conceptually similar actions are mapped into the same region of the latent vector space. Based on these results, we introduce an action matching and retrieval mechanism, benchmark its performance on two large-scale action datasets, 20BN-something-something and ActivityNet and evaluate its generalization capability in a real-world scenario on a humanoid robot.

show abstract

DiBS: Differentiable Bayesian Structure Learning

Lorch¹,

Rothfuss²,

Schölkopf³

et al. 2021

Preprint

View full text Add to dashboard Cite

Bayesian structure learning allows inferring Bayesian network structure from data while reasoning about the epistemic uncertainty-a key element towards enabling active causal discovery and designing interventions in real world systems. In this work, we propose a general, fully differentiable framework for Bayesian structure learning (DiBS) that operates in the continuous space of a latent probabilistic graph representation. Building on recent advances in variational inference, we use DiBS to devise an efficient method for approximating posteriors over structural models. Contrary to existing work, DiBS is agnostic to the form of the local conditional distributions and allows for joint posterior inference of both the graph structure and the conditional distribution parameters. This makes our method directly applicable to posterior inference of nonstandard Bayesian network models, e.g., with nonlinear dependencies encoded by neural networks. In evaluations on simulated and real-world data, DiBS significantly outperforms related approaches to joint posterior inference.

show abstract

Robustness to Pruning Predicts Generalization in Deep Neural Networks

Kuhn

Lyle

Gomez

et al. 2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jonas Rothfuss

ProMP: Proximal Meta-Policy Search

PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees

Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution

DiBS: Differentiable Bayesian Structure Learning

Robustness to Pruning Predicts Generalization in Deep Neural Networks

Contact Info

Product

Resources

About