Daniel Geng scite author profile

Daniel Geng

4Publications

24Citation Statements Received

164Citation Statements Given

How they've been cited

How they cite others

164

Affiliations

Georgia Institute of Technology

Publications

Order By: Most citations

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

Berseth¹,

Geng²,

Devin³

et al. 2019

Preprint

View full text Add to dashboard Cite

All living organisms struggle against the forces of nature to carve out a maintainable niche. We propose that such a search for order amidst chaos might offer a unifying principle for the emergence of useful behaviors in artificial agents. We formalize this idea into an unsupervised reinforcement learning method called Surprise Minimizing RL (SMiRL). SMiRL alternates between learning a density model to evaluate the surprise of a stimulus, and improving the policy to seek more predictable stimuli. This process maximizes a lower-bound on the negative entropy of the states, which can be seen as maximizing the agent's ability to maintain order in the environment. The policy seeks out stable and repeatable situations that counteract the environment's prevailing sources of entropy. This might include avoiding other hostile agents, or finding a stable, balanced pose for a bipedal robot in the face of disturbance forces. We demonstrate that our surprise minimizing agents can successfully play Tetris, Doom, control a humanoid to avoid falls, and navigate to escape enemies in a maze without any task-specific reward supervision. We further show that SMiRL can be used together with a standard task rewards to accelerate reward-driven learning.

show abstract

Plan Arithmetic: Compositional Plan Vectors for Multi-Task Control

Devin¹,

Geng²,

Abbeel³

et al. 2019

Preprint

View full text Add to dashboard Cite

Comparing Correspondences: Video Prediction with Correspondence-wise Losses

Geng¹,

Owens²

2021

Preprint

View full text Add to dashboard Cite

Today's image prediction methods struggle to change the locations of objects in a scene, producing blurry images that average over the many positions they might occupy. In this paper, we propose a simple change to existing image similarity metrics that makes them more robust to positional errors: we match the images using optical flow, then measure the visual similarity of corresponding pixels. This change leads to crisper and more perceptually accurate predictions, and can be used with any image prediction network. We apply our method to predicting future frames of a video, where it obtains strong performance with simple, off-the-shelf architectures.

show abstract

Energy Minimization in Overtaking for Autonomous Vehicles in a Bidirectional Environment

Chio

Geng

Han

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Daniel Geng

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

Plan Arithmetic: Compositional Plan Vectors for Multi-Task Control

Comparing Correspondences: Video Prediction with Correspondence-wise Losses

Energy Minimization in Overtaking for Autonomous Vehicles in a Bidirectional Environment

Contact Info

Product

Resources

About