Unsupervised Visuomotor Control through Distributional Planning Networks

Yu, Tong; Shevchuk, Gleb; Sadigh, Dorsa; Finn, Chelsea

doi:10.15607/rss.2019.xv.020

Cited by 25 publications

(16 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Several prior works [50,56,59,44] maximize MI objectives that closely resemble the forward information objective we introduce in Section 4, whiel others optimize related objectives by learning latent forward dynamics models [69,33,73,26,39]. Multi-step inverse models, closely related to the inverse information objective (Section 4), have been used to learn control-centric representations [70,23]. Single-step inverse models have been deployed as regularization of forward models [72,2] and as an auxiliary loss for policy gradient RL [57,52].…”

Section: Mutual Information Objectives In Rlmentioning

confidence: 99%

“…We suggestively name this objective "inverse information" due to the second term, which is the entropy of the inverse dynamics. A wide range of prior work learns representations by optimizing closely related objectives [23,57,2,52,70,72]. Intuitively, inverse models allow the representation to capture only the elements of the state that are necessary to predict the action, allowing the discard of potentially irrelevant information.…”

Section: Mutual Information For Representation Learning In Rlmentioning

confidence: 99%

See 1 more Smart Citation

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Rakelly,

Gupta,

Florensa

et al. 2021

Preprint

View full text Add to dashboard Cite

Mutual information maximization provides an appealing formalism for learning representations of data. In the context of reinforcement learning (RL), such representations can accelerate learning by discarding irrelevant and redundant information, while retaining the information necessary for control. Much of the prior work on these methods has addressed the practical difficulties of estimating mutual information from samples of high-dimensional observations, while comparatively less is understood about which mutual information objectives yield representations that are sufficient for RL from a theoretical perspective. In this paper, we formalize the sufficiency of a state representation for learning and representing the optimal policy, and study several popular mutual-information based objectives through this lens. Surprisingly, we find that two of these objectives can yield insufficient representations given mild and common assumptions on the structure of the MDP. We corroborate our theoretical results with empirical experiments on a simulated game environment with visual observations.

show abstract

Section: Mutual Information Objectives In Rlmentioning

confidence: 99%

Section: Mutual Information For Representation Learning In Rlmentioning

confidence: 99%

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Rakelly,

Gupta,

Florensa

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Finally, there are efforts to learn more sophisticated cost functions over input images C θ (ô, o g ) [16,20,21,30,23]. For example, Nair et al [16] train a latent representation to focus on portions of the image that are different between the goal and the current image, and show that costs computed over these latents permit better control on one robot.…”

Section: Robot-aware Planning Costsmentioning

confidence: 99%

Know Thyself: Transferable Visual Control Policies Through Robot-Awareness

Hu¹,

Huang²,

Rybkin³

et al. 2021

Preprint

View full text Add to dashboard Cite

Training visuomotor robot controllers from scratch on a new robot typically requires generating large amounts of robot-specific data. Could we leverage data previously collected on another robot to reduce or even completely remove this need for robot-specific data? We propose a "robot-aware" solution paradigm that exploits readily available robot "self-knowledge" such as proprioception, kinematics, and camera calibration to achieve this. First, we learn modular dynamics models that pair a transferable, robot-agnostic world dynamics module with a robot-specific, analytical robot dynamics module. Next, we set up visual planning costs that draw a distinction between the robot self and the world. Our experiments on tabletop manipulation tasks in simulation and on real robots demonstrate that these plug-in improvements dramatically boost the transferability of visuomotor controllers, even permitting zero-shot transfer onto new robots for the very first time. Project website: https://hueds.github.io/rac/ Preprint. Under review.

show abstract

“…Some works use visual affordance as auxiliary to estimate and adjust the joint configuration of robots in different tasks [26], [27]. There are also works attempting to directly optimise action trajectories via raw input images [28], by learning embeddings without supervision [29], [30], [31]. However, the interaction among objects are not considered in these works.…”

Section: Related Workmentioning

confidence: 99%

DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos

Yuan¹,

Wu²,

Zhao³

et al. 2021

Preprint

View full text Add to dashboard Cite

Learning an accurate model of the environment is essential for model-based control tasks. Existing methods in robotic visuomotor control usually learn from data with heavily labelled actions, object entities or locations, which can be demanding in many cases. To cope with this limitation, we propose a method that trains a forward model from video data only, via disentangling the motion of controllable agent to model the transition dynamics. An object extractor and an interaction learner are trained in an end-to-end manner without supervision. The agent's motions are explicitly represented using spatial transformation matrices containing physical meanings. In the experiments, our method achieves superior performance on learning an accurate forward model in a Grid World environment, as well as a more realistic robot control environment in simulation. With the accurate learned forward models, we further demonstrate their usage in model predictive control as an effective approach for robotic manipulations.

show abstract

Unsupervised Visuomotor Control through Distributional Planning Networks

Cited by 25 publications

References 40 publications

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Know Thyself: Transferable Visual Control Policies Through Robot-Awareness

DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos

Contact Info

Product

Resources

About