Variational Inference MPC using Normalizing Flows and Out-of-Distribution Projection

Power, Thomas J.; Berenson, Dmitry

doi:10.48550/arxiv.2205.04667

Cited by 3 publications

(3 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On the other hand, modern learning techniques such as neural networks can learn an embedding of a larger set of parameters that maps to the solutions [16]. One advantage of using neural network methods is the capability to generalize to out-of-distribution situations that are not included in the training set [29].…”

Section: Related Work a Parametric Programmingmentioning

confidence: 99%

ReDUCE: Reformulation of Mixed Integer Programs Using Data from Unsupervised Clusters for Learning Efficient Strategies

Lin¹,

Fernandez²,

Hong³

2022

2022 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

Mixed integer bilinear programs (MIBLPs) offer tools to resolve robotics motion planning problems with orthogonal rotation matrices or static moment balance, but require long solving times. Recent work utilizing data-driven methods has shown potential to overcome this issue allowing for applications on larger scale problems. To solve mixed-integer bilinear programs online with data-driven methods, several reformulations exist including mathematical programming with complementary constraints (MPCC), and mixed-integer programming (MIP). In this work, we compare the data-driven performances of various MIBLP reformulations using a book placement problem that has discrete configuration switches and bilinear constraints. The success rate, cost, and solving time are compared along with non-data-driven methods. Our results demonstrate the advantage of using data-driven methods to accelerate the solving speed of MIBLPs, and provide references for users to choose the suitable re-formulation.

show abstract

Section: Related Work a Parametric Programmingmentioning

confidence: 99%

ReDUCE: Reformulation of Mixed Integer Programs Using Data from Unsupervised Clusters for Learning Efficient Strategies

Lin¹,

Fernandez²,

Hong³

2022

2022 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

show abstract

“…Learning a prior distribution of actions and subgoals has been used to speed up MPC and accomplish complex tasks. Power and Berenson [25] leverage normalizing flow for modeling the action distributions. Wang and Ba [26] use a policy network to initialize the action sequences for MPC.…”

Section: Mpc With a Learned Priormentioning

confidence: 99%

Hierarchical Subgoal Generation from Language Instruction for Robot Task Planning

Yang

Jiang

et al. 2022

2022 China Automation Congress (CAC)

View full text Add to dashboard Cite

Manipulation of articulated and deformable objects can be difficult due to their compliant and under-actuated nature. Unexpected disturbances can cause the object to deviate from a predicted state, making it necessary to use Model-Predictive Control (MPC) methods to plan motion. However, these methods need a short planning horizon to be practical. Thus, MPC is ill-suited for long-horizon manipulation tasks due to local minima. In this paper, we present a diffusionbased method that guides an MPC method to accomplish long-horizon manipulation tasks by dynamically specifying sequences of subgoals for the MPC to follow. Our method, called Subgoal Diffuser, generates subgoals in a coarse-to-fine manner, producing sparse subgoals when the task is easily accomplished by MPC and more dense subgoals when the MPC method needs more guidance. The density of subgoals is determined dynamically based on a learned estimate of reachability, and subgoals are distributed to focus on challenging parts of the task. We evaluate our method on two robot manipulation tasks and find it improves the planning performance of an MPC method, and also outperforms prior diffusion-based methods.More visualizations and results can be found at https://sites.google.com/view/subgoal-diffuser-mpc

show abstract

“…Of particular relevance to our framework are methods that combine principled control strategies with learned components in a hierarchical way. Examples include using LQR control in the inner problem with learnable cost and dynamics (Tamar et al, 2017;Amos et al, 2018;Agrawal et al, 2019b), learning sampling distributions in planning and control (Ichter et al, 2018;Power & Berenson, 2022;Amos & Yarats, 2020), or learning optimization strategies or goals for optimization-based control (Sacks & Boots, 2022;Xiao et al, 2022;Metz et al, 2019;2022;Lew et al, 2022).…”

Section: Related Workmentioning

confidence: 99%

Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems

Gammelli

Yang

Harrison

et al. 2021

2021 60th IEEE Conference on Decision and Control (CDC)

View full text Add to dashboard Cite

Optimization problems over dynamic networks have been extensively studied and widely used in the past decades to formulate numerous real-world problems. However, (1) traditional optimization-based approaches do not scale to large networks, and (2) the design of good heuristics or approximation algorithms often requires significant manual trial-and-error. In this work, we argue that data-driven strategies can automate this process and learn efficient algorithms without compromising optimality. To do so, we present network control problems through the lens of reinforcement learning and propose a graph networkbased framework to handle a broad class of problems. Instead of naively computing actions over high-dimensional graph elements, e.g., edges, we propose a bi-level formulation where we (1) specify a desired next state via RL, and (2) solve a convex program to best achieve it, leading to drastically improved scalability and performance. We further highlight a collection of desirable features to system designers, investigate design decisions, and present experiments on real-world control problems showing the utility, scalability, and flexibility of our framework.

show abstract

Variational Inference MPC using Normalizing Flows and Out-of-Distribution Projection

Cited by 3 publications

References 21 publications

ReDUCE: Reformulation of Mixed Integer Programs Using Data from Unsupervised Clusters for Learning Efficient Strategies

ReDUCE: Reformulation of Mixed Integer Programs Using Data from Unsupervised Clusters for Learning Efficient Strategies

Hierarchical Subgoal Generation from Language Instruction for Robot Task Planning

Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems

Contact Info

Product

Resources

About