Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

Williams, Grady; Drews, Paul; Goldfain, Brian; Rehg, James M.; Theodorou, iEvangelos A.

doi:10.1109/tro.2018.2865891

Cited by 192 publications

(187 citation statements)

References 42 publications

Supporting

Mentioning

182

Contrasting

Order By: Relevance

“…The iterative path integral methods [2]- [5] are optimization methods for the stochastic optimal control problem. These methods assume that the system noise t is zeromean Gaussian t ∼ N (0, Σ) with a covariance matrix Σ ∈ R m×m , and suppose a trajectory cost function S(τ ) as the sum of arbitrary state-cost and quadratic control-cost over time time-horizon:…”

Section: B Iterative Path Integral Methodsmentioning

confidence: 99%

“…This section briefly reviews the formulation of stochastic optimal control problem, Williams's iterative path integral methods [2]- [5], and PI-Net [6].…”

Section: Preliminariesmentioning

confidence: 99%

“…This approach solves the intrinsic problem of primitive path integral methods that require almost infinite samples for optimal solutions. Moreover, Williams derived different iterative methods [4], [5], which eliminate the affine dynamics constraints on the original path integral framework. This paper focuses on this type of iterative path integral methods.…”

Section: Introductionmentioning

confidence: 99%

“…First, as it is not necessary to approximate dynamics and cost models with linear and quadratic forms, non linear system dynamics and cost functions can be naturally employed. In [2], [5], highly non linear car dynamics have been employed to successfully control a miniature real vehicle that functions autonomously and with aggressive drifting. Dynamics can also be represented as trainable models, i.e., neural networks, thus allowing to solve model-based reinforcement learning tasks.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Acceleration of Gradient-Based Path Integral Method for Efficient Optimal and Inverse Optimal Control

Okada

Taniguchi

2018

2018 IEEE International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

This paper deals with a new accelerated path integral method, which iteratively searches optimal controls with a small number of iterations. This study is based on the recent observations that a path integral method for reinforcement learning can be interpreted as gradient descent. This observation also applies to an iterative path integral method for optimal control, which sets a convincing argument for utilizing various optimization methods for gradient descent, such as momentum-based acceleration, step-size adaptation and their combination. We introduce these types of methods to the path integral and demonstrate that momentum-based methods, like Nesterov Accelerated Gradient and Adam, can significantly improve the convergence rate to search for optimal controls in simulated control systems. We also demonstrate that the accelerated path integral could improve the performance on model predictive control for various vehicle navigation tasks. Finally, we represent this accelerated path integral method as a recurrent network, which is the accelerated version of the previously proposed path integral networks (PI-Net). We can train the accelerated PI-Net more efficiently for inverse optimal control with less RAM than the original PI-Net.

show abstract

Section: B Iterative Path Integral Methodsmentioning

confidence: 99%

“…This section briefly reviews the formulation of stochastic optimal control problem, Williams's iterative path integral methods [2]- [5], and PI-Net [6].…”

Section: Preliminariesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Acceleration of Gradient-Based Path Integral Method for Efficient Optimal and Inverse Optimal Control

Okada

Taniguchi

2018

2018 IEEE International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

show abstract

“…DMD-MPC is based on a first-order online learning algorithm called dynamic mirror descent (DMD) [14], a generalization of mirror descent [4] for dynamic comparators. We show that several existing MPC algorithms [31,32] are special cases of DMD-MPC, given specific choices of step sizes, loss functions, and regularization. Furthermore, we demonstrate how new MPC algorithms can be derived systematically from DMD-MPC with only mild assumptions on the regularity of the cost function.…”

Section: Introductionmentioning

confidence: 99%

An Online Learning Approach to Model Predictive Control

Wagener¹,

Cheng²,

Sacks³

et al. 2019

Robotics: Science and Systems XV

View full text Add to dashboard Cite

Model predictive control (MPC) is a powerful technique for solving dynamic control tasks. In this paper, we show that there exists a close connection between MPC and online learning, an abstract theoretical framework for analyzing online decision making in the optimization literature. This new perspective provides a foundation for leveraging powerful online learning algorithms to design MPC algorithms. Specifically, we propose a new algorithm based on dynamic mirror descent (DMD), an online learning algorithm that is designed for non-stationary setups. Our algorithm, Dynamic Mirror Descent Model Predictive Control (DMD-MPC), represents a general family of MPC algorithms that includes many existing techniques as special instances. DMD-MPC also provides a fresh perspective on previous heuristics used in MPC and suggests a principled way to design new MPC algorithms. In the experimental section of this paper, we demonstrate the flexibility of DMD-MPC, presenting a set of new MPC algorithms on a simple simulated cartpole and a simulated and real-world aggressive driving task. Videos of the real-world experiments can be found at https://youtu.be/vZST3v0_S9w and https://youtu.be/MhuqiHo2t98.

show abstract

Control of Bioprocess

2020

Control in Bioprocessing

View full text Add to dashboard Cite

Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

Cited by 192 publications

References 42 publications

Acceleration of Gradient-Based Path Integral Method for Efficient Optimal and Inverse Optimal Control

Acceleration of Gradient-Based Path Integral Method for Efficient Optimal and Inverse Optimal Control

An Online Learning Approach to Model Predictive Control

Control of Bioprocess

Contact Info

Product

Resources

About