2018
DOI: 10.1109/tro.2018.2865891
|View full text |Cite
|
Sign up to set email alerts
|

Information-Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving

Abstract: We present an information theoretic approach to stochastic optimal control problems that can be used to derive general sampling based optimization schemes. This new mathematical method is used to develop a sampling based model predictive control algorithm. We apply this information theoretic model predictive control (IT-MPC) scheme to the task of aggressive autonomous driving around a dirt test track, and compare its performance to a model predictive control version of the cross-entropy method.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
182
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 192 publications
(187 citation statements)
references
References 42 publications
0
182
0
Order By: Relevance
“…The iterative path integral methods [2]- [5] are optimization methods for the stochastic optimal control problem. These methods assume that the system noise t is zeromean Gaussian t ∼ N (0, Σ) with a covariance matrix Σ ∈ R m×m , and suppose a trajectory cost function S(τ ) as the sum of arbitrary state-cost and quadratic control-cost over time time-horizon:…”
Section: B Iterative Path Integral Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…The iterative path integral methods [2]- [5] are optimization methods for the stochastic optimal control problem. These methods assume that the system noise t is zeromean Gaussian t ∼ N (0, Σ) with a covariance matrix Σ ∈ R m×m , and suppose a trajectory cost function S(τ ) as the sum of arbitrary state-cost and quadratic control-cost over time time-horizon:…”
Section: B Iterative Path Integral Methodsmentioning
confidence: 99%
“…This section briefly reviews the formulation of stochastic optimal control problem, Williams's iterative path integral methods [2]- [5], and PI-Net [6].…”
Section: Preliminariesmentioning
confidence: 99%
See 2 more Smart Citations
“…DMD-MPC is based on a first-order online learning algorithm called dynamic mirror descent (DMD) [14], a generalization of mirror descent [4] for dynamic comparators. We show that several existing MPC algorithms [31,32] are special cases of DMD-MPC, given specific choices of step sizes, loss functions, and regularization. Furthermore, we demonstrate how new MPC algorithms can be derived systematically from DMD-MPC with only mild assumptions on the regularity of the cost function.…”
Section: Introductionmentioning
confidence: 99%