2016
DOI: 10.1007/s10955-016-1446-7
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Importance Sampling for Control and Inference

Abstract: Path integral (PI) control problems are a restricted class of non-linear control problems that can be solved formally as a Feynman-Kac PI and can be estimated using Monte Carlo sampling. In this contribution we review PI control theory in the finite horizon case. We subsequently focus on the problem how to compute and represent control solutions. We review the most commonly used methods in robotics and control. Within the PI theory, the question of how to compute becomes the question of importance sampling. Ef… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
95
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 85 publications
(99 citation statements)
references
References 46 publications
1
95
0
Order By: Relevance
“…[24] uses path sampling techniques to harvest nonequilibrium protocols in proportion to their average dissipation. Trajectory space Monte Carlo techniques have also been developed for use in stochastic optimal control theory to iteratively refine importance sampling distributions [25,26,32], exploiting the connection between importance sampling and optimal control [33,34]. With a bias that favors low dissipation, Gingrich et al [24] explore an ensemble of low dissipation protocols and show that there is a large number of protocols with a dissipation near the minimum achievable value.…”
Section: Introductionmentioning
confidence: 99%
“…[24] uses path sampling techniques to harvest nonequilibrium protocols in proportion to their average dissipation. Trajectory space Monte Carlo techniques have also been developed for use in stochastic optimal control theory to iteratively refine importance sampling distributions [25,26,32], exploiting the connection between importance sampling and optimal control [33,34]. With a bias that favors low dissipation, Gingrich et al [24] explore an ensemble of low dissipation protocols and show that there is a large number of protocols with a dissipation near the minimum achievable value.…”
Section: Introductionmentioning
confidence: 99%
“…Note that all the estimations are unbiased [7,6]. Especially, it is proven that the variance of estimation decreases as u in becomes closer to the real optimal control u * [6].…”
Section: Change Of Measure (Importance Sampling)mentioning
confidence: 87%
“…In [6], path-integral formula is utilized to construct a state-dependent feedback controller and theoretical analysis on how sampling strategies affect the estimation results is presented. In [7], the cross entropy method was applied to build an efficient importance sampler that reduces estimation variance. In [8], the rapidly-exploring random tree (RRT) algorithm was used to help the importance sampler to pick valuable samples.…”
Section: Introductionmentioning
confidence: 99%
“…It turns out, that the gradient with respect to θ can be estimated iteratively by a Monte Carlo method using adaptive importance sampling. [28,29]…”
Section: Variational Path Inferencementioning
confidence: 99%