Path integrals and symmetry breaking for optimal control theory

Kappen, Hilbert J.

doi:10.1088/1742-5468/2005/11/p11011

Cited by 238 publications

(282 citation statements)

References 16 publications

Supporting

Mentioning

279

Contrasting

Order By: Relevance

“…A brief version is provided for the reader's convenience; for the technical details we refer to, e.g., [36]. Let L u = " + (u rV (x)) · r denote the infinitesimal generator of (6). Here the superscript indicates the explicit dependence on the control variable.…”

Section: (10)mentioning

confidence: 99%

“…Here the superscript indicates the explicit dependence on the control variable. We have to show that the solutions to (10) yield optimal controls that maximize (8)-(9) subject to (6). Now choose a…”

Section: (10)mentioning

confidence: 99%

“…Available methods for the numerical solution of high dimensional HJB equations include Markov chain approximations [2], monotone schemes [3,4], or methods designed for relatively specific problems [5,6]. Nonetheless we are not aware of a single article on optimal control of molecular dynamics (MD) using dynamic programming principles, although the interest in controlling molecular dynamics simulations has already started more than a decade ago with the development of Targeted MD or Steered MD [7,8], and laser control of (open) quantum systems [9]; see [10] for a recent survey of approaches from molecular physics, chemistry, and optical control of quantum molecular dynamics.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Optimal control of molecular dynamics using Markov state models

2012

View full text Add to dashboard Cite

A numerical scheme for solving high-dimensional stochastic control problems on an infinite time horizon that appear relevant in the context of molecular dynamics is outlined. The scheme rests on the interpretation of the corresponding Hamilton-Jacobi-Bellman equation as a nonlinear eigenvalue problem that, using a logarithmic transformation, can be recast as a linear eigenvalue problem, for which the principal eigenvalue and its eigenfunction are sought. The latter can be computed e ciently by approximating the underlying stochastic process with a coarse-grained Markov state model for the dominant metastable sets. We illustrate our method with two numerical examples, one of which involves the task of maximizing the population of ↵-helices in an ensemble of small biomolecules (Alanine dipeptide), and discuss the relation to the large deviation principle of Donsker and Varadhan.

show abstract

Section: (10)mentioning

confidence: 99%

Section: (10)mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Optimal control of molecular dynamics using Markov state models

2012

View full text Add to dashboard Cite

show abstract

“…Additionally, we require both W and R to be positive definite and bounded everywhere on Ω, but otherwise impose no restrictions on them. Contrary to the assumptions in previous work [9,10,14] and the work of Kappen [7] and Broek et al [12] they are no longer required to relate to the inverse of eachother. As formulated, the control u and the noise w enters the state equation via the same matrix G. However, the problem can easily be reformulated such that the control and noise enter via different matrices as long as they have the same column space [14].…”

Section: Problem Formulationmentioning

confidence: 82%

Solving the Hamilton-Jacobi-Bellman equation for a stochastic system with state constraints

Rutquist

Wik

Breitholtz

2014

53rd IEEE Conference on Decision and Control

View full text Add to dashboard Cite

We present a method for solving the Hamilton-Jacobi-Bellman (HJB) equation for a stochastic system with state constraints. A variable transformation is introduced which turns the HJB equation into a combination of a linear eigenvalue problem, a set of partial differential equations (PDE:s), and a point-wise equation. For a fixed solution to the eigenvalue problem, the PDE:s are linear and the point-wise equation is quadratic, indicating that the problem can be solved efficiently using an iterative scheme.As an example, we numerically solve for the optimal control of a Linear Quadratic Gaussian (LQG) system with state constraints. A reasonably accurate solution is obtained even with a very small number of collocation points (three in each dimension), which suggests that the method could be used on high order systems, mitigating the curse of dimensionality.

show abstract

“…In recent years, general reinforcement learning has yielded three kinds of policy search approaches that have translated particularly well into the domain of robotics: (i) policy gradients approaches based on likelihood-ratio estimation [Sutton et al, 1999], (ii) policy updates inspired by expectation-maximization [Toussaint et al, 2010], and (iii) the path integral methods [Kappen, 2005]. Likelihood-ratio policy gradient methods rely on perturbing the motor command instead of comparing in policy space.…”

Section: Policy Searchmentioning

confidence: 99%

Learning motor skills: from algorithms to robot experiments

Kober

Peters

2014

It - Information Technology

View full text Add to dashboard Cite

Die Veröffentlichung steht unter folgender Creative Commons Lizenz: Namensnennung -Keine kommerzielle Nutzung -Keine Bearbeitung 2.0 Deutschland http://creativecommons.org/licenses/by-nc-nd/2.0/de/ Abstract Ever since the word "robot" was introduced to the English language by KarelČapek's play "Rossum's Universal Robots" in 1921, robots have been expected to become part of our daily lives. In recent years, robots such as autonomous vacuum cleaners, lawn mowers, and window cleaners, as well as a huge number of toys have been made commercially available. However, a lot of additional research is required to turn robots into versatile household helpers and companions. One of the many challenges is that robots are still very specialized and cannot easily adapt to changing environments and requirements. Since the 1960s, scientists attempt to provide robots with more autonomy, adaptability, and intelligence. Research in this field is still very active but has shifted focus from reasoning based methods towards statistical machine learning. Both navigation (i.e., moving in unknown or changing environments) and motor control (i.e., coordinating movements to perform skilled actions) are important sub-tasks.In this thesis, we will discuss approaches that allow robots to learn motor skills. We mainly consider tasks that need to take into account the dynamic behavior of the robot and its environment, where a kinematic movement plan is not sufficient. The presented tasks correspond to sports and games but the presented techniques will also be applicable to more mundane household tasks. Motor skills can often be represented by motor primitives. Such motor primitives encode elemental motions which can be generalized, sequenced, and combined to achieve more complex tasks. For example, a forehand and a backhand could be seen as two different motor primitives of playing table tennis. We show how motor primitives can be employed to learn motor skills on three different levels. First, we discuss how a single motor skill, represented by a motor primitive, can be learned using reinforcement learning. Second, we show how such learned motor primitives can be generalized to new situations. Finally, we present first steps towards using motor primitives in a hierarchical setting and how several motor primitives can be combined to achieve more complex tasks.To date, there have been a number of successful applications of learning motor primitives employing imitation learning. However, many interesting motor learning problems are high-dimensional reinforcement learning problems which are often beyond the reach of current reinforcement learning methods. We review research on reinforcement learning applied to robotics and point out key challenges and important strategies to render reinforcement learning tractable. Based on these insights, we introduce novel learning approaches both for single and generalized motor skills.For learning single motor skills, we study parametrized policy search methods and introduce a framework of reward-weighted imi...

show abstract

Path integrals and symmetry breaking for optimal control theory

Cited by 238 publications

References 16 publications

Optimal control of molecular dynamics using Markov state models

Optimal control of molecular dynamics using Markov state models

Solving the Hamilton-Jacobi-Bellman equation for a stochastic system with state constraints

Learning motor skills: from algorithms to robot experiments

Contact Info

Product

Resources

About