Robotics: Science and Systems XI 2015
DOI: 10.15607/rss.2015.xi.029
|View full text |Cite
|
Sign up to set email alerts
|

Robust Trajectory Optimization: A Cooperative Stochastic Game Theoretic Approach

Abstract: Abstract-We present a novel trajectory optimization framework to address the issue of robustness, scalability and efficiency in optimal control and reinforcement learning. Based on prior work in Cooperative Stochastic Differential Game (CSDG) theory, our method performs local trajectory optimization using cooperative controllers. The resulting framework is called Cooperative Game-Differential Dynamic Programming (CG-DDP). Compared to related methods, CG-DDP exhibits improved performance in terms of robustness … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(17 citation statements)
references
References 28 publications
0
17
0
Order By: Relevance
“…Iterative methods via linearization have been widely used in real-time OCP (Pan et al, 2015;Tassa et al, 2014). We adopt a similar methodology for its computational efficiency and algorithmic connection to existing training methods (shown later).…”
Section: Iterative Update Via Linearizationmentioning
confidence: 99%
See 1 more Smart Citation
“…Iterative methods via linearization have been widely used in real-time OCP (Pan et al, 2015;Tassa et al, 2014). We adopt a similar methodology for its computational efficiency and algorithmic connection to existing training methods (shown later).…”
Section: Iterative Update Via Linearizationmentioning
confidence: 99%
“…In other words, we can divide the layer's weight (or player's action) into multiple parts, so that the MPDG framework remains applicable. Interestingly, the transformation of this kind resembles game-theoretic robust optimal control (Pan et al, 2015;Sun et al, 2018), where the controller (or player in our context) models external disturbances with fictitious agents, in order to enhance the robustness or convergence of the optimization process.…”
Section: Game-theoretic Applicationsmentioning
confidence: 99%
“…Several robust variants of differential dynamic programming [11] have been proposed for solving worst-case minimax problems [27], risk-sensitive optimizations for stochastic systems [6], and cooperative stochastic games [29]. Like the algorithm proposed in this paper, these methods consider system responses under linear feedback, but they use different robustness metrics and lack the ability to explicitly incorporate bounds on disturbances.…”
Section: Related Workmentioning
confidence: 99%
“…This paper builds on previous work on robust motion planning based on direct trajectory optimization [26,3] and differential dynamic programming (DDP) [27,6,29]. Robust motion planning algorithms often differ in the precise notion of robustness that they seek to optimize.…”
Section: Introductionmentioning
confidence: 99%
“…The MFG case is further complicated due to the fully coupled nature of the HJB-FP system ( [7], [8], [9]). The first [10] and second ( [11], [12]) order forward-backward SDE (FBSDE) [1] framework has been applied to obtain algorithms for optimal control of dynamics with nonlinear drift and state multiplicative noise, but not in the case of control multiplicative Gaussian or the general case of non-Gaussian excitation [13].…”
Section: Introductionmentioning
confidence: 99%