2008
DOI: 10.1613/jair.2473
|View full text |Cite
|
Sign up to set email alerts
|

Graphical Model Inference in Optimal Control of Stochastic Multi-Agent Systems

Abstract: In this article we consider the issue of optimal control in collaborative multi-agent systems with stochastic dynamics. The agents have a joint task in which they have to reach a number of target states. The dynamics of the agents contains additive control and additive noise, and the autonomous part factorizes over the agents. Full observation of the global state is assumed. The goal is to minimize the accumulated joint cost, which consists of integrated instantaneous costs and a joint end cost. The joint end … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
50
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 75 publications
(51 citation statements)
references
References 41 publications
(36 reference statements)
0
50
0
Order By: Relevance
“…In this work, the role of noise in symmetry breaking phenomena was investigated in the context of stochastic optimal control. In [22], the path integral formalism is extended for stochastic optimal control of multi-agent systems, which is not unlike our multi DOF control systems.…”
Section: Related Workmentioning
confidence: 99%
“…In this work, the role of noise in symmetry breaking phenomena was investigated in the context of stochastic optimal control. In [22], the path integral formalism is extended for stochastic optimal control of multi-agent systems, which is not unlike our multi DOF control systems.…”
Section: Related Workmentioning
confidence: 99%
“…In this paper, we present a probabilistic reinforcement learning approach, which is derived from the framework of stochastic optimal control and path integrals, based on the original work of [14]. As will be detailed in the sections below, this approach makes an appealing theoretical connection between value function approximation using the stochastic HJB equations and direct policy learning by approximating a path integral, i.e., by solving a statistical inference problem from sample roll-outs.…”
Section: Introductionmentioning
confidence: 99%
“…It is not hard to show [25] that the optimal control (17) may be related to the expected value of the first discretized increment x 1 of the trajectory x N as 1…”
Section: Path Integral Constructionmentioning
confidence: 99%
“…Related works incorporating the PI framework for multiagent systems designed control for systems in which agents cooperatively compute their control from a marginalization of the joint probability distribution of the group's system trajectory [25,26]. However, to the best of our knowledge, this paper contains the first use of the PI framework for a robotic team without explicit inter-agent communication.…”
Section: Introductionmentioning
confidence: 99%