Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence 2017
DOI: 10.24963/ijcai.2017/638
|View full text |Cite
|
Sign up to set email alerts
|

COG-DICE: An Algorithm for Solving Continuous-Observation Dec-POMDPs

Abstract: The decentralized partially observable Markov decision process (Dec-POMDP) is a powerful model for representing multi-agent problems with decentralized behavior. Unfortunately, current Dec-POMDP solution methods cannot solve problems with continuous observations, which are common in many real-world domains. To that end, we present a framework for representing and generating Dec-POMDP policies that explicitly include continuous observations. We apply our algorithm to a novel tagging problem and an extended vers… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 7 publications
(3 reference statements)
0
1
0
Order By: Relevance
“…Specifically, our solution is model-free where each agent's policy is optimized iteratively with the local information collected in several trials by that agent. In more detail, we optimize each agent's policy using a variation of the Cross-Entropy (CE) method (Oliehoek, Kooij, and Vlassis 2008;Omidshafiei et al 2016;Clark-Turner and Amato 2017). Like most of the existing algorithms for Dec-POMDPs, privacy issues are not concerned in the vanilla CE method (Oliehoek, Kooij, and Vlassis 2008).…”
Section: Introductionmentioning
confidence: 99%
“…Specifically, our solution is model-free where each agent's policy is optimized iteratively with the local information collected in several trials by that agent. In more detail, we optimize each agent's policy using a variation of the Cross-Entropy (CE) method (Oliehoek, Kooij, and Vlassis 2008;Omidshafiei et al 2016;Clark-Turner and Amato 2017). Like most of the existing algorithms for Dec-POMDPs, privacy issues are not concerned in the vanilla CE method (Oliehoek, Kooij, and Vlassis 2008).…”
Section: Introductionmentioning
confidence: 99%