COG-DICE: An Algorithm for Solving Continuous-Observation Dec-POMDPs

Clark-Turner, Madison; Amato, Christopher

doi:10.24963/ijcai.2017/638

Cited by 1 publication

(1 citation statement)

References 7 publications

(3 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, our solution is model-free where each agent's policy is optimized iteratively with the local information collected in several trials by that agent. In more detail, we optimize each agent's policy using a variation of the Cross-Entropy (CE) method (Oliehoek, Kooij, and Vlassis 2008;Omidshafiei et al 2016;Clark-Turner and Amato 2017). Like most of the existing algorithms for Dec-POMDPs, privacy issues are not concerned in the vanilla CE method (Oliehoek, Kooij, and Vlassis 2008).…”

Section: Introductionmentioning

confidence: 99%

Privacy-Preserving Policy Iteration for Decentralized POMDPs

Zilberstein

Chen

2018

AAAI

View full text Add to dashboard Cite

We propose the first privacy-preserving approach to address the privacy issues that arise in multi-agent planning problems modeled as a Dec-POMDP. Our solution is a distributed message-passing algorithm based on trials, where the agents' policies are optimized using the cross-entropy method. In our algorithm, the agents' private information is protected using a public-key homomorphic cryptosystem. We prove the correctness of our algorithm and analyze its complexity in terms of message passing and encryption/decryption operations. Furthermore, we analyze several privacy aspects of our algorithm and show that it can preserve the agent privacy of non-neighbors, model privacy, and decision privacy. Our experimental results on several common Dec-POMDP benchmark problems confirm the effectiveness of our approach.

show abstract

Section: Introductionmentioning

confidence: 99%