2020
DOI: 10.48550/arxiv.2006.06555
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multi-Agent Reinforcement Learning in Stochastic Networked Systems

Abstract: We study distributed reinforcement learning (RL) for a network of agents. The objective is to find localized policies that maximize the (discounted) global reward. In general, scalability is a challenge in this setting because the size of the global state/action space can be exponential in the number of agents. Scalable algorithms are only known in cases where dependencies are local, e.g., between neighbors. In this work, we propose a Scalable Actor Critic framework that applies in settings where the dependenc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(12 citation statements)
references
References 27 publications
(91 reference statements)
0
12
0
Order By: Relevance
“…Remark 10 The works in [Qu and Li, 2019, Qu et al, 2020a, Lin et al, 2020, Qu et al, 2020b are closely related to our contribution, as they also use decay of correlation assumptions to provably avoid the curse of dimensionality in MARL. Our contribution differs from these works in the following main ways:…”
Section: Decentralized Npgmentioning
confidence: 54%
See 3 more Smart Citations
“…Remark 10 The works in [Qu and Li, 2019, Qu et al, 2020a, Lin et al, 2020, Qu et al, 2020b are closely related to our contribution, as they also use decay of correlation assumptions to provably avoid the curse of dimensionality in MARL. Our contribution differs from these works in the following main ways:…”
Section: Decentralized Npgmentioning
confidence: 54%
“…To take advantage of the local structure of the network, Lin et al [2020] define a property regarding the dependence of Q π k (s, a) on the neighbors of k.…”
Section: Exponential Decaymentioning
confidence: 99%
See 2 more Smart Citations
“…We further focus on the case where the communications network is a structural component of the problem setting, as in (Lowe et al, 2017;Zhang et al, 2018). However, a separate but related body of works estimate the communications architecture when agents' behavior is fixed using graph neural networks (Ahilan & Dayan, 2020;Bachrach et al, 2020;Eccles et al, 2019) or statistical tests for correlation between agents' local utilities (Lin et al, 2020;Qu et al, 2020a).…”
Section: Additional Contextmentioning
confidence: 99%