2020
DOI: 10.48550/arxiv.2004.00470
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Counterfactual Multi-Agent Reinforcement Learning with Graph Convolution Communication

Jianyu Su,
Stephen Adams,
Peter A. Beling

Abstract: We consider a fully cooperative multi-agent system where agents cooperate to maximize a system's utility in a partial-observable environment. We propose that multi-agent systems must have the ability to (1) communicate and understand the inter-plays between agents and (2) correctly distribute rewards based on an individual agent's contribution. In contrast, most work in this setting considers only one of the above abilities. In this study, we develop an architecture that allows for communication among agents a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 13 publications
0
13
0
Order By: Relevance
“…This is the reason why they did not implement a RL-based approach, but instead a semi-supervised learning algorithm, therefore guiding the learning process with explicit labeled data. In fact, so far the very few works that combine GNNs with a MARL framework [51], [52] are theoretical papers from the ML community, and none of them apply to the field of networking.…”
Section: Related Workmentioning
confidence: 99%
“…This is the reason why they did not implement a RL-based approach, but instead a semi-supervised learning algorithm, therefore guiding the learning process with explicit labeled data. In fact, so far the very few works that combine GNNs with a MARL framework [51], [52] are theoretical papers from the ML community, and none of them apply to the field of networking.…”
Section: Related Workmentioning
confidence: 99%
“…In summary, GCN exquisitely designs a structure to extract graph embedding. In MARL, there are some works (Ryu, Shin, and Park 2020;Jiang et al 2018;Su, Adams, and Beling 2020;Mao et al 2020) using GCN to encode the observations of agents to obtain a richer representation to help make simultaneous decisions.…”
Section: Graph Convolutional Networkmentioning
confidence: 99%
“…QTRAN (Son et al 2019) is a generalized factorization method that can be applied to environments that are free from structural constraints. Other works, such as CommNet (Foerster et al 2016), TarMAC (Das et al 2019), ATOC (Jiang and Lu 2018), MAAC (Iqbal and Sha 2019), CCOMA (Su, Adams, and Beling 2020) and BiCNet (Peng et al 2017) exploit interagent communication.…”
Section: Related Workmentioning
confidence: 99%