2019
DOI: 10.48550/arxiv.1912.05304
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning Agent Communication under Limited Bandwidth by Message Pruning

Abstract: Communication is a crucial factor for the big multi-agent world to stay organized and productive. Recently, Deep Reinforcement Learning (DRL) has been applied to learn the communication strategy and the control policy for multiple agents. However, the practical limited bandwidth in multiagent communication has been largely ignored by the existing DRL methods. Specifically, many methods keep sending messages incessantly, which consumes too much bandwidth. As a result, they are inapplicable to multi-agent system… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 9 publications
(5 reference statements)
0
11
0
Order By: Relevance
“…A predator only has a local view around itself, so they have to cooperate to capture preys and are required to avoid collision with other predators. Our baselines for limited bandwidth are (1) Gated-ACML (Mao et al 2019), (2) SchedNet , and (3) Message-Dropout (Kilinc and Montana 2018). In addition, A3C2 (Simões, Lau, and Reis 2020) is introduced as a full-communication version.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…A predator only has a local view around itself, so they have to cooperate to capture preys and are required to avoid collision with other predators. Our baselines for limited bandwidth are (1) Gated-ACML (Mao et al 2019), (2) SchedNet , and (3) Message-Dropout (Kilinc and Montana 2018). In addition, A3C2 (Simões, Lau, and Reis 2020) is introduced as a full-communication version.…”
Section: Methodsmentioning
confidence: 99%
“…IC3Net (Singh, Jain, and Sukhbaatar 2018) extends the work of CommNet (Sukhbaatar, Fergus et al 2016) by means of Long Short-Term Memory (LSTM) and the gating mechanism. Gated-ACML (Mao et al 2019) and ATOC (Jiang and Lu 2018) both evaluate the importance of communication by comparing the Q-difference between sending messages and not. If the difference is greater than a threshold, agents consider the message is valuable and choose to communicate.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations