Emergent Cooperation from Mutual Acknowledgment Exchange in Multi-Agent Reinforcement Learning

Phan, Thomy; Sommer, Felix; Ritz, Fabian; Altmann, Philipp; Nüßlein, Jonas; Kölle, Michael; Belzner, Lenz; Linnhoff‐Popien, Claudia

doi:10.21203/rs.3.rs-2315844/v1

Cited by 2 publications

(1 citation statement)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…LILAC [9] learns a leader to assign roles. Another line of work such as [38,39,28,13], divides the agents into some groups that carry out similar sub-tasks with a specific policy or value function. In our work, we learn more stable and distinguishable group embeddings and further consider the integration of team-level strategy and individual-level decision.…”

Section: Related Workmentioning

confidence: 99%

Learning to Collaborate by Grouping: A Consensus-Oriented Strategy for Multi-Agent Reinforcement Learning

Ruan,

Hao,

et al. 2023

Frontiers in Artificial Intelligence and Applications

View full text Add to dashboard Cite

Multi-agent systems require effective coordination between groups and individuals to achieve common goals. However, current multi-agent reinforcement learning (MARL) methods primarily focus on improving individual policies and do not adequately address group-level policies, which leads to weak cooperation. To address this issue, we propose a novel Consensus-oriented Strategy (CoS) that emphasizes group and individual policies simultaneously. Specifically, CoS comprises two main components: (a) the vector quantized group consensus module, which extracts discrete latent embeddings that represent the stable and discriminative group consensus, and (b) the group consensus-oriented strategy, which integrates the group policy using a hypernet and the individual policies using the group consensus, thereby promoting coordination at both the group and individual levels. Through empirical experiments on cooperative navigation tasks with both discrete and continuous spaces, as well as google research football, we demonstrate that CoS outperforms state-of-the-art MARL algorithms and achieves better collaboration, thus providing a promising solution for achieving effective coordination in multi-agent systems.

show abstract

Section: Related Workmentioning

confidence: 99%

Learning to Collaborate by Grouping: A Consensus-Oriented Strategy for Multi-Agent Reinforcement Learning

Ruan,

Hao,

et al. 2023

Frontiers in Artificial Intelligence and Applications

View full text Add to dashboard Cite

show abstract

Emergent cooperation from mutual acknowledgment exchange in multi-agent reinforcement learning

Phan,

Sommer,

Ritz

et al. 2024

Auton Agent Multi-Agent Syst

View full text Add to dashboard Cite

Peer incentivization (PI) is a recent approach where all agents learn to reward or penalize each other in a distributed fashion, which often leads to emergent cooperation. Current PI mechanisms implicitly assume a flawless communication channel in order to exchange rewards. These rewards are directly incorporated into the learning process without any chance to respond with feedback. Furthermore, most PI approaches rely on global information, which limits scalability and applicability to real-world scenarios where only local information is accessible. In this paper, we propose Mutual Acknowledgment Token Exchange (MATE), a PI approach defined by a two-phase communication protocol to exchange acknowledgment tokens as incentives to shape individual rewards mutually. All agents condition their token transmissions on the locally estimated quality of their own situations based on environmental rewards and received tokens. MATE is completely decentralized and only requires local communication and information. We evaluate MATE in three social dilemma domains. Our results show that MATE is able to achieve and maintain significantly higher levels of cooperation than previous PI approaches. In addition, we evaluate the robustness of MATE in more realistic scenarios, where agents can deviate from the protocol and communication failures can occur. We also evaluate the sensitivity of MATE w.r.t. the choice of token values.

show abstract

Emergent Cooperation from Mutual Acknowledgment Exchange in Multi-Agent Reinforcement Learning

Cited by 2 publications

References 19 publications

Learning to Collaborate by Grouping: A Consensus-Oriented Strategy for Multi-Agent Reinforcement Learning

Learning to Collaborate by Grouping: A Consensus-Oriented Strategy for Multi-Agent Reinforcement Learning

Emergent cooperation from mutual acknowledgment exchange in multi-agent reinforcement learning

Contact Info

Product

Resources

About