2022
DOI: 10.21203/rs.3.rs-2315844/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Emergent Cooperation from Mutual Acknowledgment Exchange in Multi-Agent Reinforcement Learning

Abstract: Peer incentivization (PI) is a recent approach, where all agents learn to reward or to penalize each other in a distributed fashion which often leads to emergent cooperation. Current PI mechanisms implicitly assume a flawless communication channel in order to exchange rewards. These rewards are directly integrated into the learning process without any chance to respond with feedback. Furthermore, most PI approaches rely on global information which limits scalability and applicability to real-world scenarios, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 19 publications
0
0
0
Order By: Relevance
“…LILAC [9] learns a leader to assign roles. Another line of work such as [38,39,28,13], divides the agents into some groups that carry out similar sub-tasks with a specific policy or value function. In our work, we learn more stable and distinguishable group embeddings and further consider the integration of team-level strategy and individual-level decision.…”
Section: Related Workmentioning
confidence: 99%
“…LILAC [9] learns a leader to assign roles. Another line of work such as [38,39,28,13], divides the agents into some groups that carry out similar sub-tasks with a specific policy or value function. In our work, we learn more stable and distinguishable group embeddings and further consider the integration of team-level strategy and individual-level decision.…”
Section: Related Workmentioning
confidence: 99%