The 2021 Conference on Artificial Life 2021
DOI: 10.1162/isal_a_00399
|View full text |Cite
|
Sign up to set email alerts
|

A Sustainable Ecosystem through Emergent Cooperation in Multi-Agent Reinforcement Learning

Abstract: This paper considers sustainable and cooperative behavior in multi-agent systems. In the proposed predator-prey simulation, multiple selfish predators can learn to act sustainably by maintaining a herd of reproducing prey and further hunt cooperatively for long term benefit. Since the predators face starvation pressure, the scenario can also turn in a tragedy of the commons if selfish individuals decide to greedily hunt down the prey population before their conspecifics do, ultimately leading to extinction of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…Ideally, this adaptation should be autonomous, i.e., the change of parameters in data exchange should be decided without even knowing the application or the agents' backgrounds. This objective fits well in the field of emergent communications that studies which circumstances lead to communication as an instrumental strategy when multiple learning agents are rewarded for completing specific tasks [14]. Knowledge distillation -With advancements in federated learning techniques, knowledge can be aggregated as individually trained models are integrated.…”
Section: B Emergent Knowledge Accumulationmentioning
confidence: 79%
“…Ideally, this adaptation should be autonomous, i.e., the change of parameters in data exchange should be decided without even knowing the application or the agents' backgrounds. This objective fits well in the field of emergent communications that studies which circumstances lead to communication as an instrumental strategy when multiple learning agents are rewarded for completing specific tasks [14]. Knowledge distillation -With advancements in federated learning techniques, knowledge can be aggregated as individually trained models are integrated.…”
Section: B Emergent Knowledge Accumulationmentioning
confidence: 79%
“…( [44,26,6,49]. Different studies have been conducted on various complex SSDs, where interesting phenomena like group hunting, attacking and dodging, or flocking have been observed [23,31,15,36]. Independent MARL like naive learning has been widely used in most studies to model agents with individual rationality [44,13].…”
Section: Policy Gradient Reinforcement Learningmentioning
confidence: 99%
“…Multi-agent reinforcement learning (MARL) has become popular to model individually rational agents in SDs and SSDs to examine emergent behavior [6,23,31,15,36]. The goal of each agent is defined by an individual reward function.…”
Section: Introductionmentioning
confidence: 99%