2020
DOI: 10.48550/arxiv.2006.11438
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning

Abstract: Multi-agent reinforcement learning (MARL) requires coordination to efficiently solve certain tasks. Fully centralized control is often infeasible in such domains due to the size of joint action spaces. Coordination graph based formalization allows reasoning about the joint action based on the structure of interactions. However, they often require domain expertise in their design. This paper introduces the deep implicit coordination graph (DICG) architecture for such scenarios. DICG consists of a module for inf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(10 citation statements)
references
References 37 publications
1
6
0
Order By: Relevance
“…Coordinating graph formulation is one of the methods for determining the joint action between agents based on the structure of interactions. In [55], Deep Implicit Coordination Graph (DICG) architecture is introduced, which includes two modules, one for obtaining the dynamic coordination graph structure and the other for learning the implicit reasoning about common actions or values. DICG uses the actor-critic structure to improve coordination for multi-agent situations.…”
Section: Interaction Methods Between Multi-agents With Gnn Architecturementioning
confidence: 99%
See 1 more Smart Citation
“…Coordinating graph formulation is one of the methods for determining the joint action between agents based on the structure of interactions. In [55], Deep Implicit Coordination Graph (DICG) architecture is introduced, which includes two modules, one for obtaining the dynamic coordination graph structure and the other for learning the implicit reasoning about common actions or values. DICG uses the actor-critic structure to improve coordination for multi-agent situations.…”
Section: Interaction Methods Between Multi-agents With Gnn Architecturementioning
confidence: 99%
“…In this approach, the agents have access to the state and complete information during the training step, but in some environments, the learned policy must be applied in a decentralized manner, and the agents cannot access the full state in the execution phase. In this method, the purpose of each agent is to perform actions that maximize their utility function (joint value function), but such decentralization can result in sub-optimal actions [55].…”
Section: Different Methods For Computing Value Function In Marlmentioning
confidence: 99%
“…Coordination graphs are classical technique for planning in multi-agent systems (Guestrin et al, 2001;2002b). They are combined with multi-agent deep reinforcement learning by recent work (Castellini et al, 2019;Böhmer et al, 2020;Li et al, 2020;Wang et al, 2021b). Joint action selection on coordination graphs can be modeled as a decentralized constraint optimization problem (DCOP), and previous methods compute approximate solutions by message passing among agents (Pearl, 1988).…”
Section: Related Workmentioning
confidence: 99%
“…To define the edge weights of the graph G, i.e., {w uv } (u,v)∈E , we use an attention mechanism similar to the one proposed in [34]. In particular, the agent observations at each time step are first encoded using a shared encoder mechanism φ : Z → R F to an F -dimensional embedding.…”
Section: Attention-based Edge Weightsmentioning
confidence: 99%
“…We use the exponential linear unit (ELU) as the non-linearity and set F = 128. Moreover, for the mixing GNN, we use a graph convolutional network (GCN) [12] as also used in [34]. Since the attention mechanism described in Section 4.1 leads to an effective outgoing degree of one for each graph node, the combining operation in (5) can be simplified as where σ(•) denotes a non-linearity, and…”
Section: Map Namementioning
confidence: 99%