2023
DOI: 10.48550/arxiv.2302.10058
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Differentiable Arbitrating in Zero-sum Markov Games

Abstract: We initiate the study of how to perturb the reward in a zero-sum Markov game with two players to induce a desirable Nash equilibrium, namely arbitrating. Such a problem admits a bi-level optimization formulation. The lower level requires solving the Nash equilibrium under a given reward function, which makes the overall problem challenging to optimize in an end-to-end way. We propose a backpropagation scheme that differentiates through the Nash equilibrium, which provides the gradient feedback for the upper le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 58 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?