2013
DOI: 10.1142/9789814390705_0002
|View full text |Cite
|
Sign up to set email alerts
|

A Simple Adaptive Procedure Leading to Correlated Equilibrium

Abstract: We propose a new and simple adaptive procedure for playing a game: ''regret-matching.'' In this procedure, players may depart from their current play with probabilities that are proportional to measures of regret for not having used other strategies in the past. It is shown that our adaptive procedure guarantees that, with probability one, the empirical distributions of play converge to the set of correlated equilibria of the game.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
285
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 135 publications
(287 citation statements)
references
References 11 publications
2
285
0
Order By: Relevance
“…The third algorithm in this class is regret matching [23], which we include for completeness (since it is commonly applied) and as a benchmark. Let:…”
Section: Definition 44 (Fading-memory Regret-based Dynamics With Inementioning
confidence: 99%
See 1 more Smart Citation
“…The third algorithm in this class is regret matching [23], which we include for completeness (since it is commonly applied) and as a benchmark. Let:…”
Section: Definition 44 (Fading-memory Regret-based Dynamics With Inementioning
confidence: 99%
“…This problem is addressed by the literature on learning in games; the dynamics of learning processes in repeated games is a well investigated branch of game theory (see [6], for example). In particular, the results that are relevant to this work are the guaranteed convergence to Nash equilibrium in potential games of a variety of action adaptation processes, including finite-memory better reply processes [8], adaptive play [9], joint-strategy fictitious play [10], fading-memory regret monitoring [11], and generalised weakened fictitious play [7]; we also include in our investigation regret-matching [23], which converges to the set of correlated equilibria. Thus, a decentralised solution to an optimisation problem can be found by, first, constructing a potential game from the optimisation problem, and then using one of these algorithms to compute an equilibrium.…”
Section: Introductionmentioning
confidence: 99%
“…If all players use regret matching, then the empirical frequency of the joint actions converges almost surely to the set of coarse correlated equilibria, a generalization of Nash equilibria, in any game [18]. We prove that if all players use JSFP with inertia, then the action profile converges almost surely to a pure Nash equilibrium, albeit in the special glass of generalized ordinal potential games.…”
Section: F Relationship Between Regret Matching and Jsfpmentioning
confidence: 90%
“…It turns out that JSFP is strongly related to the learning algorithm regret matching, from [18], in which players choose their actions based on their regret for not choosing particular actions in the past steps.…”
Section: F Relationship Between Regret Matching and Jsfpmentioning
confidence: 99%
See 1 more Smart Citation