2019
DOI: 10.1613/jair.1.11370
|View full text |Cite
|
Sign up to set email alerts
|

Revisiting CFR+ and Alternating Updates

Abstract: The CFR + algorithm for solving imperfect information games is a variant of the popular CFR algorithm, with faster empirical performance on a range of problems. It was introduced with a theoretical upper bound on solution error, but subsequent work showed an error in one step of the proof. We provide updated proofs to recover the original bound.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 4 publications
(5 reference statements)
0
10
0
Order By: Relevance
“…A well-known example is CFR by Zinkevich et al (2007) based on the regret-matching algorithm (Hart and Mas-Colell, 2000;Gordon, 2007). There exist many other variants of it, such as CFR+ (Tammelin, 2014;Burch et al, 2019), see also Farina et al (2019Farina et al ( , 2021a. These algorithms however only enjoy a (known) guarantee of convergence of order O((X √ A + Y √ B)/ √ T ).…”
Section: Introductionmentioning
confidence: 99%
“…A well-known example is CFR by Zinkevich et al (2007) based on the regret-matching algorithm (Hart and Mas-Colell, 2000;Gordon, 2007). There exist many other variants of it, such as CFR+ (Tammelin, 2014;Burch et al, 2019), see also Farina et al (2019Farina et al ( , 2021a. These algorithms however only enjoy a (known) guarantee of convergence of order O((X √ A + Y √ B)/ √ T ).…”
Section: Introductionmentioning
confidence: 99%
“…A number of CFR variants have been proposed since the pioneering work Zinkevich et al (2008) for improving computational efficiency. For example, Lanctot et al (2009); Burch et al (2012); Gibson et al (2012); Lis ỳ et al (2015); Schmid et al (2019) combine CFR with Monte-Carlo sampling; Waugh et al (2015); Morrill (2016); propose to estimate the counterfactual value functions via regression; Brown and Sandholm (2015); ; improve efficiency by pruning suboptimal paths in the game tree; Tammelin (2014); Tammelin et al (2015); Burch et al (2019) analyze the performance of a modification named CFR + , and Zhou et al (2018) proposes lazy updates with a near-optimal regret upper bound.…”
Section: Policy-based Methodsmentioning
confidence: 99%
“…Finally, to show a linear convergence rate, we need to show the counterpart of Eq. (11), which is again more involved compared to the normal-form game case. Indeed, we are only able to do so for DOMWU by making use of its closed-form update described in Lemma 2.…”
Section: Analysis Of Theorem 6 and Theoremmentioning
confidence: 99%
“…However, due to their ergodic convergence guarantee, theoretical convergence rates of regretminimization algorithms are typically limited to O(1/ √ T ) or O(1/T ) for T rounds, and this is also the case in practice [5,11]. In contrast, it is known that linear convergence rates are achievable for certain other first-order algorithms [44,22].…”
Section: Introductionmentioning
confidence: 99%