2019
DOI: 10.1609/aaai.v33i01.33011861
|View full text |Cite
|
Sign up to set email alerts
|

Solving Large Extensive-Form Games with Strategy Constraints

Abstract: Extensive-form games are a common model for multiagent interactions with imperfect information. In two-player zerosum games, the typical solution concept is a Nash equilibrium over the unconstrained strategy set for each player. In many situations, however, we would like to constrain the set of possible strategies. For example, constraints are a natural way to model limited resources, risk mitigation, safety, consistency with past observations of behavior, or other secondary objectives for an agent. In small g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 11 publications
0
6
0
Order By: Relevance
“…As an application, we show that the correctness and convergence rate of the CFR algorithm can be proven easily through our calculus. We also show that the recent Constrained CFR algorithm (Davis et al, 2019) can be constructed via our framework. Our framework enables the construction of two algorithms for that problem.…”
Section: Introductionmentioning
confidence: 93%
See 2 more Smart Citations
“…As an application, we show that the correctness and convergence rate of the CFR algorithm can be proven easily through our calculus. We also show that the recent Constrained CFR algorithm (Davis et al, 2019) can be constructed via our framework. Our framework enables the construction of two algorithms for that problem.…”
Section: Introductionmentioning
confidence: 93%
“…Farina et al (2019) study opponent exploitation where the goal is to compute a best response, subject to a penalty for moving away from a precomputed Nash equilibrium strategy; this is captured by having d 1 or d 2 include a penalty term that penalizes distance from the Nash equilibrium strategy. and Kroer et al (2017) study constraints on individual decision points, and Davis et al (2019) study additional constraints on the overall EFG polytopes X , Y. Regret minimization in those settings requires regret minimizers that can operate on more general domains X , Y than the sequence form.…”
Section: Connection To Convex-concave Saddle-pointmentioning
confidence: 99%
See 1 more Smart Citation
“…Related work also includes approaches for opponents which can change strategy over time (Powers & Shoham, 2005) and 3-player games (Ganzfried et al, 2018). Recent work introduces several forms of counterfactual regret minimization (Farina et al, 2019;Davis et al, 2019) and deep reinforcement learning (Brown et al, 2020) to find Nash equilibria. However, it has been shown that computing optimal strategies when playing a limited lookahead opponent in an imperfect-information game is NP-hard in all but the most restricted cases (Kroer & Sandholm, 2020).…”
Section: Opponent Modelingmentioning
confidence: 99%
“…These methods exploit the hierarchical structure of the sequential strategy spaces of the players to construct a regret minimizer that recursively minimizes regret locally at each decision point in the game tree. This has inspired regret-based algorithms for other solution concepts in game theory, such as extensive-form perfect equilibria (Farina et al, 2017), Nash equilibrium with strategy constraints (Farina et al, 2017;b;Davis et al, 2019), and quantal-response equilibrium (Farina et al, 2019a).…”
Section: Introductionmentioning
confidence: 99%