2019
DOI: 10.1609/aaai.v33i01.33011222
|View full text |Cite
|
Sign up to set email alerts
|

On Strength Adjustment for MCTS-Based Programs

Abstract: This paper proposes an approach to strength adjustment for MCTS-based game-playing programs. In this approach, we use a softmax policy with a strength index z to choose moves. Most importantly, we filter low quality moves by excluding those that have a lower simulation count than a pre-defined threshold ratio of the maximum simulation count. We perform a theoretical analysis, reaching the result that the adjusted policy is guaranteed to choose moves exceeding a lower bound in strength by using a threshold rati… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 13 publications
(14 reference statements)
0
1
0
Order By: Relevance
“…Other work has used this model to estimate λ rationality parameters for human agents in a discrete information-gathering game (Ling et al, 2019) or to vary the difficulty level of AI agents in games (Wu et al, 2019). One of our proposed methods (see Section 6) uses this softmax model internally, producing an estimate of the λ-rationality parameter for the observed agent.…”
Section: Bounded Rationalitymentioning
confidence: 99%
“…Other work has used this model to estimate λ rationality parameters for human agents in a discrete information-gathering game (Ling et al, 2019) or to vary the difficulty level of AI agents in games (Wu et al, 2019). One of our proposed methods (see Section 6) uses this softmax model internally, producing an estimate of the λ-rationality parameter for the observed agent.…”
Section: Bounded Rationalitymentioning
confidence: 99%