On Strength Adjustment for MCTS-Based Programs

Wu, I‐Chen; Wu, Ti-Rong; Liu, An-Jen; Guei, Hung; Wei, Ting-Han

doi:10.1609/aaai.v33i01.33011222

Cited by 9 publications

(1 citation statement)

References 13 publications

(14 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other work has used this model to estimate λ rationality parameters for human agents in a discrete information-gathering game (Ling et al, 2019) or to vary the difficulty level of AI agents in games (Wu et al, 2019). One of our proposed methods (see Section 6) uses this softmax model internally, producing an estimate of the λ-rationality parameter for the observed agent.…”

Section: Bounded Rationalitymentioning

confidence: 99%

Estimating Agent Skill in Continuous Action Domains

Archibald,

Nieves-Rivera

2024

jair

View full text Add to dashboard Cite

Actions in most real-world continuous domains cannot be executed exactly. An agent’s performance in these domains is influenced by two critical factors: the ability to select effective actions (decision-making skill), and how precisely it can execute those selected actions (execution skill). This article addresses the problem of estimating the execution and decision-making skill of an agent, given observations. Several execution skill estimation methods are presented, each of which utilize different information from the observations and make assumptions about the agent’s decision-making ability. A final novel method forgoes these assumptions about decision-making and instead estimates the execution and decision-making skills simultaneously under a single Bayesian framework. Experimental results in several domains evaluate the estimation accuracy of the estimators, especially focusing on how robust they are as agents and their decision-making methods are varied. These results demonstrate that reasoning about both types of skill together significantly improves the robustness and accuracy of execution skill estimation. A case study is presented using the proposed methods to estimate the skill of Major League Baseball pitchers, demonstrating how these methods can be applied to real-world data sources.

show abstract

Section: Bounded Rationalitymentioning

confidence: 99%