2014 IEEE Conference on Computational Intelligence and Games 2014
DOI: 10.1109/cig.2014.6932903
|View full text |Cite
|
Sign up to set email alerts
|

Monte Carlo Tree Search with heuristic evaluations using implicit minimax backups

Abstract: Monte Carlo Tree Search (MCTS) has improved the performance of game engines in domains such as Go, Hex, and general game playing. MCTS has been shown to outperform classic αβ search in games where good heuristic evaluations are difficult to obtain. In recent years, combining ideas from traditional minimax search in MCTS has been shown to be advantageous in some domains, such as Lines of Action, Amazons, and Breakthrough. In this paper, we propose a new way to use heuristic evaluations to guide the MCTS search … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(15 citation statements)
references
References 28 publications
0
15
0
Order By: Relevance
“…by Ramanujan and Selman (2011), where the algorithm U CT M AX H replaces MCTS rollouts with heuristic evaluations and classic averaging MCTS backups with minimaxing backups. In implicit minimax backups (Lanctot et al, 2014), both minimaxing backups of heuristic evaluations and averaging backups of rollout returns are managed simultaneously. On the other hand, one can nest minimax searches into MCTS searches.…”
Section: Related Workmentioning
confidence: 99%
“…by Ramanujan and Selman (2011), where the algorithm U CT M AX H replaces MCTS rollouts with heuristic evaluations and classic averaging MCTS backups with minimaxing backups. In implicit minimax backups (Lanctot et al, 2014), both minimaxing backups of heuristic evaluations and averaging backups of rollout returns are managed simultaneously. On the other hand, one can nest minimax searches into MCTS searches.…”
Section: Related Workmentioning
confidence: 99%
“…In [29], the researchers proposed implicit minimax backups which incorporated into MCTS the minimax scores based on heuristic evaluations of positions. The minimax scores were used together with the estimator Q i to guide the selections of MCTS.…”
Section: Previous Workmentioning
confidence: 99%
“…One issue to discuss is the minimax scores for chance nodes, which were not mentioned in [29]. An intuitive way is to use the probability distribution for unrevealed pieces, when calculating the expected values for chance nodes.…”
Section: Our Workmentioning
confidence: 99%
“…Similar techniques have been shown to significantly improve [24] the quality of MCTS players in games like Breakout and LOA [35]. Lanctot et al [24], introduced adding evaluation function values to nodes in the MCTS tree. These values are updated during the backup phase, by considering the minimax values of a node's children.…”
Section: Related Workmentioning
confidence: 99%
“…Various studies have considered approaches to dealing with stochasticity and uncertain information in MCTS [5,3,31,8,24].…”
Section: Mcts For Stochastic Domainsmentioning
confidence: 99%