2007
DOI: 10.1007/978-3-540-75538-8_7
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search

Abstract: Abstract. Monte-Carlo evaluation consists in estimating a position by averaging the outcome of several random continuations, and can serve as an evaluation function at the leaves of a min-max tree. This paper presents a new framework to combine tree search with Monte-Carlo evaluation, that does not separate between a min-max phase and a MonteCarlo phase. Instead of backing-up the min-max value close to the root, and the average value at some depth, a more general backup operator is defined that progressively c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
770
0
6

Year Published

2010
2010
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 889 publications
(813 citation statements)
references
References 18 publications
5
770
0
6
Order By: Relevance
“…First, as well as for humans, all time scales of learning are important: offline knowledge (strategic rules and patterns) as in [7,5]; online information (i.e. analysis of a sequence by mental simulations) [10]; transient information (extrapolation as a guide for exploration).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…First, as well as for humans, all time scales of learning are important: offline knowledge (strategic rules and patterns) as in [7,5]; online information (i.e. analysis of a sequence by mental simulations) [10]; transient information (extrapolation as a guide for exploration).…”
Section: Resultsmentioning
confidence: 99%
“…Monte-Carlo Tree Search (MCTS [5,7,11]) is a recent tool for difficult planning tasks. Impressive results have already been produced in the case of the game of Go [7,10].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…We assume in this section that the reader is familiar with the Monte-Carlo Tree Search (MCTS) and Upper Confidence Tree (UCT) literature [4,7,9]. We here focus on the experimental application of MCTS to acyclic GSA games.…”
Section: Upper Confidence Trees For Games With Simultaneous Actionsmentioning
confidence: 99%
“…2) has 11 millions of registered users. It's a Card Game, related to games like Pokemon or Magic, with Partial Observability, a small number of turns leading to fast games (often less than a minute) 4 . First, each player chooses a deck, which contains four cards (see Fig.…”
Section: Games Against Humansmentioning
confidence: 99%