Proceedings of the 9th International Conference on Agents and Artificial Intelligence 2017
DOI: 10.5220/0006205806480652
|View full text |Cite
|
Sign up to set email alerts
|

An Analysis of Virtual Loss in Parallel MCTS

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 4 publications
0
7
0
Order By: Relevance
“…In our future research we will look into additional techniques, promising a speed-up which scales better with the number of threads [20], as the standard techniques for parallelization do not address the trade of between exploration and exploitation well enough [19], [20], [25].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In our future research we will look into additional techniques, promising a speed-up which scales better with the number of threads [20], as the standard techniques for parallelization do not address the trade of between exploration and exploitation well enough [19], [20], [25].…”
Section: Discussionmentioning
confidence: 99%
“…In addition, measures such as virtual loss [6] improve the effectiveness of tree parallelization, by increasing the likelihood that multiple threads traverse different paths. While virtual loss has been an adopted concept [2], more recent studies suggest that the increase in effectiveness is merely a trade-off with time efficiency [19].…”
Section: B Parallelizationmentioning
confidence: 99%
“…This could be observed from our experiments in Section 5. In contrast, although the virtual loss used in TreeP could encourage exploration diversity, this hard additive penalty could cause exploitatin failure: workers will be less likely to co-simulating the same node even when they are certain that it is optimal (Mirsoleimani et al, 2017). RootP tries to avoid these issues by letting workers perform an independent tree search.…”
Section: The Benefits Of Watching Unobserved Samplesmentioning
confidence: 99%
“…Additionally, architecture side improvements such as using pipeline (Mirsoleimani et al, 2018b) or lock-free structure (Mirsoleimani et al, 2018a) speedup the algorithm significantly. However, though being able to increase diversity, virtual loss degrades the performance under even four workers (Mirsoleimani et al, 2017;Bourki et al, 2010).…”
Section: Related Workmentioning
confidence: 99%
“…Despite their extensive usage, the performance of parallel MCTS algorithms [12] is not systematically understood from a theoretical perspective. There are empirical studies on the advantages (e.g., [12,13,14]) and disadvantages (e.g., [15,16,17]) of existing approaches. However, they are mainly algorithm-specific analysis, which provides less systematic design principles on effective MCTS parallelization.…”
Section: Introductionmentioning
confidence: 99%