2019 IEEE Symposium Series on Computational Intelligence (SSCI) 2019
DOI: 10.1109/ssci44817.2019.9003020
|View full text |Cite
|
Sign up to set email alerts
|

UCT-ADP Progressive Bias Algorithm for Solving Gomoku

Abstract: We combine Adaptive Dynamic Programming (ADP), a reinforcement learning method and UCB applied to trees (UCT) algorithm with a more powerful heuristic function based on Progressive Bias method and two pruning strategies for a traditional board game Gomoku. For the Adaptive Dynamic Programming part, we train a shallow forward neural network to give a quick evaluation of Gomoku board situations. UCT is a general approach in MCTS as a tree policy. Our framework use UCT to balance the exploration and exploitation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 11 publications
(14 reference statements)
0
3
0
Order By: Relevance
“…4. Select excellent individuals from selected population According to the ELO scoring standard commonly used in international competitions [24], the eight individuals of the 500th generation were finally selected, and the optimized chess shape parameters are obtained.…”
Section: Optimization Of Chess Shape Parametersmentioning
confidence: 99%
“…4. Select excellent individuals from selected population According to the ELO scoring standard commonly used in international competitions [24], the eight individuals of the 500th generation were finally selected, and the optimized chess shape parameters are obtained.…”
Section: Optimization Of Chess Shape Parametersmentioning
confidence: 99%
“…When Victoria first moved a stone, the algorithm yielded a winning result without fail. Cao et al presented a Gomoku AI model using an algorithm that combined the upper confidence bounds that were applied to the trees (UCT) [12] and adaptive dynamic programming (ADP) [13,14]. This algorithm could solve the search depth defect more accurately and efficiently than the case when only a single UCT was used, thereby improving the performance of Gomoku AI.…”
Section: Related Workmentioning
confidence: 99%
“…If Victoria moves a stone first, it always leads to a win. Cao et al used an algorithm combining Upper Confidence Bounds applied to Trees (UCT) [18] and Adaptive Dynamic Programming (ADP) [19] and introduced a Gomoku AI model that could solve the problem of search depth defects more accurately than using a single UCT [20].…”
Section: Related Workmentioning
confidence: 99%