2019
DOI: 10.1109/access.2019.2937943
|View full text |Cite
|
Sign up to set email alerts
|

A Middle Game Search Algorithm Applicable to Low-Cost Personal Computer for Go

Abstract: Go Artificial Intellects(AIs) using deep reinforcement learning and neural networks have achieved superhuman performance, but they rely on powerful computing resources. They are not applicable to low-cost personal computer(PC). In our life, most entertainment programs of Go run on the general PC. A human Go master consider different strategies for different stages, especially for the middle stage that has a significant impact on winning or losing. To study arguably a more humanlike approach that is applicable … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 21 publications
(27 reference statements)
0
3
0
Order By: Relevance
“…Deep reinforcement learning (DRL) is a branch of Reinforcement Learning that introduces deep neural networks to deal with high dimensional and complex state spaces to better solve real-world problems. DRL has demonstrated its powerful ability to learn and optimize decisions in several domains, such as problems like AlphaGo Zero [62] and Atari [63] . In addition, it is able to quickly find the best solution in discrete decision spaces with the advantages of speed and generalization, so it has a significant advantage in combinatorial optimization problems [64][65][66][67] .…”
Section: Reinforcement Learning Algorithmmentioning
confidence: 99%
“…Deep reinforcement learning (DRL) is a branch of Reinforcement Learning that introduces deep neural networks to deal with high dimensional and complex state spaces to better solve real-world problems. DRL has demonstrated its powerful ability to learn and optimize decisions in several domains, such as problems like AlphaGo Zero [62] and Atari [63] . In addition, it is able to quickly find the best solution in discrete decision spaces with the advantages of speed and generalization, so it has a significant advantage in combinatorial optimization problems [64][65][66][67] .…”
Section: Reinforcement Learning Algorithmmentioning
confidence: 99%
“…Go games are typically divided into three stages: the opening (fuseki ), middle game (chuban), and endgame. One of the authors of Li et al (2019) analysed 500 Go games and extracted the move numbers at which the middle game and the endgame begin. Both distributions were found to be normal.…”
Section: Impact Of Game Stagementioning
confidence: 99%
“…Go games are typically divided into three stages: the opening (fuseki ), middle game (chuban), and endgame. One of the authors of Li et al (2019) analysed 500 Go games and extracted the move numbers at which the middle game and the endgame begin. Both distributions were found to be normal.…”
Section: Impact Of Game Stagementioning
confidence: 99%