GomokuNet: A Novel UNet-style Network for Gomoku Zero Learning via Exploiting Positional Information and Multiscale Features Yifan Gao Lezhou Wu Haoyue Li Northeastern University Northeastern University Northeastern University Shenyang, China Shenyang, China Shenyang, China
[email protected] [email protected] [email protected] Abstract—Since the tremendous success of the AlphaGo family way to predict the move of Gomoku with CNN. Other works (AlphaGo, AlphaGo Zero, and Alpha Zero), zero learning has [8]–[10] look into the neural network structures of Gomoku become the baseline method for many board games, such as trained by the zero learning method in the absence of expert Gomoku and Chess. Recent works on zero learning have demon- strated that improving the performance of neural networks used knowledge. However, the neural network structures in the in zero learning programs remains nontrivial and challenging. former researches are relatively simple and cannot be used Considering both positional information and multiscale features, to train a powerful Gomoku AI model. this paper presents a novel positional attention-based UNet- Many improvements on CNN have been employed in the style model (GomokuNet) for Gomoku AI. An encoder-decoder game of Go. Residual networks [11] used by AlphaGo con- architecture is adopted as the backbone network to guarantee the fusion of multiscale features. Positional information modules tributed the first significant performance improvement. Com- are incorporated into our model in order to further capture the pared with the typical CNN, AlphaGo Zero’s performance location information of the board. Quantitative results obtained increased by 600 Elo with the help of the residual networks.