FML-Based Prediction Agent and Its Application to Game of Go

Published as a conference paper at IFSA-SCIS 2017______________________________________________________________________ FML-based Prediction Agent and Its Application to Game of Go Chang-Shing Lee Chia-Hsiu Kao Yusuke Nojima Nan Shuo Mei-Hui Wang Sheng-Chi Yang Ryosuke Saga Naoyuki Kubota Dept. of Computer Science and Information Engineering Dept. of Electrical Engineering Dept. of System Design Knowledge Application and Web Service Research Center and Information Science Tokyo Metropolitan University National University of Tainan Osaka Prefecture University Tokyo, Japan Tainan, Taiwan Osaka, Japan [email protected] [email protected] [email protected] Abstract- In this paper, we present a robotic prediction agent Many different real-world applications are with a high-level including a darkforest Go engine, a fuzzy markup language (FML) of uncertainty. A lot of researches proved the good performance assessment engine, an FML-based decision support engine, and a of using fuzzy sets. Being an IEEE Standard in May 2016, fuzzy robot engine for game of Go application. The knowledge base and markup language (FML) provides designers of intelligent rule base of FML assessment engine are constructed by referring decision making systems with a unified and high-level the information from the darkforest Go engine located in NUTN methodology for describing systems’ behaviors by means of and OPU, for example, the number of MCTS simulations and rules based on human domain knowledge [10, 11]. The main winning rate prediction. The proposed robotic prediction agent advantage of using FML is easy to understand and extend the first retrieves the database of Go competition website, and then the implemented programs for other researchers. FML is with FML assessment engine infers the winning possibility based on the understandability, extendibility, and compatibility of information generated by darkforest Go engine. The FML-based decision support engine computes the winning possibility based on implemented programs as well as efficiency of programming the partial game situation inferred by FML assessment engine. [10]. There are considerable research applications based on Finally, the robot engine combines with the human-friendly robot FML and Genetic FML (GFML) such as game of Go [12, 13] partner PALRO, produced by Fujisoft incorporated, to report the and diet [14, 15]. Additionally, Acampora et al. [16] proposed game situation to human Go players. Experimental results show FML Script to make it with evolving capabilities through a that the FML-based prediction agent can work effectively. scripting language approach. Akhavan et al. [17] used FML- based specifications to validate and implement fuzzy models. Keywords—Fuzzy markup language; prediction agent; decision support engine; robot engine; darkforest Go engine The objective of this paper is to use FML to construct the knowledge base and rule base of the proposed agent and then I. INTRODUCTION predict the winner of the game based on the information from darkforest Go open source and extracted sub-games. In this The game of Go originated from China. There are 381 paper, in addition to information provided by darkforest positions to play for a 19×19 board game [2]. Normally, the program, we have an additional system to show the information weaker player plays Black and starts the game. Stones are of the game such as current game situation. According to the consecutively located by two players, Black and White, on the first-stage prediction results of darkforest Go engine [8, 9] and points where the lines cross [3]. Black is also allowed to start the second-stage inferred results of the FML assessment engine with some handicap stones located on the empty board when the [18], we further introduce the third-stage FML-based decision difference in strength between the players is large [3]. Moreover, support engine to predict the winner of the game. Next, we the Go rules contain parts of the ko rule, life and death, suicide, choose seven games from 60 games (Master vs. top professional and the scoring method [3]. In the end, the player who controls Go players in Dec. 2016 and in Jan. 2017) and three games of most territory wins the game. The level of amateur Go players is amateur Go players to evaluate the performance. Finally, we ranked as Kyu (1K is the highest one) and Dan (1D – 7D and ID combine playing Go with the fourth-stage robot engine to report is the lowest one). Professional Go players are ranked from 1P real-time situation to Go players. The experimental results show to 9P and the highest level is 9P [19]. The strongest professional the proposed approach is feasible. Go player in the world is Jie Ke from China in Feb. 2017 [20]. The remainder of this paper is organized as follows: Section Competing with top human players in the ancient, game of II introduces Dynamic DarkForest Go (DyNaDF) cloud Go has been a long-term goal of artificial intelligence [1]. Go’s platform for game of Go application. Section III describes the high branching factor makes traditional search techniques or proposed FML-based prediction agent by introducing the system even on cutting-edge hardware ineffective [1]. Additionally, the structure and the FML-based decision support engine. The evaluation function of Go could change drastically with one experimental results are shown in Section IV. Finally, stone change [1]. Combining a Deep Convolutional Neural conclusions are given in Section V. Network (DCNN) with Monte Carlo Tree Search (MCTS), Google DeepMind AlphaGo has sent shockwaves throughout II. DYNADF CLOUD PLATFORM FOR GAME OF GO Asia and the world since Challenge Match with Lee Sedol in 2016 [3, 5, 6, 19]. Moreover, the new prototype version of A. Structure of Dynamic Darkforest Cloud Platform for Go AlphaGo played as Master and Magister on the online servers Fig. 1 shows the structure of the DyNaDF Cloud Platform Tygem and FoxGo defeated more than 50 of the top Go players and its brief descriptions are as follows: (1) The DyNaDF cloud in the world in Dec. 2016 and in Jan. 2017 [7]. platform for game of Go application is composed of a playing- The authors would like to thank the financially support sponsored by the Ministry of Science and Technology of Taiwan under the grant MOST 105- 2622-E-024-003-CC2 and MOST 105-2221-E-024-017. Published as a conference paper at IFSA-SCIS 2017______________________________________________________________________ Go platform located at National University of Tainan (NUTN) / Machine Recommendation (MR) Logout Taiwan and National Center for High-Performance Computing Game Finish (NCHC) / Taiwan, a darkforest Go engine located at Osaka Recommendation Admin Prefecture University (OPU) / Japan, and the robot PALRO from Black (62.50%) Black Recommendation White (57.81%) Move 1 (B Q16) Facebook Darkforest Move 2 (W D16) Tokyo Metropolitan University (TMU) / Japan; (2) Human Go Move 3 (B R4) • suggest 1: H2 Move 4 (W D3) Move 5 (B C5) (Num simulation: 689, Win rate: 0.320) Move 6 (B P3) Move 7 (B M4) • suggest 2: A3 Move 8 (W C9) players surf on the DyNaDF platform located at NUTN / NCHC Move 9 (B H3) (Num simulation: 531, Win rate: 0.227) Move 10 (W C4) Move 11 (B E4) • suggest 3: M9 Move 12 (W D5) to play with darkforest Go engine located in OPU; (3) The FML Move 13 (B D4) (Num simulation: 415, Win rate: 0.248) Move 14 (W C6) Move 15 (B C3) • suggest 4: N19 Move 16 (W B5) Move 17 (B B4) (Num simulation: 329, Win rate: 0.308) Move 18 (W C5) assessment engine infers the current game situation based on the Move 19 (B E3) • suggest 5: H13 Move 20 (W B3) Move 21 (B D2) (Num simulation: 263, Win rate: 0.225) Move 22 (W N4) prediction information from darkforest and stores the results into Move 23 (B Q6) Move 24 (W M5) Move 25 (B L4) Move 26 (W N3) Move 27 (B Q3) Move 28 (W O17) Move 30 (W Q2) the database; (4) PALRO receives the game situation via the Move 29 (B R14) White Recommendation Move 31 (B F17) Move 32 (W S3) Internet and reports to the human Go players; (5) Human can Move 33 (B P17) Move 34 (W R8) Move 103 (B L13) learn more information about game’s comments via Go eBook. Move 104 (W S16) Move 105 (B S15) Move 106 (B S13) Move 107 (B S17) Move 108 (W D13) Move 109 (B E13) Move 110 (W D12) Move 111 (B E12) Move 112 (W D11) Move 113 (B E11) Move 114 (W E10) Move 115 (B F10) Move 116 (W E9) Move 117 (B R10) Move 118 (W S10) Move 119 (B Q9) Move 120 (W S9) Move 121 (B F18) Move 122 (W F9) Move 123 (B G10) Move 124 (W L2) Move 125 (B B2) Tokyo Tainan Move 126 (W J2) Move 127 (B K3) Move 128 (W K2) Osaka (a) OPU / Japan TMU / Japan NUTN / NCHC Human Go Players Darkforest Go Engine PALRO Black: Jie Ke (9P) White: Master Komi: 6.5 Simulation Number: 3000 Date: Dec. 31, 2016 100 75 Playing-Go Platform FML Assessment Engine (%) 50 Black White 25 Go eBook Rate Winning 0 Fig. 1. Structure of dynamic darkforest cloud platform for Go. 1 11 21 31 41 51 61 71 81 91 101 111 121 Move No. B. Introduction to Dynamic Darkforest Cloud Platform Black: Jie Ke (9P) White: Master Komi: 6.5 Simulation Number: 3000 Date: Dec. 31, 2016 4k This subsection introduces the developed DyNaDF platform, 3k including a demonstration game platform, a machine 2k Black White recommendation platform, and an FML assessment engine. 1k Number of of Simulations Number GUID 0 1 11 21 31 41 51 61 71 81 91 101 111 121 WCCI-2016-GO-466 Move No.

FML-Based Prediction Agent and Its Application to Game of Go

Residual Networks for Computer Go

Arxiv:1701.07274V3 [Cs.LG] 15 Jul 2017

ELF: an Extensive, Lightweight and Flexible Research Platform for Real-Time Strategy Games

Improved Policy Networks for Computer Go

Chinese Health App Arrives Access to a Large Population Used to Sharing Data Could Give Icarbonx an Edge Over Rivals

Fml-Based Dynamic Assessment Agent for Human-Machine Cooperative System on Game of Go

Residual Networks for Computer Go Tristan Cazenave

Spy on Me #2 – Artistic Manoevres for the Digital Present

Move Prediction in Gomoku Using Deep Learning

ELF Opengo: an Analysis and Open Reimplementation of Alphazero

AI-FML Agent for Robotic Game of Go and Aiot Real-World Co-Learning Applications

When Are We Done with Games?