Quality-Based Rewards for Monte-Carlo Tree Search Simulations

Total Page:16

File Type:pdf, Size:1020Kb

Quality-Based Rewards for Monte-Carlo Tree Search Simulations Quality-based Rewards for Monte-Carlo Tree Search Simulations Tom Pepels and Mandy J .W. Tak and Marc Lanctot and Mark H. M. Winands1 Abstract. Monte-Carlo Tree Search is a best-first search technique In this paper, two techniques are proposed for determining the based on simulations to sample the state space of a decision-making quality of a simulation based on properties of play-outs. The first, problem. In games, positions are evaluated based on estimates ob- Relative Bonus, assesses the quality of a simulation based on its tained from rewards of numerous randomized play-outs. Generally, length. The second, Qualitative Bonus, formulates a quality assess- rewards from play-outs are discrete values representing the outcome ment of the terminal state. Adjusting results in a specific way using of the game (loss, draw, or win), e.g., r 2 {−1; 0; 1g, which are these values leads to increased performance in six distinct two-player backpropagated from expanded leaf nodes to the root node. How- games. Moreover, we determine the advantage of using the Relative ever, a play-out may provide additional information. In this paper, Bonus in the General Game Playing agent CADIAPLAYER [4], win- we introduce new measures for assessing the a posteriori quality of ner of the International GGP Competition in 2007, 2008, and 2012. a simulation. We show that altering the rewards of play-outs based The paper is structured as follows. First, the general MCTS frame- on their assessed quality improves results in six distinct two-player work is discussed in Section 2. Next, two different techniques for as- games and in the General Game Playing agent CADIAPLAYER. We sessing the quality of play-outs are detailed in Section 3. Section 4 propose two specific enhancements, the Relative Bonus and Qual- explains how rewards can be altered using the quality measures from itative Bonus. Both are used as control variates, a variance reduc- the previous section. This is followed by pseudo-code outlining the tion method for statistical simulation. Relative Bonus is based on the proposed algorithm in Section 5. Finally, the performance of the pro- number of moves made during a simulation and Qualitative Bonus re- posed enhancements is determined in Section 6, accompanied by a lies on a domain-dependent assessment of the game’s terminal state. discussion and conclusion in Section 7. We show that the proposed enhancements, both separate and com- bined, lead to significant performance increases in the domains dis- 2 MONTE-CARLO TREE SEARCH cussed. Monte-Carlo Tree Search (MCTS) is a simulation-based search method [7, 12]. MCTS grows a search tree incrementally over time, 1 INTRODUCTION by expanding a leaf node of the tree every simulation. Values of the rewards stored at nodes, when averaged over the results of numerous Monte-Carlo Tree Search (MCTS) [7, 12] is a simulation-based simulations, represent an estimate of the win probability of simula- best-first search technique for decision-making problems. Recently, tions that pass through the node. Each simulation consist of two parts, MCTS has shown to improve performance in various domains, such 1) the selection step, where moves are selected and played inside as the two-player games Go [17], Lines of Action [25], and Hex [1]. the tree according to the selection policy, and 2) the play-out step, Moreover, MCTS has seen successes in other domains such as real- where moves are played according to a simulation strategy, outside time strategy games [5], arcade games such as Ms Pac-Man [14] and the tree. At the end of each play-out a terminal state is reached and the Physical Traveling Salesman [15], but also in real-life domains the result r, usually expressed numerically in some discrete range, such as optimization, scheduling, and security [5]. e.g., r 2 {−1; 0; 1g representing a loss, draw or win, respectively, is Standard MCTS runs simulated play-outs until a terminal state is backpropagated along the tree from the expanded leaf to the root. reached. The returned reward signal represents the outcome of the In its basic form, MCTS does not require an evaluation function. game’s final position (win, draw, or loss), but any other informa- Nonetheless, in most domains it is beneficial to add some domain tion about the play-out is disregarded. Several techniques for de- knowledge to the play-out policy. MCTS can be terminated at any termining the quality of simulations have been previously proposed. time, for instance when some computational limit is reached, to se- Early cut-offs terminates a play-out and returns a heuristic value of lect a move to return. The move to make is selected by choosing ei- the state [24]. Evaluating the final score of a game, e.g., using the ther the child of the root with the highest number of visits, the highest number of points achieved, and combining it with the outcome has average reward, or a combination [6]. shown to improve results in points-based games [19]. However, for During the selection step, a policy is required to explore the tree some domains, a heuristic evaluation may not be available or too to decide on promising options. The UCT [12] is derived from the time-consuming. Additionally, many games do not have point-based UCB1 policy [2] for maximizing the rewards of a multi-armed ban- scores and only have the distinct possible outcomes, win, draw, or dit. In UCT, each node is treated as a bandit problem whose arms loss. are the moves that lead to different child nodes. UCT balances the 1 Games and AI Group, Department of Knowledge Engineering exploitation of rewarding nodes whilst allowing exploration of less (DKE), Maastricht University, Maastricht, The Netherlands, email: often visited nodes. Consider a node p with children I(p), then the ftom.pepels,mandy.tak,marc.lanctot,[email protected] policy determining which child i to select is defined as: ( r ) ∗ ln np i = argmaxi2I(p) vi + C ; (1) ni where vi is the score of the child i based on the average result of simulations that visited it, np and ni are the visit counts of the current node and its child, respectively. C is the exploration constant to tune. 3 ASSESSING SIMULATION QUALITY In this section, we discuss two quality assessments of the terminal state of a simulation. First, in Subsection 3.1 the length of a simula- tion is discussed as a measure of its quality. Second, in Subsection 3.2 a quality assessment of the terminal state of a match is consid- ered. In the next section we establish how these quantities can be used to enhance the rewards of MCTS simulations. Figure 1. A single MCTS simulation [9]. 3.1 Simulation Length states directly in order to augment the information used by MCTS. In some domains MCTS’ performance is improved by using either a The first assessment of a simulation’s quality is the length of the sim- static, or an early cut-off of the simulations and evaluating this po- ulated game played. Consider a single MCTS simulation as depicted sition. However, a heuristic evaluation may not be available or too in Figure 1, then we can define two separate distances: time-consuming. In this paper, we consider any available informa- 1. The number of moves made from the root S to the leaf N, dSN , tion that can help describe the quality of a terminal state, such as 2. The number of moves required to reach T, the simulation’s termi- how convincing of a win it was based on in-game scores. nal state, from N during play-out dNT . As before, consider a single MCTS simulation as depicted in Fig- ure 1. When a terminal state is reached, a quality assessment function The length of the simulation is defined as the sum of these distances: is called to evaluate the position with respect to the winning player. This measure q, should reflect the quality of a terminal state. For d = d + d ; (2) SN NT instance, in a game with material such as Breakthrough, Chess or i.e., the total number of moves made before reaching the terminal Checkers, an evaluation can be based on scoring the remaining mate- state of the simulation T from S. A play-out policy chooses moves rial of the winning player. For a racing game such as Chinese Check- at each state uniformly random, is rule-based, reactive, or combined ers, the inverse of the number of pieces the opponent has in his target with a source of expert or heuristic information such as an -greedy base can be considered. As such, the quality is based on the a poste- policy [20, 21]. Alternative methods have been proposed, such as riori evaluation of the terminal state. Having witnessed the states and using low-level αβ searches [24], and methods that learn a strat- actions performed from S to T, the score is based on an assessment egy online, such as the Last-Good-Reply policy [3], Move-average of T given the progression S ... N ... T (see Figure 1). Sampling Technique (MAST) [10], or N-Grams [22]. However, any Score-based bonuses have previously been shown to enhance the play-out policy balances computational effort required to select high- performance of MCTS in Go [8, 26], Mancala [16], and Blokus- quality moves, with the number of simulations performed in the al- Duo [19]. As detailed in the next section, our approach differs from lowed search time. Therefore, moves sampled from the play-out pol- previous work because the bonuses are normalized based on their icy are far from optimal.
Recommended publications
  • CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Havannah, A
    CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Havannah, a Monte Carlo Approach A thesis submitted in partial fulfillment of the requirements For the degree of Master of Science in Computer Science By Roberto Nahue December 2014 The Thesis of Roberto Nahue is approved: ____________________________________ ________________ Professor Jeff Wiegley Date ____________________________________ ________________ Professor John Noga Date ___________________________________ ________________ Professor Richard Lorentz, Chair Date California State University, Northridge ii ACKNOWLEDGEMENTS Thanks to Professor Richard Lorentz for your constant support and your vast knowledge of computer game algorithms. You made it possible to bring Wanderer to life and you have helped me bring this thesis to completion. Thanks to my family for their support and especially my daughter who gave me that last push to complete my work. iii Table of Contents Signature Page .................................................................................................................... ii Acknowledgements ............................................................................................................ iii List of Figures .................................................................................................................... vi List of Tables ..................................................................................................................... vii Abstract ...........................................................................................................................
    [Show full text]
  • On the Huge Benefit of Decisive Moves in Monte-Carlo Tree Search Algorithms Fabien Teytaud, Olivier Teytaud
    On the Huge Benefit of Decisive Moves in Monte-Carlo Tree Search Algorithms Fabien Teytaud, Olivier Teytaud To cite this version: Fabien Teytaud, Olivier Teytaud. On the Huge Benefit of Decisive Moves in Monte-Carlo Tree Search Algorithms. IEEE Conference on Computational Intelligence and Games, Aug 2010, Copenhagen, Denmark. inria-00495078 HAL Id: inria-00495078 https://hal.inria.fr/inria-00495078 Submitted on 25 Jun 2010 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. On the Huge Benefit of Decisive Moves in Monte-Carlo Tree Search Algorithms Fabien Teytaud, Olivier Teytaud TAO (Inria), LRI, UMR 8623(CNRS - Univ. Paris-Sud), bat 490 Univ. Paris-Sud 91405 Orsay, France Abstract— Monte-Carlo Tree Search (MCTS) algorithms, Algorithm 1 The UCT algorithm in short. nextState(s,m) including upper confidence Bounds (UCT), have very good is the implementation of the rules of the game, and the results in the most difficult board games, in particular the ChooseMove() function is defined in Alg. 2. The constant game of Go. More recently these methods have been successfully k is to be tuned empirically. introduce in the games of Hex and Havannah.
    [Show full text]
  • Arxiv:1605.04715V1 [Cs.CC] 16 May 2016 of Hex Has Acquired a Special Spot in the Heart of Abstract Game Aficionados
    On the Complexity of Connection Games Edouard´ Bonnet [email protected] Sztaki, Hungarian Academy of Sciences Florian Jamain [email protected] Lamsade, Universit´eParis-Dauphine Abdallah Saffidine [email protected] Cse, The University of New South Wales Abstract In this paper, we study three connection games among the most widely played: havannah, twixt, and slither. We show that determining the outcome of an arbitrary input position is PSPACE-complete in all three cases. Our reductions are based on the popular graph problem generalized geography and on hex itself. We also consider the complexity of generalizations of hex parameterized by the length of the solution and establish that while short generalized hex is W[1]-hard, short hex is FPT. Finally, we prove that the ultra-weak solution to the empty starting position in hex cannot be fully adapted to any of these three games. Keywords: Complexity, Havannah, Twixt, Hex, Slither, PSPACE 1. Introduction Since its independent inventions in 1942 and 1948 by the poet and mathe- matician Piet Hein and the economist and mathematician John Nash, the game arXiv:1605.04715v1 [cs.CC] 16 May 2016 of hex has acquired a special spot in the heart of abstract game aficionados. Its purity and depth has lead Jack van Rijswijck to conclude his PhD thesis with the following hyperbole [1]. Hex has a Platonic existence, independent of human thought. If ever we find an extraterrestrial civilization at all, they will know hex, without any doubt. Hex not only exerts a fascination on players, but it is the root of the field of connection games which is being actively explored by game designers and researchers alike [2].
    [Show full text]
  • A Simple Yet Deep 2-Player Board Game
    Dominect: A Simple yet Deep 2-Player Board Game Oswin Aichholzer1, Maarten Löffler2, Jayson Lynch3, Zuzana Masárová4, Joachim Orthaber1, Irene Parada5, Rosna Paul1, Daniel Perz1, Birgit Vogtenhuber1, and Alexandra Weinberger1 1Graz University of Technology, Austria; {oaich, ropaul, daperz, bvogt, aweinber}@ist.tugraz.at, [email protected] 2Utrecht University, the Netherlands; m.loffl[email protected] 3University of Waterloo, Canada; [email protected] 4IST Austria; [email protected] 5TU Eindhoven, the Netherlands; [email protected] Abstract In this work we introduce the perfect information 2-player game Dominect, which has recently been invented by two of the authors. Despite being a game with quite simple rules, Dominect reveals a surprisingly high depth of complexity. Next to the rules, this work comes with a webpage containing a print-cut-glue-and-play version of the game and further information. Moreover, we report on first results concerning the development of winning strategies, as well as a PSPACE-hardness result for deciding whether a given game position is a winning position. Introduction and Game Description Dominect is a finite, deterministic perfect information 2-player game developed by Oswin Aichholzer and Maarten Löffler during the 33rd Bellairs Winter Workshop on Computational Geometry in 2018. It belongs to the class of connection games, where a typical goal is to form a path connecting two opposite sides of the game board; see for example [1, 8]. Prominent members of this class are Hex (aka Con-tac-tix), TwixT, and Tak. But many others like Havannah, Connect, Gale (aka Bridg-It), or the Shannon switching game are also rather popular, as games or mathematical puzzles or both.
    [Show full text]
  • Playing and Solving Havannah
    University of Alberta Playing and Solving Havannah by Timo Ewalds A thesis submitted to the Faculty of Graduate Studies and Research in partial fulfillment of the requirements for the degree of Master of Science Department of Computing Science c Timo Ewalds Spring 2012 Edmonton, Alberta Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is converted to, or otherwise made available in digital form, the University of Alberta will advise potential users of the thesis of these terms. The author reserves all other publication and other rights in association with the copyright in the thesis and, except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatsoever without the author's prior written permission. Library and Archives Bibliothèque et Canada Archives Canada Published Heritage Direction du Branch Patrimoine de l'édition 395 Wellington Street 395, rue Wellington Ottawa ON K1A 0N4 Ottawa ON K1A 0N4 Canada Canada Your file Votre référence ISBN: 978-0-494-90390-2 Our file Notre référence ISBN: 978-0-494-90390-2 NOTICE: AVIS: The author has granted a non- L'auteur a accordé une licence non exclusive exclusive license allowing Library and permettant à la Bibliothèque et Archives Archives Canada to reproduce, Canada de reproduire, publier, archiver, publish, archive, preserve, conserve, sauvegarder, conserver, transmettre au public communicate to the public by par télécommunication ou par l'Internet, prêter, telecommunication or on the Internet, distribuer et vendre des thèses partout dans le loan, distrbute and sell theses monde, à des fins commerciales ou autres, sur worldwide, for commercial or non- support microforme, papier, électronique et/ou commercial purposes, in microform, autres formats.
    [Show full text]
  • MONTE-CARLO TWIXT Janik Steinhauer
    MONTE-CARLO TWIXT Janik Steinhauer Master Thesis 10-08 Thesis submitted in partial fulfilment of the requirements for the degree of Master of Science of Artificial Intelligence at the Faculty of Humanities and Sciences of Maastricht University Thesis committee: Dr. M. H. M. Winands Dr. ir. J. W. H. M. Uiterwijk J. A. M. Nijssen M.Sc. M. P. D. Schadd M.Sc. Maastricht University Department of Knowledge Engineering Maastricht, The Netherlands 17 June 2010 Preface This master thesis is the product of a research project in the program \Arti¯cial Intelligence" at Maas- tricht University. The work was supervised by Dr. Mark H. M. Winands. During the study, he gave lectures in the courses \Game AI" and \Intelligent Search Techniques". Especially the latter one con- centrates on how to build a strong AI for classic board games, most often two-player perfect-information zero-sum games. In the game of TwixT, few serious programs have been written that play at the strength of human amateurs. The probably most successful program so far is based on pattern matching. So it was an interesting question if it is possible to build a strong AI using a search-based approach, i.e., Monte- Carlo Tree Search. Further, it was not obvious how and which pieces of knowledge can be included in the search. So it has been an interesting research project with several surprising results and many challenging problems. I want to thank all people who helped me with my master project. Of greatest help to me was Dr. Mark Winands, who gave me a lot of feedback on my ideas.
    [Show full text]
  • Pruning Playouts in Monte-Carlo Tree Search for the Game of Havannah
    The 9th International Conference on Computers and Games Pruning playouts in Monte-Carlo Tree Search for the game of Havannah Joris Dugu´ep´eroux Ahmad Mazyad Fabien Teytaud Julien Dehos LISIC - Universit´edu Littoral C^oted'Opale June 2016 Dugu´ep´eroux,Mazyad, Teytaud, Dehos CG2016 - Pruning playouts in MCTS for the game of Havannah 1/22 Outline 1 The game of Havannah 2 Monte Carlo Tree Search & beyond 3 Proposed method 4 Results 5 Conclusion Dugu´ep´eroux,Mazyad, Teytaud, Dehos CG2016 - Pruning playouts in MCTS for the game of Havannah 2/22 Outline 1 The game of Havannah 2 Monte Carlo Tree Search & beyond 3 Proposed method 4 Results 5 Conclusion Dugu´ep´eroux,Mazyad, Teytaud, Dehos CG2016 - Pruning playouts in MCTS for the game of Havannah 2/22 The game of Havannah Presentation Invented by Christian Freeling in 1979. 2-player connection game. Hexagonal board of hexagonal cells. At each turn a player has to put a stone in an empty cell. To win a player has to realize one of these shapes: fork, bridge, ring. Dugu´ep´eroux,Mazyad, Teytaud, Dehos CG2016 - Pruning playouts in MCTS for the game of Havannah 3/22 The game of Havannah The winning shapes white wins with a ring white wins with a bridge white wins with a fork draw Dugu´ep´eroux,Mazyad, Teytaud, Dehos CG2016 - Pruning playouts in MCTS for the game of Havannah 4/22 Outline 1 The game of Havannah 2 Monte Carlo Tree Search & beyond 3 Proposed method 4 Results 5 Conclusion Dugu´ep´eroux,Mazyad, Teytaud, Dehos CG2016 - Pruning playouts in MCTS for the game of Havannah 4/22 Monte Carlo Tree Search & beyond MCTS principle Compute a good move to play.
    [Show full text]
  • On the Complexity of Connection Games Edouard Bonnet, Florian Jamain, Abdallah Saffidine
    On the Complexity of Connection Games Edouard Bonnet, Florian Jamain, Abdallah Saffidine To cite this version: Edouard Bonnet, Florian Jamain, Abdallah Saffidine. On the Complexity of Connection Games. The- oretical Computer Science, Elsevier, 2016, 644, pp.2-28. 10.1016/j.tcs.2016.06.033. hal-01994450 HAL Id: hal-01994450 https://hal.archives-ouvertes.fr/hal-01994450 Submitted on 25 Jan 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. On the Complexity of Connection Games Édouard Bonnet [email protected] Sztaki, Hungarian Academy of Sciences Florian Jamain [email protected] Lamsade, Université Paris-Dauphine Abdallah Saffidine [email protected] Cse, The University of New South Wales, Abstract In this paper, we study three connection games among the most widely played: havannah, twixt, and slither. We show that determining the outcome of an arbitrary input position is PSPACE-complete in all three cases. Our reductions are based on the popular graph problem generalized geography and on hex itself. We also consider the complexity of generalizations of hex parameterized by the length of the solution and establish that while short generalized hex is W[1]-hard, short hex is FPT.
    [Show full text]
  • Embed the Rules
    60 Game Design Patterns Embed the Rules Cameron Browne, Queensland University of Technology (QUT) This article examines the concept of embedding the rules of a game in its equipment, as a general pattern for good game design. Several examples demonstrate the benefits of doing so. An analogy is drawn with the Japanese concept of poka-yoke, or mistake-proofing, in manufacturing design. 1 Introduction For example, consider the Heptalion puzzle shown in Figure 1, in which the aim is to place HIS series on Game Design Patterns aims to the five tiles, each showing a pair of symbols, to T explore fundamental principles that encour- exactly cover the set of matching symbols on the age good game designs. While this term is remi- left [3]. These rules are trivially simple and in- niscent of the ‘game programming patterns’ de- stantly intuitive; no player who has played the scribed in the book of the same name [1], its use game should ever need to reread them. there refers to software programming practices in video game development, whereas a ‘game design pattern’ here refers to any practice that encourages good designs in games and puzzles.1 This first installment in the series looks at the concept of embedding the rules in the equipment. To ‘embed the rules’ means to use rele- vant features of the game’s physical components (board, pieces, environment, etc.), to enforce im- plicit rules which then do not need to be explicitly stated to players. This might also be described as Figure 1. ‘hiding the rules in the equipment’, or phrased as A Heptalion challenge and pieces.
    [Show full text]
  • Hex (Board Game)
    Hex (board game) Hex is a strategy board game for two players played on a hexagonal grid, theoretically of any size and several possible Hex shapes, but traditionally as an 11×11 rhombus. Players alternate placing markers or stones on unoccupied spaces in an attempt to link their opposite sides of the board in an unbroken chain. One player must win; there are no draws. The game has deep strategy, sharp tactics and a profound mathematical underpinning related to the Brouwer fixed-point theorem. It was invented in the 1940s independently by two mathematicians, Piet Hein and John Nash. The game was first marketed as a board game in Denmark under the name Con-tac-tix, and Parker Brothers marketed a version of it in 1952 called Hex; they are no longer in production. Hex can 11×11 Hex gameboard showing a also be played with paper and pencil on hexagonally ruled graph winning configuration for Blue paper. Years 1942–present Hex-related research is current in the areas of topology, graph active and matroid theory, combinatorics, game theory and artificial Genre(s) Board game intelligence. Abstract strategy game Connection game Contents Players 2 Setup time None Game type Playing 30 minutes – 2 hours History time (11×11 board) Published games Random None Shannon's hex machine chance Research timeline Skill(s) Strategy, tactics Automatons required Game play Strategy Mathematical theory Computational complexity of generalizations Computed strategies for smaller boards Game tree of 11 by 11 Hex Variants Rectangular grids and paper and pencil Board sizes Misère hex Blockbusters Y Havannah Projex Competition See also References Further reading External links Game type Hex is a connection game, and can be classified as a Maker-Breaker game, a particular type of positional game.
    [Show full text]
  • Strategic Potential in Games
    University College Dublin Lecture for AI for Games and Puzzles Strategic Potential in Games Cameron Browne Maastricht University 8/11/2018 Outline Prior work: • Automated Game Design • Measuring Game Quality Current work: • Reconstruction of Ancient Games Importance of Strategic Potential Connection Games Games in which players aim to complete a given type of connection with their pieces: • Connecting goals, • Completing a loop, • Gathering pieces, • etc. Hex: • Invented 1940s • Connect own opposite sides • Source of all connection games Connection Games Most players know a few: • Hex, Havannah, TwixT, Bridg-It, Y, Star, … Connection Games: Variations on a Theme (2005): • Survey of genre • Found over 200 • Most games involve some aspect of connection Connection Games Almost all are combinations of same few rules: • Hex: Connect own opposite sides • Chameleon: Connect any opposite sides • Jade: Connect opposite sides or all sides • Havannah: Connect three sides or corners or form loop • Unlur: Connect two or three sides • Y: Connect three sides (triangular board) • TwixT: Connect opposite sides (square board with bridge moves) • Quax: Connect opposite sides (square board with bridge moves) • Gonnect: Connect opposite sides (square board with surround capture) All Games Almost all games are new combinations of existing ideas Truly revolutionary ideas in games are rare: • e.g. Connection as a goal – Inspired by a mathematical problem • “Nothing new under the sun” Obvious question: • Why not evolve rule sets to create new games? Evolutionary
    [Show full text]
  • On the Complexity of Connection Games
    On the Complexity of Connection Games Édouard Bonnet [email protected] Sztaki, Hungarian Academy of Sciences Florian Jamain [email protected] Lamsade, Université Paris-Dauphine Abdallah Saffidine [email protected] Cse, The University of New South Wales, Abstract In this paper, we study three connection games among the most widely played: havannah, twixt, and slither. We show that determining the outcome of an arbitrary input position is PSPACE-complete in all three cases. Our reductions are based on the popular graph problem generalized geography and on hex itself. We also consider the complexity of generalizations of hex parameterized by the length of the solution and establish that while short generalized hex is W[1]-hard, short hex is FPT. Finally, we prove that the ultra-weak solution to the empty starting position in hex cannot be fully adapted to any of these three games. Keywords: Complexity, Havannah, Twixt, Hex, Slither, PSPACE 1. Introduction Since its independent inventions in 1942 and 1948 by the poet and mathe- matician Piet Hein and the economist and mathematician John Nash, the game of hex has acquired a special spot in the heart of abstract game aficionados. Its purity and depth has lead Jack van Rijswijck to conclude his PhD thesis with the following hyperbole [1]. Hex has a Platonic existence, independent of human thought. If ever we find an extraterrestrial civilization at all, they will know hex, without any doubt. Hex not only exerts a fascination on players, but it is the root of the field of connection games which is being actively explored by game designers and researchers alike [2].
    [Show full text]