Constructing an Evaluation Function for Playing Backgammon

KTH – Royal Institute of Technology School of Computer Science and Communication DD143X Degree Project in Computer Science,First Level Constructing an Evaluation Function for Playing Backgammon Authors: Supervisor: Anders Sjoqvist¨ Johan Boye [email protected] Andre´ Stenlund [email protected] 14 June 2011 Abstract In this project we describe a few variations of the game backgammon. We explain different algorithms, used to play backgammon and other similar games, and develop our own rule engine and a set of players. By developing an evaluation function for playing board games, we aim to find out if it’s possible to construct a simple computerized player that outperforms other players, and which properties such a player should have. The player programs that we have developed are very primitive, but there are distinct tendencies revealing which players dominate the others. The winning player’s only strategy is to hit the opponent’s lone checkers, while protecting its own. Sammanfattning I denna uppsats beskriver vi några varianter av spelet backgammon. Vi går igenom olika algoritmer som anvands¨ for¨ att spela backgammon och andra liknande spel, och utvecklar vår egen regelmotor och en uppsattning¨ spelare. Vi vill skapa en evalueringsfunktion for¨ att spela bradspel¨ så att vi ska kunna ta reda på om det går att konstruera en enkel datorspelare som vinner over¨ andra spelare, och vilka egenskaper en sådan spelare ska ha. Spelarprogrammen som vi utvecklat ar¨ mycket primitiva, men det ar¨ andå¨ tydligt vilka spelare som tenderar att dominiera andra. Den vinnande spelaren har som sin enda strategi att slå ut motståndarens ensamma pjaser,¨ medan den skyddar sina egna. Contents Contents Contents 1 Introduction ............................................... 3 1.1 Statement of collaboration.....................................3 2 Problem statement ............................................ 4 3 Playing backgammon .......................................... 4 3.1 Terminology and basic rules....................................4 3.2 Rules of classic backgammon...................................4 3.3 Rules of Greek backgammon...................................5 3.3.1 Portes............................................5 3.3.2 Fevga.............................................5 3.3.3 Plakoto............................................6 4 Computer algorithms .......................................... 6 4.1 Minimax...............................................7 4.2 Expectimax..............................................7 4.3 Alpha-beta pruning.........................................8 4.4 *-Minimax..............................................8 4.5 Knowledge-based evaluation...................................8 4.6 Neural networks..........................................8 5 Constructing our own evaluation function .............................. 9 5.1 Research process..........................................9 5.2 Choice of game rules........................................9 5.3 Choice of algorithm......................................... 10 5.4 Rule engine............................................. 10 5.5 AI players.............................................. 11 5.6 Automated test framework.................................... 13 6 Results ................................................... 13 7 Conclusions ................................................ 14 8 Further research ............................................. 14 9 Acknowledgements ........................................... 15 References ................................................... 15 2 1 Introduction 1.1 Statement of collaboration 1 Introduction solve is to find the true probability of winning from a given game state when both players are rational, it turns out that we don’t know whether the prob- Artificial intelligence for computerized gameplay lem is in P.[3] That makes it unlikely that we will has been studied for a long time, but not until re- find a simple algorithm for solving it anytime soon, cently has the topic gained publicity. [6, p. xix] and requires us to create methods for optimizing While some of the algorithms have few appli- our guesses. cations outside the domain of human-imitating gameplaying opponents, they still have quite a In this project, we intend to present a few dif- few things to teach us. Firstly, in their algorith- ferent games played on the the same backgam- mic forms they can improve our understanding mon board and explain our reasons for choosing of probability, tactics and consequences of pre- a certain set of rules. Furthermore, we will in- vious decision-making. Secondly, they serve as troduce algorithms, frequently used when imple- measurements of the development in the com- menting backgammon artificial intelligence (AI) puter industry, as they are one the easiest compre- players. Lastly, we’ll describe our own experi- hensible ways of comparing human intelligence ments for constructing an evaluation function, the with computational thoroughness (especially in conclusions we drew and discuss possibilities for the well-known matches between computers and future research. human world champions). Thirdly, the bene- fits of the development of efficient gaming opponents shouldn’t be underestimated, as they pro- vide recreation as well as sharpening of our logical 1.1 Statement of collaboration and tactical abilities, which is often benefitial in other situations in our lives. This thesis has been written entirely by Anders The game of backgammon is particularly interest- Sjoqvist¨ and Andre´ Stenlund. It’s our firm belief ing, and is one of the most studied problems. [7] that every task must be assigned to exactly one in- It’s a two-player zero-sum game with a chance ele- dividual, or else it might be overlooked. Accord- ment. Since we might experience “bad luck” with ing to this principle, Andre´ was assigned the main the dice throws, the chance element makes it im- responsibility of coding the rule engine and game- possible to develop a strategy that is guaranteed playing software, as well as running the tests. An- to perform well in at least 50 % of the games in the ders, on the other hand, was responsible for finding short run, as even a novice player can beat a skilled appropriate literature, deciding about algorithms one thanks to luck (but note that this won’t happen and writing the report. The division of responsibil- in the long run, when “the luck runs out”). Devel- ities also roughly represents how the work was di- opment of new software for playing backgammon vided, although we made sure that we both agreed can improve the availability of games for different on everything. In addition, some tasks were done platforms, as well as promote further research of jointly, as exemplified by the separate rule engine the topic. Backgammon is also theoretically inter- written by Anders and parts of the literature search esting; if we decide that the problem we want to which was made by Andre.´ 3 3.2 Rules of classic backgammon 3 Playing backgammon 2 Problem statement ing the two dice, which must be thrown together and display an unambiguous result (i.e. if at least We want to use Alpha-beta pruning and Expecti- one die lands on the floor or stops in a tilted posi- max to find an efficient backgammon AI evaluation tion, both dice must be thrown again). The check- function, in order to increase our own understand- ers are then, if possible, moved around the board ing, summarize some important algorithms and and placed on some of the 24 points. The points are draw conclusions about the behavior of a few sim- located on both sides of the board, with 12 on each ple tactics. Preferably, these conclusions should be side, and divided into groups of six by the verti- able to help game developers or enable future re- cal bar. If a point is occupied by a single checker, search. Specifically, this is the question we want to it’s called a blot. Depending on the variant of the answer: game, the blot might be hit by the opponent and placed on the bar. The six final points along the “Is it possible to construct an evaluation path of movement for one of the players is called function that, using simple and humanly that player’s home board. If a player has a checker comprehensible rules, with statistical sig- on the bar, he must enter it into the opposing home nificance can outperform other evalua- board before he can make any other move. If there tion functions in backgammon?” are no possibilities to enter the checker, given the numbers on the dice, the player must wait a turn. 3 Playing backgammon Once all the checkers are gathered in the own home board, the player may start bearing off the checkers. The information in this section is available from various sources, but we recommend using 3.2 Rules of classic backgammon Backgammon Galore for reference. [1] They also have some information about how backgammon Figure 1 depicts the initial board setup in classic programs work. backgammon. The game is played with two dice, We will hereby explain the rules of a few games. indicating the movement of the checkers. A dice throw of 3; 5 means that two checkers may be The rules are provided for completeness, to ensure f g that a reader who has already played backgammon moved 3 and 5 points, respectively, or that one understands the rules in the same way that we do. checker may be moved twice, a total of 8 points It’s out of the scope of this thesis to try to teach a (but only in case a temporary stop on the 3rd or beginner how to play, and although the rules that 5th point is

Constructing an Evaluation Function for Playing Backgammon

Alpha-Beta Pruning

Combinatorial Game Theory

Game Tree Search

Lecture Notes Part 2

Combinatorial Game Theory: an Introduction to Tree Topplers

CONNECT6 I-Chen Wu1, Dei-Yen Huang1, and Hsiu-Chen Chang1

Part 4: Game Theory II Sequential Games

Alpha-Beta Pruning

Game Tree Search Complete Control of Environment ■ State Does Not Change Unless the Agent (Robot) Changes It

How Game Complexity Affects the Playing Behavior of Synthetic Agents

Games Solved: Now and in the Future by H

CS 520: Intro AI Jingjin Yu | Rutgers Games