A Gamut of Games

Total Page:16

File Type:pdf, Size:1020Kb

A Gamut of Games AI Magazine Volume 22 Number 3 (2001) (© AAAI) Articles A Gamut of Games Jonathan Schaeffer I In 1950, Claude Shannon published his seminal make this amazing feat possible. Often over- work on how to program a computer to play chess. looked, however, is that this result was also a Since then, developing game-playing programs testament to human abilities. Considering the that can compete with (and even exceed) the abil- formidable computing power that DEEP BLUE ities of the human world champions has been a used in its 1997 exhibition match against long-sought-after goal of the AI research commu- world chess champion Garry Kasparov nity. In Shannon’s time, it would have seemed (machine: 200,000,000 chess positions a sec- unlikely that only a scant 50 years would be need- ond; man: 2 a second), one can only admire ed to develop programs that play world-class backgammon, checkers, chess, Othello, and Scrab- the human champions for withstanding the ble. These remarkable achievements are the result technological onslaught for so long. of a better understanding of the problems being Computer game research was started by solved, major algorithmic insights, and tremen- some of the luminaries in computing science dous advances in hardware technology. Computer history. In 1950, Claude Shannon published games research is one of the important success sto- his seminal paper that laid out the framework ries of AI. This article reviews the past successes, for building high-performance game-playing current projects, and future research directions for programs (Shannon 1950). In 1951, Alan Tur- AI using computer games as a research test bed. ing (1953) did a hand simulation of his com- puter chess algorithm (a lack of resources pre- vented him from actually programming it); the algorithm lost to a weak human player. ames are ideal domains for exploring Around this time, Arthur Samuel began work the capabilities of computational intel- on his famous checkers-playing program, the Gligence. The rules are fixed, the scope of first program to achieve notable success the problem is constrained, and the interac- against human opposition (Samuel 1967, tions of the players are well defined. Contrast 1959). By 1958, Alan Newell and Herb Simon the game world with the real world—the game had begun their investigations into chess, of life—where the rules often change, the which eventually led to fundamental results scope of the problem is almost limitless, and for AI and cognitive science (Newell, Shaw, and the participants interact in an infinite number Simon 1958). An impressive lineup to say the of ways. Games can be a microcosm of the real least! world (for example, the role of game theory in In the half century since Shannon’s paper, economics, social interaction, and animal enormous progress has been made in con- behavior), and successfully achieving high structing high-performance game-playing pro- computer performance in a nontrivial game grams. In Shannon’s time, it would have can be a stepping stone toward solving more seemed unlikely that within a scant 50 years challenging real-world problems. checkers (8 8 draughts), Othello, and Scrab- Historically, games have been a popular ble programs would exist that exceed the abil- choice for demonstrating new research ideas in ities of the best human players,1,2 and back- AI. Indeed, one of the early goals of AI was to gammon and chess programs could play at a build a program capable of defeating the level comparable to the human world champi- human world chess champion in a match. This on. These remarkable accomplishments are the challenge proved to be more difficult than was result of a better understanding of the anticipated; the AI literature is replete with problems being solved, major algorithmic optimistic predictions. It eventually took insights, and tremendous advances in hard- almost 50 years to complete the task—a ware technology. The work on computer remarkably short time when one considers the games has been one of the most successful and software and hardware advances needed to visible results of AI research. For some games, Copyright © 2001, American Association for Artificial Intelligence. All rights reserved. 0738-4602-2000 / $2.00 FALL 2001 29 Articles one could argue that the Turing test has been these enhancements include iterative deepen- passed (Krol 1999). ing, caching previously seen subtree results This article discusses the progress made in (transposition tables), successor reordering, developing programs for the classic board and search extensions and reductions, probabilistic card games. For a number of games, a short his- cutoffs, and parallel search. The results are tru- tory of the progress in building a world-class ly amazing. Even though there is an exponen- program for the game is given, along with a tial difference between the best case and the brief description of the strongest program. In worst case for an alpha-beta search, most high- each case, a single feature of the program that performance game-playing programs are Considering is a major contributor to the program’s searching within a small constant of the best strength is highlighted. The histories are neces- case (Plaat et al. 1996). the sarily brief. I apologize in advance to the many Sadly, the community of researchers in- formidable hard-working researchers and hobbyists whose volved in this work has done a relatively poor computing work is not mentioned here. job of selling the technology, resulting in many Enabling Technologies briefly summarizes of the ideas being reinvented for other power that some of the major advances in technology that domains. For example, many search tech- DEEP BLUE facilitated the construction of world-champi- niques pioneered with alpha-beta have become onship–caliber programs. Success in Computer standard in other search domains, with few used in its Games reports the past successes where comput- realizing the lineage of the ideas. 1997 ers have met or exceeded the best human play- At the heart of many game-playing programs ers (backgammon, checkers, chess, Othello, and is an evaluation function. Early on, game exhibition Scrabble). Current Research Efforts highlights developers quickly encountered the knowl- match against games of current academic interest (bridge, go, edge-acquisition bottleneck and traded quality world chess and poker). The Future of Computer Games dis- of knowledge for speed of the program. Simple cusses some of the future challenges of using evaluation functions, linear combinations of champion games as a research test bed for AI. easily identifiable features, were the mainstay Garry Although this article emphasizes the AI of computer game programs for many decades. viewpoint, one should not underestimate the Alternative approaches, such as modeling Kasparov engineering effort required to build these pro- human cognitive processes, turned out to be (machine: grams. One need only look at the recent suc- much harder to do than initially expected and cess of the DEEP BLUE chess machine to appreci- generally resulted in poor performance. Game 200,000,000 ate the effort required. This project spanned 8 programmers quickly learned that a little chess years (12 if one includes the pre-IBM time) and heuristic knowledge, when combined with positions a included several full-time people, extensive deep search, can produce amazing perfor- computing resources, chip design, and grand mance results. Indeed, one could argue that the second; man: master consultation. Some of the case studies viability of brute-force search, once a term with 2 a second), hint at the amount of work required to con- negative connotations in the AI community, is struct these systems. In all cases, the successes one of the main research results from games- one can only reported in this article are the result of consis- related research (Ginsberg 1996b). admire the tent progress over many years. In the last decade, new techniques have human moved to the forefront of games research. Two Enabling Technologies in particular are mentioned here because they champions for are likely to play a more prominent role in the withstanding The biggest advances in computer game play- near future: (1) Monte Carlo simulation and (2) ing have come as a result of work done on the temporal-difference learning. the alpha-beta search algorithm. This algorithm Monte Carlo simulation has successfully technological received the most attention because of the been applied to games with imperfect or non- research community’s preoccupation with deterministic information. In these games, it is onslaught for chess. With the DEEP BLUE victory over world too expensive to search all possible outcomes. so long. chess champion Garry Kasparov, interest in Instead, only a representative sample is chosen methods suitable for chess has waned and been to give a statistical profile of the outcome. This replaced by activity in other games. One could technique has been successful in bridge, poker, argue that the chess victory removed a ball and and Scrabble. shackle that was stifling the creativity of Temporal-difference learning is the direct researchers who were building high-perfor- descendent of Samuel’s machine learning mance game-playing systems. research (Sutton 1988). Here, a database of The alpha-beta research led to a plethora of games (possibly generated by computer self- search enhancements, which significantly play) can be used to bootstrap a program to improved the efficiency of the search. Some of find a good combination of knowledge fea- 30 AI MAGAZINE Articles tures. The algorithm has successfully been ences” (Berliner 1980a, p. 215). Thus, assessing applied to backgammon and has recently the results of a five-game match is difficult. shown promise in chess and checkers (Schaef- Afterwards, Berliner (1980b, p. 71) analyzed fer, Hlynka, and Jussila 2001). the program’s play and concluded that The most obvious hardware advance is sim- There was no doubt that BKG9.8 played ply speed.
Recommended publications
  • University of Alberta Library Release
    University of Alberta Library Release Form Name of Author: Morgan Hugh Kan Title of Thesis: Postgame Analysis of Poker Decisions Degree: Master of Science Year this Degree Granted: 2007 Permission is hereby granted to the University of Alberta Library to reproduce sin- gle copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. The author reserves all other publication and other rights in association with the copyright in the thesis, and except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatever without the author's prior written permission. Morgan Hugh Kan Date: University of Alberta POSTGAME ANALYSIS OF POKER DECISIONS by Morgan Hugh Kan A thesis submitted to the Faculty of Graduate Studies and Research in partial ful- fillment of the requirements for the degree of Master of Science. Department of Computing Science Edmonton, Alberta Spring 2007 University of Alberta Faculty of Graduate Studies and Research The undersigned certify that they have read, and recommend to the Faculty of Grad- uate Studies and Research for acceptance, a thesis entitled Postgame Analysis of Poker Decisions submitted by Morgan Hugh Kan in partial fulfillment of the re- quirements for the degree of Master of Science. Jonathan Schaeffer Supervisor Michael Bowling Michael Carbonaro External Examiner Date: To my parents, Janet and Chay Kan, and my sister, Megan Kan, I would never have made it this far without them. Abstract In Artificial Intelligence research, evaluation is a recurring theme. A newly crafted game-playing program is interesting if it can be shown to be better by some mea- sure.
    [Show full text]
  • Catur Komputer Dari Wikipedia Bahasa Indonesia, Ensiklopedia Bebas
    Deep Blue Dari Wikipedia bahasa Indonesia, ensiklopedia bebas Belum Diperiksa Deep Blue Deep Blue adalah sebuah komputer catur buatan IBM. Deep Blue adalah komputer pertama yang memenangkan sebuah permainan catur melawan seorang juara dunia (Garry Kasparov) dalam waktu standar sebuah turnamen catur. Kemenangan pertamanya (dalam pertandingan atau babak pertama) terjadi pada 10 Februari 1996, dan merupakan permainan yang sangat terkenal. Namun Kasparov kemudian memenangkan 3 pertandingan lainnya dan memperoleh hasil remis pada 2 pertandingan selanjutnya, sehingga mengalahkan Deep Blue dengan hasil 4-2. Deep Blue lalu diupgrade lagi secara besar-besaran dan kembali bertanding melawan Kasparov pada Mei 1997. Dalam pertandingan enam babak tersebut Deep Blue menang dengan hasil 3,5- 2,5. Babak terakhirnya berakhir pada 11 Mei. Deep Blue menjadi komputer pertama yang mengalahkan juara dunia bertahan. Komputer ini saat ini sudah "dipensiunkan" dan dipajang di Museum Nasional Sejarah Amerika (National Museum of American History),Amerika Serikat. http://id.wikipedia.org/wiki/Deep_Blue Catur komputer Dari Wikipedia bahasa Indonesia, ensiklopedia bebas Komputer catur dengan layar LCD pada 1990-an Catur komputer adalah arsitektur komputer yang memuat perangkat keras dan perangkat lunak komputer yang mampu bermain caturtanpa kendali manusia. Catur komputer berfungsi sebagai alat hiburan sendiri (yang membolehkan pemain latihan atau hiburan jika lawan manusia tidak ada), sebagai alat bantu kepada analisis catur, untuk pertandingan catur komputer dan penelitian untuk kognisi manusia. Kategori Deep Blue (chess computer) From Wikipedia, the free encyclopedia Deep Blue Deep Blue was a chess-playing computer developed by IBM. On May 11, 1997, the machine, with human intervention between games, won the second six-game match against world champion Garry Kasparov, two to one, with three draws.[1] Kasparov accused IBM of cheating and demanded a rematch.
    [Show full text]
  • Multilinear Algebra and Chess Endgames
    Games of No Chance MSRI Publications Volume 29, 1996 Multilinear Algebra and Chess Endgames LEWIS STILLER Abstract. This article has three chief aims: (1) To show the wide utility of multilinear algebraic formalism for high-performance computing. (2) To describe an application of this formalism in the analysis of chess endgames, and results obtained thereby that would have been impossible to compute using earlier techniques, including a win requiring a record 243 moves. (3) To contribute to the study of the history of chess endgames, by focusing on the work of Friedrich Amelung (in particular his apparently lost analysis of certain six-piece endgames) and that of Theodor Molien, one of the founders of modern group representation theory and the first person to have systematically numerically analyzed a pawnless endgame. 1. Introduction Parallel and vector architectures can achieve high peak bandwidth, but it can be difficult for the programmer to design algorithms that exploit this bandwidth efficiently. Application performance can depend heavily on unique architecture features that complicate the design of portable code [Szymanski et al. 1994; Stone 1993]. The work reported here is part of a project to explore the extent to which the techniques of multilinear algebra can be used to simplify the design of high- performance parallel and vector algorithms [Johnson et al. 1991]. The approach is this: Define a set of fixed, structured matrices that encode architectural primitives • of the machine, in the sense that left-multiplication of a vector by this matrix is efficient on the target architecture. Formulate the application problem as a matrix multiplication.
    [Show full text]
  • Table of Contents 129
    Table of Contents 129 TABLE OF CONTENTS Table of Contents ......................................................................................................................................................129 Science and Checkers (H.J. van den Herik) .............................................................................................................129 Searching Solitaire in Real Time (R. Bjarnason, P. Tadepalli, and A. Fern)........................................................ 131 An Efficient Approach to Solve Mastermind Optimally (L-T. Huang, S-T. Chen, S-Ch. Huang, and S.-S. Lin) ...................................................................................................................................... 143 Note: ................................................................................................................................................................. 150 Gentlemen, Stop your Engines! (G. McC. Haworth).......................................................................... 150 Information for Contributors............................................................................................................................. 157 News, Information, Tournaments, and Reports: ......................................................................................................158 The 12th Computer Olympiad (Continued) (H.J. van den Herik, M.H.M. Winands, and J. Hellemons).158 DAM 2.2 Wins Draughts Tournament (T. Tillemans) ........................................................................158
    [Show full text]
  • SCIENCE VOLUME 18, No 2, FALL 2007
    SCIENCE VOLUME 18, No 2, FALL 2007 FACULTY OF SCIENCE ALUMNI MAGAZINE contourswww.ualberta.ca/science UNIVERSITY Reading CREATES our body’s SPACE chemicals for INSTITUTE better diagnosis Honouring Mama ALUMNUS TURNS Lu’s spirit of GRIEF INTO HOPE giving Fine arts ARCTIC PONDS students liven DRYING UP up lab space through games SCIENCEcontours Science Contours is published twice MESSAGE FROM THE DEAN a year by the faculty of Science to provide current information on its many activities. The magazine is distributed to alumni and friends of the Faculty of An Intersection Between Arts and Science Science. Phase 1 of our new Dean of Science Centennial Centre for Gregory Taylor Interdisciplinary Sci- ence (CCIS) was near- Assistant Dean, External Relations Claudia Wood ing completion when I happened upon a stu- External Relations Team dent art competition on Katherine Captain, Emily Lennstrom, campus. It occurred to Michel Proulx, Traci Toshack, me then that it would be Kevin Websdale wonderful to have stu- Fine arts students Editor dents help us put some Wendy Lung and Ansun Michel Proulx finishing touches on our Yan enjoy their work new space. Graphic Design After a few conversations, two different classes - one in Inte- Studio X Design grative/Exhibit Design and the other in Printmaking - were tasked with coming up with ideas. The results are astounding. Contributing writers To begin with, the students had to wrap their heads around the Bev Betkowski, Kris Connor, Michel purpose and uniqueness of the building. CCIS is one of the few Proulx, Isabela Varela facilities in the world to house interdisciplinary teams under one roof and Phase 1 is no different.
    [Show full text]
  • Oral History of Hans Berliner
    Oral History of Hans Berliner Interviewed by: Gardner Hendrie Recorded: March 7, 2005 Riviera Beach, Florida Total Running Time: 2:33:00 CHM Reference number: X3131.2005 © 2005 Computer History Museum CHM Ref: X3131.2005 © 2005 Computer History Museum Page 1 of 65 Q: Who has graciously agreed to do an oral history for the Computer History Museum. Thank you very much, Hans. Hans Berliner: Oh, you’re most welcome. Q: O.k. I think where I’d like to start is maybe a little further back than you might expect. I’d like to know if you could share with us a little bit about your family background. The environment that you grew up in. Your mother and father, what they did. Your brothers and sisters. Hans Berliner: O.k. Q: Where you were born. That sort of thing. Hans Berliner: O.k. I was born in Berlin in 1929, and we immigrated to the United States, very fortunately, in 1937, to Washington, D.C. As far as the family goes, my great uncle, who was my grandfather’s brother, was involved in telephone work at the turn of the previous century. And he actually owned the patent on the carbon receiver for the telephone. And they started a telephone company in Hanover, Germany, based upon his telephone experience. And he, later on, when Edison had patented the cylinder for recording, he’d had enough experience with sound recording that he said, “that’s pretty stupid”. And he decided to do the recording on a disc, and he successfully defended his patent in the Supreme Court, and so the patent on the phono disc belongs to Emile Berliner, who was my grand uncle.
    [Show full text]
  • THE PSYCHOLOGY of CHESS: an Interview with Author Fernand Gobet
    FM MIKE KLEIN, the Chess Journalists Chess Life of America's (CJA) 2017 Chess Journalist of the Year, stepped OCTOBER to the other side of the camera for this month's cover shoot. COLUMNS For a list of all CJA CHESS TO ENJOY / ENTERTAINMENT 12 award winners, It Takes A Second please visit By GM Andy Soltis chessjournalism.org. 14 BACK TO BASICS / READER ANNOTATIONS The Spirits of Nimzo and Saemisch By GM Lev Alburt 16 IN THE ARENA / PLAYER OF THE MONTH Slugfest at the U.S. Juniors By GM Robert Hess 18 LOOKS AT BOOKS / SHOULD I BUY IT? Training Games By John Hartmann 46 SOLITAIRE CHESS / INSTRUCTION Go Bogo! By Bruce Pandolfini 48 THE PRACTICAL ENDGAME / INSTRUCTION How to Write an Endgame Thriller By GM Daniel Naroditsky DEPARTMENTS 5 OCTOBER PREVIEW / THIS MONTH IN CHESS LIFE AND US CHESS NEWS 20 GRAND PRIX EVENT / WORLD OPEN 6 COUNTERPLAY / READERS RESPOND Luck and Skill at the World Open BY JAMAAL ABDUL-ALIM 7 US CHESS AFFAIRS / GM Illia Nyzhnyk wins clear first at the 2018 World Open after a NEWS FOR OUR MEMBERS lucky break in the penultimate round. 8 FIRST MOVES / CHESS NEWS FROM AROUND THE U.S. 28 COVER STORY / ATTACKING CHESS 9 FACES ACROSS THE BOARD / How Practical Attacking Chess is Really Conducted BY AL LAWRENCE BY IM ERIK KISLIK 51 TOURNAMENT LIFE / OCTOBER The “secret sauce” to good attacking play isn’t what you think it is. 71 CLASSIFIEDS / OCTOBER CHESS PSYCHOLOGY / GOBET SOLUTIONS / OCTOBER 32 71 The Psychology of Chess: An Interview with Author 72 MY BEST MOVE / PERSONALITIES Fernand Gobet THIS MONTH: FM NATHAN RESIKA BY DR.
    [Show full text]
  • English for It Students
    УКРАЇНА НАЦІОНАЛЬНИЙ УНІВЕРСИТЕТ БІОРЕСУРСІВ І ПРИРОДОКОРИСТУВАННЯ УКРАЇНИ Кафедра англійської філології Ямнич Н. Ю., Данькевич Л. Р. ENGLISH FOR IT STUDENTS 1 УДК: 811.111(072) Навчальний посібник з англійської мови розрахований на студентів вищих навчальних закладів зі спеціальностей «Комп”ютерні науки» та «Програмна інженерія». Мета видання – сприяти розвитку і вдосконаленню у студентів комунікативних навичок з фаху, навичок читання та письма і закріплення навичок з граматики, а також активізувати навички автономного навчання. Посібник охоплює теми актуальні у сучасному інформаційному середовищі, що подаються на основі автентичних професійно спрямованих текстів, метою яких є розвиток у студентів мовленнєвої фахової компетенції, що сприятиме розвитку логічного мислення. Добір навчального матеріалу відповідає вимогам навчальної програми з англійської мови. Укладачі: Л.Р. Данькевич, Н.Ю. Ямнич, Рецензенти: В. В. Коломійцева, к. філол. н., доцент кафедри сучасної української мови інституту філології Київського національного університету імені Тараса Шевченка В.І. Ковальчук, д. пед. наук, професор,завідувач кафедри методики навчання та управління навчальними закладами НУБіП України Кравченко Н. К., д. філ. наук, професор кафедри англійської філології і філософії мови ім. професора О. М. Мороховського, КНЛУ Навчальний посібник з англійської мови для студентів факультету інформаційних технологій. – К.: «Компринт», 2017. – 608 с. ISBN Видання здійснено за авторським редагуванням Відповідальний за випуск: Н.Ю.Ямнич ISBN © Н. Ю. Ямнич, Л.Д. Данькевич, 2017 2 CONTENTS Unit 1 Higher Education 5 Language practice. Overview of verb tenses 18 Unit 2 Jobs and careers 25 Language practice. Basic sentence structures 89 Unit 3 Tied to technology 103 Language practice. Modal verbs 115 Unit 4 Computers 135 Language practice. Passive 153 Unit 5 Communication. E –commerce 167 Language practice.
    [Show full text]
  • Building a Champion Level Computer Poker Player
    University of Alberta Library Release Form Name of Author: Michael Bradley Johanson Title of Thesis: Robust Strategies and Counter-Strategies: Building a Champion Level Computer Poker Player Degree: Master of Science Year this Degree Granted: 2007 Permission is hereby granted to the University of Alberta Library to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. The author reserves all other publication and other rights in association with the copyright in the thesis, and except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatever without the author’s prior written permission. Michael Bradley Johanson Date: Too much chaos, nothing gets finished. Too much order, nothing gets started. — Hexar’s Corollary University of Alberta ROBUST STRATEGIES AND COUNTER-STRATEGIES: BUILDING A CHAMPION LEVEL COMPUTER POKER PLAYER by Michael Bradley Johanson A thesis submitted to the Faculty of Graduate Studies and Research in partial fulfillment of the requirements for the degree of Master of Science. Department of Computing Science Edmonton, Alberta Fall 2007 University of Alberta Faculty of Graduate Studies and Research The undersigned certify that they have read, and recommend to the Faculty of Graduate Studies and Research for acceptance, a thesis entitled Robust Strategies and Counter-Strategies: Building a Champion Level Computer Poker Player submitted by Michael Bradley Johanson in partial fulfillment of the requirements for the degree of Master of Science. Michael Bowling Supervisor Duane Szafron Michael Carbonaro External Examiner Date: To my family: my parents Brad and Sue Johanson, and my brother, Jeff Johanson.
    [Show full text]
  • Biography of N. N. Luzin
    BIOGRAPHY OF N. N. LUZIN http://theor.jinr.ru/~kuzemsky/Luzinbio.html BIOGRAPHY OF N. N. LUZIN (1883-1950) born December 09, 1883, Irkutsk, Russia. died January 28, 1950, Moscow, Russia. Biographic Data of N. N. Luzin: Nikolai Nikolaevich Luzin (also spelled Lusin; Russian: НиколайНиколаевич Лузин) was a Soviet/Russian mathematician known for his work in descriptive set theory and aspects of mathematical analysis with strong connections to point-set topology. He was the co-founder of "Luzitania" (together with professor Dimitrii Egorov), a close group of young Moscow mathematicians of the first half of the 1920s. This group consisted of the higly talented and enthusiastic members which form later the core of the famous Moscow school of mathematics. They adopted his set-theoretic orientation, and went on to apply it in other areas of mathematics. Luzin started studying mathematics in 1901 at Moscow University, where his advisor was professor Dimitrii Egorov (1869-1931). Professor Dimitrii Fedorovihch Egorov was a great scientist and talented teacher; in addition he was a person of very high moral principles. He was a Russian and Soviet mathematician known for significant contributions to the areas of differential geometry and mathematical analysis. Egorov was devoted and openly practicized member of Russian Orthodox Church and active parish worker. This activity was the reason of his conflicts with Soviet authorities after 1917 and finally led him to arrest and exile to Kazan (1930) where he died from heavy cancer. From 1910 to 1914 Luzin studied at Gottingen, where he was influenced by Edmund Landau. He then returned to Moscow and received his Ph.D.
    [Show full text]
  • Mohex 2.0: a Pattern-Based MCTS Hex Player
    MoHex 2.0: a pattern-based MCTS Hex player Shih-Chieh Huang1,2, Broderick Arneson2, Ryan B. Hayward2, Martin M¨uller2, and Jakub Pawlewicz3 1 DeepMind Technologies 2 Computing Science, University of Alberta 3 Institute of Informatics, University of Warsaw Abstract. In recent years the Monte Carlo tree search revolution has spread from computer Go to many areas, including computer Hex. MCTS Hex players now outperform traditional knowledge-based alpha-beta search players, and the reigning Computer Olympiad Hex gold medallist is the MCTS player MoHex. In this paper we show how to strengthen Mo- Hex, and observe that — as in computer Go — using learned patterns in priors and replacing a hand-crafted simulation policy with a softmax pol- icy that uses learned patterns can significantly increase playing strength. The result is MoHex 2.0, about 250 Elo stronger than MoHex on the 11×11 board, and 300 Elo stronger on 13×13. 1 Introduction In the 1940s Piet Hein [22] and independently John Nash [26–28] invented Hex, the classic two-player alternate-turn connection game. The game is easy to im- plement — in the 1950s Claude Shannon and E.F. Moore built an analogue Hex player based on electrical circuits [29] — but difficult to master, and has often been used as a testbed for artificial intelligence research. Around 2006 Monte Carlo tree search appeared in Go Go [11] and soon spread to other domains. The four newest Olympiad Hex competitors — MoHex from 2008 [4], Yopt from 2009 [3], MIMHex from 2010 [5], Panoramex from 2011 [20] — all use MCTS.
    [Show full text]
  • Penultimate Self-Produced Version. Published in the Journal Paradigmi
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Archivio della ricerca - Università degli studi di Napoli Federico II Penultimate self-produced version. Published in the journal Paradigmi 33(3), pp. 163-178. DOI: 10.3280/PARA2015-003010 GUGLIELMO TAMBURRINI* MACHINE INTELLIGENCE SPORTS AS RESEARCH PROGRAMS Abstract: Games and competitions have played a significant role throughout the history of artificial intelligence and robotics. Machine intelligence games are examined here from a distinctive methodological perspective, focusing on their use as generators of multidisciplinary research programs. In particular, Robocup is analyzed as an exemplary case of contemporary research program developing from machine intelligence games. These research programs arising are schematized in terms of framework building, subgoaling, and outcome appraisal processes. The latter process is found to involve a rather intricate system of rewards and penalties, which take into account the double allegiance of participating scientists, trading and sharing interchanges taking place in a multidisciplinary research environment, in addition to expected industrial payoffs and a variety of other fringe research benefits in the way of research outreach and results dissemination, recruitment of junior researchers and students enrollment. Key words: double allegiance of scientists, machine intelligence games, methodology of AI, methodology of robotics, multidisciplinary research, research gamification. Parole chiave: doppia
    [Show full text]