Designing a Single Chip Chess Grandmaster While Knowing Nothing About Chess*

Total Page:16

File Type:pdf, Size:1020Kb

Designing a Single Chip Chess Grandmaster While Knowing Nothing About Chess* Designing a Single Chip Chess Grandmaster While Knowing Nothing About Chess* Feng-hsiung Hsu IBM T. J. Watson Research Center [email protected] (*Well, I did know close to nothing about chess...) Outline Project History Problem Descriptions Design Philosophy Chip Architecture System Performance World Chess Champion on a PC? Project History Started as a one-person student project at Carnegie Mellon ('85) First Grandmaster Level Chess Machine (Deep Thought, '88) Became an IBM Project in '89 New Chip Design in Full Swing from '92 First Release Chip in '95 ('96 Match) Second Release Chip in '97 ('97 Rematch) Problem Descriptions (Speed) FIDE Rating 3600 3400 3200 3000 Gary Kasparov Deep Blue 2800 Target Zone 2600 Deep Blue Prototype 2400 Deep Thought Hitech 2200 Belle at 8 2000 Belle at 7 1800 Belle at 6 1600 Belle at 5 1400 1200 Belle at 4 4 5 678910 11 12 13 Normalized search depth Problem Descriptions Search speed appears to be a necessary condition Speed can be obtained by increasing the level of integration (single chip chess machines) and by using massive parallelism But is speed alone enough? Problem Descriptions Speed alone might not be enough Deep Thought was a tournament GM But not a match GM Human Grandmasters, in serious matches, learn from computers' mistakes, exploit the weaknesses, and drive a truck thru the gaping holes Need to have very few weaknesses, and only weaknesses that are difficult to exploit Design Philosophy Chip speed is secondary Integration level is paramount Encapsulate every chess evaluation terms from chess books Create evaluation terms to deal with every known computer weaknesses Add hooks to handle new weaknesses, if they appear, with external FPGA hardware Put everything on one chip Deep Blue Architecture DB BOARD P2SC MEMORY I/O DB BOARD Scalable parallel system 30 node IBM RS/6000 SP supercomputer 16 chess accelerator chips per node 200 million chess positions per second average Chess Machine Basics Move Move Repetition Generator Stack Detector Move Bus (19-bit) Fast Status Eval Signals Evaluation Alphabeta Search Function Control Control Slow Signals Eval Move Systolic Eval Gen Array Pipelined (8x1) Evaluation (8x8) RAMs Pipelined Post-evaluation Logic Post-evaluation Pipelined Game Phase Search Control Endgame Control Repetition King Endgame Detector Piece and ROM Placement Pawn Tables Array Move Tables Search Stack Datapath Chip Architecture Entry e4 ... No Legal? Leaf? No e5 to Parent ... Yes Yes Early Fast Eval d4 >= in <= Exit Window? Move ... Move Evaluation Generator Generation Yes Function d Continue to Parent Slow Eval Qd4 No Has Yes Return move? Cutoff? Yes No Next Chip Architecture (Move Gen) The move generator is an extension of the move generator used in Deep Thought 8 by 8 combinatorial array, one cell per square of the chess board Intercell wiring corresponding to the way chess pieces move Can generate checking and check evasion moves Also "generalized checking" moves Chip Architecture (Eval) A fully parallel implementation of the evaluation function is too big Use a pipelined implementation Divide the evaluation into two parts Fast evaluation in one cycle, including all the big terms ''Slow'' evaluation computed in a 8-cycle sequence (+3 additional cycles), one cycle per column of the chess board Use small RAMs instead few large RAMs Fast and Slow Evaluation Fast Eval 1 cycle Slow Evaluation ... 8 + 3 cycles (if needed) Time Time when a move was made Fast Eval to slow eval Game Phase Control mat adj Endgame Draw? Logic & ROMs Endgame Adjustment to Endgame Evaluation Endgame Adjustment to search control King Logic & pawn run? Pawn Array Piece Incremental Evaluation to search control Placement Piece Placement Table Aggr+to Both colors (1024x10) To Vict+to Alphabeta White Only Mux Aggr+fr (512x10) Search Control Aggr+fr Black Only Mux Vict+to (512x10) AggrIsWhite Endgame ROM Interface KP vs K Piece Counts Address KR vs KP to Generators Endgame adder tree XORed Piece KQ vs KP Locations for each piece type KRP vs KP Main Slow Evaluation Flow PC type RAM 1 Rank 8 r to l signals RAM 2 Piece Reg & Right Transformation to Merge and Adder tree Left to Signals Rank 2 alphabeta search control RAM N-1 Rank 1 RAM N to Left to right scan alphabeta Dev RAM search control FSMs Pin RAM Adder Tree REGs Chip Statistics 3-level metal 0.6-micron 5-V CMOS 1.5 million transistors, 1 W 1.4 cm x 1.4 cm 2.5 million chess positions/sec 25 to 100 billion general purpose instructions per second A single chip system appears to play at strong Grandmaster and possibly Super Grandmaster level System Performance A 480-chip system was assembled for the 1997 Rematch with Kasparov System sustained speed was about 200 million chess positions per second, or roughly 1/5 of the potential peak speed First match win ever by a computer over World Chess Champion (3.5 to 2.5) 78 million web hits (1997 figure) 5-7 billion impressions from Rematch alone IBM stock went up the next day World Chess Champion on a PC? 0.6-micron CMOS not state of the art Design was not speed optimized 30 million chess positions/sec per chip appears possible with, say, a 0.35-micron process A small array of chips plugged into a PC could be sufficient to beat Kasparov, if a single chip is not sufficient Shogi (Japanese Chess) next?.
Recommended publications
  • Oral History of Hans Berliner
    Oral History of Hans Berliner Interviewed by: Gardner Hendrie Recorded: March 7, 2005 Riviera Beach, Florida Total Running Time: 2:33:00 CHM Reference number: X3131.2005 © 2005 Computer History Museum CHM Ref: X3131.2005 © 2005 Computer History Museum Page 1 of 65 Q: Who has graciously agreed to do an oral history for the Computer History Museum. Thank you very much, Hans. Hans Berliner: Oh, you’re most welcome. Q: O.k. I think where I’d like to start is maybe a little further back than you might expect. I’d like to know if you could share with us a little bit about your family background. The environment that you grew up in. Your mother and father, what they did. Your brothers and sisters. Hans Berliner: O.k. Q: Where you were born. That sort of thing. Hans Berliner: O.k. I was born in Berlin in 1929, and we immigrated to the United States, very fortunately, in 1937, to Washington, D.C. As far as the family goes, my great uncle, who was my grandfather’s brother, was involved in telephone work at the turn of the previous century. And he actually owned the patent on the carbon receiver for the telephone. And they started a telephone company in Hanover, Germany, based upon his telephone experience. And he, later on, when Edison had patented the cylinder for recording, he’d had enough experience with sound recording that he said, “that’s pretty stupid”. And he decided to do the recording on a disc, and he successfully defended his patent in the Supreme Court, and so the patent on the phono disc belongs to Emile Berliner, who was my grand uncle.
    [Show full text]
  • Computer Chess Methods
    COMPUTER CHESS METHODS T.A. Marsland Computing Science Department, University of Alberta, EDMONTON, Canada T6G 2H1 ABSTRACT Article prepared for the ENCYCLOPEDIA OF ARTIFICIAL INTELLIGENCE, S. Shapiro (editor), D. Eckroth (Managing Editor) to be published by John Wiley, 1987. Final draft; DO NOT REPRODUCE OR CIRCULATE. This copy is for review only. Please do not cite or copy. Acknowledgements Don Beal of London, Jaap van den Herik of Amsterdam and Hermann Kaindl of Vienna provided helpful responses to my request for de®nitions of technical terms. They and Peter Frey of Evanston also read and offered constructive criticism of the ®rst draft, thus helping improve the basic organization of the work. In addition, my many friends and colleagues, too numerous to mention individually, offered perceptive observations. To all these people, and to Ken Thompson for typesetting the chess diagrams, I offer my sincere thanks. The experimental work to support results in this article was possible through Canadian Natural Sciences and Engineering Research Council Grant A7902. December 15, 1990 COMPUTER CHESS METHODS T.A. Marsland Computing Science Department, University of Alberta, EDMONTON, Canada T6G 2H1 1. HISTORICAL PERSPECTIVE Of the early chess-playing machines the best known was exhibited by Baron von Kempelen of Vienna in 1769. Like its relations it was a conjurer's box and a grand hoax [1, 2]. In contrast, about 1890 a Spanish engineer, Torres y Quevedo, designed a true mechanical player for king-and-rook against king endgames. A later version of that machine was displayed at the Paris Exhibition of 1914 and now resides in a museum at Madrid's Polytechnic University [2].
    [Show full text]
  • Deep Blue System Overview
    Deep Blue System Overview Feng-hsiung Hsu, Murray S. Campbell, and A. Joseph Hoane, Jr. IBM T. J. Watson Research Center Abstract By the late 1970s, it became clear that faster hard- ware speed was the single most effective way to improve One of the oldest Grand Challenge problems in com- program performance, when used in conjunction with puter science is the creation of a World Championship brute force searching techniques. Joe Condon and Ken level chess computer. Combining VLSI custom circuits, Thompson [2] of Bell Laboratories proceeded to build dedicated massively parallel C@ search engines, and the famous Belle special-purpose chess automaton. In various new search algorithms, Deep Blue is designed 1982, searching about 8 plies (4 moves by White, and to be such a computer. This paper gives an overview 4 moves by Black) in chess middle games, it became of the system, and examines the prospects of reaching the first computer to play at USCF National Master the goal. level and thus claimed the first Fredkin Intermediate Prize. Thompson’s self-play experiments [13, 4] with Belle, however, showed that the performance increase 1 Introduction started to seriously taper off when the search depth exceeded 8 plies. At shallower depths, Belle gained Charles Babbage, back in 1840s, contemplated the pos- about 200 rating points (one full chess Class) for each sibility of playing chess using his Analytical Engine. additional ply, or close to 100 rating points for each Claude Shannon, at the dawn of the modern computer doubling of search speed. (To search one extra ply re- age, presented his seminal paper [10] on computer chess quires a six-fold increase in search speed on average.
    [Show full text]
  • Game Playing in the Real World
    GGaammee PPllaayyiinngg iinn tthhee RReeaall WWoorrlldd Next time: Knowledge Representation Reading: Chapter 7.1-7.3 1 WWhhaatt mmaatttteerrss?? ° Speed? ° Knowledge? ° Intelligence? ° (And what counts as intelligence?) ° Human vs. machine characteristics 2 ° The decisive game of the match was Game 2, which left a scare in my memory … we saw something that went well beyond our wildest expectations of how well a computer would be able to foresee the long-term positional consequences of its decisions. The machine refused to move to a position that had a decisive short-term advantage – showing a very human sense of danger. I think this moment could mark a revolution in computer science that could earn IBM and the Deep Blue team a Nobel Prize. Even today, weeks later, no other chess-playing program in the world has been able to evaluate correctly the consequences of Deep Blue’s position. (Kasparov, 1997) 3 QQuuootteess ffrroomm IIEEEEEE aarrttiiccllee ° Why, then, do the very best grandmasters still hold their own against the silicon beasts? ° The side with the extra half-move won 3 games out of four, corresponding to a 200-point gap in chess rating – roughly the difference between a typically grandmaster (about 2600) and Kasparov (2830) ° An opening innovation on move nine gave Kasparov not merely the superior game but one that Fritz could not understand ° Kasparov made sure that Fritz would never see the light at the end of that tunnel by making the tunnel longer. 4 DDeeeepp BBlluuee –– AA CCaassee SSttuuddyy Goals ° Win a match against
    [Show full text]
  • Modern Masters of an Ancient Game
    AI Magazine Volume 18 Number 4 (1997) (© AAAI) Modern Masters of an Ancient Game Carol McKenna Hamilton and Sara Hedberg he $100,000 Fredkin Prize for on Gary Kasparov in the final game of tute of Technology Computer Science Computer Chess, created in a tied, six-game match last May 11. Professor Edward Fredkin to encourage T1980 to honor the first program Kasparov had beaten the machine in continued research progress in com- to beat a reigning world chess champi- an earlier match held in February 1996. puter chess. The prize was three-tiered. on, was awarded to the inventors of The Fredkin Prize was awarded un- The first award of $5,000 was given to the Deep Blue chess machine Tuesday, der the auspices of AAAI; funds had two scientists from Bell Laboratories July 29, at the annual meeting of the been held in trust at Carnegie Mellon who in 1981 developed the first chess American Association for Artificial In- University. machine to achieve master status. telligence (AAAI) in Providence, The Fredkin Prize was originally es- Seven years later, the intermediate Rhode Island. tablished at Carnegie Mellon Universi- prize of $10,000 for the first chess ma- Deep Blue beat world chess champi- ty 17 years ago by Massachusetts Insti- chine to reach international master status was awarded in 1988 to five Carnegie Mellon graduate students who built Deep Thought, the precur- sor to Deep Blue, at the university. Soon thereafter, the team moved to IBM, where they have been ever since, working under wraps on Deep Blue. The $100,000 third tier of the prize was awarded at AAAI–97 to this IBM team, who built the first computer chess machine that beat a world chess champion.
    [Show full text]
  • Chess As Played by Artificial Intelligence
    Chess as Played by Artificial Intelligence Filemon Jankuloski, Adrijan Božinovski1 University American College of Skopje, Skopje, Macedonia [email protected] [email protected] Abstract. In this paper, we explore the evolution of how chess machinery plays chess. The purpose of the paper is to discuss the several different algorithms and methods which are implemented into chess machinery systems to either make chess playing more efficient or increase the playing strength of said chess machinery. Not only do we look at methods and algorithms implemented into chess machinery, but we also take a look at the many accomplishments achieved by these machines over a 70 year period. We will analyze chess machines from their conception up until their absolute domination of the competitive chess scene. This includes chess machines which utilize anything from brute force approaches to deep learning and neural networks. Finally, the paper is concluded by discussing the future of chess machinery and their involvement in both the casual and competitive chess scene. Keywords: Chess · Algorithm · Search · Artificial Intelligence 1 Introduction Artificial intelligence has become increasingly prevalent in our current society. We can find artificial intelligence being used in many aspects of our daily lives, such as through the use of Smartphones and autonomous driving vehicles such as the Tesla. There is no clear and concise definition for Artificial Intelligence, since it is a subject which still has massive space for improvement and has only recently been making most of its significant advancements. Leading textbooks on AI, however, define it as the study of “intelligent agents”, which can be represented by any device that perceives its environment and takes actions that maximizes its chances of achieving its goals [1].
    [Show full text]
  • Deep Thought Wins Fredkin Intermediate Prize Hans Berliner
    AI Magazine Volume 10 Number 2 (1989) (© AAAI) CHESS REPORT Deep Thought Wins Fredkin Intermediate Prize Hans Berliner ince May 1988, Deep Thought almost all potential contenders. The S (DT), the creation of a team of two first place finishers drew with students at Carnegie Mellon Universi- each other. Hitech led the field at the ty, has been attracting a lot of notice. halfway point but lost to DT in round In the Fredkin Masters Open, May 3 and threw away a winning position 28–30, DT tied for second in a field of against Fidelity in round four (because over 20 masters and ahead of three of a programming bug). The level of other computers, including Hitech play in this tournament was by far the and Chiptest (the winner of the 1987 best ever in a computer event, and the North American Computer Champi- winners clearly deserved their top onships). In August at the U.S. Open, places. DT scored 8.5, 3.5 to tie for eigh- Ten days later, DT achieved the teenth place with Arnold Denker greatest computer success to date. It among others. Its performance was tied for first with GM Tony Miles in marred by hardware and software the prestigious Software Toolworks bugs. However, DT astounded every- Open in Los Angeles with a score of one by beating International Master 6.5, 1.5. Several GMs played in this (IM) Igor Ivanov, the perennial winner tournament, including former World of the U.S. Grand Prix circuit prize, Champion Mikhail Tal of the USSR. who is generally regarded to be as In the tournament, DT became the strong as the average Grandmaster first computer to beat a GM in a regu- (GM).
    [Show full text]
  • Results of Acm's Eighteenth Computer Chess
    MEETINGREPORT RESULTS OF ACM’S EIGHTEENTH COMPUTER CHESS CHAMPIONSHIP MONTY NEWBORN and DANNY KOPEC CHIPTEST-M, a chess-playing computer program devel- at least one level deeper than its opponents searched oped by a team of three graduate students at Carnegie- and apparently too much of an advantage for any of Mellon University, took first place at ACM’s 18th North them to overcome. The program is only two years old American Computer Chess Championship held at the and has less chess knowledge than its main rivals, but ACM/IEEE-CS Fall Joint Computer Conference in Dal- the enormous tree that it searches more than makes up las, Texas last October. With the strongest field ever for these shortcomings. assembled for a computer chess event, including cur- Hsu expects to improve the program over the coming rent World Champion CRAY BLITZ and former World year by introducing parallelism. His chip can be repli- Champion BELLE, CHIPTEST-M overpowered the field cated, and with the talent that has created the current of 12 with a perfect 4-0 performance to capture the version, a parallel version can be anticipated which $2000 first place prize. En route to winning the champi- will be significantly stronger. The program was as- onship, the program defeated CRAY BLITZ in the third signed a 2584 performance rating at the Dallas event round and then routinely disposed of SUN PHOENIX in by Ken Thompson’s rating program. This salys that the final fourth round. CRAY BLITZ finished in second CHIPTEST-M was playing at the level of a Grandmaster place, while SUN PHOENIX settled for third.
    [Show full text]
  • The Copyright Law of the United States (Title 17, U.S
    NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or other reproductions of copyrighted material. Any copying of this document without permission of its author may be prohibited by law. CMU-CS-86-103 Two Designs of Functional Units for VLSI Based Chess Machi nes Feng-hsiung Hsu Department of Computer Science Carnegie-Mellon University Pittsburgk Pennsylvania 15213 Abstract Brute force chess automata searching 8 plies (4 full moves), or deeper have been dominating the computer chess scene in recent years and have reached master level performance. One interesting quesdon is whether 3 or 4 additional plies coupled with an improved evaluation scheme will bring forth world championship level performance. Assuming an optimistic branching ratio of 5, speedup of at least one hundredfold over the best current chess automaton would be necessary to reach die 11 to 12 plies per move range. One way to obtain such speedup is to improve the gate utilization and then parallelize die search process. In this paper, two new designs of functional units with higher gate efficiency tfian previous designs in the literature will be presented. The first design is for move generation only, and is essentially a refinement of the move generator used in the Belle chess automaton, the first certified computer chess master. The second design is a general scheme diat can be used for evaluating a class of chess-specific functions, besides generating moves. A move generator based on die second design will be described. Applications of the same general scheme to board evaluation will be briefly discussed.
    [Show full text]
  • The Startech Massively Parallel Chess Program Bradley C
    The StarTech Massively Parallel Chess Program Bradley C. Kuszmaul [email protected] http://theory.lcs.mit.edu/Äbradley Laboratory for Computer Science Massachusetts Institute of Technology NE43-247, 545 Technology Square Cambridge, MA 02139 January 11, 1995 Abstract chess, the StarTech project has produced some interest- ing computer chess technology. This paper explains how The StarTech massively parallel chess program, running StarTech works and how it performs. on a 512-processor Connection Machine CM-5 supercom- The chess knowledge of StarTechÐwhich includes the puter, tied for third place at the 1993 ACM International opening book of precomputed moves at the beginning Computer Chess Championship. StarTech employs the of the game, the endgame databases, the static position- Jamboree search algorithm, a natural extension of J. Pearl's evaluation function, and the time-control strategyÐis Scout search algorithm, to ®nd parallelism in game-tree based on H. Berliner's Hitech program [BE89]. Hitech searches. StarTech's work-stealing scheduler distributes runs on special-purpose hardware built in the mid 1980's the work speci®ed by the search algorithm across the pro- and searches in the range of 100,000 to 200,000 positions cessors of the CM-5. StarTech uses a global transposition per second. Berliner provided me with an implementa- table shared among the processors. StarTech has an infor- tion of Hitech written in C (without any search extensions mally estimated rating of over 2400 USCF. except for quiescence with check extension) that runs at Two performance measures help in understanding the 2,000 to 5,000 positions per second.
    [Show full text]
  • White Knight Review Chess E-Magazine
    Chess E-Magazine Interactive E-Magazine Volume 2 • Issue 1 January/February 2011 CHESS THEMES IN MUSIC Turk, Ajeeb, Mephisto:-the great Chess Automatons Women and Chess The Jerome Gambit Commander in Chess-Presidents Who Played Chess White Knight Review Chess E-Magazine Table of Contents EDITORIAL- ”My Move” 3 contents HISTORY- Turk, Ajeeb, Mephisto: 4 the great chess Automatrons INTERACTIVE CONTENT HISTORY- Computers and Chess 7 ________________ BOOK REVIEW-White King and Red Queen: 13 • Click on title in How the Cold War Was Fought on the Chess board Table of Contents to move directly to page. FEATURE- Chess Themes in Music 14 • Click on “White Knight Review” on FEATURE- Commander-in-Chess 18 the top of each page to return to Presidents Who Played Chess Table of Contents. • Click on red type to BIOGRAPHY- Grandmaster Larry Evans 22 continue to next page HUMOR- Chess Daffinitions 24 • Click on ads to go to their websites NEWS - Chess News around the World 25 • Click on email to open up email program FEATURE-Women and Chess 26 • Click up URLs to go to ANNOTATED GAME - The Jerome Gambit 30 websites. COMMENTARY- “Ask Bill” 31 January/February 2011 White Knight Review January/February 2011 My Move [email protected] editorial -Jerry Wall I was recently sitting in the airport in Puerto Rico waiting on my flight when I noticed a nice family sitting by and saw that the father and son had a miniature chess board set up and were playing a game of chess to pass the time. I quietly watched until the young boy White Knight (perhaps 12) shouted “checkmate” and the father smiled and shook his head.
    [Show full text]
  • Hitech Defeats Denker in AGS Challenge Match
    AI Magazine Volume 10 Number 1 (1989) (© AAAI) HITECH CHESS REPORT Hitech Defeats Denker in AGS Challenge Match Hans Berliner t has been difficult to find a forum third game and lost this match too. In I where the best chess programs can the final game, Hitech played truly play comparable humans with inter- brilliantly, and reacted to a tiny error national titles. In response to this by Denker with an impressive series need, Shelby Lyman, the host of past of moves that ended in a sacrifice Public Broadcasting Station (PBS) which brought home the win. Every- series on world chess championship one seemed impressed with Hitech’s matches, organized the AGS Chal- performance throughout the match, lenge Match at the New School for although it was clear the audience Social Research in New York City. was divided in its sympathies. Denker Funding for this event was provided was generous in his praise of Hitech, by AGS Computers, Inc., a New Jer- saying, “The machine gave me a real sey–based software firm. The match trimming; I am very impressed.” The was held September 22–25, with one final game of the match follows with game played each day, and was widely a few annotations. covered by the international press. Participating were Hitech, at 2407 Game 4 then the highest-rated computer in the world, and International Grand- White: Hitech master Arnold S. Denker, a former Black: A. S. Denker U.S. champion. Denker’s rating of 2410 was comparable to that of WB Hitech. He had recently placed eigh- 1.
    [Show full text]