Heuristic Evaluation Functions for General Game Playing

Total Page:16

File Type:pdf, Size:1020Kb

Heuristic Evaluation Functions for General Game Playing University of California Los Angeles Heuristic Evaluation Functions for General Game Playing A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by James Edmond Clune III 2008 c Copyright by James Edmond Clune III 2008 The dissertation of James Edmond Clune III is approved. Adnan Y. Darwiche Thomas S. Ferguson Alan C. Kay Todd D. Millstein Richard E. Korf, Committee Chair University of California, Los Angeles 2008 ii Table of Contents 1 Introduction ................................ 1 1.1 General Game Playing: The Problem . 1 1.2 The Importance of General Game Playing . 2 1.2.1 Competitive Performance Metric . 2 1.2.2 Flexible Software Capabilities . 3 1.2.3 Game-Oriented Programming . 3 1.2.4 Generality as a Key Aspect of Intelligence . 4 1.2.5 A Timely Problem . 4 1.3 Overview of the Project . 5 1.4 Overview of the Dissertation . 6 2 Philosophy and Vision ......................... 8 2.1 The Ubiquity of Game Models . 8 2.2 Game-Oriented Programming . 10 3 Literature Review ............................ 13 3.1 General Game Playing . 13 3.1.1 Barney Pell’s Metagamer . 13 3.1.2 AAAI General Game Playing Competition . 14 3.1.3 Other General Game Playing Systems . 15 3.2 Automated Planning . 16 3.2.1 Classical Planning . 16 iii 3.2.2 International Planning Competition . 17 3.2.3 Planning Techniques . 18 3.3 Discovery Systems . 20 3.3.1 Feature Discovery . 20 3.3.2 AM and Eurisko . 21 3.3.3 Learning Heuristic Evaluation Functions . 23 4 General Game Playing Framework .................. 25 4.1 Game Description Language (GDL) . 25 4.2 GGP Protocol . 28 4.3 AAAI GGP Competition . 29 5 Abstract-Model Based Heuristic Evaluation Functions ..... 31 5.1 Overview . 31 5.2 Feature Identification . 32 5.2.1 Candidate Expressions . 33 5.2.2 Expression Interpretations . 34 5.2.3 Stability . 35 5.3 Abstract Model . 37 5.3.1 Payoff . 37 5.3.2 Mobility . 38 5.3.3 Termination . 39 5.4 Heuristic Evaluation Function . 40 5.5 Anytime Algorithm . 42 iv 5.6 Use of Evaluation Function in Game-Play . 42 5.7 Evaluation Function Results . 43 5.7.1 Racetrack Corridor . 44 5.7.2 Othello . 45 5.7.3 Chess . 46 5.7.4 Chinese Checkers . 47 6 Techniques Specific to Single-Player Games ............ 49 6.1 Motivation and Overview . 49 6.2 Heuristic Evaluation Function Construction . 50 6.3 Search Algorithms . 52 6.3.1 Uninformed Search . 53 6.3.2 Informed Search . 55 6.4 Algorithm Composition . 56 6.5 Summary . 57 7 Rollout-Based Monte Carlo Methods ................ 58 7.1 Introduction . 58 7.2 Use of Heuristic Evaluation Functions . 60 7.3 Use of Action Heuristics . 60 7.4 Automatic Construction of Action Heuristics . 61 8 Alpha-Beta Minimax versus Monte Carlo Methods ....... 64 8.1 Overview . 64 v 8.2 Randomly Generated Synthetic Games . 66 8.3 Experiments . 68 8.4 Results . 68 8.5 Real Games . 72 9 Empirical Results: AAAI GGP Competitions ........... 78 9.1 First Annual GGP Competition . 78 9.2 Second Annual GGP Competition . 80 9.3 Third Annual GGP Competition . 82 9.4 Summary . 84 10 Discussion and Conclusions ...................... 85 10.1 Summary . 85 10.2 Discussion and Future Work . 86 10.3 Conclusion . 87 A Interpreting Heuristic Evaluation Functions ............ 88 A.1 Chess . 89 A.2 Chinese Checkers . 90 A.3 Othello . 92 B Engineering Considerations ...................... 94 B.1 Reasoning Module . 94 B.2 Multi-Processor Utilization . 95 vi References ................................... 97 vii List of Figures 4.1 A State in Tic-Tac-Toe . 26 5.1 Overall Evaluation Function . 41 5.2 Racetrack Corridor (initial position) . 44 5.3 Racetrack Corridor (with some walls placed) . 44 5.4 Chinese Checkers . 48 6.1 Flow-Chart for Solving Single-Player Problems . 51 8.1 Two-Player Games from AAAI Tournaments . 76 8.2 Results by Branching Factor . 77 viii List of Tables 5.1 Abstract Model Parameters . 37 8.1 Zero-Sum Games . 69 8.2 Varying Time Per Move . 70 8.3 Nonzero-Sum Games . 71 8.4 Games from AAAI Tournaments . 73 9.1 AAAI 2006 GGP Competition Leaderboard . 79 9.2 AAAI 2007 GGP Competition: Results of Preliminary Rounds . 82 ix Acknowledgments I am indebted to my adviser, Rich Korf, for his advice and encouragement. He was supportive from the project’s conception and remained so throughout. Many technical discussions with Rich contributed to this work by improving my under- standing of central issues. He read multiple writeups and provided insightful comments and constructive criticism. He steered me in helpful directions, yet gave me freedom to pursue my own ideas. He also sponsored me as a research assistant, which helped me finish this dissertation. I would like to thank Michael Genesereth for sponsoring the AAAI general game playing competition, without which this work would not have been possible. Also, thanks to the Stanford General Game Playing group including Nat Love, Eric Schkufza, David Haley, and Tim Hinrichs. I’d like to also thank the other competitors in the AAAI general game playing tournaments for open discussions amidst friendly competition. Thanks to Alan Kay for encouraging discussions, especially in helping broaden my vision beyond “games” in the traditional sense. The philosophical issues ex- plored in Chapter 2, particularly the view of game-oriented programming lan- guages, came largely from interactions with Alan. I’m grateful to the past and present UCLA AI grads who have helped me through numerous discussions and proofreading, particularly Alex Dow and Alex Fukunaga. Thanks to Barney Pell for an encouraging and helpful discussion early in the project. x Abstract of the Dissertation Heuristic Evaluation Functions for General Game Playing by James Edmond Clune III Doctor of Philosophy in Computer Science University of California, Los Angeles, 2008 Professor Richard E. Korf, Chair A general game-playing program plays games that it has not previously encoun- tered. A game manager program sends the game-playing programs a description of a game’s rules and objectives in a game description language. The game- playing programs compete by sending messages over a network indicating their moves until the game is completed. The class of games covered is intentionally broad, including games of one or more players with alternating or simultaneous moves, with arbitrary numeric payoffs. This research explores the problem of constructing an effective general game- playing program, with an emphasis on techniques for automatically constructing effective heuristic evaluation functions from game descriptions. A technique based on abstract models of games is presented. The abstract model treats mobility, payoff and termination as the most salient elements of a game. Each of these aspects are quantified in terms of stable features. Evidence is presented that the technique produces heuristic evaluation functions that are both comprehensible and effective. Empirical work includes a series of general game-playing programs that placed xi first or second for the three consecutive years of the AAAI General Game-Playing Competition. xii CHAPTER 1 Introduction 1.1 General Game Playing: The Problem The idea of general game playing (GGP) is to create a computer program that effectively plays games that it has not previously encountered. A game manager program sends the game playing programs a description of a game in a well- defined game description language. The description specifies the goal of the game, the legal moves, the initial game state, and the termination conditions. The game manager also sends information about what role the program will play (black or white, naughts or crosses, etc), a start time (time allowed for pre-game analysis), and a move time (time allowed per move once game play begins). The game playing programs compete by sending messages over a network indicating their moves until the game is completed. The class of games covered is intentionally broad, including games of one or more players with alternating or simultaneous moves, with arbitrary numeric payoffs. The immediate goal of the research is to develop techniques that allow us to create GGP programs that win games. The techniques are implemented and their effectiveness is evaluated empirically by competing against programs embodying alternative techniques. My work on general game playing techniques emphasizes heuristic evaluation functions. These are functions from game states to numbers used to assess the de- sirability of non-terminal states for particular players. In my opinion, automatic 1 construction of effective heuristic evaluation functions from game descriptions is the central challenge of general game playing. However, it is a subject that is best dealt with in the context of a complete general game player employing some type of game-tree search. In recognition of this, search techniques are covered in this dissertation as well. 1.2 The Importance of General Game Playing Here I describe several reasons why GGP is an important research problem. 1.2.1 Competitive Performance Metric In his doctoral dissertation [Pel93], Barney Pell argues that a primary reason for computer game playing research has been the competitive performance metric for intelligence, namely the presumed link between winning games and intelligent behavior. Pell describes a problem with this presumption: Unfortunately, the use of such a link has proved problematic: we have been able to produce strong programs for some games through specialized engineering methods, the extreme case being special-purpose hardware, and through analysis of the games by humans instead of by programs themselves. Consequently, it now appears that increased understanding and automation of intelligent processing is neither nec- essary nor sufficient for strong performance in game-playing. That is, it appears that we can construct strong game-playing programs with- out doing much of interest from an AI perspective, and conversely, we can make significant advances in AI that do not result in strong game-playing programs.
Recommended publications
  • AAAI 2008 Workshop Reports
    University of Pennsylvania ScholarlyCommons Technical Reports (CIS) Department of Computer & Information Science 5-2009 AAAI 2008 Workshop Reports Mark Dredze University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/cis_reports Recommended Citation Mark Dredze, "AAAI 2008 Workshop Reports", . May 2009. Sarabjot Singh Anand, Razvan Bunescu, Vitor Carvalho, Jan Chomicki, Vincent Conitzer, Michael T Cox, Virginia Dignum, Zachary Dodds, Mark Dredze, David Furcy, Evgeniy Gabrilovich, Mehmet H Göker, Hans Guesgen, Haym Hirsh, Dietmar Jannach, Ulrich Junker. (2009, April). AAAI 2008 Workshop Reports. AI Magazine, 30(1), 108-118. Copyright AAAI 2009. The copies do not imply AAAI endorsement of a product or a service of the employer, and that the copies are not for sale. Publisher URL: http://proquest.umi.com/ pqdlink?did=1680350011&sid=1&Fmt=2&clientId=3748&RQT=309&VName=PQD This paper is posted at ScholarlyCommons. https://repository.upenn.edu/cis_reports/904 For more information, please contact [email protected]. AAAI 2008 Workshop Reports Abstract AAAI was pleased to present the AAAI-08 Workshop Program, held Sunday and Monday, July 13-14, in Chicago, Illinois, USA. The program included the following 15 workshops: Advancements in POMDP Solvers; AI Education Workshop Colloquium; Coordination, Organizations, Institutions, and Norms in Agent Systems, Enhanced Messaging; Human Implications of Human-Robot Interaction; Intelligent Techniques for Web Personalization and Recommender Systems; Metareasoning: Thinking about Thinking; Multidisciplinary Workshop on Advances in Preference Handling; Search in Artificial Intelligence and Robotics; Spatial and Temporal Reasoning; Trading Agent Design and Analysis; Transfer Learning for Complex Tasks; What Went Wrong and Why: Lessons from AI Research and Applications; and Wikipedia and Artificial Intelligence: An vE olving Synergy.
    [Show full text]
  • The Founder of the Creative Destruction Lab Describes Its Moonshot Mission to Create a Canadian AI Ecosystem
    The founder of the Creative Destruction Lab describes its moonshot mission to create a Canadian AI ecosystem. Thought Leader Interview: by Karen Christensen Describe what happens at the Creative Destruction Lab by companies that went through the Lab. When we finished our (CDL). fifth year in June 2017, we had exceeded $1.4 billion in equity The CDL is a seed-stage program for massively scalable science- value created. based companies. Some start-ups come from the University of Toronto community, but we now also receive applications from What exactly does the Lab provide to entrepreneurs? Europe, the U.S. (including Silicon Valley), Israel and Asia. Start-up founders benefit from a structured, objectives-oriented We launched the program in September 2012, and each process that increases their probability of success. The process is autumn since, we’ve admitted a new cohort of start-ups into orchestrated by the CDL team, while CDL Fellows and Associ- the program. Most companies that we admit have developed a ates generate the objectives. Objective-setting is a cornerstone working prototype or proof of concept. The most common type of the process. Every eight weeks the Fellows and Associates set of founder is a recently graduated PhD in Engineering or Com- three objectives for the start-ups to achieve, at the exclusion of puter Science who has spent several years working on a problem everything else. In other words, they define clear goals for an and has invented something at the frontier of their field. eight week ‘sprint’. Objectives can be business, technnology or The program does not guarantee financing, but the majority HR-oriented.
    [Show full text]
  • A Strategic Metagame Player for General Chesslike Games
    From: AAAI Technical Report FS-93-02. Compilation copyright © 1993, AAAI (www.aaai.org). All rights reserved. A Strategic Metagame Player for General Chess-Like Games Barney Pell* Computer Laboratory University of Cambridge Cambridge, CB2 3QG, UK b dp((~cl c am. ac Abstract game will transfer also to new games. Such criteria in- clude the use of learning and planning, and the ability This paper reviews the concept of Metagameand to play more than one game. discusses the implelnentation of METAGAMER,a program which plays Metagame in the class of 1.1 Bias when using Existing Games symmetric chess-like games, which includes chess, However,even this generality-oriented research is sub- Chinese-chess, checkers, draughts, and Shogi. The ject to a potential methodological bias. As the human l)rogram takes as input the rules of a specific game researchers know at the time of program-development and analyses those rules to construct for that game which specific game or games the program will be tested an efficient representation and an evaluation func- on, it is possible that they import the results of their tion, both for use with a generic search engine. The own understanding of the game dh’ectly into their pro- strategic analysis performed by the program relates gn’am. In this case, it becomes difficult to determine a set of general knowledgesources to the details of whether the subsequent performance of the program the particular game. Amongother properties, this is due to the general theory it implements, or merely analysis determines the relative value of the differ- to the insightful observations of its developer about the eat pieces in a given game.
    [Show full text]
  • Lukas Biewald
    Lukas Biewald 823 Capp St., San Francisco, CA 94110 (650) 804 5470 lukas@crowdflower.com Awards Netexplorateur of the Year http://www.netexplorateur.org/ (2010) TechCrunch 50 http://techcrunch50.com (2009) First Place, Caltech Turing Tournament http://turing.ssel.caltech.edu Work Founder, CEO: CrowdFlower 2008 — Present • Built leading Labor-on-Demand company from the ground up. Designed and coded core technology. Built customer base up from nothing to 100 customers including several Fortune-500 companies. Hired 25 employees. • Raised two rounds of financing totalling $6.2 million in 2009 in tough economic climate from well-known angels and VCs including Trinity, Bessemer, Founders´ Fund, Auren Hoffman, Aydin Senkut, Manu Kumar, Gary Kremen and Travis Kalanick. Senior Scientist: Powerset 2007 — 2008 • As the 20th employee, created and led two core teams: metrics and search ranking. • Developed the Amazon Mechanical Turk interface, identified as a key technology in Microsoft’s $100 million acquisition. Relevance Engineer, Engineering Manager: Yahoo! Inc. 2005 — 2007 • Technical lead for research, development and deployment of new web search ranking functions in five major markets. • Chosen as most productive employee in Applied Science business unit year end review. Research Assistant: Stanford AI Lab (sail.stanford.edu) 2003 — 2004 • Researched word-sense disambiguation, machine translation and Japanese word segmentation. • Published two papers with over 100 citations. Education Stanford University 1998 — 2004 BS in Mathematical and Computational Science with Honors (2003) MS in Computer Science with Concentration in Artificial Intelligence (2004) Publications David Vickrey, Lukas Biewald, Marc Teyssier, and Daphne Koller Word-Sense Disambiguation for Machine Translation. Conference on Empirical Methods in Natural Language Processing (EMNLP), 2005.
    [Show full text]
  • Autonomous Horizons: the Way Forward Is a Product of the Office Air University Press 600 Chennault Circle, Bldg 1405 of the US Air Force Chief Scientist (AF/ST)
    Autonomous Horizons The Way Forward A vision for Air Force senior leaders of the potential for autonomous systems, and a general framework for the science and technology community to advance the state of the art Dr. Greg L. Zacharias Chief Scientist of the United States Air Force 2015–2018 The second volume in a series introduced by: Autonomous Horizons: Autonomy in the Air Force – A Path to the Future, Volume 1: Human Autonomy Teaming (AF/ST TR 15-01) March 2019 Air University Press Curtis E. LeMay Center for Doctrine Development and Education Maxwell AFB, Alabama Chief of Staff, US Air Force Library of Congress Cataloging-in-Publication Data Gen David L. Goldfein Names: Zacharias, Greg, author. | Air University (U.S.). Press, publisher. Commander, Air Education and Training | United States. Department of Defense. United States Air Force. Command Title: Autonomous horizons : the way forward / by Dr. Greg L. Zacha- Lt Gen Steven L. Kwast rias. Description: First edition. | Maxwell Air Force Base, AL : AU Press, 2019. “Chief Scientist for the United States Air Force.” | Commander and President, Air University Lt Gen Anthony J. Cotton “January 2019.” |Includes bibliographical references. Identifiers: LCCN 2018061682 | ISBN 9781585662876 Commander, Curtis E. LeMay Center for Subjects: LCSH: Aeronautics, Military—Research—United States. | Doctrine Development and Education United States. Air Force—Automation. | Artificial intelligence— Maj Gen Michael D. Rothstein Military applications—United States. | Intelligent control systems. | Autonomic
    [Show full text]
  • The Semantic
    The Fate of the Semantic Web Technology experts and stakeholders who participated in a recent survey believe online information will continue to be organized and made accessible in smarter and more useful ways in coming years, but there is stark dispute about whether the improvements will match the visionary ideals of those who are working to build the semantic web. Janna Quitney Anderson, Elon University Lee Rainie, Pew Research Center’s Internet & American Life Project May 4, 2010 Pew Research Center’s Internet & American Life Project An initiative of the Pew Research Center 1615 L St., NW – Suite 700 Washington, D.C. 20036 202‐419‐4500 | pewInternet.org This publication is part of a Pew Research Center series that captures people’s expectations for the future of the Internet, in the process presenting a snapshot of current attitudes. Find out more at: http://www.pewInternet.org/topics/Future‐of‐the‐Internet.aspx and http://www.imaginingtheinternet.org. 1 Overview Sir Tim Berners‐Lee, the inventor of the World Wide Web, has worked along with many others in the Internet community for more than a decade to achieve his next big dream: the semantic web. His vision is a web that allows software agents to carry out sophisticated tasks for users, making meaningful connections between bits of information so “computers can perform more of the tedious work involved in finding, combining, and acting upon information on the web.”1 The concept of the semantic web has been fluid and evolving and never quite found a concrete expression and easily‐understood application that could be grasped readily by ordinary Internet users.
    [Show full text]
  • Will Impact Higher Education WEB
    Foroughi Web 3.0: How “the Internet of Everything” will impact Higher Education WEB 3.0: HOW THE “INTERNET OF EVERYTHING” WILL IMPACT HIGHER EDUCATION Abbas Foroughi College of Business, University of Southern Indiana [email protected] 812-465-1667 ABSTRACT Universities are finding that adopting Web 2.0 is necessary to remain competitive, while, at the same time, the evolution to Web 3.0 is on the horizon. This paper discusses Web 3.0, how it builds on Web 2.0, examples of current applications of Web 3.0, the potential benefits for higher education and lifelong learning which experts predict from Web 3.0 and concerns that have also been voiced. INTRODUCTION Over the past two decades, many institutions of higher learning and their students have reaped benefits from using Web 2.0 tools such as Wikis, Facebook, blogs, tagging, LinkedIn, virtual reality, social bookmarking, mashing, rss, podcasts, folksonomies, ePortfolios, chatrooms, and similar technologies, which facilitate communication, give participants a feeling of group membership, and are user-friendly. Specific benefits include: learning-related benefits: (Anderson, 2007; Alexander, 2006; Wesch, 2009; Reuben, 2008; McDonald, 2009; Brainard, 2007; Thompson, 2008; Minocha, 2009), such as facilitation of collaborative learning, development of independent learning skills, problem solving, team work, reflective learning, quick/early feedback from instructors, overcoming isolation of geographical distances, peer-to- peer support/feedback, visibility of students’ work, integration of multimedia assets, and the creation of informal relations between educators and students. Universities are finding that adopting Web 2.0 is necessary to remain competitive, while, at the same time, the evolution to Web 3.0 is on the horizon.
    [Show full text]
  • Lorien Pratt, Ph.D. Curriculum Vita
    Lorien Pratt, Ph.D. Curriculum Vita Sunnyvale, CA USA +1.303.589.7476 [email protected] Blog: www.lorienpratt.com SUMMARY Professional machine learning and decision intelligence consultant, developer, entrepreneur, writer, evangelist. Author of Link: How Decision Intelligence Connects Data, Actions, and Outcomes for a Better World. Co-organizer of the annual Responsible AI/DI Summit. Two TEDx talks. Inventor of machine learning inductive transfer. Co-Founder and Chief Scientist of Quantellia, LLC. Co-inventor and evangelist for Decision Intelligence. Over three decades delivering applied machine learning solutions, including for the Human Genome Project, the US Department of Energy, budget analysis for a $100 million annual spend for the Administrative Office (AO) of the US Courts, award-winning forensic analysis for the Colorado Bureau of Investigation, RBS, DARPA, and many more. EDUCATION Ph.D., Computer Science, Rutgers University, 5/93. Thesis: Transferring previously learned back-propagation neural networks to new learning tasks. M.S., Computer Science, Rutgers University, 5/88. A.B., Computer Science, Dartmouth College, 5/83. CONSULTING A variety of engagements including the following: Regular invited speaker to user groups, executive roundtables, keynotes, webinars, and workshops for organizations and companies like SAP, HP, Unisys, the TM Forum, Comptel, SAS. Machine learning consulting on a wide variety of projects for cyber security, web advertising, financial prediction, medical decision making, cable company capital expenditure, LIDAR image analysis, and more. Five-star rated consultant on the Experfy platform (www.experfy.com), from the Harvard Innovation Laboratory. Strategic consulting in decision intelligence, analytics, and telecom OSS/BSS to companies and organizations like the US Department of Defense, Solvina, ConceptWave, the Colorado Bureau of Investigation (CBI), Sun, IBM, Comptel, SAP, HP, Amdocs, Telcordia (now Ericsson).
    [Show full text]
  • Knowledge-Based General Game Playing
    Knowledge-Based General Game Playing Dissertation zur Erlangung des akademischen Grades Doktor rerum naturalium (Dr. rer. nat.) vorgelegt an der Technischen Universitat¨ Dresden Fakultat¨ Informatik eingereicht von Dipl.-Inf. Stephan Schiffel geboren am 15. 09. 1980 in Sebnitz Gutachter: Prof. Michael Thielscher, University of New South Wales Prof. Yngvi Bj¨ornsson, H´ask´olinn´ıReykjav´ık Datum der Verteidigung: 29.07.2011 Abstract General Game Playing (GGP) is concerned with the development of systems that can play well an arbitrary game solely by being given the rules of the game. This problem is considerably harder than traditional artificial intelligence (AI) game playing. Writing a player for a particular game allows to focus on the design of elaborate strategies and libraries that are specific to this game. Systems able to play arbitrary, previously unknown games cannot be given game-specific knowledge. They rather need to be endowed with high-level cognitive abilities such as general strategic thinking and abstract reasoning. This makes General Game Playing a good example of a challenge problem, which encompasses a variety of AI research areas including knowledge representation and reasoning, heuristic search, planning, and learning. Two fundamentally different approaches to GGP exist today, we call them knowledge-free and knowledge-based approach. The knowledge-free approach considers the game rules as a black box used for generating legal moves and successor states of the game. Players using this approach run Monte-Carlo simulations of the game to estimate the value of states of the game. On the other hand, the knowledge-based approach considers the components of a state (e.
    [Show full text]
  • Ary, a General Game Playing Program
    Ary, a general game playing program Jean M´ehat,Tristan Cazenave Abstract We present our program Ary that won the 2009 and 2010 General Game Playing competition. In General Game Playing (GGP), players have to play games they never encountered before. The rules are trans- mitted as a description in a specific language and the player have to au- tomatically analyse the rules and select a method of playing. 1 Introduction The field of program playing games is an extensively explored field of computer science and more particularly of artificial intelligence as it is a domain where common methods of reasonning are to be applied, while offering a closed world, completely described by the game rules. Moreover the quality of the play is easy to evaluate quantitatively by observing the matches outcomes, and a human expertise has been developped. Great successes have been obtained: in many games, including chess, programs routinely outplay even the best human player. The program usually embed a important body of knowledge that is specific of the game they play. This knowledge is used by the designer beforehand and limit somewhat the generality of the program. While a program like Deep Blue is able to play well chess, it can not play a match of Checker, or even Tic Tac Toe: while an expert in its domain, the playing program is limited to one game in its abilities, and these are not easily extended to other domains or even to other games. The Logic Group at the university of Stanford addresses this limitation with GGP. In a GGP match, the players receive the rules of the game they have to play in a specific langage called Game Description Language from a Game Master.
    [Show full text]
  • Barney Pell WSDM 2017 Conference Handout
    Crucial Questions for Applying AI in Startups Barney Pell, Ph.D. Founder: Powerset, Moon Express, LocoMobi Associate Founder: Singularity University Fellow: Creative Disruption Lab Advisor: CrowdFlower, Uber/Otto, LinkedIn, … [email protected] @barneyp My background • I'm a long-time technologist, entrepreneur, investor and advisor • Specialty and passion for applications of Artificial Intelligence (AI) • SRI • NASA • Powerset • Microsoft (Bing, Cortana) • Singularity University • Creative Disruption Lab • Participated in hundreds of AI application efforts! Challenges for Startups Applying AI • AI technology is advancing rapidly • But: Applying AI successfully is hard, with many issues leading to failure • Art of the application is harder than the technology, and less understood • Needs of the application often drive (new) requirements on technology • Opportunity: • Ask the key questions early on can dramatically increase chances of success. • May also lead to different design and approach AI Goldilocks Zone Our opportunity Customers need We can do AI Goldilocks Zone Alternatives Others can do Customers need We can do AI Goldilocks Zone Too easy Others can do Customers need Just right We can do Too hard Goldilocks Zone Questions • Who are the target customers? Which is our best market segment? • Which of their problems or needs are An AI Start-up should the most important ones we are offer a solution to planning to address? real customer needs that actually works • How well do our capabilities align and is much better with the core
    [Show full text]
  • Design of the Remote Agent Experiment for Spacecraft Autonomy 1 3 3 1 3 3 Douglas E
    Design of the Remote Agent Experiment for Spacecraft Autonomy 1 3 3 1 3 3 Douglas E. Bernard , Gregory A. Dorais , Chuck Fry , Edward B. Gamble Jr. , Bob Kanefsky , James Kurien , 3 2 4 4 3 1 William Millar , Nicola Muscettola , P. Pandurang Nayak , Barney Pell , Kanna Rajan , Nicolas Rouquette , 1 5 Benjamin Smith , Brian C. Williams 1 2, 3, 4, 5 Jet Propulsion Laboratory, NASA Ames Research Center, MS 269-2 California Institute of Technology Moffett Field, CA 94035 2 4800 Oak Grove Drive Recom Technologies @ Ames 3 Pasadena, CA 91109 Caelum Research @ Ames 4 818-354-2597 RIACS @ Ames [email protected], 650-604-4756 [email protected] exploration. Where human exploration is desired, robotic ABSTRACT—This paper describes the Remote Agent flight precursors can help identify and map candidate landing experiment for spacecraft commanding and control. In the sites, find resources, and demonstrate experimental Remote Agent approach, the operational rules and technologies. constraints are encoded in the flight software. The software may be considered to be an autonomous “remote agent” of Current spacecraft control technology relies heavily on a the spacecraft operators in the sense that the operators rely relatively large and highly skilled mission operations team on the agent to achieve particular goals. that generates detailed time-ordered sequences of commands or macros to step the spacecraft through each The experiment will be executed during the flight of desired activity. Each sequence is carefully constructed in NASA’s Deep Space One technology validation mission. such a way as to ensure that all known operational During the experiment, the spacecraft will not be given the constraints are satisfied.
    [Show full text]