Building a Champion Level Computer Poker Player
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Most Important Fundamental Rule of Poker Strategy
The Thirty-Third International FLAIRS Conference (FLAIRS-33) Most Important Fundamental Rule of Poker Strategy Sam Ganzfried,1 Max Chiswick 1Ganzfried Research Abstract start playing very quickly, even the best experts spend many years (for some an entire lifetime) learning and improving. Poker is a large complex game of imperfect information, Humans learn and improve in a variety of ways. The most which has been singled out as a major AI challenge prob- obvious are reading books, hiring coaches, poker forums and lem. Recently there has been a series of breakthroughs culmi- nating in agents that have successfully defeated the strongest discussions, and simply playing a lot to practice. Recently human players in two-player no-limit Texas hold ’em. The several popular software tools have been developed, most 1 strongest agents are based on algorithms for approximating notably PioSolver, where players solve for certain situa- Nash equilibrium strategies, which are stored in massive bi- tions, given assumptions for the hands the player and oppo- nary files and unintelligible to humans. A recent line of re- nent can have. This is based on the new concept of endgame search has explored approaches for extrapolating knowledge solving (Ganzfried and Sandholm 2015), where strategies from strong game-theoretic strategies that can be understood are computed for the latter portion of the game given fixed by humans. This would be useful when humans are the ulti- strategies for the trunk (which are input by the user). While mate decision maker and allow humans to make better deci- it has been pointed out that theoretically this approach is sions from massive algorithmically-generated strategies. -
“Algoritmos Para Um Jogador Inteligente De Poker” Autor: Vinícius
Universidade Federal De Santa Catarina Centro Tecnológico Bacharelado em Ciências da Computação “Algoritmos para um jogador inteligente de Poker” Autor: Vinícius Sousa Fazio Florianópolis/SC, 2008 Universidade Federal De Santa Catarina Centro Tecnológico Bacharelado em Ciências da Computação “Algoritmos para um jogador inteligente de Poker” Autor: Vinícius Sousa Fazio Orientador: Mauro Roisenberg Banca: Benjamin Luiz Franklin Banca: João Rosaldo Vollertt Junior Banca: Ricardo Azambuja Silveira Florianópolis/SC, 2008 AGRADECIMENTOS Agradeço a todos que me ajudaram no desenvolvimento deste trabalho, em especial ao professor Mauro Roisenberg e aos colegas de trabalho João Vollertt e Benjamin Luiz Franklin e ao Ricardo Azambuja Silveira pela participação na banca avaliadora. Algoritmos para um jogador inteligente de Poker – Vinícius Sousa Fazio 4 RESUMO Poker é um jogo simples de cartas que envolve aposta, blefe e probabilidade de vencer. O objetivo foi inventar e procurar algoritmos para jogadores artificiais evolutivos e estáticos que jogassem bem poker. O jogador evolutivo criado utiliza aprendizado por reforço, onde o jogo é dividido em uma grande matriz de estados com dados de decisões e recompensas. O jogador toma a decisão que obteve a melhor recompensa média em cada estado. Para comparar a eficiência do jogador, várias disputas com jogadores que tomam decisões baseados em fórmulas simples foram feitas. Diversas disputas foram feitas para comparar a eficiência de cada algoritmo e os resultados estão demonstrados graficamente no trabalho. Palavras-Chave: Aprendizado por Reforço, Inteligência Artificial, Poker; Algoritmos para um jogador inteligente de Poker – Vinícius Sousa Fazio 5 SUMÁRIO 1. Introdução........................................................................................................12 2. Fundamentação Teórica..................................................................................15 2.1. Regras do Poker Texas Hold'em................................................................16 2.1.1. -
Evolutionary Game Theory: ESS, Convergence Stability, and NIS
Evolutionary Ecology Research, 2009, 11: 489–515 Evolutionary game theory: ESS, convergence stability, and NIS Joseph Apaloo1, Joel S. Brown2 and Thomas L. Vincent3 1Department of Mathematics, Statistics and Computer Science, St. Francis Xavier University, Antigonish, Nova Scotia, Canada, 2Department of Biological Sciences, University of Illinois, Chicago, Illinois, USA and 3Department of Aerospace and Mechanical Engineering, University of Arizona, Tucson, Arizona, USA ABSTRACT Question: How are the three main stability concepts from evolutionary game theory – evolutionarily stable strategy (ESS), convergence stability, and neighbourhood invader strategy (NIS) – related to each other? Do they form a basis for the many other definitions proposed in the literature? Mathematical methods: Ecological and evolutionary dynamics of population sizes and heritable strategies respectively, and adaptive and NIS landscapes. Results: Only six of the eight combinations of ESS, convergence stability, and NIS are possible. An ESS that is NIS must also be convergence stable; and a non-ESS, non-NIS cannot be convergence stable. A simple example shows how a single model can easily generate solutions with all six combinations of stability properties and explains in part the proliferation of jargon, terminology, and apparent complexity that has appeared in the literature. A tabulation of most of the evolutionary stability acronyms, definitions, and terminologies is provided for comparison. Key conclusions: The tabulated list of definitions related to evolutionary stability are variants or combinations of the three main stability concepts. Keywords: adaptive landscape, convergence stability, Darwinian dynamics, evolutionary game stabilities, evolutionarily stable strategy, neighbourhood invader strategy, strategy dynamics. INTRODUCTION Evolutionary game theory has and continues to make great strides. -
University of Alberta Library Release
University of Alberta Library Release Form Name of Author: Morgan Hugh Kan Title of Thesis: Postgame Analysis of Poker Decisions Degree: Master of Science Year this Degree Granted: 2007 Permission is hereby granted to the University of Alberta Library to reproduce sin- gle copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. The author reserves all other publication and other rights in association with the copyright in the thesis, and except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatever without the author's prior written permission. Morgan Hugh Kan Date: University of Alberta POSTGAME ANALYSIS OF POKER DECISIONS by Morgan Hugh Kan A thesis submitted to the Faculty of Graduate Studies and Research in partial ful- fillment of the requirements for the degree of Master of Science. Department of Computing Science Edmonton, Alberta Spring 2007 University of Alberta Faculty of Graduate Studies and Research The undersigned certify that they have read, and recommend to the Faculty of Grad- uate Studies and Research for acceptance, a thesis entitled Postgame Analysis of Poker Decisions submitted by Morgan Hugh Kan in partial fulfillment of the re- quirements for the degree of Master of Science. Jonathan Schaeffer Supervisor Michael Bowling Michael Carbonaro External Examiner Date: To my parents, Janet and Chay Kan, and my sister, Megan Kan, I would never have made it this far without them. Abstract In Artificial Intelligence research, evaluation is a recurring theme. A newly crafted game-playing program is interesting if it can be shown to be better by some mea- sure. -
Texas Hold'em Poker
11/5/2018 Rules of Card Games: Texas Hold'em Poker Pagat Poker Poker Rules Poker Variants Home Page > Poker > Variations > Texas Hold'em DE EN Choose your language Texas Hold'em Introduction Players and Cards The Deal and Betting The Showdown Strategy Variations Pineapple ‑ Crazy Pineapple ‑ Crazy Pineapple Hi‑Lo Irish Casino Versions Introduction Texas Hold'em is a shared card poker game. Each player is dealt two private cards and there are five face up shared (or "community") cards on the table that can be used by anyone. In the showdown the winner is the player who can make the best five‑card poker hand from the seven cards available. Since the 1990's, Texas Hold'em has become one of the most popular poker games worldwide. Its spread has been helped firstly by a number of well publicised televised tournaments such as the World Series of Poker and secondly by its success as an online game. For many people nowadays, poker has become synonymous with Texas Hold'em. This page assumes some familiarity with the general rules and terminology of poker. See the poker rules page for an introduction to these, and the poker betting and poker hand ranking pages for further details. Players and Cards From two to ten players can take part. In theory more could play, but the game would become unwieldy. A standard international 52‑card pack is used. The Deal and Betting Texas Hold'em is usually played with no ante, but with blinds. When there are more than two players, the player to dealer's left places a small blind, and the next player to the left a big blind. -
In the Light of Evolution VIII: Darwinian Thinking in the Social Sciences Brian Skyrmsa,1, John C
FROM THE ACADEMY: COLLOQUIUM INTRODUCTION COLLOQUIUM INTRODUCTION In the light of evolution VIII: Darwinian thinking in the social sciences Brian Skyrmsa,1, John C. Aviseb,1, and Francisco J. Ayalab,1 norms are seen as a society’s way of coor- aDepartment of Logic and Philosophy of Science and bDepartment of Ecology and dinating on one of these equilibria. This, Evolutionary Biology, University of California, Irvine, CA 92697 rather than enforcement of out-of-equilibrium behavior, is seen as the function of these Darwinian thinking in the social sciences was Evolutionary game theory was initially norms. Fairness norms, however, may involve inaugurated by Darwin himself in The De- seen by some social scientists as just a way interpersonal comparisons of utility. This is scent of Man and Selection in Relation to to provide a low rationality foundation for true of egalitarian norms, and of the norms Sex high rationality equilibrium concepts. A Nash of proportionality suggested by Aristotle. In (8, 9). Despite various misappropriations ’ of the Darwinian label, true Darwinian think- equilibrium is a rest point of a large class of Binmore s account, the standards for such ing continued in the social sciences. However, adaptive dynamics. If the dynamics converges interpersonal comparisons themselves evolve. toarestpoint,itconvergestoNash.Thismay Komarova(22),inanarticlethatcould with the advent of evolutionary game theory Social Dynamics (10) there has been an explosion of Darwin- nothappeninallgames,butitmayhappen equally well be in ,discusses ian analysis in the social sciences. And it has in large classes of games, depending on the the speed evolution of complex phenotypes in led to reciprocal contributions from the social dynamics in question. -
Proquest Dissertations
Sons of Barabbas or Sons of Descartes? An evolutionary game-theoretic view of psychopathy by Lloyd D. Balbuena A thesis submitted to the Faculty of Graduate Studies and Postdoctoral Affairs in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Cognitive Science Carleton University Ottawa, Ontario ©2010, Lloyd D. Balbuena Library and Archives Bibliotheque et 1*1 Canada Archives Canada Published Heritage Direction du Branch Patrimoine de I'edition 395 Wellington Street 395, rue Wellington OttawaONK1A0N4 OttawaONK1A0N4 Canada Canada Your file Votre reference ISBN: 978-0-494-79631-3 Our file Notre reference ISBN: 978-0-494-79631-3 NOTICE: AVIS: The author has granted a non L'auteur a accorde une licence non exclusive exclusive license allowing Library and permettant a la Bibliotheque et Archives Archives Canada to reproduce, Canada de reproduire, publier, archiver, publish, archive, preserve, conserve, sauvegarder, conserver, transmettre au public communicate to the public by par telecommunication ou par I'lnternet, preter, telecommunication or on the Internet, distribuer et vendre des theses partout dans le loan, distribute and sell theses monde, a des fins commerciales ou autres, sur worldwide, for commercial or non support microforme, papier, electronique et/ou commercial purposes, in microform, autres formats. paper, electronic and/or any other formats. The author retains copyright L'auteur conserve la propriete du droit d'auteur ownership and moral rights in this et des droits moraux qui protege cette these. Ni thesis. Neither the thesis nor la these ni des extraits substantiels de celle-ci substantial extracts from it may be ne doivent etre imprimes ou autrement printed or otherwise reproduced reproduits sans son autorisation. -
Pokerface: Emotion Based Game-Play Techniques for Computer Poker Players
University of Kentucky UKnowledge University of Kentucky Master's Theses Graduate School 2004 POKERFACE: EMOTION BASED GAME-PLAY TECHNIQUES FOR COMPUTER POKER PLAYERS Lucas Cockerham University of Kentucky, [email protected] Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Cockerham, Lucas, "POKERFACE: EMOTION BASED GAME-PLAY TECHNIQUES FOR COMPUTER POKER PLAYERS" (2004). University of Kentucky Master's Theses. 224. https://uknowledge.uky.edu/gradschool_theses/224 This Thesis is brought to you for free and open access by the Graduate School at UKnowledge. It has been accepted for inclusion in University of Kentucky Master's Theses by an authorized administrator of UKnowledge. For more information, please contact [email protected]. ABSTRACT OF THESIS POKERFACE: EMOTION BASED GAME-PLAY TECHNIQUES FOR COMPUTER POKER PLAYERS Numerous algorithms/methods exist for creating computer poker players. This thesis compares and contrasts them. A set of poker agents for the system PokerFace are then introduced. A survey of the problem of facial expression recognition is included in the hopes it may be used to build a better computer poker player. KEYWORDS: Poker, Neural Networks, Arti¯cial Intelligence, Emotion Recognition, Facial Action Coding System Lucas Cockerham 7/30/2004 POKERFACE: EMOTION BASED GAME-PLAY TECHNIQUES FOR COMPUTER POKER PLAYERS By Lucas Cockerham Dr. Judy Goldsmith Dr. Grzegorz Wasilkowski 7/30/2004 RULES FOR THE USE OF THESES Unpublished theses submitted for the Master's degree and deposited in the University of Kentucky Library are as a rule open for inspection, but are to be used only with due regard to the rights of the authors. -
On the Behavior of Strategies in Hawk-Dove Game
Journal of Game Theory 2016, 5(1): 9-15 DOI: 10.5923/j.jgt.20160501.02 On the Behavior of Strategies in Hawk-Dove Game Essam EL-Seidy Department of Mathematics, Faculty of Science, Ain Shams University, Cairo, Egypt Abstract The emergence of cooperative behavior in human and animal societies is one of the fundamental problems in biology and social sciences. In this article, we study the evolution of cooperative behavior in the hawk dove game. There are some mechanisms like kin selection, group selection, direct and indirect reciprocity can evolve the cooperation when it works alone. Here we combine two mechanisms together in one population. The transformed matrices for each combination are determined. Some properties of cooperation like risk-dominant (RD) and advantageous (AD) are studied. The property of evolutionary stable (ESS) for strategies used in this article is discussed. Keywords Hawk-dove game, Evolutionary game dynamics, of evolutionary stable strategies (ESS), Kin selection, Direct and Indirect reciprocity, Group selection difference between both games has a rather profound effect 1. Introduction on the success of both strategies. In particular, whilst by the Prisoner’s Dilemma spatial structure often facilitates Game theory provides a quantitative framework for cooperation this is often not the case in the hawk–dove game analyzing the behavior of rational players. The theory of ([14]). iterated games in particular provides a systematic framework The Hawk-Dove game ([18]), also known as the snowdrift to explore the players' relationship in a long-term. It has been game or the chicken game, is used to study a variety of topics, an important tool in the behavioral and biological sciences from the evolution of cooperation to nuclear brinkmanship and it has been often invoked by economists, political ([8], [22]). -
SCIENCE VOLUME 18, No 2, FALL 2007
SCIENCE VOLUME 18, No 2, FALL 2007 FACULTY OF SCIENCE ALUMNI MAGAZINE contourswww.ualberta.ca/science UNIVERSITY Reading CREATES our body’s SPACE chemicals for INSTITUTE better diagnosis Honouring Mama ALUMNUS TURNS Lu’s spirit of GRIEF INTO HOPE giving Fine arts ARCTIC PONDS students liven DRYING UP up lab space through games SCIENCEcontours Science Contours is published twice MESSAGE FROM THE DEAN a year by the faculty of Science to provide current information on its many activities. The magazine is distributed to alumni and friends of the Faculty of An Intersection Between Arts and Science Science. Phase 1 of our new Dean of Science Centennial Centre for Gregory Taylor Interdisciplinary Sci- ence (CCIS) was near- Assistant Dean, External Relations Claudia Wood ing completion when I happened upon a stu- External Relations Team dent art competition on Katherine Captain, Emily Lennstrom, campus. It occurred to Michel Proulx, Traci Toshack, me then that it would be Kevin Websdale wonderful to have stu- Fine arts students Editor dents help us put some Wendy Lung and Ansun Michel Proulx finishing touches on our Yan enjoy their work new space. Graphic Design After a few conversations, two different classes - one in Inte- Studio X Design grative/Exhibit Design and the other in Printmaking - were tasked with coming up with ideas. The results are astounding. Contributing writers To begin with, the students had to wrap their heads around the Bev Betkowski, Kris Connor, Michel purpose and uniqueness of the building. CCIS is one of the few Proulx, Isabela Varela facilities in the world to house interdisciplinary teams under one roof and Phase 1 is no different. -
Bargaining with Neighbors: Is Justice Contagious? (1999)
Journal of Philosophy, Inc. Bargaining with Neighbors: Is Justice Contagious? Author(s): Jason Alexander and Brian Skyrms Source: The Journal of Philosophy, Vol. 96, No. 11 (Nov., 1999), pp. 588-598 Published by: Journal of Philosophy, Inc. Stable URL: http://www.jstor.org/stable/2564625 Accessed: 12-05-2016 19:03 UTC Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://about.jstor.org/terms JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Journal of Philosophy, Inc. is collaborating with JSTOR to digitize, preserve and extend access to The Journal of Philosophy This content downloaded from 128.91.58.254 on Thu, 12 May 2016 19:03:20 UTC All use subject to http://about.jstor.org/terms 588 THEJOURNAL OF PHILOSOPHY BARGAINING WITH NEIGHBORS: IS JUSTICE CONTAGIOUS ? What is justice? The question is llarder to answer in some cases than in others. We focus on the easiest case of dis- tributive justice. Two individuals are to decide how to dis- tribute a windfall of a certain amount of money. Neither is especially entitled, or especially needy, or especially anything their positions are entirely symmetric. Their utilities derived from the dis- tribution may be taken, for all intents and purposes, simply as the amount of money received. -
Early Round Bluffing in Poker Author(S): California Jack Cassidy Source: the American Mathematical Monthly, Vol
Early Round Bluffing in Poker Author(s): California Jack Cassidy Source: The American Mathematical Monthly, Vol. 122, No. 8 (October 2015), pp. 726-744 Published by: Mathematical Association of America Stable URL: http://www.jstor.org/stable/10.4169/amer.math.monthly.122.8.726 Accessed: 23-12-2015 19:20 UTC Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://www.jstor.org/page/ info/about/policies/terms.jsp JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Mathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access to The American Mathematical Monthly. http://www.jstor.org This content downloaded from 128.32.135.128 on Wed, 23 Dec 2015 19:20:53 UTC All use subject to JSTOR Terms and Conditions Early Round Bluffing in Poker California Jack Cassidy Abstract. Using a simplified form of the Von Neumann and Morgenstern poker calculations, the author explores the effect of hand volatility on bluffing strategy, and shows that one should never bluff in the first round of Texas Hold’Em. 1. INTRODUCTION. The phrase “the mathematics of bluffing” often brings a puzzled response from nonmathematicians. “Isn’t that an oxymoron? Bluffing is psy- chological,” they might say, or, “Bluffing doesn’t work in online poker.