Two Player Zero Sum Multi-Stage Game Analysis Using Coevolutionary Algorithm

Total Page:16

File Type:pdf, Size:1020Kb

Two Player Zero Sum Multi-Stage Game Analysis Using Coevolutionary Algorithm Wright State University CORE Scholar Browse all Theses and Dissertations Theses and Dissertations 2019 Two Player Zero Sum Multi-Stage Game Analysis Using Coevolutionary Algorithm Sumedh Sopan Nagrale Wright State University Follow this and additional works at: https://corescholar.libraries.wright.edu/etd_all Part of the Electrical and Computer Engineering Commons Repository Citation Nagrale, Sumedh Sopan, "Two Player Zero Sum Multi-Stage Game Analysis Using Coevolutionary Algorithm" (2019). Browse all Theses and Dissertations. 2175. https://corescholar.libraries.wright.edu/etd_all/2175 This Thesis is brought to you for free and open access by the Theses and Dissertations at CORE Scholar. It has been accepted for inclusion in Browse all Theses and Dissertations by an authorized administrator of CORE Scholar. For more information, please contact [email protected]. TWO PLAYER ZERO SUM MULTI-STAGE GAME ANALYSIS USING COEVOLUTIONARY ALGORITHM. A Thesis submitted in partial fulfilment of the requirements for the degree of Master of Science in Electrical Engineering By SUMEDH SOPAN NAGRALE B.E.,University of Mumbai, 2012 2019 Wright State University WRIGHT STATE UNIVERSITY GRADUATE SCHOOL April 24, 2019 I HEREBY RECOMMEND THAT THE THESIS PREPARED UNDER MY SUPERVISION BY Sumedh Sopan Nagrale ENTITLED Two Player Zero Sum Multi-Stage Game Analysis Using Coevolutionary Algorithm BE ACCEPTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DE- GREE OF Master of Science in Electrical Engineering. Luther Palmer, III, Ph.D. Thesis Director. Fred D. Garber, Ph.D. Chair, Department of Electrical Engineering Committee on Final Examination: Luther Palmer, III, Ph.D. Pradeep Misra, Ph.D. Xiaodong Zhang, Ph.D. Barry Milligan, Ph.D. Interim Dean of the Graduate School ABSTRACT Nagrale, Sumedh Sopan. M.S.E.E, Department of Electrical Engineering, Wright State University, 2019. Two Player Zero Sum Multi-Stage Game Analysis Using Coevolutionary Algorithm. A New Two player zero sum multistage simultaneous Game has been developed from a real-life sit- uation of dispute between two individual. Research identifies a multistage game as a multi-objective optimization problem and solves it using Coevolutionary algorithm which converges to a solution from pareto optimal solution. A comparison is done between individual stage behaviour and multistage be- haviour. Further, simulations over a range for Crossover rate, Mutation rate and Number of interaction is done to narrow down the range for a range with optimal computation speed. A relationship has been observed which identifies a relationship between population size, number of interactions, crossover rate, mutation rate and computational time. A point from the obtained range is then selected and applied to a new game to see if the point from the narrowed range works. iii Contents 1 Background 1 1.1 Game Theory . .1 1.2 Solution Concept . .7 1.2.1 Minimax Theorem: . .7 1.2.2 Nash Equilibrium . .7 1.2.3 Rationalization . .7 1.3 Contemporary . .9 2 Introduction 10 3 Game 1: Attacker’s Game or Horizontal Movement Game 12 3.0.1 Players . 12 3.0.2 Environment . 12 3.0.3 Actions . 12 3.0.4 States . 13 3.0.5 Representation choice . 13 3.0.6 Utility . 13 3.0.7 Normal form representation of the states . 13 3.0.8 Transition of states based on actions taken . 14 3.1 Game theoretic solution for the Attack Game . 15 4 Methodology 19 4.1 Co-Evolutionary Algorithm . 20 4.2 Solving s0 as an example by Hand . 21 4.2.1 Population and Gene . 21 4.2.2 Fitness Function . 22 4.2.3 Selection: . 24 4.2.4 Crossover: . 25 4.2.5 Mutation: . 26 4.2.6 Fitness Evaluation: . 26 iv 5 Simulation 27 5.1 Simulation for Individual States using CoEv . 27 5.1.1 Individual Stage Game S0 simulation . 27 5.1.2 Individual Stage Game S1 simulation . 28 5.1.3 Individual Stage Game S2 simulation . 29 5.1.4 Observation . 30 5.1.5 Analysis . 31 5.1.6 Conclusion for the simulation . 31 5.2 Simulations for Multistage Game using CoEv Algorithm . 32 5.2.1 Simulation . 32 5.2.2 Observations . 32 5.2.3 Analysis . 33 5.2.4 Conclusion for the simulation . 33 5.3 Simulation for Parameters with optimal Computation time . 34 5.3.1 Simulation . 34 5.3.2 Observation . 36 5.3.3 Analysis . 36 5.4 Derivation for Computational time . 36 5.4.1 Relation between Generation, Interaction and Time taken . 38 5.4.2 Comparison between values Actual time and real time taken for Interaction . 41 5.5 Conclusion . 42 6 Game 2: Movement Game 43 6.0.1 Players . 43 6.0.2 Environment . 43 6.0.3 Actions . 43 6.0.4 States . 44 6.0.5 Utility . 44 6.0.6 Representation choice . 44 6.0.7 Normal form representation of the states . 44 6.1 Simulation . 46 6.2 Observation and Analysis . 46 6.3 Conclusion . 46 7 Discussion 47 7.1 Future Work . 47 7.1.1 Multiple task Learning . 47 7.1.2 Thesis Extension . 47 7.1.3 Application of research . 48 v 8 Conclusion 49 9 Summary 50 Bibliography 51 vi List of Figures 3.1 Game I States and their equivalent representation s0,s1,s2 ............... 13 3.2 There are multiple Nash equilibriums (R,R) , (R,L), (L,R),(L,L) . 15 3.3 pure stage game S1 equilibrium (L,R) . 15 3.4 pure stage game S2 equilibrium (R,L) . 16 4.1 Genetic Algorithm Outline . 20 4.2 Coevolutionary Algorithm Outline . 21 5.1 State S0 simulation I . 28 5.2 State S0 simulation II . 28 5.3 State S1 simulation I . 29 5.4 State S1 simulation II . 29 5.5 State S2 simulation I . 30 5.6 State S2 simulation II . 30 5.7 State S0 + S1 + S2 simulation I . 32 5.8 Interactions 24 and 48 . 34 5.9 Interactions 72 and 96 . 34 5.10 Interactions 120 and 144 . 35 5.11 Interactions 168 and 192 . 35 5.12 Interactions 216 and 240 . 35 5.13 Plot for G Vs I, Computation time Vs I and Total time Vs I . 39 5.14 Plot for G Vs I, Computation time Vs I and Total time Vs I . 39 5.15 Plot for G Vs I, Computation time Vs I and Total time Vs I . 39 5.16 Plot for G Vs I, Computation time Vs I and Total time Vs I . 39 5.17 PS vsI.......................................... 41 5.18 Game I States and their equivalent representation s0,s1,s2 ............... 42 5.19 Game I States and their equivalent representation s0,s1,s2 ............... 42 6.1 Game II States s0,s1,s2,s3 ............................... 44 6.2 State S0 + S1 + S2 + S3 simulation I . 46 vii List of Tables 1.1 3x3 Matrix: Rock-Paper-Scissor Normal-Form Game . .5 3.1 4x4 Matrix: s0 Normal-Form Game . 14 3.2 4x4 Matrix: s1 Normal-Form Game . 14 3.3 4x4 Matrix: s2 Normal-Form Game . 14 3.4 4x4 Matrix: s0 Reduced Normal-Form Game by Iterative deletion of dominant strategy 16 3.5 4x4 Matrix: s1 Reduced Normal-Form Game by Iterative deletion of dominant strategy 17 3.6 Expected payoff for P5 for mixing different probabilities . 18 4.1 solution for P4 s0 ................................... 19 4.2 Gene example for P4 s0 population . 22 4.3 P4 s0 example and corresponding binary encoding . 23 4.4 Game Matrix in state s0 ................................. 23 4.5 fitness function value P4 s0 population . 23 4.6 P4 s0 population with binary encoded values . 24 4.7 P4 s0 binary encoded.
Recommended publications
  • Labsi Working Papers
    UNIVERSITY OF SIENA S.N. O’ H iggins Arturo Palomba Patrizia Sbriglia Second Mover Advantage and Bertrand Dynamic Competition: An Experiment May 2010 LABSI WORKING PAPERS N. 28/2010 SECOND MOVER ADVANTAGE AND BERTRAND DYNAMIC COMPETITION: AN EXPERIMENT § S.N. O’Higgins University of Salerno [email protected] Arturo Palomba University of Naples II [email protected] Patrizia Sbriglia §§ University of Naples II [email protected] Abstract In this paper we provide an experimental test of a dynamic Bertrand duopolistic model, where firms move sequentially and their informational setting varies across different designs. Our experiment is composed of three treatments. In the first treatment, subjects receive information only on the costs and demand parameters and on the price’ choices of their opponent in the market in which they are positioned (matching is fixed); in the second and third treatments, subjects are also informed on the behaviour of players who are not directly operating in their market. Our aim is to study whether the individual behaviour and the process of equilibrium convergence are affected by the specific informational setting adopted. In all treatments we selected students who had previously studied market games and industrial organization, conjecturing that the specific participants’ expertise decreased the chances of imitation in treatment II and III. However, our results prove the opposite: the extra information provided in treatment II and III strongly affects the long run convergence to the market equilibrium. In fact, whilst in the first session, a high proportion of markets converge to the Nash-Bertrand symmetric solution, we observe that a high proportion of markets converge to more collusive outcomes in treatment II and more competitive outcomes in treatment III.
    [Show full text]
  • 1 Sequential Games
    1 Sequential Games We call games where players take turns moving “sequential games”. Sequential games consist of the same elements as normal form games –there are players, rules, outcomes, and payo¤s. However, sequential games have the added element that history of play is now important as players can make decisions conditional on what other players have done. Thus, if two people are playing a game of Chess the second mover is able to observe the …rst mover’s initial move prior to making his initial move. While it is possible to represent sequential games using the strategic (or matrix) form representation of the game it is more instructive at …rst to represent sequential games using a game tree. In addition to the players, actions, outcomes, and payo¤s, the game tree will provide a history of play or a path of play. A very basic example of a sequential game is the Entrant-Incumbent game. The game is described as follows: Consider a game where there is an entrant and an incumbent. The entrant moves …rst and the incumbent observes the entrant’sdecision. The entrant can choose to either enter the market or remain out of the market. If the entrant remains out of the market then the game ends and the entrant receives a payo¤ of 0 while the incumbent receives a payo¤ of 2. If the entrant chooses to enter the market then the incumbent gets to make a choice. The incumbent chooses between …ghting entry or accommodating entry. If the incumbent …ghts the entrant receives a payo¤ of 3 while the incumbent receives a payo¤ of 1.
    [Show full text]
  • Finitely Repeated Games
    Repeated games 1: Finite repetition Universidad Carlos III de Madrid 1 Finitely repeated games • A finitely repeated game is a dynamic game in which a simultaneous game (the stage game) is played finitely many times, and the result of each stage is observed before the next one is played. • Example: Play the prisoners’ dilemma several times. The stage game is the simultaneous prisoners’ dilemma game. 2 Results • If the stage game (the simultaneous game) has only one NE the repeated game has only one SPNE: In the SPNE players’ play the strategies in the NE in each stage. • If the stage game has 2 or more NE, one can find a SPNE where, at some stage, players play a strategy that is not part of a NE of the stage game. 3 The prisoners’ dilemma repeated twice • Two players play the same simultaneous game twice, at ! = 1 and at ! = 2. • After the first time the game is played (after ! = 1) the result is observed before playing the second time. • The payoff in the repeated game is the sum of the payoffs in each stage (! = 1, ! = 2) • Which is the SPNE? Player 2 D C D 1 , 1 5 , 0 Player 1 C 0 , 5 4 , 4 4 The prisoners’ dilemma repeated twice Information sets? Strategies? 1 .1 5 for each player 2" for each player D C E.g.: (C, D, D, C, C) Subgames? 2.1 5 D C D C .2 1.3 1.5 1 1.4 D C D C D C D C 2.2 2.3 2 .4 2.5 D C D C D C D C D C D C D C D C 1+1 1+5 1+0 1+4 5+1 5+5 5+0 5+4 0+1 0+5 0+0 0+4 4+1 4+5 4+0 4+4 1+1 1+0 1+5 1+4 0+1 0+0 0+5 0+4 5+1 5+0 5+5 5+4 4+1 4+0 4+5 4+4 The prisoners’ dilemma repeated twice Let’s find the NE in the subgames.
    [Show full text]
  • Chapter 16 Oligopoly and Game Theory Oligopoly Oligopoly
    Chapter 16 “Game theory is the study of how people Oligopoly behave in strategic situations. By ‘strategic’ we mean a situation in which each person, when deciding what actions to take, must and consider how others might respond to that action.” Game Theory Oligopoly Oligopoly • “Oligopoly is a market structure in which only a few • “Figuring out the environment” when there are sellers offer similar or identical products.” rival firms in your market, means guessing (or • As we saw last time, oligopoly differs from the two ‘ideal’ inferring) what the rivals are doing and then cases, perfect competition and monopoly. choosing a “best response” • In the ‘ideal’ cases, the firm just has to figure out the environment (prices for the perfectly competitive firm, • This means that firms in oligopoly markets are demand curve for the monopolist) and select output to playing a ‘game’ against each other. maximize profits • To understand how they might act, we need to • An oligopolist, on the other hand, also has to figure out the understand how players play games. environment before computing the best output. • This is the role of Game Theory. Some Concepts We Will Use Strategies • Strategies • Strategies are the choices that a player is allowed • Payoffs to make. • Sequential Games •Examples: • Simultaneous Games – In game trees (sequential games), the players choose paths or branches from roots or nodes. • Best Responses – In matrix games players choose rows or columns • Equilibrium – In market games, players choose prices, or quantities, • Dominated strategies or R and D levels. • Dominant Strategies. – In Blackjack, players choose whether to stay or draw.
    [Show full text]
  • Cooperation Spillovers in Coordination Games*
    Cooperation Spillovers in Coordination Games* Timothy N. Casona, Anya Savikhina, and Roman M. Sheremetab aDepartment of Economics, Krannert School of Management, Purdue University, 403 W. State St., West Lafayette, IN 47906-2056, U.S.A. bArgyros School of Business and Economics, Chapman University, One University Drive, Orange, CA 92866, U.S.A. November 2009 Abstract Motivated by problems of coordination failure observed in weak-link games, we experimentally investigate behavioral spillovers for order-statistic coordination games. Subjects play the minimum- and median-effort coordination games simultaneously and sequentially. The results show the precedent for cooperative behavior spills over from the median game to the minimum game when the games are played sequentially. Moreover, spillover occurs even when group composition changes, although the effect is not as strong. We also find that the precedent for uncooperative behavior does not spill over from the minimum game to the median game. These findings suggest guidelines for increasing cooperative behavior within organizations. JEL Classifications: C72, C91 Keywords: coordination, order-statistic games, experiments, cooperation, minimum game, behavioral spillover Corresponding author: Timothy Cason, [email protected] * We thank Yan Chen, David Cooper, John Duffy, Vai-Lam Mui, seminar participants at Purdue University, and participants at Economic Science Association conferences for helpful comments. Any remaining errors are ours. 1. Introduction Coordination failure is often the reason for the inefficient performance of many groups, ranging from small firms to entire economies. When agents’ actions have strategic interdependence, even when they succeed in coordinating they may be “trapped” in an equilibrium that is objectively inferior to other equilibria. Coordination failure and inefficient coordination has been an important theme across a variety of fields in economics, ranging from development and macroeconomics to mechanism design for overcoming moral hazard in teams.
    [Show full text]
  • 570: Minimax Sample Complexity for Turn-Based Stochastic Game
    Minimax Sample Complexity for Turn-based Stochastic Game Qiwen Cui1 Lin F. Yang2 1School of Mathematical Sciences, Peking University 2Electrical and Computer Engineering Department, University of California, Los Angeles Abstract guarantees are rather rare due to complex interaction be- tween agents that makes the problem considerably harder than single agent reinforcement learning. This is also known The empirical success of multi-agent reinforce- as non-stationarity in MARL, which means when multi- ment learning is encouraging, while few theoret- ple agents alter their strategies based on samples collected ical guarantees have been revealed. In this work, from previous strategy, the system becomes non-stationary we prove that the plug-in solver approach, proba- for each agent and the improvement can not be guaranteed. bly the most natural reinforcement learning algo- One fundamental question in MBRL is that how to design rithm, achieves minimax sample complexity for efficient algorithms to overcome non-stationarity. turn-based stochastic game (TBSG). Specifically, we perform planning in an empirical TBSG by Two-players turn-based stochastic game (TBSG) is a two- utilizing a ‘simulator’ that allows sampling from agents generalization of Markov decision process (MDP), arbitrary state-action pair. We show that the em- where two agents choose actions in turn and one agent wants pirical Nash equilibrium strategy is an approxi- to maximize the total reward while the other wants to min- mate Nash equilibrium strategy in the true TBSG imize it. As a zero-sum game, TBSG is known to have and give both problem-dependent and problem- Nash equilibrium strategy [Shapley, 1953], which means independent bound.
    [Show full text]
  • Formation of Coalition Structures As a Non-Cooperative Game
    Formation of coalition structures as a non-cooperative game Dmitry Levando, ∗ Abstract We study coalition structure formation with intra and inter-coalition externalities in the introduced family of nested non-cooperative si- multaneous finite games. A non-cooperative game embeds a coalition structure formation mechanism, and has two outcomes: an alloca- tion of players over coalitions and a payoff for every player. Coalition structures of a game are described by Young diagrams. They serve to enumerate coalition structures and allocations of players over them. For every coalition structure a player has a set of finite strategies. A player chooses a coalition structure and a strategy. A (social) mechanism eliminates conflicts in individual choices and produces final coalition structures. Every final coalition structure is a non-cooperative game. Mixed equilibrium always exists and consists arXiv:2107.00711v1 [cs.GT] 1 Jul 2021 of a mixed strategy profile, payoffs and equilibrium coalition struc- tures. We use a maximum coalition size to parametrize the family ∗Moscow, Russian Federation, independent. No funds, grants, or other support was received. Acknowledge: Nick Baigent, Phillip Bich, Giulio Codognato, Ludovic Julien, Izhak Gilboa, Olga Gorelkina, Mark Kelbert, Anton Komarov, Roger Myerson, Ariel Rubinstein, Shmuel Zamir. Many thanks for advices and for discussion to participants of SIGE-2015, 2016 (an earlier version of the title was “A generalized Nash equilibrium”), CORE 50 Conference, Workshop of the Central European Program in Economic Theory, Udine (2016), Games 2016 Congress, Games and Applications, (2016) Lisbon, Games and Optimization at St Etienne (2016). All possible mistakes are only mine. E-mail for correspondence: dlevando (at) hotmail.ru.
    [Show full text]
  • Finding Strategic Game Equivalent of an Extensive Form Game
    Finding Strategic Game Equivalent of an Extensive Form Game • In an extensive form game, a strategy for a player should specify what action the player will choose at each information set. That is, a strategy is a complete plan for playing a game for a particular player. • Therefore to find the strategic game equivalent of an extensive form game we should follow these steps: 1. First we need to find all strategies for every player. To do that first we find all information sets for every player (including the singleton information sets). If there are many of them we may label them (such as Player 1’s 1st info. set, Player 1’s 2nd info. set, etc.) 2. A strategy should specify what action the player will take in every information set which belongs to him/her. We find all combinations of actions the player can take at these information sets. For example if a player has 3 information sets where • the first one has 2 actions, • the second one has 2 actions, and • the third one has 3 actions then there will be a total of 2 × 2 × 3 = 12 strategies for this player. 3. Once the strategies are obtained for every player the next step is finding the payoffs. For every strategy profile we find the payoff vector that would be obtained in case these strategies are played in the extensive form game. Example: Let’s find the strategic game equivalent of the following 3-level centipede game. (3,1)r (2,3)r (6,2)r (4,6)r (12,4)r (8,12)r S S S S S S r 1 2 1 2 1 2 (24,8) C C C C C C 1.
    [Show full text]
  • How to Solve Strategic Games? Dominant Strategies
    How to Solve Strategic Games? There are three main concepts to solve strategic games: 1. Dominant Strategies & Dominant Strategy Equilibrium 2. Dominated Strategies & Iterative Elimination of Dominated Strategies 3. Nash Equilibrium Dominant Strategies • Astrategyisadominant strategy for a player if it yields the best payoff (for that player) no matter what strategies the other players choose. • If all players have a dominant strategy, then it is natural for them to choose the dominant strategies and we reach a dominant strategy equilibrium. Example (Prisoner’s Dilemma): Prisoner 2 Confess Deny Prisoner 1 Confess -10, -10 -1, -25 Deny -25, -1 -3, -3 Confess is a dominant strategy for both players and therefore (Confess,Confess) is a dominant strategy equilibrium yielding the payoff vector (-10,-10). Example (Time vs. Newsweek): Newsweek AIDS BUDGET Time AIDS 35,35 70,30 BUDGET 30,70 15,15 The AIDS story is a dominant strategy for both Time and Newsweek. Therefore (AIDS,AIDS) is a dominant strategy equilibrium yielding both magazines a market share of 35 percent. Example: Player 2 XY A 5,2 4,2 Player 1 B 3,1 3,2 C 2,1 4,1 D 4,3 5,4 • Here Player 1 does not have a single strategy that “beats” every other strategy. Therefore she does not have a dominant strategy. • On the other hand Y is a dominant strategy for Player 2. Example (with 3 players): P3 A B P2 P2 LR LR U 3,2,1 2,1,1 U 1,1,2 2,0,1 P1 M 2,2,0 1,2,1 M 1,2,0 1,0,2 D 3,1,2 1,0,2 D 0,2,3 1,2,2 Here • U is a dominant strategy for Player 1, L is a dominant strategy for Player 2, B is a dominant strategy for Player 3, • and therefore (U;L;B) is a dominant strategy equilibrium yielding a payoff of (1,1,2).
    [Show full text]
  • Building a Poker Playing Agent Based on Game Logs Using Supervised Learning
    FACULDADE DE ENGENHARIA DA UNIVERSIDADE DO PORTO Building a Poker Playing Agent based on Game Logs using Supervised Learning Luís Filipe Guimarães Teófilo Mestrado Integrado em Engenharia Informática e Computação Orientador: Professor Doutor Luís Paulo Reis 26 de Julho de 2010 Building a Poker Playing Agent based on Game Logs using Supervised Learning Luís Filipe Guimarães Teófilo Mestrado Integrado em Engenharia Informática e Computação Aprovado em provas públicas pelo Júri: Presidente: Professor Doutor António Augusto de Sousa Vogal Externo: Professor Doutor José Torres Orientador: Professor Doutor Luís Paulo Reis ____________________________________________________ 26 de Julho de 2010 iv Resumo O desenvolvimento de agentes artificiais que jogam jogos de estratégia provou ser um domínio relevante de investigação, sendo que investigadores importantes na área das ciências de computadores dedicaram o seu tempo a estudar jogos como o Xadrez e as Damas, obtendo resultados notáveis onde o jogador artificial venceu os melhores jogadores humanos. No entanto, os jogos estocásticos com informação incompleta trazem novos desafios. Neste tipo de jogos, o agente tem de lidar com problemas como a gestão de risco ou o tratamento de informação não fiável, o que torna essencial modelar adversários, para conseguir obter bons resultados. Nos últimos anos, o Poker tornou-se um fenómeno de massas, sendo que a sua popularidade continua a aumentar. Na Web, o número de jogadores aumentou bastante, assim como o número de casinos online, tornando o Poker numa indústria bastante rentável. Além disso, devido à sua natureza estocástica de informação imperfeita, o Poker provou ser um problema desafiante para a inteligência artificial. Várias abordagens foram seguidas para criar um jogador artificial perfeito, sendo que já foram feitos progressos nesse sentido, como o melhoramento das técnicas de modelação de oponentes.
    [Show full text]
  • Credible Commitments 2.1 Game Trees and Subgame Perfection
    R.E.Marks 2000 Week 8-1 2. Credible Commitments 2.1 Game Trees and Subgame Perfection What if one player moves first? Use a game tree, in which the players, their actions, and the timing of their actions are explicit. Allow three choices for each of the two players, Alpha and Beta: Do Not Expand (DNE), Small, and Large expansions. The payoff matrix for simultaneous moves is: The Capacity Game Beta ________________________________DNE Small Large L L L L DNE $18, $18 $15, $20 $9, $18 _L_______________________________L L L L L L L Alpha SmallL $20, $15L $16, $16L $8, $12 L L________________________________L L L L L L L Large_L_______________________________ $18, $9L $12, $8L $0, $0 L TABLE 1. The payoff matrix (Alpha, Beta) R.E.Marks 2000 Week 8-2 The game tree. If Alpha preempts Beta, by making its capacity decision before Beta does, then use the game tree: Alpha L S DNE Beta Beta Beta L S DNE L S DNE L S DNE 0 12 18 8 16 20 9 15 18 0 8 9 12 16 15 18 20 18 Figure 1. Game Tree, Payoffs: Alpha’s, Beta’s Use subgame perfect Nash equilibrium, in which each player chooses the best action for itself at each node it might reach, and assumes similar behaviour on the part of the other. R.E.Marks 2000 Week 8-3 2.1.1 Backward Induction With complete information (all know what each has done), we can solve this by backward induction: 1. From the end (final payoffs), go up the tree to the first parent decision nodes.
    [Show full text]
  • The Mathematical Analysis of Games, Focusing on Variance
    The Mathematical Analysis of Games, Focusing on Variance Dr. ir. Tom Verhoeff, TU Eindhoven Last November, I gave a lunch talk on game analysis for W.I.S.V. Mathematically interesting questions are: Given the game's state, `Christiaan Huygens'. If you missed it, you can catch up here. how do you decide your best move? Which player can force the best Games, Mathematics, and Decisions outcome from the start? What is the (long-term) optimum result? Games have always been loved by mathematicians. Because of their A Strategic Coin Game (usually) well-defined rules, games admit a formal analysis, sometimes Let us dive into a simple strategic coin game. The two players, with surprising results. It is impossible to provide comprehensive Alice and Bob, simultaneously and secretly each choose 0 (head) or coverage in this article. Instead, I will give an overview and focus on 1 (tail)1. After committing their choices, the payoff is determined by the role of variance, which is often overlooked. the matrix in Figure 2. Alice receives e 2 from Bob when the choices J¨orgBewersdorff wrote a nice book on applying mathematics to the Bob chooses analysis of games [2]. I recommend it to the mathematically inclined reader. It classifies games according to the sources of uncertainty 0 1 that confront the players, also see Figure 1. m m Combinatorial Games generate uncertainty by the many ways in which 0 " 1 2 moves can be combined, in spite of the fact that the players have Alice chooses m open access to all information.
    [Show full text]