Week 7: Games in Extensive Form

Total Page:16

File Type:pdf, Size:1020Kb

Week 7: Games in Extensive Form Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Week 7: Games in Extensive Form Dr Daniel Sgroi Reading: 1. Osborne chapter 5 (and also 6, 7); 2. Snyder & Nicholson, pp. 255{260. With thanks to Peter J. Hammond. EC202, University of Warwick, Term 2 1 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Sequential BoS, I 1 BS XX 2 XXX2 BS @ b BS @ r @@ r @@ 2; 1 0; 0 0; 0 1; 2 r r r r Consider the BoS game, with the key difference that player 1 who prefers (B; B) to (S; S) moves first, and player 2 observes player 1's move before replying. This difference affects the normal or strategic form, because player 2 now has four instead of two strategies, as follows: 1. B always (which we denote by b); 2. S always (which we denote by s); 3. B if B and S if S (which we denote by m for match); 4. S if B and B if S (which we denote by x for cross). EC202, University of Warwick, Term 2 2 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Sequential BoS, II Here is the normal form of sequential BoS, with (multiple) best responses starred as usual. P2 b s m x B 2∗ 0 2∗ 0∗ 1∗ 0 1∗ 0 ∗ ∗ P1 S 0 1 1 0 0 2∗ 2∗ 0 There are three pure strategy Nash equilibria, and many mixed strategy equilibria (for example, B with any mixture of b and m). EC202, University of Warwick, Term 2 3 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Sequential BoS, III P2 b s m x B 2∗ 0 2∗ 0∗ 1∗ 0 1∗ 0 ∗ ∗ P1 S 0 1 1 0 0 2∗ 2∗ 0 Player P2's strategy x is strictly dominated by m, while both b and s are weakly dominated by m. If we eliminate all weakly dominated strategies once, then eliminate 1's strictly dominated strategy, only (B; m) remains. This is a refinement of the set of Nash equilibria. EC202, University of Warwick, Term 2 4 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Sequential BoS, IV An alternative way to justify P2 choosing strategy m considers the two different subgames that arise after player 1 has chosen either B or S. In each of these player 2 can also choose B or S: after B, player 2's best response is B; after S, player 2's best response is S. Thus player 2 matches by choosing m in the full game. Player 1's best response is B. In effect, player 1's move anticipates player 2's response. The result is player 1's favoured outcome, so moving first confers an advantage in this game. Player 1 has a “first-mover advantage". EC202, University of Warwick, Term 2 5 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Sequential Cornot, I Consider the Cournot duopoly model where firms have zero cost, and inverse demand is p = maxf0; 100 − qg, with q = q1 + q2. Firm i's best response, taking the other firm’s qj as fixed, 1 solves maxqi (100 − qi − qj )qi , so qi = 50 − 2 qj . After allowing for the possibility of a corner solution, we derive 1 the best response (BR) function bi (qj ) = maxf0; 50 − 2 qj g. As in sequential BoS, suppose that before firm 2 chooses q2, it will have observed firm 1's choice of q1. Firm 2's strategy space becomes the set of all response functions q1 7! r2(q1) from R+ to R+. A strategy for firm 2 that weakly dominates all others is to choose r2(·) to be the best response function b2(·). When making its move, firm 2 not only has the deterministic belief, but the certain knowledge that q1 has been played. EC202, University of Warwick, Term 2 6 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Sequential Cornot, II Assuming common knowledge of rationality, what should firm 1 do? A deterministic belief about q2 would be na¨ıve, since firm 1 knows that a rational firm 2 1 will choose b2(q1) = maxf0; 50 − 2 q1g in response to its choice of q1. This, in turn, means that a rational firm 1 would replace the “fixed” q2 in its profit function with firm 2's best response b2(q1) to 1's choice of q1. That is, firm 1 now chooses q1 to solve max[100 − q1 − b2(q1)] q1: q1 EC202, University of Warwick, Term 2 7 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Stackelberg, I Firm 1 chooses q1 to solve max[100 − q1 − b2(q1)] q1; q1 1 where b2(q1) = maxf0; 50 − 2 q1g. In the interior case, the maximand 1 1 is [100 − q1 − (50 − 2 q1)] q1 = (50 − 2 q1) q1, with derivative 50 − q1, which is ? 0 as q1 7 50. So, anticipating firm 2's response b2(q1) to q1, firm 1's optimal strategy is q1 = 50. After this firm 2 responds by choosing q2 = b2(50) = 25. The price is 100 − q1 − q2 = 100 − 75 = 25. This is the Stackelberg equilibrium. EC202, University of Warwick, Term 2 8 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Stackelberg, II Given q1 = 50, q2 = 25, and p = 25, the resulting profits (revenues) are π1 = 25 · 50 = 1; 250 and π2 = 25 · 25 = 625. Recall that in the original Cournot equilibrium (with simultaneous moves) 1 the quantities were q1 = q2 = 100=3 = 33 3 , 1 the price was p = 100=3 = 33 3 , 1 and the profits were π1 = π2 = 10; 000=9 = 1; 111 9 . So firm 1 has a first-mover advantage, relative to both firm 2 and the Cournot equilibrium. Total profits, however, are lower. EC202, University of Warwick, Term 2 9 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Extensive Form Games One obvious drawback (sometimes an advantage) of the normal form of a game is its failure to capture time. A player's strategy describes one complete plan for the whole game. Payoff functions represent how players value the consequences of strategy profiles. But how is the order of moves captured? A game in extensive form captures sequential strategic situations when there is a well defined order of moves. Our analyses of sequential BoS and sequential Cournot suggest that the order of moves can affect what we would regard as a reasonable outcome of the game. It illustrates a solution concept that captures an important idea of sequential rationality. EC202, University of Warwick, Term 2 10 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Players and Payoffs The extensive form formally captures situations where: 1. players may move sequentially; 2. what players know, when it is their turn to move, may depend on players' previous choices. Like the normal or strategic form, any extensive form game has two key ingredients in its description: (EF1) The set of players, N (EF2) The players' payoffs (ui (·))i2N as functions of everybody's actions. EC202, University of Warwick, Term 2 11 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Move Order and Feasible Sets To allow sequential play to be represented, we extend the notion of pure strategy by specifying two separate parts: first, as before, what players can do; second, when (and with what information) they can do it. In the Stackelberg example, as in the Cournot game, players can choose any quantity they like. But we also needed to specify that player 1 moves first, and only then does player 2 move (after observing 1's prior choice). Thus, in general we need two extra components in order to model allowable move sequences: (EF3) The order of moves. (EF4) What choices players have when they do move (their feasible sets). EC202, University of Warwick, Term 2 12 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Players' Knowledge Simultaneous moves in the normal form represent the situation where players know nothing about other players' prior moves. In the extensive form, it is not the chronological order of play that matters, but what players know when they choose. In both the Cournot and Stackelberg models of duopoly, firm 1 may indeed move some time before firm 2. But if firm 2 must choose without observing firm 1's choice, then the Cournot model of simultaneous choice still applies. In contrast, if firm 1's choice is revealed before firm 2 moves, then we should use the sequential Stackelberg model. In general, we need to specify how information and knowledge change over time: (EF5) Players' information when making their moves. EC202, University of Warwick, Term 2 13 of 33 Battle of the Sexes Stackelberg Essential Ingredients Game Trees Backward Induction Subgame Perfection Chance Moves Exogenous random events vastly enrich the strategic decision problems we can describe. Example Suppose firm A embarks on a research and development (R&D) project that may succeed or fail. Competing firm B can adapt its strategy to the outcome of firm A's R&D project, by waiting to see if it succeeds or not.
Recommended publications
  • Game Theory 2: Extensive-Form Games and Subgame Perfection
    Game Theory 2: Extensive-Form Games and Subgame Perfection 1 / 26 Dynamics in Games How should we think of strategic interactions that occur in sequence? Who moves when? And what can they do at different points in time? How do people react to different histories? 2 / 26 Modeling Games with Dynamics Players Player function I Who moves when Terminal histories I Possible paths through the game Preferences over terminal histories 3 / 26 Strategies A strategy is a complete contingent plan Player i's strategy specifies her action choice at each point at which she could be called on to make a choice 4 / 26 An Example: International Crises Two countries (A and B) are competing over a piece of land that B occupies Country A decides whether to make a demand If Country A makes a demand, B can either acquiesce or fight a war If A does not make a demand, B keeps land (game ends) A's best outcome is Demand followed by Acquiesce, worst outcome is Demand and War B's best outcome is No Demand and worst outcome is Demand and War 5 / 26 An Example: International Crises A can choose: Demand (D) or No Demand (ND) B can choose: Fight a war (W ) or Acquiesce (A) Preferences uA(D; A) = 3 > uA(ND; A) = uA(ND; W ) = 2 > uA(D; W ) = 1 uB(ND; A) = uB(ND; W ) = 3 > uB(D; A) = 2 > uB(D; W ) = 1 How can we represent this scenario as a game (in strategic form)? 6 / 26 International Crisis Game: NE Country B WA D 1; 1 3X; 2X Country A ND 2X; 3X 2; 3X I Is there something funny here? I Is there something funny here? I Specifically, (ND; W )? I Is there something funny here?
    [Show full text]
  • 1 Sequential Games
    1 Sequential Games We call games where players take turns moving “sequential games”. Sequential games consist of the same elements as normal form games –there are players, rules, outcomes, and payo¤s. However, sequential games have the added element that history of play is now important as players can make decisions conditional on what other players have done. Thus, if two people are playing a game of Chess the second mover is able to observe the …rst mover’s initial move prior to making his initial move. While it is possible to represent sequential games using the strategic (or matrix) form representation of the game it is more instructive at …rst to represent sequential games using a game tree. In addition to the players, actions, outcomes, and payo¤s, the game tree will provide a history of play or a path of play. A very basic example of a sequential game is the Entrant-Incumbent game. The game is described as follows: Consider a game where there is an entrant and an incumbent. The entrant moves …rst and the incumbent observes the entrant’sdecision. The entrant can choose to either enter the market or remain out of the market. If the entrant remains out of the market then the game ends and the entrant receives a payo¤ of 0 while the incumbent receives a payo¤ of 2. If the entrant chooses to enter the market then the incumbent gets to make a choice. The incumbent chooses between …ghting entry or accommodating entry. If the incumbent …ghts the entrant receives a payo¤ of 3 while the incumbent receives a payo¤ of 1.
    [Show full text]
  • Equilibrium Refinements
    Equilibrium Refinements Mihai Manea MIT Sequential Equilibrium I In many games information is imperfect and the only subgame is the original game. subgame perfect equilibrium = Nash equilibrium I Play starting at an information set can be analyzed as a separate subgame if we specify players’ beliefs about at which node they are. I Based on the beliefs, we can test whether continuation strategies form a Nash equilibrium. I Sequential equilibrium (Kreps and Wilson 1982): way to derive plausible beliefs at every information set. Mihai Manea (MIT) Equilibrium Refinements April 13, 2016 2 / 38 An Example with Incomplete Information Spence’s (1973) job market signaling game I The worker knows her ability (productivity) and chooses a level of education. I Education is more costly for low ability types. I Firm observes the worker’s education, but not her ability. I The firm decides what wage to offer her. In the spirit of subgame perfection, the optimal wage should depend on the firm’s beliefs about the worker’s ability given the observed education. An equilibrium needs to specify contingent actions and beliefs. Beliefs should follow Bayes’ rule on the equilibrium path. What about off-path beliefs? Mihai Manea (MIT) Equilibrium Refinements April 13, 2016 3 / 38 An Example with Imperfect Information Courtesy of The MIT Press. Used with permission. Figure: (L; A) is a subgame perfect equilibrium. Is it plausible that 2 plays A? Mihai Manea (MIT) Equilibrium Refinements April 13, 2016 4 / 38 Assessments and Sequential Rationality Focus on extensive-form games of perfect recall with finitely many nodes. An assessment is a pair (σ; µ) I σ: (behavior) strategy profile I µ = (µ(h) 2 ∆(h))h2H: system of beliefs ui(σjh; µ(h)): i’s payoff when play begins at a node in h randomly selected according to µ(h), and subsequent play specified by σ.
    [Show full text]
  • Notes on Sequential and Repeated Games
    Notes on sequential and repeated games 1 Sequential Move Games Thus far we have examined games in which players make moves simultaneously (or without observing what the other player has done). Using the normal (strategic) form representation of a game we can identify sets of strategies that are best responses to each other (Nash Equilibria). We now focus on sequential games of complete information. We can still use the normal form representation to identify NE but sequential games are richer than that because some players observe other players’decisions before they take action. The fact that some actions are observable may cause some NE of the normal form representation to be inconsistent with what one might think a player would do. Here’sa simple game between an Entrant and an Incumbent. The Entrant moves …rst and the Incumbent observes the Entrant’s action and then gets to make a choice. The Entrant has to decide whether or not he will enter a market or not. Thus, the Entrant’s two strategies are “Enter” or “Stay Out”. If the Entrant chooses “Stay Out” then the game ends. The payo¤s for the Entrant and Incumbent will be 0 and 2 respectively. If the Entrant chooses “Enter” then the Incumbent gets to choose whether or not he will “Fight”or “Accommodate”entry. If the Incumbent chooses “Fight”then the Entrant receives 3 and the Incumbent receives 1. If the Incumbent chooses “Accommodate”then the Entrant receives 2 and the Incumbent receives 1. This game in normal form is Incumbent Fight if Enter Accommodate if Enter .
    [Show full text]
  • Algorithms and Collusion - Background Note by the Secretariat
    Unclassified DAF/COMP(2017)4 Organisation for Economic Co-operation and Development 9 June 2017 English - Or. English DIRECTORATE FOR FINANCIAL AND ENTERPRISE AFFAIRS COMPETITION COMMITTEE Cancels & replaces the same document of 17 May 2017. Algorithms and Collusion - Background Note by the Secretariat 21-23 June 2017 This document was prepared by the OECD Secretariat to serve as a background note for Item 10 at the 127th Meeting of the Competition Committee on 21-23 June 2017. The opinions expressed and arguments employed herein do not necessarily reflect the official views of the Organisation or of the governments of its member countries. More documentation related to this discussion can be found at http://www.oecd.org/daf/competition/algorithms-and-collusion.htm Please contact Mr. Antonio Capobianco if you have any questions about this document [E-mail: [email protected]] JT03415748 This document, as well as any data and map included herein, are without prejudice to the status of or sovereignty over any territory, to the delimitation of international frontiers and boundaries and to the name of any territory, city or area. 2 │ DAF/COMP(2017)4 Algorithms and Collusion Background note by the Secretariat Abstract The combination of big data with technologically advanced tools, such as pricing algorithms, is increasingly diffused in today everyone’s life, and it is changing the competitive landscape in which many companies operate and the way in which they make commercial and strategic decisions. While the size of this phenomenon is to a large extent unknown, there are a growing number of firms using computer algorithms to improve their pricing models, customise services and predict market trends.
    [Show full text]
  • Evolutionary Game Theory: ESS, Convergence Stability, and NIS
    Evolutionary Ecology Research, 2009, 11: 489–515 Evolutionary game theory: ESS, convergence stability, and NIS Joseph Apaloo1, Joel S. Brown2 and Thomas L. Vincent3 1Department of Mathematics, Statistics and Computer Science, St. Francis Xavier University, Antigonish, Nova Scotia, Canada, 2Department of Biological Sciences, University of Illinois, Chicago, Illinois, USA and 3Department of Aerospace and Mechanical Engineering, University of Arizona, Tucson, Arizona, USA ABSTRACT Question: How are the three main stability concepts from evolutionary game theory – evolutionarily stable strategy (ESS), convergence stability, and neighbourhood invader strategy (NIS) – related to each other? Do they form a basis for the many other definitions proposed in the literature? Mathematical methods: Ecological and evolutionary dynamics of population sizes and heritable strategies respectively, and adaptive and NIS landscapes. Results: Only six of the eight combinations of ESS, convergence stability, and NIS are possible. An ESS that is NIS must also be convergence stable; and a non-ESS, non-NIS cannot be convergence stable. A simple example shows how a single model can easily generate solutions with all six combinations of stability properties and explains in part the proliferation of jargon, terminology, and apparent complexity that has appeared in the literature. A tabulation of most of the evolutionary stability acronyms, definitions, and terminologies is provided for comparison. Key conclusions: The tabulated list of definitions related to evolutionary stability are variants or combinations of the three main stability concepts. Keywords: adaptive landscape, convergence stability, Darwinian dynamics, evolutionary game stabilities, evolutionarily stable strategy, neighbourhood invader strategy, strategy dynamics. INTRODUCTION Evolutionary game theory has and continues to make great strides.
    [Show full text]
  • SEQUENTIAL GAMES with PERFECT INFORMATION Example
    SEQUENTIAL GAMES WITH PERFECT INFORMATION Example 4.9 (page 105) Consider the sequential game given in Figure 4.9. We want to apply backward induction to the tree. 0 Vertex B is owned by player two, P2. The payoffs for P2 are 1 and 3, with 3 > 1, so the player picks R . Thus, the payoffs at B become (0, 3). 00 Next, vertex C is also owned by P2 with payoffs 1 and 0. Since 1 > 0, P2 picks L , and the payoffs are (4, 1). Player one, P1, owns A; the choice of L gives a payoff of 0 and R gives a payoff of 4; 4 > 0, so P1 chooses R. The final payoffs are (4, 1). 0 00 We claim that this strategy profile, { R } for P1 and { R ,L } is a Nash equilibrium. Notice that the 0 00 strategy profile gives a choice at each vertex. For the strategy { R ,L } fixed for P2, P1 has a maximal payoff by choosing { R }, ( 0 00 0 00 π1(R, { R ,L }) = 4 π1(R, { R ,L }) = 4 ≥ 0 00 π1(L, { R ,L }) = 0. 0 00 In the same way, for the strategy { R } fixed for P1, P2 has a maximal payoff by choosing { R ,L }, ( 00 0 00 π2(R, {∗,L }) = 1 π2(R, { R ,L }) = 1 ≥ 00 π2(R, {∗,R }) = 0, where ∗ means choose either L0 or R0. Since no change of choice by a player can increase that players own payoff, the strategy profile is called a Nash equilibrium. Notice that the above strategy profile is also a Nash equilibrium on each branch of the game tree, mainly starting at either B or starting at C.
    [Show full text]
  • The Three Types of Collusion: Fixing Prices, Rivals, and Rules Robert H
    University of Baltimore Law ScholarWorks@University of Baltimore School of Law All Faculty Scholarship Faculty Scholarship 2000 The Three Types of Collusion: Fixing Prices, Rivals, and Rules Robert H. Lande University of Baltimore School of Law, [email protected] Howard P. Marvel Ohio State University, [email protected] Follow this and additional works at: http://scholarworks.law.ubalt.edu/all_fac Part of the Antitrust and Trade Regulation Commons, and the Law and Economics Commons Recommended Citation The Three Types of Collusion: Fixing Prices, Rivals, and Rules, 2000 Wis. L. Rev. 941 (2000) This Article is brought to you for free and open access by the Faculty Scholarship at ScholarWorks@University of Baltimore School of Law. It has been accepted for inclusion in All Faculty Scholarship by an authorized administrator of ScholarWorks@University of Baltimore School of Law. For more information, please contact [email protected]. ARTICLES THE THREE TYPES OF COLLUSION: FIXING PRICES, RIVALS, AND RULES ROBERTH. LANDE * & HOWARDP. MARVEL** Antitrust law has long held collusion to be paramount among the offenses that it is charged with prohibiting. The reason for this prohibition is simple----collusion typically leads to monopoly-like outcomes, including monopoly profits that are shared by the colluding parties. Most collusion cases can be classified into two established general categories.) Classic, or "Type I" collusion involves collective action to raise price directly? Firms can also collude to disadvantage rivals in a manner that causes the rivals' output to diminish or causes their behavior to become chastened. This "Type 11" collusion in turn allows the colluding firms to raise prices.3 Many important collusion cases, however, do not fit into either of these categories.
    [Show full text]
  • Kranton Duke University
    The Devil is in the Details – Implications of Samuel Bowles’ The Moral Economy for economics and policy research October 13 2017 Rachel Kranton Duke University The Moral Economy by Samuel Bowles should be required reading by all graduate students in economics. Indeed, all economists should buy a copy and read it. The book is a stunning, critical discussion of the interplay between economic incentives and preferences. It challenges basic premises of economic theory and questions policy recommendations based on these theories. The book proposes the path forward: designing policy that combines incentives and moral appeals. And, therefore, like such as book should, The Moral Economy leaves us with much work to do. The Moral Economy concerns individual choices and economic policy, particularly microeconomic policies with goals to enhance the collective good. The book takes aim at laws, policies, and business practices that are based on the classic Homo economicus model of individual choice. The book first argues in great detail that policies that follow from the Homo economicus paradigm can backfire. While most economists would now recognize that people are not purely selfish and self-interested, The Moral Economy goes one step further. Incentives can amplify the selfishness of individuals. People might act in more self-interested ways in a system based on incentives and rewards than they would in the absence of such inducements. The Moral Economy warns economists to be especially wary of incentives because social norms, like norms of trust and honesty, are critical to economic activity. The danger is not only of incentives backfiring in a single instance; monetary incentives can generally erode ethical and moral codes and social motivations people can have towards each other.
    [Show full text]
  • 8. Maxmin and Minmax Strategies
    CMSC 474, Introduction to Game Theory 8. Maxmin and Minmax Strategies Mohammad T. Hajiaghayi University of Maryland Outline Chapter 2 discussed two solution concepts: Pareto optimality and Nash equilibrium Chapter 3 discusses several more: Maxmin and Minmax Dominant strategies Correlated equilibrium Trembling-hand perfect equilibrium e-Nash equilibrium Evolutionarily stable strategies Worst-Case Expected Utility For agent i, the worst-case expected utility of a strategy si is the minimum over all possible Husband Opera Football combinations of strategies for the other agents: Wife min u s ,s Opera 2, 1 0, 0 s-i i ( i -i ) Football 0, 0 1, 2 Example: Battle of the Sexes Wife’s strategy sw = {(p, Opera), (1 – p, Football)} Husband’s strategy sh = {(q, Opera), (1 – q, Football)} uw(p,q) = 2pq + (1 – p)(1 – q) = 3pq – p – q + 1 We can write uw(p,q) For any fixed p, uw(p,q) is linear in q instead of uw(sw , sh ) • e.g., if p = ½, then uw(½,q) = ½ q + ½ 0 ≤ q ≤ 1, so the min must be at q = 0 or q = 1 • e.g., minq (½ q + ½) is at q = 0 minq uw(p,q) = min (uw(p,0), uw(p,1)) = min (1 – p, 2p) Maxmin Strategies Also called maximin A maxmin strategy for agent i A strategy s1 that makes i’s worst-case expected utility as high as possible: argmaxmin ui (si,s-i ) si s-i This isn’t necessarily unique Often it is mixed Agent i’s maxmin value, or security level, is the maxmin strategy’s worst-case expected utility: maxmin ui (si,s-i ) si s-i For 2 players it simplifies to max min u1s1, s2 s1 s2 Example Wife’s and husband’s strategies
    [Show full text]
  • Subgame Perfect (-)Equilibrium in Perfect Information Games
    Subgame perfect (-)equilibrium in perfect information games J´anosFlesch∗ April 15, 2016 Abstract We discuss recent results on the existence and characterization of subgame perfect ({)equilibrium in perfect information games. The game. We consider games with perfect information and deterministic transitions. Such games can be given by a directed tree.1 In this tree, each node is associated with a player, who controls this node. The outgoing arcs at this node represent the actions available to this player at this node. We assume that each node has at least one successor, rather than having terminal nodes.2 Play of the game starts at the root. At any node z that play visits, the player who controls z has to choose one of the actions at z, which brings play to a next node. This induces an infinite path in the tree from the root, which we call a play. Depending on this play, each player receives a payoff. Note that these payoffs are fairly general. This setup encompasses the case when the actions induce instantaneous rewards which are then aggregated into a payoff, possibly by taking the total discounted sum or the long-term average. It also includes payoff functions considered in the literature of computer science (reachability games, etc.). Subgame-perfect (-)equilibrium. We focus on pure strategies for the players. A central solution concept in such games is subgame-perfect equilibrium, which is a strategy profile that induces a Nash equilibrium in every subgame, i.e. when starting at any node in the tree, no player has an incentive to deviate individually from his continuation strategy.
    [Show full text]
  • Subgame-Perfect Equilibria in Mean-Payoff Games
    Subgame-perfect Equilibria in Mean-payoff Games Léonard Brice ! Université Gustave Eiffel, France Jean-François Raskin ! Université Libre de Bruxelles, Belgium Marie van den Bogaard ! Université Gustave Eiffel, France Abstract In this paper, we provide an effective characterization of all the subgame-perfect equilibria in infinite duration games played on finite graphs with mean-payoff objectives. To this end, we introduce the notion of requirement, and the notion of negotiation function. We establish that the plays that are supported by SPEs are exactly those that are consistent with the least fixed point of the negotiation function. Finally, we show that the negotiation function is piecewise linear, and can be analyzed using the linear algebraic tool box. As a corollary, we prove the decidability of the SPE constrained existence problem, whose status was left open in the literature. 2012 ACM Subject Classification Software and its engineering: Formal methods; Theory of compu- tation: Logic and verification; Theory of computation: Solution concepts in game theory. Keywords and phrases Games on graphs, subgame-perfect equilibria, mean-payoff objectives. Digital Object Identifier 10.4230/LIPIcs... 1 Introduction The notion of Nash equilibrium (NE) is one of the most important and most studied solution concepts in game theory. A profile of strategies is an NE when no rational player has an incentive to change their strategy unilaterally, i.e. while the other players keep their strategies. Thus an NE models a stable situation. Unfortunately, it is well known that, in sequential games, NEs suffer from the problem of non-credible threats, see e.g. [18]. In those games, some NE only exists when some players do not play rationally in subgames and so use non-credible threats to force the NE.
    [Show full text]