Stochastic Games

Total Page:16

File Type:pdf, Size:1020Kb

Stochastic Games PERSPECTIVE PERSPECTIVE Stochastic games Eilon Solana,1 and Nicolas Vieilleb,1 aDepartment of Statistics and Operations Research, School of Mathematical Sciences, Tel Aviv University, Tel Aviv 6997800, Israel; and bDepartment of Economics and Decision Sciences, HEC Paris, 78 351 Jouy-en-Josas, France Edited by Jose A. Scheinkman, Columbia University, New York, NY, and approved September 22, 2015 (received for review July 9, 2015) In 1953, Lloyd Shapley contributed his paper “Stochastic games” to PNAS. In this paper, he defined the model of stochastic games, which were the first general dynamic model of a game to be defined, and proved that it admits a stationary equilibrium. In this Perspective, we summarize the historical context and the impact of Shapley’s contribution. game theory | stochastic games The 1950s were the decade in which game available actions. The collection of actions the first t stages, as well as the actions that the theory was shaped. After von Neumann and that the players choose, together with the players played in the first t − 1stages.A Morgenstern’s Theory of Games and Economic current state, determine the stage payoff that strategy of a player is a prescription how Behavior (1) was published in 1944, a group of each player receives, as well as a probability to play the game; that is, a function that assigns young and bright researchers started working distribution according to which the new state to every finite history an action to play should on game theory, each of whom published pa- that the play will visit. that history occur. A behavior strategy of a pers that opened new areas. In 1950, John Stochastic games extend the model of stra- player is a function that assigns to every finite Nash published two papers, one on the con- tegic-form games, which is attributable to von history a lottery over the set of available actions. cept of Nash equilibrium (2) and the other on Neumann (8), to dynamic situations in which A zero-sum game is defined to have a the bargaining problem (3). In 1953, Lloyd the environment changes in response to the “value” v if (i) player 1 has a strategy (which Shapley published two papers, one on stochas- players’ choices. They also extend the model is then said to be “optimal”), which ensures tic games (4) and the other on the Shapley of Markov decision problems, which was de- that his expected overall payoff over time value for coalitional games (5). In 1957, veloped by several researchers at the RAND does not fall below v,nomatterwhatisthe Harold Kuhn (6) provided a formulation Corporation in 1949–1952, to competitive sit- strategy followed by player 2, and (ii)ifthe for extensive-form games, and Duncan Luce uations with more than one decision maker. symmetric property holds when exchanging and Howard Raiffa published their book The complexity of stochastic games stems Games and Decisions: Introduction and Critical the roles of the two players. Shapley proved fromthefactthatthechoicesmadebythe the existence of a value. Because the param- Survey (7). These researchers and others, like players have two, sometimes contradictory, eters that define the game are independent Kenneth Arrow, Richard Bellman, and David effects. First, together with the current state, of time, the situation that the players face if Blackwell, interacted, worked and played to- ’ the players actions determine the immediate today the play is in a certain state is the same gether, exchanged ideas, and developed the payoff that each player receives. Second, the situation they would face tomorrow if to- theory of dynamic games as we know it today. current state and the players’ actions influence While in graduate school at Princeton, morrow the play is in that state. In particular, the choice of the new state, which determines one expects to have optimal strategies that Shapley befriended Martin Shubik and John the potential of future payoffs. In particular, Nash. The former became a well-known are stationary Markov, that is, they depend when choosing his actions, each player has to only on the current state of the game. Shapley economist, and the latter defined the concept balance these forces, a decision that may often of Nash equilibrium, the most prevalent proved that indeed such optimal strategies be difficult. Although this dichotomy is also solution concept in game theory and eco- exist, and characterized the value as the present in one-player sequential decision nomictheorytodate.In2013,Shapleywas unique fixed point of a nonlinear functional problems, the presence of additional players awarded the Nobel Prize in economics, 8 y operator—a two-player version of the dynamic who maximize their own goals adds com- after Nash was awarded the prize. programming principle. plexity to the analysis of the situation. The existence of stationary Markov opti- ’ 1. Stochastic Games Shapley s paper in PNAS introduced the mal strategies implies that, to play well, a Stochastic games model dynamic interactions model of stochastic games with positive player needs to know only the current state. in which the environment changes in response stopping probabilities, assuming a finite set In particular, the value of the game does not ’ ’ “ to players behavior. In Shapley swords, In a of states, and a finite set of possible actions change if players receive partial information stochastic game the play proceeds by steps for each player in each state. In this model, ’ from position to position, according to tran- thecurrentstateandtheplayers actions Author contributions: E.S. and N.V. wrote the paper. sition probabilities controlled jointly by the determine the probability that the game will The authors declare no conflict of interest. two players” (4). terminate once the current stage is over. His This article is a PNAS Direct Submission. A stochastic game is played by a set of paper deals with two-player zero-sum games, This article is part of the special series of PNAS 100th Anniversary players. In each stage of the game, the play is so that the gain of player 1 is always equal to articles to commemorate exceptional research published in PNAS ’ over the last century. See the companion article, “Stochastic in a given state (or position, in Shapley slan- the loss of player 2. ” t Games on page 1095 in issue 10 of volume 39. guage), taken from a set of states, and every A history of length in a stochastic game is 1To whom correspondence may be addressed. Email: eilons@post. player chooses an action from a set of the sequence of states that the game visited in tau.ac.il or [email protected]. www.pnas.org/cgi/doi/10.1073/pnas.1513508112 PNAS | November 10, 2015 | vol. 112 | no. 45 | 13743–13746 Downloaded by guest on September 26, 2021 on each other’s actions, and/or if they forget initial wealth, repeatedly play a zero-sum game on algebraic properties of the value of the previously visited states. that affects their wealth, until one player is discounted game, as a function of the dis- Inthefollowingparagraphs,webriefly ruined and declared loser. Everett (18) studied count factor, proven by Bewley and Kohlberg describe some of the directions in which the the so-called “recursive games,” in which ter- (22). One can view the monetary policy of theory has developed following Shapley’s mination probabilities are endogenous, and a the central bank, that of changing the interest groundbreaking paper. payoff is only received upon termination of the rate from time to time, as an implementation game, as a function of the terminating state. of this type of strategy. 2. Discounted Stochastic Games Gillette (19) introduced a deceptively simple- The theory of the non–zero-sum case has ’ Shapley s model is equivalent to one in which looking game, inspired by playground prac- flourished in the last two decades but has not players discount their future payoffs accord- tice, whose analysis resisted all attempts until been settled yet. A collection of strategies, one ing to a discount factor that depends on the Blackwell and Ferguson (20) provided its for each player, is a uniform «-equilibrium if current state and on the players’ actions. The solution. bydeviatingnoplayercanprofitmorethan«, game is called “discounted” if all stopping In a different but related framework, provided the game is sufficiently long or the probabilities equal the same constant, and one Aumann, Maschler, and Stearns (ref. 21, discount factor sufficiently close to 1, and if minusthisconstantiscalledthe“discount reprinted in 1995; see pp. 130–139) conceptu- the payoff vector is almost independent of the factor.” Models of discounted stochastic games alized a notion of a uniform solution to duration of the game. Existence of uniform are prevalent in economics, where the discount infinite-horizon games, in which payoffs are «-equilibria has been established in few cases: factor has a clear economic interpretation. received in each round. This approach views Vieille (25, 26) for two-player games, Solan A large literature followed, relaxing both the the infinite-horizon game as an idealized (27) for three-player games in which the state zero-sum and the finiteness assumptions of model for long-run interactions, in which the of the game changes at most once, and Solan Shapley’spaper.Innon–zero-sum games, a duration of the game is sufficiently long and/or and Vieille (28), Flesch et al. (29), and Simon collection of strategies, one for each player, is a the discount factor is sufficiently close to 1, and (30) in other classes of stochastic games. The “(Nash) equilibrium” if no player can profit by both are not perfectly known. It aims at pro- question in its most generality remains open. deviating from his strategy, assuming all other viding solutions whose optimality properties A weaker concept than Nash equilibrium players follow their prescribed strategies.
Recommended publications
  • Frequently Asked Questions in Mathematics
    Frequently Asked Questions in Mathematics The Sci.Math FAQ Team. Editor: Alex L´opez-Ortiz e-mail: [email protected] Contents 1 Introduction 4 1.1 Why a list of Frequently Asked Questions? . 4 1.2 Frequently Asked Questions in Mathematics? . 4 2 Fundamentals 5 2.1 Algebraic structures . 5 2.1.1 Monoids and Groups . 6 2.1.2 Rings . 7 2.1.3 Fields . 7 2.1.4 Ordering . 8 2.2 What are numbers? . 9 2.2.1 Introduction . 9 2.2.2 Construction of the Number System . 9 2.2.3 Construction of N ............................... 10 2.2.4 Construction of Z ................................ 10 2.2.5 Construction of Q ............................... 11 2.2.6 Construction of R ............................... 11 2.2.7 Construction of C ............................... 12 2.2.8 Rounding things up . 12 2.2.9 What’s next? . 12 3 Number Theory 14 3.1 Fermat’s Last Theorem . 14 3.1.1 History of Fermat’s Last Theorem . 14 3.1.2 What is the current status of FLT? . 14 3.1.3 Related Conjectures . 15 3.1.4 Did Fermat prove this theorem? . 16 3.2 Prime Numbers . 17 3.2.1 Largest known Mersenne prime . 17 3.2.2 Largest known prime . 17 3.2.3 Largest known twin primes . 18 3.2.4 Largest Fermat number with known factorization . 18 3.2.5 Algorithms to factor integer numbers . 18 3.2.6 Primality Testing . 19 3.2.7 List of record numbers . 20 3.2.8 What is the current status on Mersenne primes? .
    [Show full text]
  • Introduction to Computational Social Choice
    1 Introduction to Computational Social Choice Felix Brandta, Vincent Conitzerb, Ulle Endrissc, J´er^omeLangd, and Ariel D. Procacciae 1.1 Computational Social Choice at a Glance Social choice theory is the field of scientific inquiry that studies the aggregation of individual preferences towards a collective choice. For example, social choice theorists|who hail from a range of different disciplines, including mathematics, economics, and political science|are interested in the design and theoretical evalu- ation of voting rules. Questions of social choice have stimulated intellectual thought for centuries. Over time the topic has fascinated many a great mind, from the Mar- quis de Condorcet and Pierre-Simon de Laplace, through Charles Dodgson (better known as Lewis Carroll, the author of Alice in Wonderland), to Nobel Laureates such as Kenneth Arrow, Amartya Sen, and Lloyd Shapley. Computational social choice (COMSOC), by comparison, is a very young field that formed only in the early 2000s. There were, however, a few precursors. For instance, David Gale and Lloyd Shapley's algorithm for finding stable matchings between two groups of people with preferences over each other, dating back to 1962, truly had a computational flavor. And in the late 1980s, a series of papers by John Bartholdi, Craig Tovey, and Michael Trick showed that, on the one hand, computational complexity, as studied in theoretical computer science, can serve as a barrier against strategic manipulation in elections, but on the other hand, it can also prevent the efficient use of some voting rules altogether. Around the same time, a research group around Bernard Monjardet and Olivier Hudry also started to study the computational complexity of preference aggregation procedures.
    [Show full text]
  • Stable Matching: Theory, Evidence, and Practical Design
    THE PRIZE IN ECONOMIC SCIENCES 2012 INFORMATION FOR THE PUBLIC Stable matching: Theory, evidence, and practical design This year’s Prize to Lloyd Shapley and Alvin Roth extends from abstract theory developed in the 1960s, over empirical work in the 1980s, to ongoing efforts to fnd practical solutions to real-world prob- lems. Examples include the assignment of new doctors to hospitals, students to schools, and human organs for transplant to recipients. Lloyd Shapley made the early theoretical contributions, which were unexpectedly adopted two decades later when Alvin Roth investigated the market for U.S. doctors. His fndings generated further analytical developments, as well as practical design of market institutions. Traditional economic analysis studies markets where prices adjust so that supply equals demand. Both theory and practice show that markets function well in many cases. But in some situations, the standard market mechanism encounters problems, and there are cases where prices cannot be used at all to allocate resources. For example, many schools and universities are prevented from charging tuition fees and, in the case of human organs for transplants, monetary payments are ruled out on ethical grounds. Yet, in these – and many other – cases, an allocation has to be made. How do such processes actually work, and when is the outcome efcient? Matching theory The Gale-Shapley algorithm Analysis of allocation mechanisms relies on a rather abstract idea. If rational people – who know their best interests and behave accordingly – simply engage in unrestricted mutual trade, then the outcome should be efcient. If it is not, some individuals would devise new trades that made them better of.
    [Show full text]
  • A New Approach for Dynamic Stochastic Fractal Search with Fuzzy Logic for Parameter Adaptation
    fractal and fractional Article A New Approach for Dynamic Stochastic Fractal Search with Fuzzy Logic for Parameter Adaptation Marylu L. Lagunes, Oscar Castillo * , Fevrier Valdez , Jose Soria and Patricia Melin Tijuana Institute of Technology, Calzada Tecnologico s/n, Fracc. Tomas Aquino, Tijuana 22379, Mexico; [email protected] (M.L.L.); [email protected] (F.V.); [email protected] (J.S.); [email protected] (P.M.) * Correspondence: [email protected] Abstract: Stochastic fractal search (SFS) is a novel method inspired by the process of stochastic growth in nature and the use of the fractal mathematical concept. Considering the chaotic stochastic diffusion property, an improved dynamic stochastic fractal search (DSFS) optimization algorithm is presented. The DSFS algorithm was tested with benchmark functions, such as the multimodal, hybrid, and composite functions, to evaluate the performance of the algorithm with dynamic parameter adaptation with type-1 and type-2 fuzzy inference models. The main contribution of the article is the utilization of fuzzy logic in the adaptation of the diffusion parameter in a dynamic fashion. This parameter is in charge of creating new fractal particles, and the diversity and iteration are the input information used in the fuzzy system to control the values of diffusion. Keywords: fractal search; fuzzy logic; parameter adaptation; CEC 2017 Citation: Lagunes, M.L.; Castillo, O.; Valdez, F.; Soria, J.; Melin, P. A New Approach for Dynamic Stochastic 1. Introduction Fractal Search with Fuzzy Logic for Metaheuristic algorithms are applied to optimization problems due to their charac- Parameter Adaptation. Fractal Fract. teristics that help in searching for the global optimum, while simple heuristics are mostly 2021, 5, 33.
    [Show full text]
  • Stochastic Game Theory Applications for Power Management in Cognitive Networks
    STOCHASTIC GAME THEORY APPLICATIONS FOR POWER MANAGEMENT IN COGNITIVE NETWORKS A thesis submitted to Kent State University in partial fulfillment of the requirements for the degree of Master of Digital Science by Sham Fung Feb 2014 Thesis written by Sham Fung M.D.S, Kent State University, 2014 Approved by Advisor , Director, School of Digital Science ii TABLE OF CONTENTS LIST OF FIGURES . vi LIST OF TABLES . vii Acknowledgments . viii Dedication . ix 1 BACKGROUND OF COGNITIVE NETWORK . 1 1.1 Motivation and Requirements . 1 1.2 Definition of Cognitive Networks . 2 1.3 Key Elements of Cognitive Networks . 3 1.3.1 Learning and Reasoning . 3 1.3.2 Cognitive Network as a Multi-agent System . 4 2 POWER ALLOCATION IN COGNITIVE NETWORKS . 5 2.1 Overview . 5 2.2 Two Communication Paradigms . 6 2.3 Power Allocation Scheme . 8 2.3.1 Interference Constraints for Primary Users . 8 2.3.2 QoS Constraints for Secondary Users . 9 2.4 Related Works on Power Management in Cognitive Networks . 10 iii 3 GAME THEORY|PRELIMINARIES AND RELATED WORKS . 12 3.1 Non-cooperative Game . 14 3.1.1 Nash Equilibrium . 14 3.1.2 Other Equilibriums . 16 3.2 Cooperative Game . 16 3.2.1 Bargaining Game . 17 3.2.2 Coalition Game . 17 3.2.3 Solution Concept . 18 3.3 Stochastic Game . 19 3.4 Other Types of Games . 20 4 GAME THEORETICAL APPROACH ON POWER ALLOCATION . 23 4.1 Two Fundamental Types of Utility Functions . 23 4.1.1 QoS-Based Game . 23 4.1.2 Linear Pricing Game .
    [Show full text]
  • Introduction to Stochastic Processes - Lecture Notes (With 33 Illustrations)
    Introduction to Stochastic Processes - Lecture Notes (with 33 illustrations) Gordan Žitković Department of Mathematics The University of Texas at Austin Contents 1 Probability review 4 1.1 Random variables . 4 1.2 Countable sets . 5 1.3 Discrete random variables . 5 1.4 Expectation . 7 1.5 Events and probability . 8 1.6 Dependence and independence . 9 1.7 Conditional probability . 10 1.8 Examples . 12 2 Mathematica in 15 min 15 2.1 Basic Syntax . 15 2.2 Numerical Approximation . 16 2.3 Expression Manipulation . 16 2.4 Lists and Functions . 17 2.5 Linear Algebra . 19 2.6 Predefined Constants . 20 2.7 Calculus . 20 2.8 Solving Equations . 22 2.9 Graphics . 22 2.10 Probability Distributions and Simulation . 23 2.11 Help Commands . 24 2.12 Common Mistakes . 25 3 Stochastic Processes 26 3.1 The canonical probability space . 27 3.2 Constructing the Random Walk . 28 3.3 Simulation . 29 3.3.1 Random number generation . 29 3.3.2 Simulation of Random Variables . 30 3.4 Monte Carlo Integration . 33 4 The Simple Random Walk 35 4.1 Construction . 35 4.2 The maximum . 36 1 CONTENTS 5 Generating functions 40 5.1 Definition and first properties . 40 5.2 Convolution and moments . 42 5.3 Random sums and Wald’s identity . 44 6 Random walks - advanced methods 48 6.1 Stopping times . 48 6.2 Wald’s identity II . 50 6.3 The distribution of the first hitting time T1 .......................... 52 6.3.1 A recursive formula . 52 6.3.2 Generating-function approach .
    [Show full text]
  • The Portfolio Optimization Performance During Malaysia's
    Modern Applied Science; Vol. 14, No. 4; 2020 ISSN 1913-1844 E-ISSN 1913-1852 Published by Canadian Center of Science and Education The Portfolio Optimization Performance during Malaysia’s 2018 General Election by Using Noncooperative and Cooperative Game Theory Approach Muhammad Akram Ramadhan bin Ibrahim1, Pah Chin Hee1, Mohd Aminul Islam1 & Hafizah Bahaludin1 1Department of Computational and Theoretical Sciences Kulliyyah of Science International Islamic University Malaysia, 25200 Kuantan, Pahang, Malaysia Correspondence: Pah Chin Hee, Department of Computational and Theoretical Sciences, Kulliyyah of Science, International Islamic University Malaysia, 25200 Kuantan, Pahang, Malaysia. Received: February 10, 2020 Accepted: February 28, 2020 Online Published: March 4, 2020 doi:10.5539/mas.v14n4p1 URL: https://doi.org/10.5539/mas.v14n4p1 Abstract Game theory approach is used in this study that involves two types of games which are noncooperative and cooperative. Noncooperative game is used to get the equilibrium solutions from each payoff matrix. From the solutions, the values then be used as characteristic functions of Shapley value solution concept in cooperative game. In this paper, the sectors are divided into three groups where each sector will have three different stocks for the game. This study used the companies that listed in Bursa Malaysia and the prices of each stock listed in this research obtained from Datastream. The rate of return of stocks are considered as an input to get the payoff from each stock and its coalition sectors. The value of game for each sector is obtained using Shapley value solution concepts formula to find the optimal increase of the returns. The Shapley optimal portfolio, naive diversification portfolio and market portfolio performances have been calculated by using Sharpe ratio.
    [Show full text]
  • Subgame-Perfect Ε-Equilibria in Perfect Information Games With
    Subgame-Perfect -Equilibria in Perfect Information Games with Common Preferences at the Limit Citation for published version (APA): Flesch, J., & Predtetchinski, A. (2016). Subgame-Perfect -Equilibria in Perfect Information Games with Common Preferences at the Limit. Mathematics of Operations Research, 41(4), 1208-1221. https://doi.org/10.1287/moor.2015.0774 Document status and date: Published: 01/11/2016 DOI: 10.1287/moor.2015.0774 Document Version: Publisher's PDF, also known as Version of record Document license: Taverne Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal.
    [Show full text]
  • Introduction to Computational Social Choice Felix Brandt, Vincent Conitzer, Ulle Endriss, Jérôme Lang, Ariel D
    Introduction to Computational Social Choice Felix Brandt, Vincent Conitzer, Ulle Endriss, Jérôme Lang, Ariel D. Procaccia To cite this version: Felix Brandt, Vincent Conitzer, Ulle Endriss, Jérôme Lang, Ariel D. Procaccia. Introduction to Computational Social Choice. Felix Brandt, Vincent Conitzer, Ulle Endriss, Jérôme Lang, Ariel D. Procaccia, Handbook of Computational Social Choice, Cambridge University Press, pp.1-29, 2016, 9781107446984. 10.1017/CBO9781107446984. hal-01493516 HAL Id: hal-01493516 https://hal.archives-ouvertes.fr/hal-01493516 Submitted on 21 Mar 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 Introduction to Computational Social Choice Felix Brandta, Vincent Conitzerb, Ulle Endrissc, J´er^omeLangd, and Ariel D. Procacciae 1.1 Computational Social Choice at a Glance Social choice theory is the field of scientific inquiry that studies the aggregation of individual preferences towards a collective choice. For example, social choice theorists|who hail from a range of different disciplines, including mathematics, economics, and political science|are interested in the design and theoretical evalu- ation of voting rules. Questions of social choice have stimulated intellectual thought for centuries. Over time the topic has fascinated many a great mind, from the Mar- quis de Condorcet and Pierre-Simon de Laplace, through Charles Dodgson (better known as Lewis Carroll, the author of Alice in Wonderland), to Nobel Laureates such as Kenneth Arrow, Amartya Sen, and Lloyd Shapley.
    [Show full text]
  • 1 Stochastic Processes and Their Classification
    1 1 STOCHASTIC PROCESSES AND THEIR CLASSIFICATION 1.1 DEFINITION AND EXAMPLES Definition 1. Stochastic process or random process is a collection of random variables ordered by an index set. ☛ Example 1. Random variables X0;X1;X2;::: form a stochastic process ordered by the discrete index set f0; 1; 2;::: g: Notation: fXn : n = 0; 1; 2;::: g: ☛ Example 2. Stochastic process fYt : t ¸ 0g: with continuous index set ft : t ¸ 0g: The indices n and t are often referred to as "time", so that Xn is a descrete-time process and Yt is a continuous-time process. Convention: the index set of a stochastic process is always infinite. The range (possible values) of the random variables in a stochastic process is called the state space of the process. We consider both discrete-state and continuous-state processes. Further examples: ☛ Example 3. fXn : n = 0; 1; 2;::: g; where the state space of Xn is f0; 1; 2; 3; 4g representing which of four types of transactions a person submits to an on-line data- base service, and time n corresponds to the number of transactions submitted. ☛ Example 4. fXn : n = 0; 1; 2;::: g; where the state space of Xn is f1; 2g re- presenting whether an electronic component is acceptable or defective, and time n corresponds to the number of components produced. ☛ Example 5. fYt : t ¸ 0g; where the state space of Yt is f0; 1; 2;::: g representing the number of accidents that have occurred at an intersection, and time t corresponds to weeks. ☛ Example 6. fYt : t ¸ 0g; where the state space of Yt is f0; 1; 2; : : : ; sg representing the number of copies of a software product in inventory, and time t corresponds to days.
    [Show full text]
  • Deterministic and Stochastic Prisoner's Dilemma Games: Experiments in Interdependent Security
    NBER TECHNICAL WORKING PAPER SERIES DETERMINISTIC AND STOCHASTIC PRISONER'S DILEMMA GAMES: EXPERIMENTS IN INTERDEPENDENT SECURITY Howard Kunreuther Gabriel Silvasi Eric T. Bradlow Dylan Small Technical Working Paper 341 http://www.nber.org/papers/t0341 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 August 2007 We appreciate helpful discussions in designing the experiments and comments on earlier drafts of this paper by Colin Camerer, Vince Crawford, Rachel Croson, Robyn Dawes, Aureo DePaula, Geoff Heal, Charles Holt, David Krantz, Jack Ochs, Al Roth and Christian Schade. We also benefited from helpful comments from participants at the Workshop on Interdependent Security at the University of Pennsylvania (May 31-June 1 2006) and at the Workshop on Innovation and Coordination at Humboldt University (December 18-20, 2006). We thank George Abraham and Usmann Hassan for their help in organizing the data from the experiments. Support from NSF Grant CMS-0527598 and the Wharton Risk Management and Decision Processes Center is gratefully acknowledged. The views expressed herein are those of the author(s) and do not necessarily reflect the views of the National Bureau of Economic Research. © 2007 by Howard Kunreuther, Gabriel Silvasi, Eric T. Bradlow, and Dylan Small. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source. Deterministic and Stochastic Prisoner's Dilemma Games: Experiments in Interdependent Security Howard Kunreuther, Gabriel Silvasi, Eric T. Bradlow, and Dylan Small NBER Technical Working Paper No. 341 August 2007 JEL No. C11,C12,C22,C23,C73,C91 ABSTRACT This paper examines experiments on interdependent security prisoner's dilemma games with repeated play.
    [Show full text]
  • Stochastic Games with Hidden States∗
    Stochastic Games with Hidden States¤ Yuichi Yamamoto† First Draft: March 29, 2014 This Version: January 14, 2015 Abstract This paper studies infinite-horizon stochastic games in which players ob- serve noisy public information about a hidden state each period. We find that if the game is connected, the limit feasible payoff set exists and is invariant to the initial prior about the state. Building on this invariance result, we pro- vide a recursive characterization of the equilibrium payoff set and establish the folk theorem. We also show that connectedness can be replaced with an even weaker condition, called asymptotic connectedness. Asymptotic con- nectedness is satisfied for generic signal distributions, if the state evolution is irreducible. Journal of Economic Literature Classification Numbers: C72, C73. Keywords: stochastic game, hidden state, connectedness, stochastic self- generation, folk theorem. ¤The author thanks Naoki Aizawa, Drew Fudenberg, Johannes Horner,¨ Atsushi Iwasaki, Michi- hiro Kandori, George Mailath, Takeaki Sunada, and Masatoshi Tsumagari for helpful conversa- tions, and seminar participants at various places. †Department of Economics, University of Pennsylvania. Email: [email protected] 1 1 Introduction When agents have a long-run relationship, underlying economic conditions may change over time. A leading example is a repeated Bertrand competition with stochastic demand shocks. Rotemberg and Saloner (1986) explore optimal col- lusive pricing when random demand shocks are i.i.d. each period. Haltiwanger and Harrington (1991), Kandori (1991), and Bagwell and Staiger (1997) further extend the analysis to the case in which demand fluctuations are cyclic or persis- tent. One of the crucial assumptions of these papers is that demand shocks are publicly observable before firms make their decisions in each period.
    [Show full text]