Lecture Notes: GAME THEORY
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Lecture 4 Rationalizability & Nash Equilibrium Road
Lecture 4 Rationalizability & Nash Equilibrium 14.12 Game Theory Muhamet Yildiz Road Map 1. Strategies – completed 2. Quiz 3. Dominance 4. Dominant-strategy equilibrium 5. Rationalizability 6. Nash Equilibrium 1 Strategy A strategy of a player is a complete contingent-plan, determining which action he will take at each information set he is to move (including the information sets that will not be reached according to this strategy). Matching pennies with perfect information 2’s Strategies: HH = Head if 1 plays Head, 1 Head if 1 plays Tail; HT = Head if 1 plays Head, Head Tail Tail if 1 plays Tail; 2 TH = Tail if 1 plays Head, 2 Head if 1 plays Tail; head tail head tail TT = Tail if 1 plays Head, Tail if 1 plays Tail. (-1,1) (1,-1) (1,-1) (-1,1) 2 Matching pennies with perfect information 2 1 HH HT TH TT Head Tail Matching pennies with Imperfect information 1 2 1 Head Tail Head Tail 2 Head (-1,1) (1,-1) head tail head tail Tail (1,-1) (-1,1) (-1,1) (1,-1) (1,-1) (-1,1) 3 A game with nature Left (5, 0) 1 Head 1/2 Right (2, 2) Nature (3, 3) 1/2 Left Tail 2 Right (0, -5) Mixed Strategy Definition: A mixed strategy of a player is a probability distribution over the set of his strategies. Pure strategies: Si = {si1,si2,…,sik} σ → A mixed strategy: i: S [0,1] s.t. σ σ σ i(si1) + i(si2) + … + i(sik) = 1. If the other players play s-i =(s1,…, si-1,si+1,…,sn), then σ the expected utility of playing i is σ σ σ i(si1)ui(si1,s-i) + i(si2)ui(si2,s-i) + … + i(sik)ui(sik,s-i). -
Lecture Notes
GRADUATE GAME THEORY LECTURE NOTES BY OMER TAMUZ California Institute of Technology 2018 Acknowledgments These lecture notes are partially adapted from Osborne and Rubinstein [29], Maschler, Solan and Zamir [23], lecture notes by Federico Echenique, and slides by Daron Acemoglu and Asu Ozdaglar. I am indebted to Seo Young (Silvia) Kim and Zhuofang Li for their help in finding and correcting many errors. Any comments or suggestions are welcome. 2 Contents 1 Extensive form games with perfect information 7 1.1 Tic-Tac-Toe ........................................ 7 1.2 The Sweet Fifteen Game ................................ 7 1.3 Chess ............................................ 7 1.4 Definition of extensive form games with perfect information ........... 10 1.5 The ultimatum game .................................. 10 1.6 Equilibria ......................................... 11 1.7 The centipede game ................................... 11 1.8 Subgames and subgame perfect equilibria ...................... 13 1.9 The dollar auction .................................... 14 1.10 Backward induction, Kuhn’s Theorem and a proof of Zermelo’s Theorem ... 15 2 Strategic form games 17 2.1 Definition ......................................... 17 2.2 Nash equilibria ...................................... 17 2.3 Classical examples .................................... 17 2.4 Dominated strategies .................................. 22 2.5 Repeated elimination of dominated strategies ................... 22 2.6 Dominant strategies .................................. -
The Monty Hall Problem in the Game Theory Class
The Monty Hall Problem in the Game Theory Class Sasha Gnedin∗ October 29, 2018 1 Introduction Suppose you’re on a game show, and you’re given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what’s behind the doors, opens another door, say No. 3, which has a goat. He then says to you, “Do you want to pick door No. 2?” Is it to your advantage to switch your choice? With these famous words the Parade Magazine columnist vos Savant opened an exciting chapter in mathematical didactics. The puzzle, which has be- come known as the Monty Hall Problem (MHP), has broken the records of popularity among the probability paradoxes. The book by Rosenhouse [9] and the Wikipedia entry on the MHP present the history and variations of the problem. arXiv:1107.0326v1 [math.HO] 1 Jul 2011 In the basic version of the MHP the rules of the game specify that the host must always reveal one of the unchosen doors to show that there is no prize there. Two remaining unrevealed doors may hide the prize, creating the illusion of symmetry and suggesting that the action does not matter. How- ever, the symmetry is fallacious, and switching is a better action, doubling the probability of winning. There are two main explanations of the paradox. One of them, simplistic, amounts to just counting the mutually exclusive cases: either you win with ∗[email protected] 1 switching or with holding the first choice. -
More Experiments on Game Theory
More Experiments on Game Theory Syngjoo Choi Spring 2010 Experimental Economics (ECON3020) Game theory 2 Spring2010 1/28 Playing Unpredictably In many situations there is a strategic advantage associated with being unpredictable. Experimental Economics (ECON3020) Game theory 2 Spring2010 2/28 Playing Unpredictably In many situations there is a strategic advantage associated with being unpredictable. Experimental Economics (ECON3020) Game theory 2 Spring2010 2/28 Mixed (or Randomized) Strategy In the penalty kick, Left (L) and Right (R) are the pure strategies of the kicker and the goalkeeper. A mixed strategy refers to a probabilistic mixture over pure strategies in which no single pure strategy is played all the time. e.g., kicking left with half of the time and right with the other half of the time. A penalty kick in a soccer game is one example of games of pure con‡icit, essentially called zero-sum games in which one player’s winning is the other’sloss. Experimental Economics (ECON3020) Game theory 2 Spring2010 3/28 The use of pure strategies will result in a loss. What about each player playing heads half of the time? Matching Pennies Games In a matching pennies game, each player uncovers a penny showing either heads or tails. One player takes both coins if the pennies match; otherwise, the other takes both coins. Left Right Top 1, 1, 1 1 Bottom 1, 1, 1 1 Does there exist a Nash equilibrium involving the use of pure strategies? Experimental Economics (ECON3020) Game theory 2 Spring2010 4/28 Matching Pennies Games In a matching pennies game, each player uncovers a penny showing either heads or tails. -
Incomplete Information I. Bayesian Games
Bayesian Games Debraj Ray, November 2006 Unlike the previous notes, the material here is perfectly standard and can be found in the usual textbooks: see, e.g., Fudenberg-Tirole. For the examples in these notes (except for the very last section), I draw heavily on Martin Osborne’s excellent recent text, An Introduction to Game Theory, Oxford University Press. Obviously, incomplete information games — in which one or more players are privy to infor- mation that others don’t have — has enormous applicability: credit markets / auctions / regulation of firms / insurance / bargaining /lemons / public goods provision / signaling / . the list goes on and on. 1. A Definition A Bayesian game consists of 1. A set of players N. 2. A set of states Ω, and a common prior µ on Ω. 3. For each player i a set of actions Ai and a set of signals or types Ti. (Can make actions sets depend on type realizations.) 4. For each player i, a mapping τi :Ω 7→ Ti. 5. For each player i, a vN-M payoff function fi : A × Ω 7→ R, where A is the product of the Ai’s. Remarks A. Typically, the mapping τi tells us player i’s type. The easiest case is just when Ω is the product of the Ti’s and τi is just the appropriate projection mapping. B. The prior on states need not be common, but one view is that there is no loss of generality in assuming it (simply put all differences in the prior into differences in signals and redefine the state space and si-mappings accordingly). -
8. Maxmin and Minmax Strategies
CMSC 474, Introduction to Game Theory 8. Maxmin and Minmax Strategies Mohammad T. Hajiaghayi University of Maryland Outline Chapter 2 discussed two solution concepts: Pareto optimality and Nash equilibrium Chapter 3 discusses several more: Maxmin and Minmax Dominant strategies Correlated equilibrium Trembling-hand perfect equilibrium e-Nash equilibrium Evolutionarily stable strategies Worst-Case Expected Utility For agent i, the worst-case expected utility of a strategy si is the minimum over all possible Husband Opera Football combinations of strategies for the other agents: Wife min u s ,s Opera 2, 1 0, 0 s-i i ( i -i ) Football 0, 0 1, 2 Example: Battle of the Sexes Wife’s strategy sw = {(p, Opera), (1 – p, Football)} Husband’s strategy sh = {(q, Opera), (1 – q, Football)} uw(p,q) = 2pq + (1 – p)(1 – q) = 3pq – p – q + 1 We can write uw(p,q) For any fixed p, uw(p,q) is linear in q instead of uw(sw , sh ) • e.g., if p = ½, then uw(½,q) = ½ q + ½ 0 ≤ q ≤ 1, so the min must be at q = 0 or q = 1 • e.g., minq (½ q + ½) is at q = 0 minq uw(p,q) = min (uw(p,0), uw(p,1)) = min (1 – p, 2p) Maxmin Strategies Also called maximin A maxmin strategy for agent i A strategy s1 that makes i’s worst-case expected utility as high as possible: argmaxmin ui (si,s-i ) si s-i This isn’t necessarily unique Often it is mixed Agent i’s maxmin value, or security level, is the maxmin strategy’s worst-case expected utility: maxmin ui (si,s-i ) si s-i For 2 players it simplifies to max min u1s1, s2 s1 s2 Example Wife’s and husband’s strategies -
(501B) Problem Set 5. Bayesian Games Suggested Solutions by Tibor Heumann
Dirk Bergemann Department of Economics Yale University Microeconomic Theory (501b) Problem Set 5. Bayesian Games Suggested Solutions by Tibor Heumann 1. (Market for Lemons) Here I ask that you work out some of the details in perhaps the most famous of all information economics models. By contrast to discussion in class, we give a complete formulation of the game. A seller is privately informed of the value v of the good that she sells to a buyer. The buyer's prior belief on v is uniformly distributed on [x; y] with 3 0 < x < y: The good is worth 2 v to the buyer. (a) Suppose the buyer proposes a price p and the seller either accepts or rejects p: If she accepts, the seller gets payoff p−v; and the buyer gets 3 2 v − p: If she rejects, the seller gets v; and the buyer gets nothing. Find the optimal offer that the buyer can make as a function of x and y: (b) Show that if the buyer and the seller are symmetrically informed (i.e. either both know v or neither party knows v), then trade takes place with probability 1. (c) Consider a simultaneous acceptance game in the model with private information as in part a, where a price p is announced and then the buyer and the seller simultaneously accept or reject trade at price p: The payoffs are as in part a. Find the p that maximizes the probability of trade. [SOLUTION] (a) We look for the subgame perfect equilibrium, thus we solve by back- ward induction. -
Oligopolistic Competition
Lecture 3: Oligopolistic competition EC 105. Industrial Organization Mattt Shum HSS, California Institute of Technology EC 105. Industrial Organization (Mattt Shum HSS,Lecture California 3: Oligopolistic Institute of competition Technology) 1 / 38 Oligopoly Models Oligopoly: interaction among small number of firms Conflict of interest: Each firm maximizes its own profits, but... Firm j's actions affect firm i's profits PC: firms are small, so no single firm’s actions affect other firms’ profits Monopoly: only one firm EC 105. Industrial Organization (Mattt Shum HSS,Lecture California 3: Oligopolistic Institute of competition Technology) 2 / 38 Oligopoly Models Oligopoly: interaction among small number of firms Conflict of interest: Each firm maximizes its own profits, but... Firm j's actions affect firm i's profits PC: firms are small, so no single firm’s actions affect other firms’ profits Monopoly: only one firm EC 105. Industrial Organization (Mattt Shum HSS,Lecture California 3: Oligopolistic Institute of competition Technology) 2 / 38 Oligopoly Models Oligopoly: interaction among small number of firms Conflict of interest: Each firm maximizes its own profits, but... Firm j's actions affect firm i's profits PC: firms are small, so no single firm’s actions affect other firms’ profits Monopoly: only one firm EC 105. Industrial Organization (Mattt Shum HSS,Lecture California 3: Oligopolistic Institute of competition Technology) 2 / 38 Oligopoly Models Oligopoly: interaction among small number of firms Conflict of interest: Each firm maximizes its own profits, but... Firm j's actions affect firm i's profits PC: firms are small, so no single firm’s actions affect other firms’ profits Monopoly: only one firm EC 105. -
BAYESIAN EQUILIBRIUM the Games Like Matching Pennies And
BAYESIAN EQUILIBRIUM MICHAEL PETERS The games like matching pennies and prisoner’s dilemma that form the core of most undergrad game theory courses are games in which players know each others’ preferences. Notions like iterated deletion of dominated strategies, and rationalizability actually go further in that they exploit the idea that each player puts him or herself in the shoes of other players and imagines that the others do the same thing. Games and reasoning like this apply to situations in which the preferences of the players are common knowledge. When we want to refer to situations like this, we usually say that we are interested in games of complete information. Most of the situations we study in economics aren’t really like this since we are never really sure of the motivation of the players we are dealing with. A bidder on eBay, for example, doesn’t know whether other bidders are interested in a good they would like to bid on. Once a bidder sees that another has submit a bid, he isn’t sure exactly how much the good is worth to the other bidder. When players in a game don’t know the preferences of other players, we say a game has incomplete information. If you have dealt with these things before, you may have learned the term asymmetric information. This term is less descriptive of the problem and should probably be avoided. The way we deal with incomplete information in economics is to use the ap- proach originally described by Savage in 1954. If you have taken a decision theory course, you probably learned this approach by studying Anscombe and Aumann (last session), while the approach we use in game theory is usually attributed to Harsanyi. -
Arxiv:1709.04326V4 [Cs.AI] 19 Sep 2018
Learning with Opponent-Learning Awareness , Jakob Foerster† ‡ Richard Y. Chen† Maruan Al-Shedivat‡ University of Oxford OpenAI Carnegie Mellon University Shimon Whiteson Pieter Abbeel‡ Igor Mordatch University of Oxford UC Berkeley OpenAI ABSTRACT 1 INTRODUCTION Multi-agent settings are quickly gathering importance in machine Due to the advent of deep reinforcement learning (RL) methods that learning. This includes a plethora of recent work on deep multi- allow the study of many agents in rich environments, multi-agent agent reinforcement learning, but also can be extended to hierar- RL has flourished in recent years. However, most of this recent chical reinforcement learning, generative adversarial networks and work considers fully cooperative settings [13, 14, 34] and emergent decentralised optimization. In all these settings the presence of mul- communication in particular [11, 12, 22, 32, 40]. Considering future tiple learning agents renders the training problem non-stationary applications of multi-agent RL, such as self-driving cars, it is obvious and often leads to unstable training or undesired final results. We that many of these will be only partially cooperative and contain present Learning with Opponent-Learning Awareness (LOLA), a elements of competition and conflict. method in which each agent shapes the anticipated learning of The human ability to maintain cooperation in a variety of com- the other agents in the environment. The LOLA learning rule in- plex social settings has been vital for the success of human societies. cludes an additional term that accounts for the impact of one agent’s Emergent reciprocity has been observed even in strongly adversar- policy on the anticipated parameter update of the other agents. -
Bayesian Games Professors Greenwald 2018-01-31
Bayesian Games Professors Greenwald 2018-01-31 We describe incomplete-information, or Bayesian, normal-form games (formally; no examples), and corresponding equilibrium concepts. 1 A Bayesian Model of Interaction A Bayesian, or incomplete information, game is a generalization of a complete-information game. Recall that in a complete-information game, the game is assumed to be common knowledge. In a Bayesian game, in addition to what is common knowledge, players may have private information. This private information is captured by the notion of an epistemic type, which describes a player’s knowledge. The Bayesian-game formalism makes two simplifying assumptions: • Any information that is privy to any of the players pertains only to utilities. In all realizations of a Bayesian game, the number of players and their actions are identical. • Players maintain beliefs about the game (i.e., about utilities) in the form of a probability distribution over types. Prior to receiving any private information, this probability distribution is common knowledge: i.e., it is a common prior. After receiving private information, players conditional on this information to update their beliefs. As a consequence of the com- mon prior assumption, any differences in beliefs can be attributed entirely to differences in information. Rational players are again assumed to maximize their utility. But further, they are assumed to update their beliefs when they obtain new information via Bayes’ rule. Thus, in a Bayesian game, in addition to players, actions, and utilities, there is a type space T = ∏i2[n] Ti, where Ti is the type space of player i.There is also a common prior F, which is a probability distribution over type profiles. -
Reinterpreting Mixed Strategy Equilibria: a Unification Of
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Research Papers in Economics Reinterpreting Mixed Strategy Equilibria: A Unification of the Classical and Bayesian Views∗ Philip J. Reny Arthur J. Robson Department of Economics Department of Economics University of Chicago Simon Fraser University September, 2003 Abstract We provide a new interpretation of mixed strategy equilibria that incor- porates both von Neumann and Morgenstern’s classical concealment role of mixing as well as the more recent Bayesian view originating with Harsanyi. For any two-person game, G, we consider an incomplete information game, , in which each player’s type is the probability he assigns to the event thatIG his mixed strategy in G is “found out” by his opponent. We show that, generically, any regular equilibrium of G can be approximated by an equilibrium of in which almost every type of each player is strictly opti- mizing. This leadsIG us to interpret i’s equilibrium mixed strategy in G as a combination of deliberate randomization by i together with uncertainty on j’s part about which randomization i will employ. We also show that such randomization is not unusual: For example, i’s randomization is nondegen- erate whenever the support of an equilibrium contains cyclic best replies. ∗We wish to thank Drew Fudenberg, Motty Perry and Hugo Sonnenschein for very helpful comments. We are especially grateful to Bob Aumann for a stimulating discussion of the litera- ture on mixed strategies that prompted us to fit our work into that literature and ultimately led us to the interpretation we provide here, and to Hari Govindan whose detailed suggestions per- mitted a substantial simplification of the proof of our main result.