Noisy Directional Learning and the Logit Equilibrium*
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
CSC304 Lecture 4 Game Theory
CSC304 Lecture 4 Game Theory (Cost sharing & congestion games, Potential function, Braess’ paradox) CSC304 - Nisarg Shah 1 Recap • Nash equilibria (NE) ➢ No agent wants to change their strategy ➢ Guaranteed to exist if mixed strategies are allowed ➢ Could be multiple • Pure NE through best-response diagrams • Mixed NE through the indifference principle CSC304 - Nisarg Shah 2 Worst and Best Nash Equilibria • What can we say after we identify all Nash equilibria? ➢ Compute how “good” they are in the best/worst case • How do we measure “social good”? ➢ Game with only rewards? Higher total reward of players = more social good ➢ Game with only penalties? Lower total penalty to players = more social good ➢ Game with rewards and penalties? No clear consensus… CSC304 - Nisarg Shah 3 Price of Anarchy and Stability • Price of Anarchy (PoA) • Price of Stability (PoS) “Worst NE vs optimum” “Best NE vs optimum” Max total reward Max total reward Min total reward in any NE Max total reward in any NE or or Max total cost in any NE Min total cost in any NE Min total cost Min total cost PoA ≥ PoS ≥ 1 CSC304 - Nisarg Shah 4 Revisiting Stag-Hunt Hunter 2 Stag Hare Hunter 1 Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1) • Max total reward = 4 + 4 = 8 • Three equilibria ➢ (Stag, Stag) : Total reward = 8 ➢ (Hare, Hare) : Total reward = 2 1 2 1 2 ➢ ( Τ3 Stag – Τ3 Hare, Τ3 Stag – Τ3 Hare) 1 1 1 1 o Total reward = ∗ ∗ 8 + 1 − ∗ ∗ 2 ∈ (2,8) 3 3 3 3 • Price of stability? Price of anarchy? CSC304 - Nisarg Shah 5 Revisiting Prisoner’s Dilemma John Stay Silent Betray Sam -
Potential Games. Congestion Games. Price of Anarchy and Price of Stability
8803 Connections between Learning, Game Theory, and Optimization Maria-Florina Balcan Lecture 13: October 5, 2010 Reading: Algorithmic Game Theory book, Chapters 17, 18 and 19. Price of Anarchy and Price of Staility We assume a (finite) game with n players, where player i's set of possible strategies is Si. We let s = (s1; : : : ; sn) denote the (joint) vector of strategies selected by players in the space S = S1 × · · · × Sn of joint actions. The game assigns utilities ui : S ! R or costs ui : S ! R to any player i at any joint action s 2 S: any player maximizes his utility ui(s) or minimizes his cost ci(s). As we recall from the introductory lectures, any finite game has a mixed Nash equilibrium (NE), but a finite game may or may not have pure Nash equilibria. Today we focus on games with pure NE. Some NE are \better" than others, which we formalize via a social objective function f : S ! R. Two classic social objectives are: P sum social welfare f(s) = i ui(s) measures social welfare { we make sure that the av- erage satisfaction of the population is high maxmin social utility f(s) = mini ui(s) measures the satisfaction of the most unsatisfied player A social objective function quantifies the efficiency of each strategy profile. We can now measure how efficient a Nash equilibrium is in a specific game. Since a game may have many NE we have at least two natural measures, corresponding to the best and the worst NE. We first define the best possible solution in a game Definition 1. -
Lecture Notes
GRADUATE GAME THEORY LECTURE NOTES BY OMER TAMUZ California Institute of Technology 2018 Acknowledgments These lecture notes are partially adapted from Osborne and Rubinstein [29], Maschler, Solan and Zamir [23], lecture notes by Federico Echenique, and slides by Daron Acemoglu and Asu Ozdaglar. I am indebted to Seo Young (Silvia) Kim and Zhuofang Li for their help in finding and correcting many errors. Any comments or suggestions are welcome. 2 Contents 1 Extensive form games with perfect information 7 1.1 Tic-Tac-Toe ........................................ 7 1.2 The Sweet Fifteen Game ................................ 7 1.3 Chess ............................................ 7 1.4 Definition of extensive form games with perfect information ........... 10 1.5 The ultimatum game .................................. 10 1.6 Equilibria ......................................... 11 1.7 The centipede game ................................... 11 1.8 Subgames and subgame perfect equilibria ...................... 13 1.9 The dollar auction .................................... 14 1.10 Backward induction, Kuhn’s Theorem and a proof of Zermelo’s Theorem ... 15 2 Strategic form games 17 2.1 Definition ......................................... 17 2.2 Nash equilibria ...................................... 17 2.3 Classical examples .................................... 17 2.4 Dominated strategies .................................. 22 2.5 Repeated elimination of dominated strategies ................... 22 2.6 Dominant strategies .................................. -
The Logic of Costly Punishment Reversed: Expropriation of Free-Riders and Outsiders∗,†
The logic of costly punishment reversed: expropriation of free-riders and outsiders∗ ,† David Hugh-Jones Carlo Perroni University of East Anglia University of Warwick and CAGE January 5, 2017 Abstract Current literature views the punishment of free-riders as an under-supplied pub- lic good, carried out by individuals at a cost to themselves. It need not be so: often, free-riders’ property can be forcibly appropriated by a coordinated group. This power makes punishment profitable, but it can also be abused. It is easier to contain abuses, and focus group punishment on free-riders, in societies where coordinated expropriation is harder. Our theory explains why public goods are un- dersupplied in heterogenous communities: because groups target minorities instead of free-riders. In our laboratory experiment, outcomes were more efficient when coordination was more difficult, while outgroup members were targeted more than ingroup members, and reacted differently to punishment. KEY WORDS: Cooperation, costly punishment, group coercion, heterogeneity JEL CLASSIFICATION: H1, H4, N4, D02 ∗We are grateful to CAGE and NIBS grant ES/K002201/1 for financial support. We would like to thank Mark Harrison, Francesco Guala, Diego Gambetta, David Skarbek, participants at conferences and presentations including EPCS, IMEBESS, SAET and ESA, and two anonymous reviewers for their comments. †Comments and correspondence should be addressed to David Hugh-Jones, School of Economics, University of East Anglia, [email protected] 1 Introduction Deterring free-riding is a central element of social order. Most students of collective action believe that punishing free-riders is costly to the punisher, but benefits the community as a whole. -
Potential Games and Competition in the Supply of Natural Resources By
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ASU Digital Repository Potential Games and Competition in the Supply of Natural Resources by Robert H. Mamada A Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Approved March 2017 by the Graduate Supervisory Committee: Carlos Castillo-Chavez, Co-Chair Charles Perrings, Co-Chair Adam Lampert ARIZONA STATE UNIVERSITY May 2017 ABSTRACT This dissertation discusses the Cournot competition and competitions in the exploita- tion of common pool resources and its extension to the tragedy of the commons. I address these models by using potential games and inquire how these models reflect the real competitions for provisions of environmental resources. The Cournot models are dependent upon how many firms there are so that the resultant Cournot-Nash equilibrium is dependent upon the number of firms in oligopoly. But many studies do not take into account how the resultant Cournot-Nash equilibrium is sensitive to the change of the number of firms. Potential games can find out the outcome when the number of firms changes in addition to providing the \traditional" Cournot-Nash equilibrium when the number of firms is fixed. Hence, I use potential games to fill the gaps that exist in the studies of competitions in oligopoly and common pool resources and extend our knowledge in these topics. In specific, one of the rational conclusions from the Cournot model is that a firm’s best policy is to split into separate firms. In real life, we usually witness the other way around; i.e., several firms attempt to merge and enjoy the monopoly profit by restricting the amount of output and raising the price. -
Kranton Duke University
The Devil is in the Details – Implications of Samuel Bowles’ The Moral Economy for economics and policy research October 13 2017 Rachel Kranton Duke University The Moral Economy by Samuel Bowles should be required reading by all graduate students in economics. Indeed, all economists should buy a copy and read it. The book is a stunning, critical discussion of the interplay between economic incentives and preferences. It challenges basic premises of economic theory and questions policy recommendations based on these theories. The book proposes the path forward: designing policy that combines incentives and moral appeals. And, therefore, like such as book should, The Moral Economy leaves us with much work to do. The Moral Economy concerns individual choices and economic policy, particularly microeconomic policies with goals to enhance the collective good. The book takes aim at laws, policies, and business practices that are based on the classic Homo economicus model of individual choice. The book first argues in great detail that policies that follow from the Homo economicus paradigm can backfire. While most economists would now recognize that people are not purely selfish and self-interested, The Moral Economy goes one step further. Incentives can amplify the selfishness of individuals. People might act in more self-interested ways in a system based on incentives and rewards than they would in the absence of such inducements. The Moral Economy warns economists to be especially wary of incentives because social norms, like norms of trust and honesty, are critical to economic activity. The danger is not only of incentives backfiring in a single instance; monetary incentives can generally erode ethical and moral codes and social motivations people can have towards each other. -
Recency, Records and Recaps: Learning and Non-Equilibrium Behavior in a Simple Decision Problem*
Recency, Records and Recaps: Learning and Non-equilibrium Behavior in a Simple Decision Problem* DREW FUDENBERG, Harvard University ALEXANDER PEYSAKHOVICH, Facebook Nash equilibrium takes optimization as a primitive, but suboptimal behavior can persist in simple stochastic decision problems. This has motivated the development of other equilibrium concepts such as cursed equilibrium and behavioral equilibrium. We experimentally study a simple adverse selection (or “lemons”) problem and find that learning models that heavily discount past information (i.e. display recency bias) explain patterns of behavior better than Nash, cursed or behavioral equilibrium. Providing counterfactual information or a record of past outcomes does little to aid convergence to optimal strategies, but providing sample averages (“recaps”) gets individuals most of the way to optimality. Thus recency effects are not solely due to limited memory but stem from some other form of cognitive constraints. Our results show the importance of going beyond static optimization and incorporating features of human learning into economic models. Categories & Subject Descriptors: J.4 [Social and Behavioral Sciences] Economics Author Keywords & Phrases: Learning, behavioral economics, recency, equilibrium concepts 1. INTRODUCTION Understanding when repeat experience can lead individuals to optimal behavior is crucial for the success of game theory and behavioral economics. Equilibrium analysis assumes all individuals choose optimal strategies while much research in behavioral economics -
Public Goods Agreements with Other-Regarding Preferences
NBER WORKING PAPER SERIES PUBLIC GOODS AGREEMENTS WITH OTHER-REGARDING PREFERENCES Charles D. Kolstad Working Paper 17017 http://www.nber.org/papers/w17017 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 May 2011 Department of Economics and Bren School, University of California, Santa Barbara; Resources for the Future; and NBER. Comments from Werner Güth, Kaj Thomsson and Philipp Wichardt and discussions with Gary Charness and Michael Finus have been appreciated. Outstanding research assistance from Trevor O’Grady and Adam Wright is gratefully acknowledged. Funding from the University of California Center for Energy and Environmental Economics (UCE3) is also acknowledged and appreciated. The views expressed herein are those of the author and do not necessarily reflect the views of the National Bureau of Economic Research. NBER working papers are circulated for discussion and comment purposes. They have not been peer- reviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications. © 2011 by Charles D. Kolstad. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source. Public Goods Agreements with Other-Regarding Preferences Charles D. Kolstad NBER Working Paper No. 17017 May 2011, Revised June 2012 JEL No. D03,H4,H41,Q5 ABSTRACT Why cooperation occurs when noncooperation appears to be individually rational has been an issue in economics for at least a half century. In the 1960’s and 1970’s the context was cooperation in the prisoner’s dilemma game; in the 1980’s concern shifted to voluntary provision of public goods; in the 1990’s, the literature on coalition formation for public goods provision emerged, in the context of coalitions to provide transboundary pollution abatement. -
The Impact of Beliefs on Effort in Telecommuting Teams
The impact of beliefs on effort in telecommuting teams E. Glenn Dutcher, Krista Jabs Saral Working Papers in Economics and Statistics 2012-22 University of Innsbruck http://eeecon.uibk.ac.at/ University of Innsbruck Working Papers in Economics and Statistics The series is jointly edited and published by - Department of Economics - Department of Public Finance - Department of Statistics Contact Address: University of Innsbruck Department of Public Finance Universitaetsstrasse 15 A-6020 Innsbruck Austria Tel: + 43 512 507 7171 Fax: + 43 512 507 2970 E-mail: [email protected] The most recent version of all working papers can be downloaded at http://eeecon.uibk.ac.at/wopec/ For a list of recent papers see the backpages of this paper. The Impact of Beliefs on E¤ort in Telecommuting Teams E. Glenn Dutchery Krista Jabs Saralz February 2014 Abstract The use of telecommuting policies remains controversial for many employers because of the perceived opportunity for shirking outside of the traditional workplace; a problem that is potentially exacerbated if employees work in teams. Using a controlled experiment, where individuals work in teams with varying numbers of telecommuters, we test how telecommut- ing a¤ects the e¤ort choice of workers. We …nd that di¤erences in productivity within the team do not result from shirking by telecommuters; rather, changes in e¤ort result from an individual’s belief about the productivity of their teammates. In line with stereotypes, a high proportion of both telecommuting and non-telecommuting participants believed their telecommuting partners were less productive. Consequently, lower expectations of partner productivity resulted in lower e¤ort when individuals were partnered with telecommuters. -
Online Learning and Game Theory
Online Learning and Game Theory Your guide: Avrim Blum Carnegie Mellon University [Algorithmic Economics Summer School 2012] Itinerary • Stop 1: Minimizing regret and combining advice. – Randomized Wtd Majority / Multiplicative Weights alg – Connections to minimax-optimality in zero-sum games • Stop 2: Bandits and Pricing – Online learning from limited feedback (bandit algs) – Use for online pricing/auction problems • Stop 3: Internal regret & Correlated equil – Internal regret minimization – Connections to correlated equilib. in general-sum games • Stop 4: Guiding dynamics to higher-quality equilibria – Helpful “nudging” of simple dynamics in potential games Stop 1: Minimizing regret and combining expert advice Consider the following setting… Each morning, you need to pick one of N possible routes to drive to work. Robots But traffic is different each day. R Us 32 min Not clear a priori which will be best. When you get there you find out how long your route took. (And maybe others too or maybe not.) Is there a strategy for picking routes so that in the long run, whatever the sequence of traffic patterns has been, you’ve done nearly as well as the best fixed route in hindsight? (In expectation, over internal randomness in the algorithm) Yes. “No-regret” algorithms for repeated decisions A bit more generally: Algorithm has N options. World chooses cost vector. Can view as matrix like this (maybe infinite # cols) World – life - fate Algorithm At each time step, algorithm picks row, life picks column. Alg pays cost for action chosen. Alg gets column as feedback (or just its own cost in the “bandit” model). Need to assume some bound on max cost. -
Correlated Equilibria and Communication in Games Françoise Forges
Correlated equilibria and communication in games Françoise Forges To cite this version: Françoise Forges. Correlated equilibria and communication in games. Computational Complex- ity. Theory, Techniques, and Applications, pp.295-704, 2012, 10.1007/978-1-4614-1800-9_45. hal- 01519895 HAL Id: hal-01519895 https://hal.archives-ouvertes.fr/hal-01519895 Submitted on 9 May 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Correlated Equilibrium and Communication in Games Françoise Forges, CEREMADE, Université Paris-Dauphine Article Outline Glossary I. De…nition of the Subject and its Importance II. Introduction III. Correlated Equilibrium: De…nition and Basic Properties IV. Correlated Equilibrium and Communication V. Correlated Equilibrium in Bayesian Games VI. Related Topics and Future Directions VII. Bibliography Acknowledgements The author wishes to thank Elchanan Ben-Porath, Frédéric Koessler, R. Vijay Krishna, Ehud Lehrer, Bob Nau, Indra Ray, Jérôme Renault, Eilon Solan, Sylvain Sorin, Bernhard von Stengel, Tristan Tomala, Amparo Ur- bano, Yannick Viossat and, especially, Olivier Gossner and Péter Vida, for useful comments and suggestions. Glossary Bayesian game: an interactive decision problem consisting of a set of n players, a set of types for every player, a probability distribution which ac- counts for the players’ beliefs over each others’ types, a set of actions for every player and a von Neumann-Morgenstern utility function de…ned over n-tuples of types and actions for every player. -
Analyzing Just-In-Time Purchasing Strategy in Supply Chains Using an Evolutionary Game Approach
Bulletin of the JSME Vol.14, No.5, 2020 Journal of Advanced Mechanical Design, Systems, and Manufacturing Analyzing just-in-time purchasing strategy in supply chains using an evolutionary game approach Ziang LIU* and Tatsushi NISHI** *Graduate School of Engineering Science, Osaka University 1-3 Machikaneyama-Cho, Toyonaka City, Osaka 560-8531, Japan E-mail: [email protected] **Graduate School of Natural Science and Technology, Okayama University 3-1-1 Tsushima-naka, Kita-ku, Okayama City, Okayama 700-8530, Japan Received: 18 December 2019; Revised: 30 March 2020; Accepted: 9 April 2020 Abstract Many researchers have focused on the comparison between the JIT model and the EOQ model. However, few of them studied this problem from an evolutionary perspective. In this paper, a JIT purchasing with the single- setup-multi-delivery model is introduced to compare the total costs of the JIT model and the EOQ model. Also, we extend the classical JIT-EOQ models to a two-echelon supply chain which consists of one manufacturer and one supplier. Considering the bounded rationality of players and the quickly changing market, an evolutionary game model is proposed to discuss how these factors impact the strategy selection of the companies. And the evolutionarily stable strategy of the proposed model is analyzed. Based on the analysis, we derive the conditions when the supply chain system will choose the JIT strategy and propose a contract method to ensure that the system converges to the JIT strategy. Several numerical experiments are provided to observe the JIT and EOQ purchasing strategy selection of the manufacturer and the supplier.