A Course in Networks and Markets

Total Page:16

File Type:pdf, Size:1020Kb

A Course in Networks and Markets A Course in Networks and Markets Rafael Pass Cornell Tech Last updated: September 26, 2017 c 2018 Rafael Pass All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission by the author. ISBN: 10 9 8 7 6 5 4 3 2 1 First pre-edition: February 2017 Introduction In this course, using tools from game theory and graph theory, we explore how network structures and network effects play a role in economic and information markets. Let us start by providing an overview through a few examples. Markets with Network Effects: iPhone vs. Android Consider a mar- ket with two competing mobile phone brands, where the buyers are connected through a social network (see Figure 0.1). Each phone has some intrinsic value to a buyer, but the actual value of phone is affected by how many of the buyer's friends (i.e., the nodes connected to them in the social network) have the same phone|this is referred to a network effect: For instance, even if I prefer iPhones in isolation (i.e., they have a higher intrinsic value to me), I may prefer to get an Android phone (or even switch to one, if I currently have an iPhone), if enough of my friends have an Android. Some questions that naturally arise are: • Will there eventually be a stable solution, where everyone is happy with their phone, or will people keep switching phones? • If we arrive at a stable solution, what will it look like? (For instance, will everyone eventually have the same phone, or can we get a market for both?) A A A A I I A I Figure 0.1: An example of a small social network for the Android/iPhone game. Nodes correspond to buyers. We draw an edge between nodes that are friends. i • If we want to market iPhones, to which “influential" individuals should we offer discounts in order to most efficiently take over the market? • How should we set the price of a phone to best market it? (Perhaps start low and increase the price as more people buy it?) We will study models that allow us to answer these questions. Note that this type of modeling is not only useful to study the spread of products, but can also be used to reason about the spread of (e.g., political) news or other information (or disinformation) in a social networks. For instance, I may post a news article on my Facebook wall if many of my friends do it. The Role of Beliefs. In the example above, to market a phone, it may suf- fice that enough people believe that their friends will buy the phone. If people believe that their friends will buy the phone (accomplished e.g., by advertis- ing), their perceived value of the phone will increase, and they will be more likely to buy it|we get a \self-fulfilling prophecy". As we shall see, in some situations, it may even be enough that there exist people who believe there exist people who believe (etc.) that enough people will buy the phone for this effect to happen|that is, so-called higher-level beliefs can have a large impact. We will study models for discussing and analyzing such higher-level beliefs| perhaps surprisingly, networks will prove useful also for modeling higher-level beliefs. We shall next use these models to shed light on the emergence of bubbles and crashes in economic markets. More generally, we will discuss how crowds process information and how and why the following phenomena can occur: • The wisdom of crowds: In some situations, the aggregate behavior of a group can give a significantly better estimate of the \truth" than any one individual (e.g., prediction teams outperforming single analysts in elections). • The foolishness of crowds: In other situations, \misinformation" can be circulated through a social network in \information cascades" (e.g., the spread of urban legends/\fake news" through a social network). Matching Markets, Auctions and Voting. Let us finally consider a quite different type of market. Assume we have three people A; B; C and three houses called H1;H2;H3. The people may have some constraints on what houses are acceptable to them; we can depict the situation using a graph as shown in Figure 0.2. Can we find a \matching" (i.e., pairing) between people and houses that respects these acceptability constraints? In this simple Figure 0.2: The \acceptability" graph in a matching problem. We draw a edge between a person and a house if the person finds the house acceptable. example, it is easy to see that A can be matched with H2, B with H1, and C with H3; we will study algorithms for solving this problem, and more generally, understanding when a matching where everyone gets matched exists. Consider, now, a variant of this problem where everyone finds all houses acceptable, but everyone prefers H1 to H2, and H2 to H3. How should we now assign houses to people? Note that no matter how we assign houses to people, 2 people will be unhappy with their house (in the sense that they would have preferred a different house)! The key for overcoming this problem is to assign prices to the three houses. This gives rise to the following questions: • Can we set prices for these three houses so that everyone can be matched with their most preferred house (taking into account the price of the house)? Indeed, we will show that such, so-called, \market-clearing prices" are guaranteed to exist (and the hope is that the market will converge on these prices over time). • Can we design a mechanism that incentivizes people to truthfully report how much each house is worth to them, so that we can assign houses to people in a way that maximizes the total \happiness" of all the peo- ple? Indeed, we shall study the Vickrey-Clark-Groves (VCG) auction mechanism that enables doing this. We next note that the methods we use to provide answers to the above ques- tions form the basis for the auction mechanisms used in sponsored search, where advertisers bid on \slots" for sponsored results in Internet search queries (and need to pay to get their advertisement shown)|in this context, the goal is to find a matching between advertisers and slots. We will also consider the \standard" (non-sponsored) web search problem: think of it as matching webpages with \slots" in the search ranking, but the difference with the sponsored search problem is that now there are no payments. We will discuss the \relevance" algorithms used by search engines (e.g., Google's PageRank algorithm) to determine how (non-paying) pages returned by a search should be ordered. Here, the network structure of the Internet will be the central factor for computing a relevance score. The basic idea behind these methods is to implement a voting mechanism whereby other pages \vote" for each page's relevance by linking to it. Finally, we will discuss voting schemes (e.g., for presidential elections) more generally, and investigate why such schemes typically are susceptible to \strategic voting", where voters are incentivized to not truthfully report their actual preferences (for instance, if your favorite candidate in the US presidential election is a third-party candidate, you may be inclined to vote for your second choice). Outline of the course. The course is divided into four main parts. • Part 1: Games and Graphs. In Part 1, we first introduce basic concepts from game theory (the study of how rational agents, trying to maximize their utility, interact) and graph theory (the study of graphs, mathemati- cal constructs used to model networks of interconnected nodes). We then use concepts from both to analyze \networked coordination games" on social networks|such games provide a framework for analyzing situa- tions similar to the Android/iPhone game discussed above. • Part 2: Markets on Networks. In Part 2, we begin by introducing some more advanced algorithms for exploring graphs, and then use these al- gorithms to explore various different types of markets on networks (in- cluding e.g., the above-discussed market for matching houses to people). • Part 3: Mechanisms for Networks. In Part 3, we discuss mechanisms for taming the above-mentioned auctions, web search, voting, and matching, problems. • Part 4: The Role of Beliefs. Finally, in Part 4, we discuss various ways of modeling players' beliefs and knowledge, and explore how players' beliefs (and the above-mentioned higher-level beliefs) play a role in auctions and markets. Comparison with Easley-Kleinberg. The topics covered here, as well as the whole premise of using a combination of game-theory and graph-theory to study markets, is heavily inspired by Easley and Kleinberg's (EK) beau- tiful book \Networks, Crowds and Markets" [EK10]. However, whereas our selection of topics closely follows EK, our treatment of many (but not all) of the topics is somewhat different. In particular, our goal is to provide a for- mal treatment, with full proofs, of the simplest models exhibiting the above- described phenomena, while only making assumptions about the rationality of agents. As such, we are also covering fewer topics than EK: in particular, we are simply assuming that the network structure (e.g., the social-network in the first example) is exoneously given|we do not consider how this network is formed, and do not discuss properties on it. There is a number of beau- tiful models and results regarding the structure of social networks (e.g., the Barabasi-Albert preferential attachment model [BA99], Watts-Strogatz small worlds model [WS98], and Kleinberg's decentralized search model [Kle00]), which are discussed in depth in EK.
Recommended publications
  • Game Theory- Prisoners Dilemma Vs Battle of the Sexes EXCERPTS
    Lesson 14. Game Theory 1 Lesson 14 Game Theory c 2010, 2011 ⃝ Roberto Serrano and Allan M. Feldman All rights reserved Version C 1. Introduction In the last lesson we discussed duopoly markets in which two firms compete to sell a product. In such markets, the firms behave strategically; each firm must think about what the other firm is doing in order to decide what it should do itself. The theory of duopoly was originally developed in the 19th century, but it led to the theory of games in the 20th century. The first major book in game theory, published in 1944, was Theory of Games and Economic Behavior,byJohnvon Neumann (1903-1957) and Oskar Morgenstern (1902-1977). We will return to the contributions of Von Neumann and Morgenstern in Lesson 19, on uncertainty and expected utility. Agroupofpeople(orteams,firms,armies,countries)areinagame if their decision problems are interdependent, in the sense that the actions that all of them take influence the outcomes for everyone. Game theory is the study of games; it can also be called interactive decision theory. Many real-life interactions can be viewed as games. Obviously football, soccer, and baseball games are games.Butsoaretheinteractionsofduopolists,thepoliticalcampaignsbetweenparties before an election, and the interactions of armed forces and countries. Even some interactions between animal or plant species in nature can be modeled as games. In fact, game theory has been used in many different fields in recent decades, including economics, political science, psychology, sociology, computer science, and biology. This brief lesson is not meant to replace a formal course in game theory; it is only an in- troduction.
    [Show full text]
  • Prisoners of Reason Game Theory and Neoliberal Political Economy
    C:/ITOOLS/WMS/CUP-NEW/6549131/WORKINGFOLDER/AMADAE/9781107064034PRE.3D iii [1–28] 11.8.2015 9:57PM Prisoners of Reason Game Theory and Neoliberal Political Economy S. M. AMADAE Massachusetts Institute of Technology C:/ITOOLS/WMS/CUP-NEW/6549131/WORKINGFOLDER/AMADAE/9781107064034PRE.3D iv [1–28] 11.8.2015 9:57PM 32 Avenue of the Americas, New York, ny 10013-2473, usa Cambridge University Press is part of the University of Cambridge. It furthers the University’s mission by disseminating knowledge in the pursuit of education, learning, and research at the highest international levels of excellence. www.cambridge.org Information on this title: www.cambridge.org/9781107671195 © S. M. Amadae 2015 This publication is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published 2015 Printed in the United States of America A catalog record for this publication is available from the British Library. Library of Congress Cataloging in Publication Data Amadae, S. M., author. Prisoners of reason : game theory and neoliberal political economy / S.M. Amadae. pages cm Includes bibliographical references and index. isbn 978-1-107-06403-4 (hbk. : alk. paper) – isbn 978-1-107-67119-5 (pbk. : alk. paper) 1. Game theory – Political aspects. 2. International relations. 3. Neoliberalism. 4. Social choice – Political aspects. 5. Political science – Philosophy. I. Title. hb144.a43 2015 320.01′5193 – dc23 2015020954 isbn 978-1-107-06403-4 Hardback isbn 978-1-107-67119-5 Paperback Cambridge University Press has no responsibility for the persistence or accuracy of URLs for external or third-party Internet Web sites referred to in this publication and does not guarantee that any content on such Web sites is, or will remain, accurate or appropriate.
    [Show full text]
  • Repeated Games
    REPEATED GAMES 1 Early PD experiments In 1950, Merrill Flood and Melvin Dresher (at RAND) devised an experiment to test Nash’s theory about defection in a two-person prisoners’ dilemma. Experimental Design – They asked two friends to play the PD 100 times. – They measured the success of Nash’s equilibrium concept by counting the number of times the players chose {D;D}. 2 Flood and Dresher’s results Player 1 cooperated in 68 rounds Player 2 cooperated in 78 rounds Both cooperated in 60 of last 89 rounds Flood Dresher Nash 3 Flood and Dresher’s results Player 1 cooperated in 68 rounds Player 2 cooperated in 78 rounds Both cooperated in 60 of last 89 rounds Wait a Ha! That jerk I can’tI’mOh a be Ha! Nash second... Nash was genius...%&@#!wrong! was wrong! wrong! Flood Dresher Nash 4 Nash’s response “If this experiment were conducted with various different players rotating the competition and with no information given to a player of what choices the others have been making until the end of all trials, then the experimental results would have been quite different, for this modification of procedure would remove the interaction between the trials.” 5 Nash’s response “The flaw in this experiment as a test of equilibrium point theory is that the experiment really amounts to having the players play one large multimove game. One cannot...think of the thing as a sequence of independent games...there is too much interaction.” In other words, Nash said that repeating the game changes the game itself.
    [Show full text]
  • Putting Auction Theory to Work
    Putting Auction Theory to Work Paul Milgrom With a Foreword by Evan Kwerel © 2003 “In Paul Milgrom's hands, auction theory has become the great culmination of game theory and economics of information. Here elegant mathematics meets practical applications and yields deep insights into the general theory of markets. Milgrom's book will be the definitive reference in auction theory for decades to come.” —Roger Myerson, W.C.Norby Professor of Economics, University of Chicago “Market design is one of the most exciting developments in contemporary economics and game theory, and who can resist a master class from one of the giants of the field?” —Alvin Roth, George Gund Professor of Economics and Business, Harvard University “Paul Milgrom has had an enormous influence on the most important recent application of auction theory for the same reason you will want to read this book – clarity of thought and expression.” —Evan Kwerel, Federal Communications Commission, from the Foreword For Robert Wilson Foreword to Putting Auction Theory to Work Paul Milgrom has had an enormous influence on the most important recent application of auction theory for the same reason you will want to read this book – clarity of thought and expression. In August 1993, President Clinton signed legislation granting the Federal Communications Commission the authority to auction spectrum licenses and requiring it to begin the first auction within a year. With no prior auction experience and a tight deadline, the normal bureaucratic behavior would have been to adopt a “tried and true” auction design. But in 1993 there was no tried and true method appropriate for the circumstances – multiple licenses with potentially highly interdependent values.
    [Show full text]
  • A Beautiful Math : John Nash, Game Theory, and the Modern Quest for a Code of Nature / Tom Siegfried
    A BEAUTIFULA BEAUTIFUL MATH MATH JOHN NASH, GAME THEORY, AND THE MODERN QUEST FOR A CODE OF NATURE TOM SIEGFRIED JOSEPH HENRY PRESS Washington, D.C. Joseph Henry Press • 500 Fifth Street, NW • Washington, DC 20001 The Joseph Henry Press, an imprint of the National Academies Press, was created with the goal of making books on science, technology, and health more widely available to professionals and the public. Joseph Henry was one of the founders of the National Academy of Sciences and a leader in early Ameri- can science. Any opinions, findings, conclusions, or recommendations expressed in this volume are those of the author and do not necessarily reflect the views of the National Academy of Sciences or its affiliated institutions. Library of Congress Cataloging-in-Publication Data Siegfried, Tom, 1950- A beautiful math : John Nash, game theory, and the modern quest for a code of nature / Tom Siegfried. — 1st ed. p. cm. Includes bibliographical references and index. ISBN 0-309-10192-1 (hardback) — ISBN 0-309-65928-0 (pdfs) 1. Game theory. I. Title. QA269.S574 2006 519.3—dc22 2006012394 Copyright 2006 by Tom Siegfried. All rights reserved. Printed in the United States of America. Preface Shortly after 9/11, a Russian scientist named Dmitri Gusev pro- posed an explanation for the origin of the name Al Qaeda. He suggested that the terrorist organization took its name from Isaac Asimov’s famous 1950s science fiction novels known as the Foun- dation Trilogy. After all, he reasoned, the Arabic word “qaeda” means something like “base” or “foundation.” And the first novel in Asimov’s trilogy, Foundation, apparently was titled “al-Qaida” in an Arabic translation.
    [Show full text]
  • The Art and Practice of Economics Research: Lessons from Leading Minds (Edward Elgar Publishing, 2012) Simon W. Bowmaker, New Yo
    The Art and Practice of Economics Research: Lessons from Leading Minds (Edward Elgar Publishing, 2012) Simon W. Bowmaker, New York University Interview with Robert E. Lucas, Jr. University of Chicago (July 25, 2011) 1 Robert Lucas was born in Yakima, Washington in 1937 and graduated with both a BA in history and a PhD in economics from the University of Chicago in 1959 and 1964 respectively. Between 1964 and 1975, he taught at the Graduate School of Industrial Administration at Carnegie Mellon University, before moving to the University of Chicago, where he has remained ever since, currently serving as the John Dewey Distinguished Service Professor in Economics and the College. Professor Lucas is widely acknowledged as one of the most influential macroeconomists of the twentieth century. He is perhaps best known for his work on the development of the theory of rational expectations, but he has also made significant contributions to the theory of investment, the theory of endogenous growth, the theory of asset pricing and the theory of money. In addition, he introduced the ‘Lucas critique’ of the use of econometric models in policy design. His most-cited articles in chronological order include ‘Expectations and the Neutrality of Money,’ Journal of Economic Theory (1972), ‘Some International Evidence on Output-Inflation Tradeoffs,’ American Economic Review (1973), ‘Econometric Policy Evaluation: A Critique,’ Carnegie-Rochester Conference Series on Public Policy (1976), ‘Asset Prices in an Exchange Economy,’ Econometrica (1978), and ‘On the Mechanics of Economic Development,’ Journal of Monetary Economics (1988). His books include Studies in Business- Cycle Theory (MIT Press, 1983), Recursive Methods in Economic Dynamics (Harvard University Press, 1989), co-authored with Nancy Stokey and Edward Prescott, and Lectures on Economic Growth (Harvard University Press, 2004).
    [Show full text]
  • Game Theory and Its Practical Applications
    University of Northern Iowa UNI ScholarWorks Presidential Scholars Theses (1990 – 2006) Honors Program 1997 Game theory and its practical applications Angela M. Koos University of Northern Iowa Let us know how access to this document benefits ouy Copyright ©1997 Angela M. Koos Follow this and additional works at: https://scholarworks.uni.edu/pst Part of the Other Economics Commons Recommended Citation Koos, Angela M., "Game theory and its practical applications" (1997). Presidential Scholars Theses (1990 – 2006). 100. https://scholarworks.uni.edu/pst/100 This Open Access Presidential Scholars Thesis is brought to you for free and open access by the Honors Program at UNI ScholarWorks. It has been accepted for inclusion in Presidential Scholars Theses (1990 – 2006) by an authorized administrator of UNI ScholarWorks. For more information, please contact [email protected]. Game Theory and its Practical Applications A Presidential Scholar Senior Thesis University of Northern Iowa by Angela M. Koos Spring 1997 Dr. Ken Brown, 7 Dfrte Thesis and Major Advisor ,~-,, Dr. Ed Rathmell, Date Chair of Presidential Scholars Board Table of Contents Section Page(s) I. Historical Overview 1 I.A. Early Contributions to Game Theory 1 - 3 LB. John von Neumann, the RAND Corporation, and the Arms Race 3 - 7 LC. John Nash 7 - 8 I.D. Other Contributions to Game Theory 9 II. Defining Game Theory 9 - 12 II.A. Formal Representations of Games 12 - 13 II.A. I. Extensive Form 13 - 24 II.A.2. Normal Form 24 - 25 III. The Minimax Theorem 25 - 26 III.A. Preliminary Comments 26 - 27 III.B. The Theorem 27 - 28 IV.
    [Show full text]
  • Why Is Behavioral Game Theory a Game for Economists? the Concept of Beliefs in Equilibrium∗
    Why is behavioral game theory a game for economists? The concept of beliefs in equilibrium∗ Michiru Nagatsu Chiara Lisciandray May 7, 2021 Abstract The interdisciplinary exchange between economists and psychologists has so far been more active and fruitful in the modifications of Expected Util- ity Theory than in those of Game Theory. We argue that this asymmetry may be explained by economists' specific way of doing equilibrium analysis of aggregate-level outcomes in their practice, and by psychologists' reluctance to fully engage with such practice. We focus on the notion of belief that is em- bedded in economists' practice of equilibrium analysis, more specifically Nash equilibrium, and argue that its difference from the psychological counterpart is one of the factors that makes interdisciplinary exchange in behavioral game theory more difficult. 1 Introduction One of the most influential texts published in the behavioral and social sciences in the first half of the twentieth century was, according to many, Von Neumann and Morgenstern's (vNM) Theory of Games and Economic Behavior (1944). Not only did the book lay the foundation of game theory, which has become the essential research tool in contemporary economics, it also influenced several other disciplines beyond economics, from political science to linguistics and biology. Two of the most important contributions of Theory of Games and Economic Be- havior to economics are the axiomatic derivation of expected utility (in the second edition of 1947) and the minimax solution to zero-sum games. John Nash general- ized vNM's existence proof of equilibrium in non-zero-sum games in 1951, thereby ∗To appear in Egashira, Taishido, Hands and M¨aki(eds.) (2018) A Genealogy of Self-interest in Economics (Springer).
    [Show full text]
  • Algorithmic Collusion in the Digital Single Market
    This is a repository copy of EU Competition Policy: Algorithmic Collusion in the Digital Single Market. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/160267/ Article: Stewart-Moreno, Alexander (2020) EU Competition Policy: Algorithmic Collusion in the Digital Single Market. York Law Review, 1. pp. 49-82. 10.15124/yao-y99rnjtm Reuse Items deposited in White Rose Research Online are protected by copyright, with all rights reserved unless indicated otherwise. They may be downloaded and/or printed for private study, or other acts as permitted by national copyright laws. The publisher or other rights holders may allow further reproduction and re-use of the full text version. This is indicated by the licence information on the White Rose Research Online record for the item. Takedown If you consider content in White Rose Research Online to be in breach of UK law, please notify us by emailing [email protected] including the URL of the record and the reason for the withdrawal request. [email protected] https://eprints.whiterose.ac.uk/ EU Competition Policy: Algorithmic Collusion in the Digital Single Market Alexander Stewart-Moreno Abstract E-commerce promises a digital environment with ‘more perfect’ market characteristics. Although consumers may benefit from digital efficiencies, firms’ exploitation of such benefits may require new policy to regulate in line with the European Commission’s Digital Single Market Strategy. Price-setting algorithms are central to this dichotomy, as faster and more transparent pricing strategies could conceivably maintain algorithmic price-fixing cartels – which Article 101 of the Treaty on the Functioning of the European Union may prove inadequate in tackling.
    [Show full text]
  • Chronology of Game Theory
    Chronology of Game Theory http://www.econ.canterbury.ac.nz/personal_pages/paul_walker/g... Home | UC Home | Econ. Department | Chronology of Game Theory | Nobel Prize A Chronology of Game Theory by Paul Walker September 2012 | Ancient | 1700 | 1800 | 1900 | 1950 | 1960 | 1970 | 1980 | 1990 | Nobel Prize | 2nd Nobel Prize | 3rd Nobel Prize | 0-500AD The Babylonian Talmud is the compilation of ancient law and tradition set down during the first five centuries A.D. which serves as the basis of Jewish religious, criminal and civil law. One problem discussed in the Talmud is the so called marriage contract problem: a man has three wives whose marriage contracts specify that in the case of this death they receive 100, 200 and 300 respectively. The Talmud gives apparently contradictory recommendations. Where the man dies leaving an estate of only 100, the Talmud recommends equal division. However, if the estate is worth 300 it recommends proportional division (50,100,150), while for an estate of 200, its recommendation of (50,75,75) is a complete mystery. This particular Mishna has baffled Talmudic scholars for two millennia. In 1985, it was recognised that the Talmud anticipates the modern theory of cooperative games. Each solution corresponds to the nucleolus of an appropriately defined game. 1713 In a letter dated 13 November 1713 Francis Waldegrave provided the first, known, minimax mixed strategy solution to a two-person game. Waldegrave wrote the letter, about a two-person version of the card game le Her, to Pierre-Remond de Montmort who in turn wrote to Nicolas Bernoulli, including in his letter a discussion of the Waldegrave solution.
    [Show full text]
  • Several Remarks on the Role of Certain Positional
    Foundations of Management, Vol. 8 (2016), ISSN 2080-7279 DOI: 10.1515/fman-2016-0022 289 SEVERAL REMARKS ON THE ROLE OF CERTAIN POSITIONAL AND SOCIAL GAMES IN THE CREATION OF THE SELECTED STATISTICAL AND ECONOMIC APPLICATIONS Ewa DRABIK Warsaw University of Technology, Faculty of Management, Warsaw, Poland e-mail: [email protected] Abstract: The game theory was created on the basis of social as well as gambling games, such as chess, poker, baccarat, hex, or one-armed bandit. The aforementioned games lay solid foundations for analogous mathematical models (e.g., hex), artificial intelligence algorithms (hex), theoretical anal- ysis of computational complexity attributable to various numerical problems (baccarat), as well as illustration of several economic dilemmas particularly in the case where the winner takes every- thing (e.g., noughts and crosses). A certain gambling games, such as a horse racing, may be successful- ly applied to verify a wide spectrum of market mechanism, for example, market effectiveness or customer behavior in light of incoming information regarding a specific product. One of a lot appli- cations of the slot machine (one-armed bandit) is asymptotically efficient allocation rule, which was as- signed by T.L. Lai and H. Robbins (1985). In the next years, the rule was developed by another and was named a multi-armed. The aim of the paper is to discuss these social games along with their potential mathematical models, which are governed by the rules predominantly applicable to the social and natural sciences. Key words: positional games, social games, gambling games, one-armed bandit, chess. 1 Introduction Melvin Dresher working at RAND in 1950), gender discrimination game, chicken game, or Colonel Blot- The humanity has been accompanied by gambling to game were created.
    [Show full text]
  • Learning with Bandit Feedback in Potential Games
    Learning with Bandit Feedback in Potential Games Johanne Cohen LRI-CNRS, Université Paris-Sud,Université Paris-Saclay, France [email protected] Amélie Héliou LIX, Ecole Polytechnique, CNRS, AMIBio, Inria, Université Paris-Saclay [email protected] Panayotis Mertikopoulos Univ. Grenoble Alpes, CNRS, Inria, LIG, F-38000, Grenoble, France [email protected] Abstract This paper examines the equilibrium convergence properties of no-regret learning with exponential weights in potential games. To establish convergence with mini- mal information requirements on the players’ side, we focus on two frameworks: the semi-bandit case (where players have access to a noisy estimate of their payoff vectors, including strategies they did not play), and the bandit case (where players are only able to observe their in-game, realized payoffs). In the semi-bandit case, we show that the induced sequence of play converges almost surely to a Nash equilibrium at a quasi-exponential rate. In the bandit case, the same result holds for "-approximations of Nash equilibria if we introduce an exploration factor " > 0 that guarantees that action choice probabilities never fall below ". In particular, if the algorithm is run with a suitably decreasing exploration factor, the sequence of play converges to a bona fide Nash equilibrium with probability 1. 1 Introduction Given the manifest complexity of computing Nash equilibria, a central question that arises is whether such outcomes could result from a dynamic process in which players act on empirical information on their strategies’ performance over time. This question becomes particularly important when the players’ view of the game is obstructed by situational uncertainty and the “fog of war”: for instance, when deciding which route to take to work each morning, a commuter is typically unaware of how many other commuters there are at any given moment, what their possible strategies are, how to best respond to their choices, etc.
    [Show full text]