Equilibrium Computation — Dagstuhl Seminar —
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Equilibrium Refinements
Equilibrium Refinements Mihai Manea MIT Sequential Equilibrium I In many games information is imperfect and the only subgame is the original game. subgame perfect equilibrium = Nash equilibrium I Play starting at an information set can be analyzed as a separate subgame if we specify players’ beliefs about at which node they are. I Based on the beliefs, we can test whether continuation strategies form a Nash equilibrium. I Sequential equilibrium (Kreps and Wilson 1982): way to derive plausible beliefs at every information set. Mihai Manea (MIT) Equilibrium Refinements April 13, 2016 2 / 38 An Example with Incomplete Information Spence’s (1973) job market signaling game I The worker knows her ability (productivity) and chooses a level of education. I Education is more costly for low ability types. I Firm observes the worker’s education, but not her ability. I The firm decides what wage to offer her. In the spirit of subgame perfection, the optimal wage should depend on the firm’s beliefs about the worker’s ability given the observed education. An equilibrium needs to specify contingent actions and beliefs. Beliefs should follow Bayes’ rule on the equilibrium path. What about off-path beliefs? Mihai Manea (MIT) Equilibrium Refinements April 13, 2016 3 / 38 An Example with Imperfect Information Courtesy of The MIT Press. Used with permission. Figure: (L; A) is a subgame perfect equilibrium. Is it plausible that 2 plays A? Mihai Manea (MIT) Equilibrium Refinements April 13, 2016 4 / 38 Assessments and Sequential Rationality Focus on extensive-form games of perfect recall with finitely many nodes. An assessment is a pair (σ; µ) I σ: (behavior) strategy profile I µ = (µ(h) 2 ∆(h))h2H: system of beliefs ui(σjh; µ(h)): i’s payoff when play begins at a node in h randomly selected according to µ(h), and subsequent play specified by σ. -
CSC304 Lecture 4 Game Theory
CSC304 Lecture 4 Game Theory (Cost sharing & congestion games, Potential function, Braess’ paradox) CSC304 - Nisarg Shah 1 Recap • Nash equilibria (NE) ➢ No agent wants to change their strategy ➢ Guaranteed to exist if mixed strategies are allowed ➢ Could be multiple • Pure NE through best-response diagrams • Mixed NE through the indifference principle CSC304 - Nisarg Shah 2 Worst and Best Nash Equilibria • What can we say after we identify all Nash equilibria? ➢ Compute how “good” they are in the best/worst case • How do we measure “social good”? ➢ Game with only rewards? Higher total reward of players = more social good ➢ Game with only penalties? Lower total penalty to players = more social good ➢ Game with rewards and penalties? No clear consensus… CSC304 - Nisarg Shah 3 Price of Anarchy and Stability • Price of Anarchy (PoA) • Price of Stability (PoS) “Worst NE vs optimum” “Best NE vs optimum” Max total reward Max total reward Min total reward in any NE Max total reward in any NE or or Max total cost in any NE Min total cost in any NE Min total cost Min total cost PoA ≥ PoS ≥ 1 CSC304 - Nisarg Shah 4 Revisiting Stag-Hunt Hunter 2 Stag Hare Hunter 1 Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1) • Max total reward = 4 + 4 = 8 • Three equilibria ➢ (Stag, Stag) : Total reward = 8 ➢ (Hare, Hare) : Total reward = 2 1 2 1 2 ➢ ( Τ3 Stag – Τ3 Hare, Τ3 Stag – Τ3 Hare) 1 1 1 1 o Total reward = ∗ ∗ 8 + 1 − ∗ ∗ 2 ∈ (2,8) 3 3 3 3 • Price of stability? Price of anarchy? CSC304 - Nisarg Shah 5 Revisiting Prisoner’s Dilemma John Stay Silent Betray Sam -
Potential Games. Congestion Games. Price of Anarchy and Price of Stability
8803 Connections between Learning, Game Theory, and Optimization Maria-Florina Balcan Lecture 13: October 5, 2010 Reading: Algorithmic Game Theory book, Chapters 17, 18 and 19. Price of Anarchy and Price of Staility We assume a (finite) game with n players, where player i's set of possible strategies is Si. We let s = (s1; : : : ; sn) denote the (joint) vector of strategies selected by players in the space S = S1 × · · · × Sn of joint actions. The game assigns utilities ui : S ! R or costs ui : S ! R to any player i at any joint action s 2 S: any player maximizes his utility ui(s) or minimizes his cost ci(s). As we recall from the introductory lectures, any finite game has a mixed Nash equilibrium (NE), but a finite game may or may not have pure Nash equilibria. Today we focus on games with pure NE. Some NE are \better" than others, which we formalize via a social objective function f : S ! R. Two classic social objectives are: P sum social welfare f(s) = i ui(s) measures social welfare { we make sure that the av- erage satisfaction of the population is high maxmin social utility f(s) = mini ui(s) measures the satisfaction of the most unsatisfied player A social objective function quantifies the efficiency of each strategy profile. We can now measure how efficient a Nash equilibrium is in a specific game. Since a game may have many NE we have at least two natural measures, corresponding to the best and the worst NE. We first define the best possible solution in a game Definition 1. -
Lecture Notes
GRADUATE GAME THEORY LECTURE NOTES BY OMER TAMUZ California Institute of Technology 2018 Acknowledgments These lecture notes are partially adapted from Osborne and Rubinstein [29], Maschler, Solan and Zamir [23], lecture notes by Federico Echenique, and slides by Daron Acemoglu and Asu Ozdaglar. I am indebted to Seo Young (Silvia) Kim and Zhuofang Li for their help in finding and correcting many errors. Any comments or suggestions are welcome. 2 Contents 1 Extensive form games with perfect information 7 1.1 Tic-Tac-Toe ........................................ 7 1.2 The Sweet Fifteen Game ................................ 7 1.3 Chess ............................................ 7 1.4 Definition of extensive form games with perfect information ........... 10 1.5 The ultimatum game .................................. 10 1.6 Equilibria ......................................... 11 1.7 The centipede game ................................... 11 1.8 Subgames and subgame perfect equilibria ...................... 13 1.9 The dollar auction .................................... 14 1.10 Backward induction, Kuhn’s Theorem and a proof of Zermelo’s Theorem ... 15 2 Strategic form games 17 2.1 Definition ......................................... 17 2.2 Nash equilibria ...................................... 17 2.3 Classical examples .................................... 17 2.4 Dominated strategies .................................. 22 2.5 Repeated elimination of dominated strategies ................... 22 2.6 Dominant strategies .................................. -
Price of Competition and Dueling Games
Price of Competition and Dueling Games Sina Dehghani ∗† MohammadTaghi HajiAghayi ∗† Hamid Mahini ∗† Saeed Seddighin ∗† Abstract We study competition in a general framework introduced by Immorlica, Kalai, Lucier, Moitra, Postlewaite, and Tennenholtz [19] and answer their main open question. Immorlica et al. [19] considered classic optimization problems in terms of competition and introduced a general class of games called dueling games. They model this competition as a zero-sum game, where two players are competing for a user’s satisfaction. In their main and most natural game, the ranking duel, a user requests a webpage by submitting a query and players output an or- dering over all possible webpages based on the submitted query. The user tends to choose the ordering which displays her requested webpage in a higher rank. The goal of both players is to maximize the probability that her ordering beats that of her opponent and gets the user’s at- tention. Immorlica et al. [19] show this game directs both players to provide suboptimal search results. However, they leave the following as their main open question: “does competition be- tween algorithms improve or degrade expected performance?” (see the introduction for more quotes) In this paper, we resolve this question for the ranking duel and a more general class of dueling games. More precisely, we study the quality of orderings in a competition between two players. This game is a zero-sum game, and thus any Nash equilibrium of the game can be described by minimax strategies. Let the value of the user for an ordering be a function of the position of her requested item in the corresponding ordering, and the social welfare for an ordering be the expected value of the corresponding ordering for the user. -
Prophylaxy Copie.Pdf
Social interactions and the prophylaxis of SI epidemics on networks Géraldine Bouveret, Antoine Mandel To cite this version: Géraldine Bouveret, Antoine Mandel. Social interactions and the prophylaxis of SI epi- demics on networks. Journal of Mathematical Economics, Elsevier, 2021, 93, pp.102486. 10.1016/j.jmateco.2021.102486. halshs-03165772 HAL Id: halshs-03165772 https://halshs.archives-ouvertes.fr/halshs-03165772 Submitted on 17 Mar 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Social interactions and the prophylaxis of SI epidemics on networkssa G´eraldineBouveretb Antoine Mandel c March 17, 2021 Abstract We investigate the containment of epidemic spreading in networks from a nor- mative point of view. We consider a susceptible/infected model in which agents can invest in order to reduce the contagiousness of network links. In this setting, we study the relationships between social efficiency, individual behaviours and network structure. First, we characterise individual and socially efficient behaviour using the notions of communicability and exponential centrality. Second, we show, by computing the Price of Anarchy, that the level of inefficiency can scale up to lin- early with the number of agents. -
Strategic Behavior in Queues
Strategic behavior in queues Lecturer: Moshe Haviv1 Dates: 31 January – 4 February 2011 Abstract: The course will first introduce some concepts borrowed from non-cooperative game theory to the analysis of strategic behavior in queues. Among them: Nash equilibrium, socially optimal strategies, price of anarchy, evolutionarily stable strategies, avoid the crowd and follow the crowd. Various decision models will be considered. Among them: to join or not to join an M/M/1 or an M/G/1 queue, when to abandon the queue, when to arrive to a queue, and from which server to seek service (if at all). We will also look at the application of cooperative game theory concepts to queues. Among them: how to split the cost of waiting among customers and how to split the reward gained when servers pooled their resources. Program: 1. Basic concepts in strategic behavior in queues: Unobservable and observable queueing models, strategy profiles, to avoid or to follow the crowd, Nash equilibrium, evolutionarily stable strategy, social optimization, the price of anarchy. 2. Examples: to queue or not to queue, priority purchasing, retrials and abandonment, server selection. 3. Competition between servers. Examples: price war, capacity competition, discipline competition. 4. When to arrive to a queue so as to minimize waiting and tardiness costs? Examples: Poisson number of arrivals, fluid approximation. 5. Basic concepts in cooperative game theory: The Shapley value, the core, the Aumann-Shapley prices. Ex- amples: Cooperation among servers, charging customers based on the externalities they inflict on others. Bibliography: [1] M. Armony and M. Haviv, Price and delay competition between two service providers, European Journal of Operational Research 147 (2003) 32–50. -
Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction
Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction Renato Paes Leme Eva´ Tardos Department of Computer Science Department of Computer Science Cornell University, Ithaca, NY Cornell University, Ithaca, NY [email protected] [email protected] Abstract—The Generalized Second Price Auction has for advertisements and slots higher on the page are been the main mechanism used by search companies more valuable (clicked on by more users). The bids to auction positions for advertisements on search pages. are used to determine both the assignment of bidders In this paper we study the social welfare of the Nash equilibria of this game in various models. In the full to slots, and the fees charged. In the simplest model, information setting, socially optimal Nash equilibria are the bidders are assigned to slots in order of bids, and known to exist (i.e., the Price of Stability is 1). This paper the fee for each click is the bid occupying the next is the first to prove bounds on the price of anarchy, and slot. This auction is called the Generalized Second Price to give any bounds in the Bayesian setting. Auction (GSP). More generally, positions and payments Our main result is to show that the price of anarchy is small assuming that all bidders play un-dominated in the Generalized Second Price Auction depend also on strategies. In the full information setting we prove a bound the click-through rates associated with the bidders, the of 1.618 for the price of anarchy for pure Nash equilibria, probability that the advertisement will get clicked on by and a bound of 4 for mixed Nash equilibria. -
Efficiency of Mechanisms in Complex Markets
EFFICIENCY OF MECHANISMS IN COMPLEX MARKETS A Dissertation Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Vasileios Syrgkanis August 2014 ⃝c 2014 Vasileios Syrgkanis ALL RIGHTS RESERVED EFFICIENCY OF MECHANISMS IN COMPLEX MARKETS Vasileios Syrgkanis, Ph.D. Cornell University 2014 We provide a unifying theory for the analysis and design of efficient simple mechanisms for allocating resources to strategic players, with guaranteed good properties even when players participate in many mechanisms simultaneously or sequentially and even when they use learning algorithms to identify how to play and have incomplete information about the parameters of the game. These properties are essential in large scale markets, such as electronic marketplaces, where mechanisms rarely run in isolation and the environment is too complex to assume that the market will always converge to the classic economic equilib- rium or that the participants will have full knowledge of the competition. We propose the notion of a smooth mechanism, and show that smooth mech- anisms possess all the aforementioned desiderata in large scale markets. We fur- ther give guarantees for smooth mechanisms even when players have budget constraints on their payments. We provide several examples of smooth mech- anisms and show that many simple mechanisms used in practice are smooth (such as formats of position auctions, uniform price auctions, proportional bandwidth allocation mechanisms, greedy combinatorial auctions). We give algorithmic characterizations of which resource allocation algorithms lead to smooth mechanisms when accompanied by appropriate payment schemes and show a strong connection with greedy algorithms on matroids. -
How Proper Is the Dominance-Solvable Outcome? Yukio Koriyama, Matias Nunez
How proper is the dominance-solvable outcome? Yukio Koriyama, Matias Nunez To cite this version: Yukio Koriyama, Matias Nunez. How proper is the dominance-solvable outcome?. 2014. hal- 01074178 HAL Id: hal-01074178 https://hal.archives-ouvertes.fr/hal-01074178 Preprint submitted on 13 Oct 2014 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. ECOLE POLYTECHNIQUE CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE HOW PROPER IS THE DOMINANCE-SOLVABLE OUTCOME? Yukio KORIYAMA Matías NÚÑEZ September 2014 Cahier n° 2014-24 DEPARTEMENT D'ECONOMIE Route de Saclay 91128 PALAISEAU CEDEX (33) 1 69333033 http://www.economie.polytechnique.edu/ mailto:[email protected] How proper is the dominance-solvable outcome?∗ Yukio Koriyamay and Mat´ıas Nu´ nez˜ z September 2014 Abstract We examine the conditions under which iterative elimination of weakly dominated strategies refines the set of proper outcomes of a normal-form game. We say that the proper inclusion holds in terms of outcome if the set of outcomes of all proper equilibria in the reduced game is included in the set of all proper outcomes of the original game. We show by examples that neither dominance solvability nor the transference of decision-maker indiffer- ence condition (TDI∗ of Marx and Swinkels [1997]) implies proper inclusion. -
Informational Incentives for Congestion Games
Informational Incentives for Congestion Games Hamidreza Tavafoghi and Demosthenis Teneketzis network, leading to a decrease in social welfare. In addition Abstract— We investigate the problems of designing public to the above-mentioned theoretical and experimental works, and private information disclosure mechanisms by a principal there are empirical evidences that identify negative impacts in a transportation network so as to improve the overall conges- tion. We show that perfect disclosure of information about the of information provision on the network’s congestion [16], routes’ conditions is not optimal. The principal can improve the [26], [28], [15], [10]. Therefore, it is important to investigate congestion (i.e. social welfare) by providing coordinated routing how to design appropriate information provision mechanisms recommendation to drivers based on the routes’ conditions. in a manner that is socially beneficial and leads to a reduction When the uncertainty about the routes’ conditions is high in the overall congestion in the transportation network. relative to the ex-ante difference in the routes’ conditions (the value of information is high), we show that the socially efficient In this paper, we study the problem of designing socially routing outcome is achievable using a private information optimal information disclosure mechanisms. We consider a disclosure mechanism. Furthermore, we study the problem congestion game [25], [29] in a parallel two-link network. of optimal dynamic private information disclosure mechanism We consider an information provider (principal) who wants design in a dynamic two-time step setting. We consider different to disclose information about the condition of the network pieces of information that drivers may observe and learn from at t = 1 and investigate qualitative properties of an optimal to a fixed population of drivers (agents). -
Intrinsic Robustness of the Price of Anarchy
Intrinsic Robustness of the Price of Anarchy Tim Roughgarden Department of Computer Science Stanford University 353 Serra Mall, Stanford, CA 94305 [email protected] ABSTRACT ble to eliminate in many situations, with large networks be- The price of anarchy, defined as the ratio of the worst-case ing one obvious example. The past ten years have provided objective function value of a Nash equilibrium of a game and an encouraging counterpoint to this widespread equilibrium that of an optimal outcome, quantifies the inefficiency of self- inefficiency: in a number of interesting application domains, ish behavior. Remarkably good bounds on this measure have decentralized optimization by competing individuals prov- been proved in a wide range of application domains. How- ably approximates the optimal outcome. ever, such bounds are meaningful only if a game’s partici- A rigorous guarantee of this type requires a formal behav- pants successfully reach a Nash equilibrium. This drawback ioral model, in order to define“the outcome of self-interested motivates inefficiency bounds that apply more generally to behavior”. The majority of previous research studies pure- weaker notions of equilibria, such as mixed Nash equilibria, strategy Nash equilibria, defined as follows. Each player i correlated equilibria, or to sequences of outcomes generated selects a strategy si from a set Si (like a path in a network). by natural experimentation strategies, such as simultaneous The cost Ci(s) incurred by a player i in a game is a function regret-minimization. of the entire vector s of players’ chosen strategies, which We prove a general and fundamental connection between is called a strategy profile or an outcome.