Stat155 Game Theory Lecture 21: Midterm 2 Review

Total Page:16

File Type:pdf, Size:1020Kb

Stat155 Game Theory Lecture 21: Midterm 2 Review Midterm 2 Stat155 Game Theory Lecture 21: Midterm 2 Review “This is an open book exam: you can use any printed or written material, but you cannot use a laptop, tablet, or phone (or any device that can communicate). There are three questions, each consisting of three parts. Peter Bartlett Each part of each question carries equal weight. Answer each question in the space provided.” November 8, 2016 1 / 59 2 / 59 Topics Series and parallel games Series and parallel games. A series game: Both players first play G1, then both play G2. Two-player general-sum games Value is V1 + V2. Payoff matrices, dominant strategies, safety strategies, Nash eq. Multiplayer general-sum games A parallel game: Both players simultaneously decide which game to Utility functions, Nash equilibria. Nash’s Theorem play, and an action in that game. Congestion games and potential games If they choose the same game, they get the payoff from that game. Congestion games: Every congestion game has a pure Nash equilibrium If they choose different games, the payoff is 0. Potential games: Every congestion game is a potential game Value is1 /(1/V1 + 1/V2). Criticisms of Nash equilibria Evolutionarily stable strategies Electrical networks ESS and Nash equilibria Multiplayer ESS Correlated equilibrium Price of anarchy Braess’s paradox The impact of adding edges Classes of latencies Pigou networks 3 / 59 4 / 59 Topics Cooperative versus noncooperative games Series and parallel games. Two-player general-sum games Payoff matrices, dominant strategies, safety strategies, Nash eq. Multiplayer general-sum games Utility functions, Nash equilibria. Nash’s Theorem Noncooperative games Congestion games and potential games Players play their strategies simultaneously. Congestion games: Every congestion game has a pure Nash equilibrium Potential games: Every congestion game is a potential game They might communicate (or see a common signal; e.g., a traffic Criticisms of Nash equilibria signal), but there’s no enforced agreement. Evolutionarily stable strategies Natural solution concepts: ESS and Nash equilibria Nash equilibrium, correlated equilibrium. Multiplayer ESS No improvement from unilaterally deviating. Correlated equilibrium Price of anarchy Braess’s paradox The impact of adding edges Classes of latencies Pigou networks 5 / 59 6 / 59 General-sum games General-sum games Dominated pure strategies Notation A pure strategy e for Player I is dominated by e in payoff matrix A if, for A two-person general-sum game is specified by two payoff matrices, i i 0 m n all j 1,..., n , A, B R × . ∈ { } ∈ aij ai j . Simultaneously, Player I chooses i 1,..., m and the Player II ≤ 0 ∈ { } chooses j 1,..., n . Similarly, a pure strategy ej for Player II is dominated by ej0 in payoff ∈ { } matrix B if, for all i 1,..., m , Player I receives payoff aij . ∈ { } Player II receives payoff bij . b b . ij ≤ ij0 7 / 59 8 / 59 General-sum games General-sum games Nash equilibria Safety strategies A pair( x , y ) ∆m ∆n is a Nash equilibrium for payoff matrices ∗m n∗ ∈ × A safety strategy for Player I is an x ∆m that satisfies A, B R × if ∗ ∈ ∈ min x>Ay = max min x>Ay. max x>Ay = x>Ay , y ∆n ∗ x ∆m y ∆n x ∆m ∗ ∗ ∗ ∈ ∈ ∈ ∈ max x>By = x>By . x maximizes the worst case expected gain for Player I. y ∆n ∗ ∗ ∗ ∗ ∈ Similarly, a safety strategy for Player II is a y ∆n that satisfies ∗ ∈ If Player I plays x and Player II plays y , neither player has an min x>By = max min x>By. ∗ ∗ x ∆m ∗ y ∆n x ∆m incentive to unilaterally deviate. ∈ ∈ ∈ x is a best response to y , y is a best response to x . y maximizes the worst case expected gain for Player II. ∗ ∗ ∗ ∗ ∗ In general-sum games, there might be many Nash equilibria, with different payoff vectors. 9 / 59 10 / 59 Comparing two-player general-sum and zero-sum games Comparing two-player general-sum and zero-sum games Zero-sum games Zero-sum games 1 A pair of safety strategies is a Nash equilibrium (minimax theorem) 4 If each player has an equalizing mixed strategy 2 Hence, there is always a Nash equilibrium. (that is, x>A = v1> and Ay = v1), 3 If there are multiple Nash equilibria, they form a convex set, and the then this pair of strategies is a Nash equilibrium. expected payoff is identical within that set. (from the principle of indifference) Thus, any two Nash equilibria give the same payoff. General-sum games General-sum games 4 If each player has an equalizing mixed strategy 1 A pair of safety strategies might be unstable. for their opponent’s payoff matrix (Opponent aims to maximize their payoff, not minimize mine.) (that is, x>B = v21> and Ay = v11), 2 There is always a Nash equilibrium (Nash’s Theorem). then this pair of strategies is a Nash equilibrium. 3 There can be multiple Nash equilibria, with different payoff vectors. 11 / 59 12 / 59 Topics Multiplayer general-sum games Series and parallel games. Two-player general-sum games Notation Payoff matrices, dominant strategies, safety strategies, Nash eq. A k-person general-sum game is specified by k utility functions, Multiplayer general-sum games uj : S1 S2 Sk R. Utility functions, Nash equilibria. Nash’s Theorem × × · · · × → Player j can choose strategies s S . Congestion games and potential games j ∈ j Congestion games: Every congestion game has a pure Nash equilibrium Simultaneously, each player chooses a strategy. Potential games: Every congestion game is a potential game Player j receives payoff uj (s1,..., sk ). Criticisms of Nash equilibria Evolutionarily stable strategies ESS and Nash equilibria k = 2: u1(i, j) = aij , u2(i, j) = bij . Multiplayer ESS For s = (s1,..., sk ), let s i denote the strategies without the ith one: Correlated equilibrium − Price of anarchy s i = (s1,..., si 1, si+1,..., sk ). Braess’s paradox − − The impact of adding edges And write( si , s i ) as the full vector. Classes of latencies − Pigou networks 13 / 59 14 / 59 Multiplayer general-sum games Multiplayer general-sum games Definition A sequence( x ,..., x ) ∆ ∆ (called a strategy profile) is a 1∗ k∗ ∈ S1 × · · · × Sk Definition Nash equilibrium for utility functions u1,..., uk if, for each player j 1,..., k , A vector( s ,..., s ) S S is a pure Nash equilibrium for utility ∈ { } 1∗ k∗ ∈ 1 × · · · × k functions u1,..., uk if, for each player j 1,..., k , ∈ { } max uj (xj , x∗ j ) = uj (xj∗, x∗ j ). xj ∆S − − ∈ j max uj (sj , s∗ j ) = uj (sj∗, s∗ j ). sj Sj − − ∈ Here, we define uj (x∗) = Es1 x1,...,sk xk uj (s1,..., sk ) If the players play these sj∗, nobody has an incentive to unilaterally ∼ ∼ deviate: each player’s strategy is a best response to the other players’ = x (s ) x (s )u (s ,..., s ). 1 1 ··· k k j 1 k strategies. s S ,...,s S 1∈ 1X k ∈ k If the players play these mixed strategies xj∗, nobody has an incentive to unilaterally deviate: each player’s mixed strategy is a best response to the other players’ mixed strategies. 15 / 59 16 / 59 Multiplayer general-sum games Multiplayer general-sum games Nash’s Theorem (1951) Every finite general-sum game has a Nash equilibrium. Theorem Consider a strategy profile x ∆ ∆ . Let Proof Idea (two players) ∈ S1 × · · · × Sk T = s S : x (s) > 0 . The following statements are equivalent. We find an “improvement” map M(x, y) = (ˆx, yˆ), so that i { ∈ i i } 1 x is a Nash equilibrium. 1 xˆ>Ay > x>Ay (orˆx>Ay = x>Ay, if x was a best response to y), 2 For each i, there is a ci such that 2 x>Byˆ > x>By (or x>Byˆ = x>By, if y was a best response to x), 1 For all si Si , ui (si , x i ) ci . No better response outside Ti 3 ∈ − ≤ ←− M is continuous. 2 For si Ti , ui (si , x i ) = ci . Indifferent within Ti ∈ − ←− It’s easy to find a map like this. For instance, take a step in the direction of increasing payoff: Compare with the principle of indifference in the zero-sum case. xˆ = (x + ηAy),ˆ y = (y + ηB x). P∆m P∆n > A Nash equilibrium is a fixed point of M. The existence of a Nash equilibrium follows from Brouwer’s fixed-point theorem. 17 / 59 18 / 59 Nash’s Theorem Topics Series and parallel games. Two-player general-sum games Payoff matrices, dominant strategies, safety strategies, Nash eq. Multiplayer general-sum games Utility functions, Nash equilibria. Nash’s Theorem Congestion games and potential games Brouwer’s Fixed-Point Theorem Congestion games: Every congestion game has a pure Nash equilibrium A continuous map f : K K from a convex, closed, bounded K Rd Potential games: Every congestion game is a potential game → ⊆ has a fixed point, that is, an x K satisfying f (x) = x. Criticisms of Nash equilibria ∈ Evolutionarily stable strategies ESS and Nash equilibria Multiplayer ESS Correlated equilibrium Price of anarchy Braess’s paradox The impact of adding edges Classes of latencies Pigou networks 19 / 59 20 / 59 Congestion games: games with a pure Nash equilibrium Congestion games Definition A congestion game has Example k players A m facilities 1,..., m (e.g., edges) { } (1,2,4) (2,3,5) For player i, there is a set Si of strategies that are sets of facilities, s 1,..., m (e.g., paths) ⊆ { } k I, II, III S (1,2,6) T For facility j, there is a cost vector cj R , where cj (n) is the cost of ∈ facility j when it is used by n players. (2,3,5) (2,4,8) For a sequence s = (s1,..., sn), the utilities of the players are defined by B cost (s) = u (s) = c (n (s)), i − i j j j s X∈ i where n (s) = i : j s is the number of players using facility j.
Recommended publications
  • CSC304 Lecture 4 Game Theory
    CSC304 Lecture 4 Game Theory (Cost sharing & congestion games, Potential function, Braess’ paradox) CSC304 - Nisarg Shah 1 Recap • Nash equilibria (NE) ➢ No agent wants to change their strategy ➢ Guaranteed to exist if mixed strategies are allowed ➢ Could be multiple • Pure NE through best-response diagrams • Mixed NE through the indifference principle CSC304 - Nisarg Shah 2 Worst and Best Nash Equilibria • What can we say after we identify all Nash equilibria? ➢ Compute how “good” they are in the best/worst case • How do we measure “social good”? ➢ Game with only rewards? Higher total reward of players = more social good ➢ Game with only penalties? Lower total penalty to players = more social good ➢ Game with rewards and penalties? No clear consensus… CSC304 - Nisarg Shah 3 Price of Anarchy and Stability • Price of Anarchy (PoA) • Price of Stability (PoS) “Worst NE vs optimum” “Best NE vs optimum” Max total reward Max total reward Min total reward in any NE Max total reward in any NE or or Max total cost in any NE Min total cost in any NE Min total cost Min total cost PoA ≥ PoS ≥ 1 CSC304 - Nisarg Shah 4 Revisiting Stag-Hunt Hunter 2 Stag Hare Hunter 1 Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1) • Max total reward = 4 + 4 = 8 • Three equilibria ➢ (Stag, Stag) : Total reward = 8 ➢ (Hare, Hare) : Total reward = 2 1 2 1 2 ➢ ( Τ3 Stag – Τ3 Hare, Τ3 Stag – Τ3 Hare) 1 1 1 1 o Total reward = ∗ ∗ 8 + 1 − ∗ ∗ 2 ∈ (2,8) 3 3 3 3 • Price of stability? Price of anarchy? CSC304 - Nisarg Shah 5 Revisiting Prisoner’s Dilemma John Stay Silent Betray Sam
    [Show full text]
  • Potential Games. Congestion Games. Price of Anarchy and Price of Stability
    8803 Connections between Learning, Game Theory, and Optimization Maria-Florina Balcan Lecture 13: October 5, 2010 Reading: Algorithmic Game Theory book, Chapters 17, 18 and 19. Price of Anarchy and Price of Staility We assume a (finite) game with n players, where player i's set of possible strategies is Si. We let s = (s1; : : : ; sn) denote the (joint) vector of strategies selected by players in the space S = S1 × · · · × Sn of joint actions. The game assigns utilities ui : S ! R or costs ui : S ! R to any player i at any joint action s 2 S: any player maximizes his utility ui(s) or minimizes his cost ci(s). As we recall from the introductory lectures, any finite game has a mixed Nash equilibrium (NE), but a finite game may or may not have pure Nash equilibria. Today we focus on games with pure NE. Some NE are \better" than others, which we formalize via a social objective function f : S ! R. Two classic social objectives are: P sum social welfare f(s) = i ui(s) measures social welfare { we make sure that the av- erage satisfaction of the population is high maxmin social utility f(s) = mini ui(s) measures the satisfaction of the most unsatisfied player A social objective function quantifies the efficiency of each strategy profile. We can now measure how efficient a Nash equilibrium is in a specific game. Since a game may have many NE we have at least two natural measures, corresponding to the best and the worst NE. We first define the best possible solution in a game Definition 1.
    [Show full text]
  • Lecture Notes
    GRADUATE GAME THEORY LECTURE NOTES BY OMER TAMUZ California Institute of Technology 2018 Acknowledgments These lecture notes are partially adapted from Osborne and Rubinstein [29], Maschler, Solan and Zamir [23], lecture notes by Federico Echenique, and slides by Daron Acemoglu and Asu Ozdaglar. I am indebted to Seo Young (Silvia) Kim and Zhuofang Li for their help in finding and correcting many errors. Any comments or suggestions are welcome. 2 Contents 1 Extensive form games with perfect information 7 1.1 Tic-Tac-Toe ........................................ 7 1.2 The Sweet Fifteen Game ................................ 7 1.3 Chess ............................................ 7 1.4 Definition of extensive form games with perfect information ........... 10 1.5 The ultimatum game .................................. 10 1.6 Equilibria ......................................... 11 1.7 The centipede game ................................... 11 1.8 Subgames and subgame perfect equilibria ...................... 13 1.9 The dollar auction .................................... 14 1.10 Backward induction, Kuhn’s Theorem and a proof of Zermelo’s Theorem ... 15 2 Strategic form games 17 2.1 Definition ......................................... 17 2.2 Nash equilibria ...................................... 17 2.3 Classical examples .................................... 17 2.4 Dominated strategies .................................. 22 2.5 Repeated elimination of dominated strategies ................... 22 2.6 Dominant strategies ..................................
    [Show full text]
  • Potential Games and Competition in the Supply of Natural Resources By
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ASU Digital Repository Potential Games and Competition in the Supply of Natural Resources by Robert H. Mamada A Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Approved March 2017 by the Graduate Supervisory Committee: Carlos Castillo-Chavez, Co-Chair Charles Perrings, Co-Chair Adam Lampert ARIZONA STATE UNIVERSITY May 2017 ABSTRACT This dissertation discusses the Cournot competition and competitions in the exploita- tion of common pool resources and its extension to the tragedy of the commons. I address these models by using potential games and inquire how these models reflect the real competitions for provisions of environmental resources. The Cournot models are dependent upon how many firms there are so that the resultant Cournot-Nash equilibrium is dependent upon the number of firms in oligopoly. But many studies do not take into account how the resultant Cournot-Nash equilibrium is sensitive to the change of the number of firms. Potential games can find out the outcome when the number of firms changes in addition to providing the \traditional" Cournot-Nash equilibrium when the number of firms is fixed. Hence, I use potential games to fill the gaps that exist in the studies of competitions in oligopoly and common pool resources and extend our knowledge in these topics. In specific, one of the rational conclusions from the Cournot model is that a firm’s best policy is to split into separate firms. In real life, we usually witness the other way around; i.e., several firms attempt to merge and enjoy the monopoly profit by restricting the amount of output and raising the price.
    [Show full text]
  • Price of Competition and Dueling Games
    Price of Competition and Dueling Games Sina Dehghani ∗† MohammadTaghi HajiAghayi ∗† Hamid Mahini ∗† Saeed Seddighin ∗† Abstract We study competition in a general framework introduced by Immorlica, Kalai, Lucier, Moitra, Postlewaite, and Tennenholtz [19] and answer their main open question. Immorlica et al. [19] considered classic optimization problems in terms of competition and introduced a general class of games called dueling games. They model this competition as a zero-sum game, where two players are competing for a user’s satisfaction. In their main and most natural game, the ranking duel, a user requests a webpage by submitting a query and players output an or- dering over all possible webpages based on the submitted query. The user tends to choose the ordering which displays her requested webpage in a higher rank. The goal of both players is to maximize the probability that her ordering beats that of her opponent and gets the user’s at- tention. Immorlica et al. [19] show this game directs both players to provide suboptimal search results. However, they leave the following as their main open question: “does competition be- tween algorithms improve or degrade expected performance?” (see the introduction for more quotes) In this paper, we resolve this question for the ranking duel and a more general class of dueling games. More precisely, we study the quality of orderings in a competition between two players. This game is a zero-sum game, and thus any Nash equilibrium of the game can be described by minimax strategies. Let the value of the user for an ordering be a function of the position of her requested item in the corresponding ordering, and the social welfare for an ordering be the expected value of the corresponding ordering for the user.
    [Show full text]
  • Prophylaxy Copie.Pdf
    Social interactions and the prophylaxis of SI epidemics on networks Géraldine Bouveret, Antoine Mandel To cite this version: Géraldine Bouveret, Antoine Mandel. Social interactions and the prophylaxis of SI epi- demics on networks. Journal of Mathematical Economics, Elsevier, 2021, 93, pp.102486. 10.1016/j.jmateco.2021.102486. halshs-03165772 HAL Id: halshs-03165772 https://halshs.archives-ouvertes.fr/halshs-03165772 Submitted on 17 Mar 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Social interactions and the prophylaxis of SI epidemics on networkssa G´eraldineBouveretb Antoine Mandel c March 17, 2021 Abstract We investigate the containment of epidemic spreading in networks from a nor- mative point of view. We consider a susceptible/infected model in which agents can invest in order to reduce the contagiousness of network links. In this setting, we study the relationships between social efficiency, individual behaviours and network structure. First, we characterise individual and socially efficient behaviour using the notions of communicability and exponential centrality. Second, we show, by computing the Price of Anarchy, that the level of inefficiency can scale up to lin- early with the number of agents.
    [Show full text]
  • Online Learning and Game Theory
    Online Learning and Game Theory Your guide: Avrim Blum Carnegie Mellon University [Algorithmic Economics Summer School 2012] Itinerary • Stop 1: Minimizing regret and combining advice. – Randomized Wtd Majority / Multiplicative Weights alg – Connections to minimax-optimality in zero-sum games • Stop 2: Bandits and Pricing – Online learning from limited feedback (bandit algs) – Use for online pricing/auction problems • Stop 3: Internal regret & Correlated equil – Internal regret minimization – Connections to correlated equilib. in general-sum games • Stop 4: Guiding dynamics to higher-quality equilibria – Helpful “nudging” of simple dynamics in potential games Stop 1: Minimizing regret and combining expert advice Consider the following setting… Each morning, you need to pick one of N possible routes to drive to work. Robots But traffic is different each day. R Us 32 min Not clear a priori which will be best. When you get there you find out how long your route took. (And maybe others too or maybe not.) Is there a strategy for picking routes so that in the long run, whatever the sequence of traffic patterns has been, you’ve done nearly as well as the best fixed route in hindsight? (In expectation, over internal randomness in the algorithm) Yes. “No-regret” algorithms for repeated decisions A bit more generally: Algorithm has N options. World chooses cost vector. Can view as matrix like this (maybe infinite # cols) World – life - fate Algorithm At each time step, algorithm picks row, life picks column. Alg pays cost for action chosen. Alg gets column as feedback (or just its own cost in the “bandit” model). Need to assume some bound on max cost.
    [Show full text]
  • Strategic Behavior in Queues
    Strategic behavior in queues Lecturer: Moshe Haviv1 Dates: 31 January – 4 February 2011 Abstract: The course will first introduce some concepts borrowed from non-cooperative game theory to the analysis of strategic behavior in queues. Among them: Nash equilibrium, socially optimal strategies, price of anarchy, evolutionarily stable strategies, avoid the crowd and follow the crowd. Various decision models will be considered. Among them: to join or not to join an M/M/1 or an M/G/1 queue, when to abandon the queue, when to arrive to a queue, and from which server to seek service (if at all). We will also look at the application of cooperative game theory concepts to queues. Among them: how to split the cost of waiting among customers and how to split the reward gained when servers pooled their resources. Program: 1. Basic concepts in strategic behavior in queues: Unobservable and observable queueing models, strategy profiles, to avoid or to follow the crowd, Nash equilibrium, evolutionarily stable strategy, social optimization, the price of anarchy. 2. Examples: to queue or not to queue, priority purchasing, retrials and abandonment, server selection. 3. Competition between servers. Examples: price war, capacity competition, discipline competition. 4. When to arrive to a queue so as to minimize waiting and tardiness costs? Examples: Poisson number of arrivals, fluid approximation. 5. Basic concepts in cooperative game theory: The Shapley value, the core, the Aumann-Shapley prices. Ex- amples: Cooperation among servers, charging customers based on the externalities they inflict on others. Bibliography: [1] M. Armony and M. Haviv, Price and delay competition between two service providers, European Journal of Operational Research 147 (2003) 32–50.
    [Show full text]
  • Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction
    Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction Renato Paes Leme Eva´ Tardos Department of Computer Science Department of Computer Science Cornell University, Ithaca, NY Cornell University, Ithaca, NY [email protected] [email protected] Abstract—The Generalized Second Price Auction has for advertisements and slots higher on the page are been the main mechanism used by search companies more valuable (clicked on by more users). The bids to auction positions for advertisements on search pages. are used to determine both the assignment of bidders In this paper we study the social welfare of the Nash equilibria of this game in various models. In the full to slots, and the fees charged. In the simplest model, information setting, socially optimal Nash equilibria are the bidders are assigned to slots in order of bids, and known to exist (i.e., the Price of Stability is 1). This paper the fee for each click is the bid occupying the next is the first to prove bounds on the price of anarchy, and slot. This auction is called the Generalized Second Price to give any bounds in the Bayesian setting. Auction (GSP). More generally, positions and payments Our main result is to show that the price of anarchy is small assuming that all bidders play un-dominated in the Generalized Second Price Auction depend also on strategies. In the full information setting we prove a bound the click-through rates associated with the bidders, the of 1.618 for the price of anarchy for pure Nash equilibria, probability that the advertisement will get clicked on by and a bound of 4 for mixed Nash equilibria.
    [Show full text]
  • Potential Games with Continuous Player Sets
    Potential Games with Continuous Player Sets William H. Sandholm* Department of Economics University of Wisconsin 1180 Observatory Drive Madison, WI 53706 [email protected] www.ssc.wisc.edu/~whs First Version: April 30, 1997 This Version: May 20, 2000 * This paper is based on Chapter 4 of my doctoral dissertation (Sandholm [29]). I thank an anonymous referee and Associate Editor, Josef Hofbauer, and seminar audiences at Caltech, Chicago, Columbia, Harvard (Kennedy), Michigan, Northwestern, Rochester, Washington University, and Wisconsin for their comments. I especially thank Eddie Dekel and Jeroen Swinkels for their advice and encouragement. Financial support from a State Farm Companies Foundation Dissertation Fellowship is gratefully acknowledged. Abstract We study potential games with continuous player sets, a class of games characterized by an externality symmetry condition. Examples of these games include random matching games with common payoffs and congestion games. We offer a simple description of equilibria which are locally stable under a broad class of evolutionary dynamics, and prove that behavior converges to Nash equilibrium from all initial conditions. W e consider a subclass of potential games in which evolution leads to efficient play. Finally, we show that the games studied here are the limits of convergent sequences of the finite player potential games studied by Monderer and Shapley [22]. JEL Classification Numbers: C72, C73, D62, R41. 1. Introduction Nash equilibrium is the cornerstone of non-cooperative game theory, providing a necessary condition for stable behavior among rational agents. Still, to justify the prediction of Nash equilibrium play, one must explain how players arrive at a Nash equilibrium; if equilibrium is not reached, the fact that it is self-sustaining becomes moot.
    [Show full text]
  • Noisy Directional Learning and the Logit Equilibrium*
    CSE: AR SJOE 011 Scand. J. of Economics 106(*), *–*, 2004 DOI: 10.1111/j.1467-9442.2004.000376.x Noisy Directional Learning and the Logit Equilibrium* Simon P. Anderson University of Virginia, Charlottesville, VA 22903-3328, USA [email protected] Jacob K. Goeree California Institute of Technology, Pasadena, CA 91125, USA [email protected] Charles A. Holt University of Virginia, Charlottesville, VA 22903-3328, USA [email protected] PROOF Abstract We specify a dynamic model in which agents adjust their decisions toward higher payoffs, subject to normal error. This process generates a probability distribution of players’ decisions that evolves over time according to the Fokker–Planck equation. The dynamic process is stable for all potential games, a class of payoff structures that includes several widely studied games. In equilibrium, the distributions that determine expected payoffs correspond to the distributions that arise from the logit function applied to those expected payoffs. This ‘‘logit equilibrium’’ forms a stochastic generalization of the Nash equilibrium and provides a possible explanation of anomalous laboratory data. ECTED Keywords: Bounded rationality; noisy directional learning; Fokker–Planck equation; potential games; logit equilibrium JEL classification: C62; C73 I. Introduction Small errors and shocks may have offsetting effects in some economic con- texts, in which case there is not much to be gained from an explicit analysis of stochasticNCORR elements. In other contexts, a small amount of randomness can * We gratefully acknowledge financial support from the National Science Foundation (SBR- 9818683U and SBR-0094800), the Alfred P. Sloan Foundation and the Dutch National Science Foundation (NWO-VICI 453.03.606).
    [Show full text]
  • Strong Nash Equilibria in Games with the Lexicographical Improvement
    Strong Nash Equilibria in Games with the Lexicographical Improvement Property Tobias Harks, Max Klimm, and Rolf H. M¨ohring Preprint 15/07/2009 Institut f¨ur Mathematik, Technische Universit¨at Berlin harks,klimm,moehring @math.tu-berlin.de { } Abstract. We introduce a class of finite strategic games with the property that every deviation of a coalition of players that is profitable to each of its members strictly decreases the lexicographical order of a certain function defined on the set of strategy profiles. We call this property the Lexicographical Improvement Property (LIP) and show that it implies the existence of a generalized strong ordinal potential function. We use this characterization to derive existence, efficiency and fairness properties of strong Nash equilibria. We then study a class of games that generalizes congestion games with bottleneck objectives that we call bot- tleneck congestion games. We show that these games possess the LIP and thus the above mentioned properties. For bottleneck congestion games in networks, we identify cases in which the potential function associated with the LIP leads to polynomial time algorithms computing a strong Nash equilibrium. Finally, we investigate the LIP for infinite games. We show that the LIP does not imply the existence of a generalized strong ordinal potential, thus, the existence of SNE does not follow. Assuming that the function associated with the LIP is continuous, however, we prove existence of SNE. As a consequence, we prove that bottleneck congestion games with infinite strategy spaces and continuous cost functions possess a strong Nash equilibrium. 1 Introduction The theory of non-cooperative games is used to study situations that involve rational and selfish agents who are motivated by optimizing their own utilities rather than reaching some social optimum.
    [Show full text]