CSC304 Lecture 4 Game Theory

Total Page:16

File Type:pdf, Size:1020Kb

CSC304 Lecture 4 Game Theory CSC304 Lecture 4 Game Theory (Cost sharing & congestion games, Potential function, Braess’ paradox) CSC304 - Nisarg Shah 1 Recap • Nash equilibria (NE) ➢ No agent wants to change their strategy ➢ Guaranteed to exist if mixed strategies are allowed ➢ Could be multiple • Pure NE through best-response diagrams • Mixed NE through the indifference principle CSC304 - Nisarg Shah 2 Worst and Best Nash Equilibria • What can we say after we identify all Nash equilibria? ➢ Compute how “good” they are in the best/worst case • How do we measure “social good”? ➢ Game with only rewards? Higher total reward of players = more social good ➢ Game with only penalties? Lower total penalty to players = more social good ➢ Game with rewards and penalties? No clear consensus… CSC304 - Nisarg Shah 3 Price of Anarchy and Stability • Price of Anarchy (PoA) • Price of Stability (PoS) “Worst NE vs optimum” “Best NE vs optimum” Max total reward Max total reward Min total reward in any NE Max total reward in any NE or or Max total cost in any NE Min total cost in any NE Min total cost Min total cost PoA ≥ PoS ≥ 1 CSC304 - Nisarg Shah 4 Revisiting Stag-Hunt Hunter 2 Stag Hare Hunter 1 Stag (4 , 4) (0 , 2) Hare (2 , 0) (1 , 1) • Max total reward = 4 + 4 = 8 • Three equilibria ➢ (Stag, Stag) : Total reward = 8 ➢ (Hare, Hare) : Total reward = 2 1 2 1 2 ➢ ( Τ3 Stag – Τ3 Hare, Τ3 Stag – Τ3 Hare) 1 1 1 1 o Total reward = ∗ ∗ 8 + 1 − ∗ ∗ 2 ∈ (2,8) 3 3 3 3 • Price of stability? Price of anarchy? CSC304 - Nisarg Shah 5 Revisiting Prisoner’s Dilemma John Stay Silent Betray Sam Stay Silent (-1 , -1) (-3 , 0) Betray (0 , -3) (-2 , -2) • Min total cost = 1 + 1 = 2 • Only equilibrium: ➢ (Betray, Betray) : Total cost = 2 + 2 = 4 • Price of stability? Price of anarchy? CSC304 - Nisarg Shah 6 Cost Sharing Game • 푛 players on directed weighted graph 퐺 • Player 푖 푠1 푠2 ➢ Wants to go from 푠푖 to 푡푖 1 1 ➢ Strategy set 푆푖 = {directed 푠푖 → 푡푖 paths} ➢ Denote his chosen path by 푃푖 ∈ 푆푖 10 10 10 • Each edge 푒 has cost 푐푒 (weight) ➢ Cost is split among all players taking edge 푒 1 1 ➢ That is, among all players 푖 with 푒 ∈ 푃푖 푡1 푡2 CSC304 - Nisarg Shah 7 Cost Sharing Game • Given strategy profile 푃, cost 푐푖 푃 to player 푖 is sum of his costs for edges 푒 ∈ 푃푖 • Social cost 퐶 푃 = σ 푐 푃 푠1 푠2 푖 푖 1 1 10 10 • Note: 퐶 푃 = σ푒∈퐸 푃 푐푒, where… 10 ➢ 퐸(푃)={edges taken in 푃 by at least one player} ➢ Why? 1 1 푡1 푡2 CSC304 - Nisarg Shah 8 Cost Sharing Game • In the example on the right: ➢ What if both players take direct paths? ➢ What if both take middle paths? ➢ What if one player takes direct path and the 푠1 푠2 other takes middle path? 1 1 • Pure Nash equilibria? 10 10 10 1 1 푡1 푡2 CSC304 - Nisarg Shah 9 Cost Sharing: Simple Example • Example on the right: 푛 players s • Two pure NE ➢ All taking the n-edge: social cost = 푛 ➢ All taking the 1-edge: social cost = 1 푛 1 o Also the social optimum • Price of stability: 1 • Price of anarchy: 푛 t ➢ We can show that price of anarchy ≤ 푛 in every cost-sharing game! CSC304 - Nisarg Shah 10 Cost Sharing: PoA • Theorem: The price of anarchy of a cost sharing game is at most 푛. • Proof: ∗ ∗ ∗ ➢ Suppose the social optimum is (푃1 , 푃2 , … , 푃푛 ), in which ∗ the cost to player 푖 is 푐푖 . ➢ Take any NE with cost 푐푖 to player 푖. ′ ∗ ➢ Let 푐푖 be his cost if he switches to 푃푖 . ′ ➢ NE ⇒ 푐푖 ≥ 푐푖 (Why?) ′ ∗ ➢ But : 푐푖 ≤ 푛 ⋅ 푐푖 (Why?) ∗ ➢ 푐 ≤ 푛 ⋅ 푐 for each 푖 ⇒ no worse than 푛 × optimum 푖 푖 ∎ CSC304 - Nisarg Shah 11 Cost Sharing • Price of anarchy ➢ Every cost-sharing game: PoA ≤ 푛 ➢ Example game with PoA = 푛 ➢ Bound of 푛 is tight. • Price of stability? ➢ In the previous game, it was 1. ➢ In general, it can be higher. How high? ➢ We’ll answer this after a short detour. CSC304 - Nisarg Shah 12 Cost Sharing • Nash’s theorem shows existence of a mixed NE. 10 B ➢ Pure NE may not always exist in 20 A 12 general. 7 C 60 32 • But in both cost-sharing games we E saw, there was a PNE. D ➢ What about a more complex 10 players: 퐸 → 퐶 game like the one on the right? 27 players: 퐵 → 퐷 19 players: 퐶 → 퐷 CSC304 - Nisarg Shah 13 Good News • Theorem: Every cost-sharing game have a pure Nash equilibrium. • Proof: ➢ Via “potential function” argument CSC304 - Nisarg Shah 14 Step 1: Define Potential Fn • Potential function: Φ ∶ ς푖 푆푖 → ℝ+ ➢ This is a function such that for every pure strategy profile ′ 푃 = 푃1, … , 푃푛 , player 푖, and strategy 푃푖 of 푖, ′ ′ 푐푖 푃푖 , 푃−푖 − 푐푖 푃 = Φ 푃푖 , 푃−푖 − Φ 푃 ➢ When a single player 푖 changes her strategy, the change in potential function equals the change in cost to 푖! ➢ In contrast, the change in the social cost 퐶 equals the total change in cost to all players. o Hence, the social cost will often not be a valid potential function. CSC304 - Nisarg Shah 15 Step 2: Potential Fn → pure Nash Eq • A potential function exists ⇒ a pure NE exists. ➢ Consider a 푃 that minimizes the potential function. ➢ Deviation by any single player 푖 can only (weakly) increase the potential function. ➢ But change in potential function = change in cost to 푖. ➢ Hence, there is no beneficial deviation for any player. • Hence, every pure strategy profile minimizing the potential function is a pure Nash equilibrium. CSC304 - Nisarg Shah 16 Step 3: Potential Fn for Cost-Sharing • Recall: 퐸(푃) = {edges taken in 푃 by at least one player} • Let 푛푒 (푃) be the number of players taking 푒 in 푃 푛 (푃) 푒 푐 Φ 푃 = ෍ ෍ 푒 푘 푒∈퐸(푃) 푘=1 • Note: The cost of edge 푒 to each player taking 푒 is 푐푒/푛푒(푃). But the potential function includes all fractions: 푐푒/1, 푐푒/2, …, 푐푒/푛푒 푃 . CSC304 - Nisarg Shah 17 Step 3: Potential Fn for Cost-Sharing 푛 (푃) 푒 푐 Φ 푃 = ෍ ෍ 푒 푘 푒∈퐸(푃) 푘=1 • Why is this a potential function? 푐푒 ➢ If a player changes path, he pays for each new 푛푒 푃 +1 푐 edge 푒, gets back 푓 for each old edge 푓. 푛푓 푃 ➢ This is precisely the change in the potential function too. ➢ So Δ푐푖 = ΔΦ. ∎ CSC304 - Nisarg Shah 18 Potential Minimizing Eq. • Minimizing the potential function gives some pure Nash equilibrium ➢ Is this equilibrium special? Yes! • Recall that the price of anarchy can be up to 푛. ➢ That is, the worst Nash equilibrium can be up to 푛 times worse than the social optimum. • A potential-minimizing pure Nash equilibrium is better! CSC304 - Nisarg Shah 19 Potential Minimizing Eq. 푛푒(푃) 푛 푐 1 ෍ 푐 ≤ Φ 푃 = ෍ ෍ 푒 ≤ ෍ 푐 ∗ ෍ 푒 푘 푒 푘 푒∈퐸(푃) 푒∈퐸(푃) 푘=1 푒∈퐸(푃) 푘=1 Social cost Harmonic function 퐻(푛) ∀푃, 퐶 푃 ≤ Φ 푃 ≤ 퐶 푃 ∗ 퐻 푛 푛 = σ푘=1 1/푛 = 푂(log 푛) 퐶 푃∗ ≤ Φ 푃∗ ≤ Φ 푂푃푇 ≤ 퐶 푂푃푇 ∗ 퐻(푛) Potential minimizing eq. Social optimum CSC304 - Nisarg Shah 20 Potential Minimizing Eq. • Potential-minimizing PNE is 푂(log 푛)-approximation to the social optimum. • Thus, in every cost-sharing game, the price of stability is 푂 log 푛 . ➢ Compare to the price of anarchy, which can be 푛 CSC304 - Nisarg Shah 21 Congestion Games • Generalize cost sharing games • 푛 players, 푚 resources (e.g., edges) • Each player 푖 chooses a set of resources 푃푖 (e.g., 푠푖 → 푡푖 paths) • When 푛푗 player use resource 푗, each of them get a cost 푓푗(푛푗) • Cost to player is the sum of costs of resources used CSC304 - Nisarg Shah 22 Congestion Games • Theorem [Rosenthal 1973]: Every congestion game is a potential game. • Potential function: 푛푗 푃 Φ 푃 = ෍ ෍ 푓푗 푘 푗∈퐸(푃) 푘=1 • Theorem [Monderer and Shapley 1996]: Every potential game is equivalent to a congestion game. CSC304 - Nisarg Shah 23 Potential Functions • Potential functions are useful for deriving various results ➢ E.g., used for analyzing amortized complexity of algorithms • Bad news: Finding a potential function that works may be hard. CSC304 - Nisarg Shah 24 The Braess’ Paradox • In cost sharing, 푓푗 is decreasing ➢ The more people use a resource, the less the cost to each. • 푓푗 can also be increasing ➢ Road network, each player going from home to work ➢ Uses a sequence of roads ➢ The more people on a road, the greater the congestion, the greater the delay (cost) • Can lead to unintuitive phenomena CSC304 - Nisarg Shah 25 The Braess’ Paradox • Parkes-Seuken Example: ➢ 2000 players want to go from 1 to 4 ➢ 1 → 2 and 3 → 4 are “congestible” roads ➢ 1 → 3 and 2 → 4 are “constant delay” roads 2 1 4 3 CSC304 - Nisarg Shah 26 The Braess’ Paradox • Pure Nash equilibrium? ➢ 1000 take 1 → 2 → 4, 1000 take 1 → 3 → 4 ➢ Each player has cost 10 + 25 = 35 ➢ Anyone switching to the other creates a greater congestion on it, and faces a higher cost 2 1 4 3 CSC304 - Nisarg Shah 27 The Braess’ Paradox • What if we add a zero-cost connection 2 → 3? ➢ Intuitively, adding more roads should only be helpful ➢ In reality, it leads to a greater delay for everyone in the unique equilibrium! 2 1 푐23 푛23 = 0 4 3 CSC304 - Nisarg Shah 28 The Braess’ Paradox • Nobody chooses 1 → 3 as 1 → 2 → 3 is better irrespective of how many other players take it • Similarly, nobody chooses 2 → 4 • Everyone takes 1 → 2 → 3 → 4, faces delay = 40! 2 1 푐23 푛23 = 0 4 3 CSC304 - Nisarg Shah 29 The Braess’ Paradox • In fact, what we showed is: ➢ In the new game, 1 → 2 → 3 → 4 is a strictly dominant strategy for each player! 2 1 푐23 푛23 = 0 4 3 CSC304 - Nisarg Shah 30.
Recommended publications
  • Potential Games. Congestion Games. Price of Anarchy and Price of Stability
    8803 Connections between Learning, Game Theory, and Optimization Maria-Florina Balcan Lecture 13: October 5, 2010 Reading: Algorithmic Game Theory book, Chapters 17, 18 and 19. Price of Anarchy and Price of Staility We assume a (finite) game with n players, where player i's set of possible strategies is Si. We let s = (s1; : : : ; sn) denote the (joint) vector of strategies selected by players in the space S = S1 × · · · × Sn of joint actions. The game assigns utilities ui : S ! R or costs ui : S ! R to any player i at any joint action s 2 S: any player maximizes his utility ui(s) or minimizes his cost ci(s). As we recall from the introductory lectures, any finite game has a mixed Nash equilibrium (NE), but a finite game may or may not have pure Nash equilibria. Today we focus on games with pure NE. Some NE are \better" than others, which we formalize via a social objective function f : S ! R. Two classic social objectives are: P sum social welfare f(s) = i ui(s) measures social welfare { we make sure that the av- erage satisfaction of the population is high maxmin social utility f(s) = mini ui(s) measures the satisfaction of the most unsatisfied player A social objective function quantifies the efficiency of each strategy profile. We can now measure how efficient a Nash equilibrium is in a specific game. Since a game may have many NE we have at least two natural measures, corresponding to the best and the worst NE. We first define the best possible solution in a game Definition 1.
    [Show full text]
  • Lecture Notes
    GRADUATE GAME THEORY LECTURE NOTES BY OMER TAMUZ California Institute of Technology 2018 Acknowledgments These lecture notes are partially adapted from Osborne and Rubinstein [29], Maschler, Solan and Zamir [23], lecture notes by Federico Echenique, and slides by Daron Acemoglu and Asu Ozdaglar. I am indebted to Seo Young (Silvia) Kim and Zhuofang Li for their help in finding and correcting many errors. Any comments or suggestions are welcome. 2 Contents 1 Extensive form games with perfect information 7 1.1 Tic-Tac-Toe ........................................ 7 1.2 The Sweet Fifteen Game ................................ 7 1.3 Chess ............................................ 7 1.4 Definition of extensive form games with perfect information ........... 10 1.5 The ultimatum game .................................. 10 1.6 Equilibria ......................................... 11 1.7 The centipede game ................................... 11 1.8 Subgames and subgame perfect equilibria ...................... 13 1.9 The dollar auction .................................... 14 1.10 Backward induction, Kuhn’s Theorem and a proof of Zermelo’s Theorem ... 15 2 Strategic form games 17 2.1 Definition ......................................... 17 2.2 Nash equilibria ...................................... 17 2.3 Classical examples .................................... 17 2.4 Dominated strategies .................................. 22 2.5 Repeated elimination of dominated strategies ................... 22 2.6 Dominant strategies ..................................
    [Show full text]
  • Potential Games and Competition in the Supply of Natural Resources By
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ASU Digital Repository Potential Games and Competition in the Supply of Natural Resources by Robert H. Mamada A Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Approved March 2017 by the Graduate Supervisory Committee: Carlos Castillo-Chavez, Co-Chair Charles Perrings, Co-Chair Adam Lampert ARIZONA STATE UNIVERSITY May 2017 ABSTRACT This dissertation discusses the Cournot competition and competitions in the exploita- tion of common pool resources and its extension to the tragedy of the commons. I address these models by using potential games and inquire how these models reflect the real competitions for provisions of environmental resources. The Cournot models are dependent upon how many firms there are so that the resultant Cournot-Nash equilibrium is dependent upon the number of firms in oligopoly. But many studies do not take into account how the resultant Cournot-Nash equilibrium is sensitive to the change of the number of firms. Potential games can find out the outcome when the number of firms changes in addition to providing the \traditional" Cournot-Nash equilibrium when the number of firms is fixed. Hence, I use potential games to fill the gaps that exist in the studies of competitions in oligopoly and common pool resources and extend our knowledge in these topics. In specific, one of the rational conclusions from the Cournot model is that a firm’s best policy is to split into separate firms. In real life, we usually witness the other way around; i.e., several firms attempt to merge and enjoy the monopoly profit by restricting the amount of output and raising the price.
    [Show full text]
  • Online Learning and Game Theory
    Online Learning and Game Theory Your guide: Avrim Blum Carnegie Mellon University [Algorithmic Economics Summer School 2012] Itinerary • Stop 1: Minimizing regret and combining advice. – Randomized Wtd Majority / Multiplicative Weights alg – Connections to minimax-optimality in zero-sum games • Stop 2: Bandits and Pricing – Online learning from limited feedback (bandit algs) – Use for online pricing/auction problems • Stop 3: Internal regret & Correlated equil – Internal regret minimization – Connections to correlated equilib. in general-sum games • Stop 4: Guiding dynamics to higher-quality equilibria – Helpful “nudging” of simple dynamics in potential games Stop 1: Minimizing regret and combining expert advice Consider the following setting… Each morning, you need to pick one of N possible routes to drive to work. Robots But traffic is different each day. R Us 32 min Not clear a priori which will be best. When you get there you find out how long your route took. (And maybe others too or maybe not.) Is there a strategy for picking routes so that in the long run, whatever the sequence of traffic patterns has been, you’ve done nearly as well as the best fixed route in hindsight? (In expectation, over internal randomness in the algorithm) Yes. “No-regret” algorithms for repeated decisions A bit more generally: Algorithm has N options. World chooses cost vector. Can view as matrix like this (maybe infinite # cols) World – life - fate Algorithm At each time step, algorithm picks row, life picks column. Alg pays cost for action chosen. Alg gets column as feedback (or just its own cost in the “bandit” model). Need to assume some bound on max cost.
    [Show full text]
  • Potential Games with Continuous Player Sets
    Potential Games with Continuous Player Sets William H. Sandholm* Department of Economics University of Wisconsin 1180 Observatory Drive Madison, WI 53706 [email protected] www.ssc.wisc.edu/~whs First Version: April 30, 1997 This Version: May 20, 2000 * This paper is based on Chapter 4 of my doctoral dissertation (Sandholm [29]). I thank an anonymous referee and Associate Editor, Josef Hofbauer, and seminar audiences at Caltech, Chicago, Columbia, Harvard (Kennedy), Michigan, Northwestern, Rochester, Washington University, and Wisconsin for their comments. I especially thank Eddie Dekel and Jeroen Swinkels for their advice and encouragement. Financial support from a State Farm Companies Foundation Dissertation Fellowship is gratefully acknowledged. Abstract We study potential games with continuous player sets, a class of games characterized by an externality symmetry condition. Examples of these games include random matching games with common payoffs and congestion games. We offer a simple description of equilibria which are locally stable under a broad class of evolutionary dynamics, and prove that behavior converges to Nash equilibrium from all initial conditions. W e consider a subclass of potential games in which evolution leads to efficient play. Finally, we show that the games studied here are the limits of convergent sequences of the finite player potential games studied by Monderer and Shapley [22]. JEL Classification Numbers: C72, C73, D62, R41. 1. Introduction Nash equilibrium is the cornerstone of non-cooperative game theory, providing a necessary condition for stable behavior among rational agents. Still, to justify the prediction of Nash equilibrium play, one must explain how players arrive at a Nash equilibrium; if equilibrium is not reached, the fact that it is self-sustaining becomes moot.
    [Show full text]
  • Noisy Directional Learning and the Logit Equilibrium*
    CSE: AR SJOE 011 Scand. J. of Economics 106(*), *–*, 2004 DOI: 10.1111/j.1467-9442.2004.000376.x Noisy Directional Learning and the Logit Equilibrium* Simon P. Anderson University of Virginia, Charlottesville, VA 22903-3328, USA [email protected] Jacob K. Goeree California Institute of Technology, Pasadena, CA 91125, USA [email protected] Charles A. Holt University of Virginia, Charlottesville, VA 22903-3328, USA [email protected] PROOF Abstract We specify a dynamic model in which agents adjust their decisions toward higher payoffs, subject to normal error. This process generates a probability distribution of players’ decisions that evolves over time according to the Fokker–Planck equation. The dynamic process is stable for all potential games, a class of payoff structures that includes several widely studied games. In equilibrium, the distributions that determine expected payoffs correspond to the distributions that arise from the logit function applied to those expected payoffs. This ‘‘logit equilibrium’’ forms a stochastic generalization of the Nash equilibrium and provides a possible explanation of anomalous laboratory data. ECTED Keywords: Bounded rationality; noisy directional learning; Fokker–Planck equation; potential games; logit equilibrium JEL classification: C62; C73 I. Introduction Small errors and shocks may have offsetting effects in some economic con- texts, in which case there is not much to be gained from an explicit analysis of stochasticNCORR elements. In other contexts, a small amount of randomness can * We gratefully acknowledge financial support from the National Science Foundation (SBR- 9818683U and SBR-0094800), the Alfred P. Sloan Foundation and the Dutch National Science Foundation (NWO-VICI 453.03.606).
    [Show full text]
  • Strong Nash Equilibria in Games with the Lexicographical Improvement
    Strong Nash Equilibria in Games with the Lexicographical Improvement Property Tobias Harks, Max Klimm, and Rolf H. M¨ohring Preprint 15/07/2009 Institut f¨ur Mathematik, Technische Universit¨at Berlin harks,klimm,moehring @math.tu-berlin.de { } Abstract. We introduce a class of finite strategic games with the property that every deviation of a coalition of players that is profitable to each of its members strictly decreases the lexicographical order of a certain function defined on the set of strategy profiles. We call this property the Lexicographical Improvement Property (LIP) and show that it implies the existence of a generalized strong ordinal potential function. We use this characterization to derive existence, efficiency and fairness properties of strong Nash equilibria. We then study a class of games that generalizes congestion games with bottleneck objectives that we call bot- tleneck congestion games. We show that these games possess the LIP and thus the above mentioned properties. For bottleneck congestion games in networks, we identify cases in which the potential function associated with the LIP leads to polynomial time algorithms computing a strong Nash equilibrium. Finally, we investigate the LIP for infinite games. We show that the LIP does not imply the existence of a generalized strong ordinal potential, thus, the existence of SNE does not follow. Assuming that the function associated with the LIP is continuous, however, we prove existence of SNE. As a consequence, we prove that bottleneck congestion games with infinite strategy spaces and continuous cost functions possess a strong Nash equilibrium. 1 Introduction The theory of non-cooperative games is used to study situations that involve rational and selfish agents who are motivated by optimizing their own utilities rather than reaching some social optimum.
    [Show full text]
  • Dynamics of Profit-Sharing Games
    Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Dynamics of Profit-Sharing Games John Augustine, Ning Chen, Edith Elkind, Angelo Fanelli, Nick Gravin, Dmitry Shiryaev School of Physical and Mathematical Sciences Nanyang Technological University, Singapore Abstract information about other players. Such agents may simply re- spond to their current environment without worrying about An important task in the analysis of multiagent sys- the subsequent reaction of other agents; such behavior is said tems is to understand how groups of selfish players to be myopic. Now, coalition formation by computationally can form coalitions, i.e., work together in teams. In limited agents has been studied by a number of researchers in this paper, we study the dynamics of coalition for- multi-agent systems, starting with the work of [Shehory and mation under bounded rationality. We consider set- Kraus, 1999] and [Sandholm and Lesser, 1997]. However, tings where each team’s profit is given by a concave myopic behavior in coalition formation received relatively lit- function, and propose three profit-sharing schemes, tle attention in the literature (for some exceptions, see [Dieck- each of which is based on the concept of marginal mann and Schwalbe, 2002; Chalkiadakis and Boutilier, 2004; utility. The agents are assumed to be myopic, i.e., Airiau and Sen, 2009]). In contrast, myopic dynamics of they keep changing teams as long as they can in- non-cooperative games is the subject of a growing body of crease their payoff by doing so. We study the prop- research (see, e.g. [Fabrikant et al., 2004; Awerbuch et al., erties (such as closeness to Nash equilibrium or to- 2008; Fanelli et al., 2008]).
    [Show full text]
  • Informational Incentives for Congestion Games
    Informational Incentives for Congestion Games Hamidreza Tavafoghi and Demosthenis Teneketzis network, leading to a decrease in social welfare. In addition Abstract— We investigate the problems of designing public to the above-mentioned theoretical and experimental works, and private information disclosure mechanisms by a principal there are empirical evidences that identify negative impacts in a transportation network so as to improve the overall conges- tion. We show that perfect disclosure of information about the of information provision on the network’s congestion [16], routes’ conditions is not optimal. The principal can improve the [26], [28], [15], [10]. Therefore, it is important to investigate congestion (i.e. social welfare) by providing coordinated routing how to design appropriate information provision mechanisms recommendation to drivers based on the routes’ conditions. in a manner that is socially beneficial and leads to a reduction When the uncertainty about the routes’ conditions is high in the overall congestion in the transportation network. relative to the ex-ante difference in the routes’ conditions (the value of information is high), we show that the socially efficient In this paper, we study the problem of designing socially routing outcome is achievable using a private information optimal information disclosure mechanisms. We consider a disclosure mechanism. Furthermore, we study the problem congestion game [25], [29] in a parallel two-link network. of optimal dynamic private information disclosure mechanism We consider an information provider (principal) who wants design in a dynamic two-time step setting. We consider different to disclose information about the condition of the network pieces of information that drivers may observe and learn from at t = 1 and investigate qualitative properties of an optimal to a fixed population of drivers (agents).
    [Show full text]
  • The Price of Stability for Network Design with Fair Cost Allocation∗
    THE PRICE OF STABILITY FOR NETWORK DESIGN WITH FAIR COST ALLOCATION¤ ELLIOT ANSHELEVICHy , ANIRBAN DASGUPTAz , JON KLEINBERG x , EVA¶ TARDOS{, TOM WEXLERk, AND TIM ROUGHGARDEN¤¤ Abstract. Network design is a fundamental problem for which it is important to understand the e®ects of strategic behavior. Given a collection of self-interested agents who want to form a network connecting certain endpoints, the set of stable solutions | the Nash equilibria | may look quite di®erent from the centrally enforced optimum. We study the quality of the best Nash equilibrium, and refer to the ratio of its cost to the optimum network cost as the price of stability. The best Nash equilibrium solution has a natural meaning of stability in this context | it is the optimal solution that can be proposed from which no user will defect. We consider the price of stability for network design with respect to one of the most widely-studied protocols for network cost allocation, in which the cost of each edge is divided equally between users whose connections make use of it; this fair-division scheme can be derived from the Shapley value, and has a number of basic economic motivations. We show that the price of stability for network design with respect to this fair cost allocation is O(log k), where k is the number of users, and that a good Nash equilibrium can be achieved via best-response dynamics in which users iteratively defect from a starting solution. This establishes that the fair cost allocation protocol is in fact a useful mechanism for inducing strategic behavior to form near-optimal equilibria.
    [Show full text]
  • Algorithms for Generalized Potential Games with Mixed-Integer Variables
    Noname manuscript No. (will be inserted by the editor) Algorithms for generalized potential games with mixed-integer variables Simone Sagratella Received: date / Accepted: date Abstract We consider generalized potential games, that constitute a funda- mental subclass of generalized Nash equilibrium problems. We propose differ- ent methods to compute solutions of generalized potential games with mixed- integer variables, i.e., games in which some variable are continuous while the others are discrete. We investigate which types of equilibria of the game can be computed by minimizing a potential function over the common feasible set. In particular, for a wide class of generalized potential games, we characterize those equilibria that can be computed by minimizing potential functions as Pareto solutions of a particular multi-objective problem, and we show how dif- ferent potential functions can be used to select equilibria. We propose a new Gauss-Southwell algorithm to compute approximate equilibria of any general- ized potential game with mixed-integer variables. We show that this method converges in a finite number of steps and we also give an upper bound on this number of steps. Moreover, we make a thorough analysis on the behaviour of approximate equilibria with respect to exact ones. Finally, we make many numerical experiments to show the viability of the proposed approaches. Keywords Generalized Nash equilibrium problem · Generalized potential game · Mixed-integer nonlinear problem · Parametric optimization 1 Introduction Generalized Nash equilibrium problems (GNEPs) are widely used tools to model multi agent systems in many fields [16]. In the past two decades, sev- The work of the author has been partially supported by Avvio alla Ricerca 2015 Sapienza University of Rome, under grant 488.
    [Show full text]
  • Learning with Bandit Feedback in Potential Games
    Learning with Bandit Feedback in Potential Games Johanne Cohen LRI-CNRS, Université Paris-Sud,Université Paris-Saclay, France [email protected] Amélie Héliou LIX, Ecole Polytechnique, CNRS, AMIBio, Inria, Université Paris-Saclay [email protected] Panayotis Mertikopoulos Univ. Grenoble Alpes, CNRS, Inria, LIG, F-38000, Grenoble, France [email protected] Abstract This paper examines the equilibrium convergence properties of no-regret learning with exponential weights in potential games. To establish convergence with mini- mal information requirements on the players’ side, we focus on two frameworks: the semi-bandit case (where players have access to a noisy estimate of their payoff vectors, including strategies they did not play), and the bandit case (where players are only able to observe their in-game, realized payoffs). In the semi-bandit case, we show that the induced sequence of play converges almost surely to a Nash equilibrium at a quasi-exponential rate. In the bandit case, the same result holds for "-approximations of Nash equilibria if we introduce an exploration factor " > 0 that guarantees that action choice probabilities never fall below ". In particular, if the algorithm is run with a suitably decreasing exploration factor, the sequence of play converges to a bona fide Nash equilibrium with probability 1. 1 Introduction Given the manifest complexity of computing Nash equilibria, a central question that arises is whether such outcomes could result from a dynamic process in which players act on empirical information on their strategies’ performance over time. This question becomes particularly important when the players’ view of the game is obstructed by situational uncertainty and the “fog of war”: for instance, when deciding which route to take to work each morning, a commuter is typically unaware of how many other commuters there are at any given moment, what their possible strategies are, how to best respond to their choices, etc.
    [Show full text]