Selfish Routing and the Price of Anarchy

The Price of Anarchy Tim Roughgarden UC Berkeley + Stanford University also featuring some slides contributed by Christos Papadimitriou (UC Berkeley) The Price of Anarchy OR, a rendezvouz of computer science and game theory Tim Roughgarden UC Berkeley + Stanford University also featuring some slides contributed by Christos Papadimitriou (UC Berkeley) The Price of Anarchy OR, a rendezvouz of computer science and game theory OR, how free food can lead to a thesis topic Tim Roughgarden UC Berkeley + Stanford University also featuring some slides contributed by Christos Papadimitriou (UC Berkeley) (Very) Brief Overview • basic object of study: noncooperative games • widespread phenomenon: selfish (or “rational”) behavior is inefficient – could improve natural objective functions by dictating behavior •price of anarchy-- “competitive analysis for noncooperative games” – when does noncooperative behavior lead to an approximately optimal outcome? Example: Load-Balancing • setup: 4 jobs self-schedule on 2 machines – each wants a lightly loaded machine • with coordination, can achieve the following “good” outcome (makespan = 5) 1 2 4 3 m1 m2 Example: Load-Balancing • if they cannot be centrally scheduled? Example: Load-Balancing • if they cannot be centrally scheduled? • e.g., the following outcome is “stable” – no job has incentive to switch machines 2 1 4 3 m1 m2 Example: Braess’s Paradox • setup: traffic (e.g., many cars or packets) pick s-t paths to minimize travel time • travel time increases with congestion c(x)=x c(x)=1 s t c(x)=1 c(x)=x Example: Braess’s Paradox • setup: traffic (e.g., many cars or packets) pick s-t paths to minimize travel time • travel time increases with congestion c(x)=x c(x)=1 ½ of traffic s t c(x)=1 c(x)=x ½ of traffic • at “equilibrium”: travel time = 3/2 for all Braess’s Paradox Initial Network: ½ ½ x1 s ½ ½ t 1 x Travel time = 1.5 Braess’s Paradox Initial Network: Augmented Network: ½ ½ ½ ½ x1x1 0 s ½ ½ t s ½ ½ t 1 x 1 x Travel time = 1.5 Now what? Braess’s Paradox Initial Network: Augmented Network: ½ ½ x1 x 1 s 0 t s ½ ½ t 1 x 1 x Travel time = 1.5 Travel time = 2 Braess’s Paradox Initial Network: Augmented Network: ½ ½ x1 x 1 s 0 t s ½ ½ t 1 x 1 x Travel time = 1.5 Travel time = 2 All traffic worse off! [Braess 68] • (with thanks to Leonard Schulman) The Price of Anarchy • both examples: selfish behavior inefficient – e.g., equilibria need not minimize avg/max cost vs. vs. The Price of Anarchy • both examples: selfish behavior inefficient – e.g., equilibria need not minimize avg/max cost vs. vs. • price of anarchy: worst-case ratio between “social cost” of equilibrium and of optimum – w.r.t. a game + a definition of social cost Agenda of Tutorial • small price of anarchy: – selfish behavior benign; little benefit of centralized control over “laissez-faire” • our focus: bounding the price of anarchy--- when, what, how, why? – want techniques that cut across applications Caveats • price of anarchy = active subfield bordering theoretical CS, game theory – we will only scratch the surface – will focus on the two most well-studied models – even for these models, will only discuss results that I can describe simply – see also bibliography to appear on my Web page • will not have many proofs or all the details – 3 hours only seems long to audience! Road Map: First Half • Part I: Basic Notions – games, strategies, Nash equilibria, the price of anarchy, examples next • Part II: The KP Model (load-balancing) – upper and lower bounds on the price of anarchy – extensions and open questions Road Map: Second Half • Part III: Selfish Routing – the potential function technique – tight bounds on the price of anarchy – extensions and open questions • Part IV: Other cool models: a brief tour – resource allocation, submodular, facility location, and network design games Part I: Basic Notions • finite (normal-form) games • strategies • Nash equilibria • nonatomic games + their equilibria – models large population of players •price of anarchy – pessimistic vs. optimistic Games In a (finite, normal-form) game there is: • a finite set of players • for each player, a finite set of strategies • for each strategy profile (each player picks a strategy), a payoff to each player Games In a (finite, normal-form) game there is: • a finite set of players • for each player, a finite set of strategies • for each strategy profile (each player picks a strategy), a payoff to each player Row 12 Column 12 player: 1 -2 -1 player: 1 -2 -1 2 -1 -2 2 -1 -2 Pure vs. Mixed Strategies Pure strategy: player picks single strategy Mixed strategy: player picks probability distribution over its strategies Pure vs. Mixed Strategies Pure strategy: player picks single strategy Mixed strategy: player picks probability distribution over its strategies • allowing randomization has pros and cons – permits universal existence of equilibria – seems to model some real behavior • e.g., bluffing in poker – still, conceptually problematic • how often to people really randomize? Example: Load-Balancing Example: 2 jobs scheduled on 2 machines • players = jobs; strategies = machines • payoff to job = -1 × #jobs on its machine j2 j1 j2 vs. j1 Example: Load-Balancing Example: 2 jobs scheduled on 2 machines • players = jobs; strategies = machines • payoff to job = -1 × #jobs on its machine j2 j1 j2 vs. j1 m1 m2 m1 m2 m1 -2 -1 m1 -2 -1 Job 1: Job 2: m2 -1 -2 m2 -1 -2 Nash Equilibria Defn: a set of mixed strategies (one per player) is a Nash equilibrium if no player has a unilateral incentive to change its strategy. – no player can increase expected payoff w/some other strategy, if strategies of others are fixed Nash Equilibria Defn: a set of mixed strategies (one per player) is a Nash equilibrium if no player has a unilateral incentive to change its strategy. – no player can increase expected payoff w/some other strategy, if strategies of others are fixed Fact 1: pure-strategy Nash eq need not exist. – “matching pennies” Fact 2: mixed-strategy Nash eq always exist. – Nash’s theorem (1950) Example: Load-Balancing Back to: two-machine, two-job example. Two pure-strategy Nash eq, with jobs on different machines. j1 j2 j2 j1 m1 m2 m1 m2 Example: Load-Balancing Back to: two-machine, two-job example. Two pure-strategy Nash eq, with jobs on different machines. j1 j2 j2 j1 m1 m2 m1 m2 One mixed-strategy Nash eq: j1 50% 50% j2 50% 50% m1 m2 Nonatomic Games • nonatomic game = model large population with infinitely many players • sounds scary, but can increase analytical tractability – (continuous math a good thing) • could define a payoff per strategy profile – but we will only care what fraction of population picks each strategy; we will only model this Nonatomic Games (con'd) Ingredients (abstract and in selfish routing): • finite # of player types [(si,ti) pairs] • population sizes [amt of traffic per pair] • finite strategy sets [(si,ti) paths] • for each distribution (fraction of population using each strategy), payoffs corresponding to each strategy – given traffic pattern, cost of each path Equilibria in Nonatomic Games • like Nash eq, though note that individual deviations do not affect payoffs – strategies used by a player type have equal cost x 1 x 1 s 0 t vs. s 0 t 1 x 1 x – cf., load-balancing: Equilibria in Nonatomic Games • like Nash eq, though note that individual deviations do not affect payoffs – strategies used by a player type have equal cost x 1 x 1 s 0 t vs. s 0 t 1 x 1 x – cf., load-balancing: • existence in general: [Schmeidler 73], etc. • WLOG, can focus on pure-strategy eq The Price of Anarchy Defn: price of obj fn value of a Nash eq = sup anarchy optimal obj fn value ⇒ with respect to a game + an objective fn ⇒ supremum is over the set of equilibria ⇒ price of anarchy of a collection C of games = take supremum over games in C Comparison to Other Notions • approximation ratio = quantifies cost of not having exponential computing time • competitive ratio = quantifies cost of not knowing the future •price of anarchy= quantifies cost of not being able to dictate behavior – also called coordination ratio [Koutsoupias/Papadimitriou 99] Optimistic Price of Anarchy • optimistic price of anarchy: compare best Nash equilibrium with the social optimum • interpretation: what is achievable with “single-shot” centralized control – e.g., for network design (later) • can often upper bound the optimistic p.o.a. but not the pessimistic p.o.a. Road Map: First Half • Part I: Basic Notions – games, strategies, Nash equilibria, the price of anarchy, examples next • Part II: The KP Model (load-balancing) – upper and lower bounds on the price of anarchy – extensions and open questions The KP Model: Overview • formal definition of model • special case: uniform machine speeds • non-uniform speeds • extensions + open problems The KP Model •nplayers/jobs, each with weight wj •mstrategies/machines, each with speed si • outcomes: assignment of jobs to machines The KP Model •nplayers/jobs, each with weight wj •mstrategies/machines, each with speed si • outcomes: assignment of jobs to machines • J(i) = jobs on machine i 2 • L(i) = load of i = 1 Σ 1/si j in J(i) wj 4 • [Koutsoupias/ 3 Papadimitriou 99] m1 m2 Costs with Mixed Strategies i • mixed strategy pj = Pr[job j on machine i] • expected load of machine i: i EL(i) = 1/si Σj wj pj • Cost of i from job j’s perspective: i Cj(i) = 1/si [(Σk not j wk pk ) + wj] j1 50% 50% j2 50% 50% m1 m2 What is a Nash equilibrium? i • no job j has incentive to change its pj ‘s • equivalently: for each job j, i pj > 0 ⇒ Cj(i) = min all machines

Selfish Routing and the Price of Anarchy

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support