Game Theory- Prisoners Dilemma Vs Battle of the Sexes EXCERPTS
Total Page:16
File Type:pdf, Size:1020Kb
Lesson 14. Game Theory 1 Lesson 14 Game Theory c 2010, 2011 ⃝ Roberto Serrano and Allan M. Feldman All rights reserved Version C 1. Introduction In the last lesson we discussed duopoly markets in which two firms compete to sell a product. In such markets, the firms behave strategically; each firm must think about what the other firm is doing in order to decide what it should do itself. The theory of duopoly was originally developed in the 19th century, but it led to the theory of games in the 20th century. The first major book in game theory, published in 1944, was Theory of Games and Economic Behavior,byJohnvon Neumann (1903-1957) and Oskar Morgenstern (1902-1977). We will return to the contributions of Von Neumann and Morgenstern in Lesson 19, on uncertainty and expected utility. Agroupofpeople(orteams,firms,armies,countries)areinagame if their decision problems are interdependent, in the sense that the actions that all of them take influence the outcomes for everyone. Game theory is the study of games; it can also be called interactive decision theory. Many real-life interactions can be viewed as games. Obviously football, soccer, and baseball games are games.Butsoaretheinteractionsofduopolists,thepoliticalcampaignsbetweenparties before an election, and the interactions of armed forces and countries. Even some interactions between animal or plant species in nature can be modeled as games. In fact, game theory has been used in many different fields in recent decades, including economics, political science, psychology, sociology, computer science, and biology. This brief lesson is not meant to replace a formal course in game theory; it is only an in- troduction. The general emphasis is on how strategic behavior affects the interactions among rational players in a game. We will provide some basic definitions, and we will discuss a number of well-known simple examples. We will start with a description of the prisoners’ dilemma,where we will introduce the idea of a dominant strategy equilibrium.Wewillbrieflydiscussrepeated games in the prisoners’ dilemma context, and tit for tat strategies.Thenwewilldescribethe Lesson 14. Game Theory 2 battle of the sexes,andintroducetheconceptofNash equilibrium.Wewilldiscussthepossibilities of there being multiple Nash equilibria, or no (pure strategy) Nash equilibria, and we discuss the idea of mixed strategy equilibria. We will then present an expanded battle of the sexes,andwe will see that in game theory, an expansion of choices may make players worse off instead of better off.Attheendofthelesson,wewilldescribesequential move games,andwewillbrieflydiscuss threats. 2. The Prisoners’ Dilemma, and the Idea of Dominant Strategy Equilibrium The most well-known example in game theory is the prisoners’ dilemma.(Itwasdeveloped around 1950 by Merrill M. Flood (1908-1991) and Melvin Dresher (1911-1992) of the RAND Cor- poration. It was so-named by Albert W. Tucker (1905-1995), a Princeton University mathematics professor.) Consider the following. A crime is committed. The police arrive at the scene and arrest two suspects. Each of the suspects is taken to the police station for interrogation, and they are placed in separate cells. The cells are cold and nasty. The police interrogate them separately, and without any lawyers present. A police officer tells each one: “You can keep your mouth shut and refuse to testify. Or, you can confess and testify at trial.” We use some special and potentially confusing terminology to describe this choice. If a suspect refuses to testify, we say that he has chosen to cooperate with his fellow suspect. If a suspect confesses and testifies at trial, we say that he has chosen to defect from his fellow suspect. The reader will need to remember that to “cooperate” means to cooperate with the other suspect, not with the police,andalsotorememberthatto“defect”meanstodefectfrom the other suspect. The officer goes on: “If both of you refuse to testify, we will only be able to convict you on aminorcharge,whichwillresultinasentenceof6monthsinprisonforeachofyou.Ifbothof you confess and testify, you will each get 5 years in prison. If one of you refuses to testify (i.e., “cooperates”) while the other confesses and testifies (i.e., “defects”), the one who testifies will go free, and the one who refuses to testify will get a full 10 years in prison.” The officer concludes: “That’s what we’re offering you, you lowlife hooligan. Think it over. We’ll be back tomorrow to hear what you have to say.” We now consider this question: given this information, how should a rational suspect behave? Lesson 14. Game Theory 3 Should the suspects “cooperate” with each other (and tell the police nothing) or should they “defect” from each other (and confess)? Table 14.1 below shows the prisoners’ dilemma game. In game theory, the people playing the game are called players, so we now refer to our suspects as players. Player 1 chooses the rows in the table, while player 2 chooses the columns. Each of them has two possible actions to choose: “Cooperate” or “Defect.” Each of the four action combinations results in payoffstoeachplayer, in the form of prison time to be served. The outcomes are shown as the vectors in the cells of Table 14.1. The first entry is always the outcome for player 1, and the second is always the outcome for player 2. For instance, if player 1 defects while player 2 cooperates (bottom row, left column of the table), prison time for player 1 is None, and prison time for player 2 is 10 years. Note that these outcomes are “bads” rather than “goods”; each player wants to minimize his outcome. Player 2 Cooperate Defect Cooperate 6months,6months 10 years, None Player 1 Defect None, 10 years 5years,5years Table 14.1: The prisoners’ dilemma. Each suspect wants to minimize his own jail time. But each must think about what the other suspect will do. Let us now analyze the problem carefully. Here’s how player 1 thinks about the game. He considers what player 2 might do. If player 2 cooperates, they are in the first column of the table. In this case, player 1 gets 6 months if he cooperates (first row), and no prison time if he defects (second row). Therefore, if player 2 cooperates, player 1 will defect. On the other hand, if player 2defects,theyareinthesecondcolumnofthetable.Inthiscase,player1gets10yearsifhe cooperates (first row), and 5 years if he defects (second row). Therefore if player 2 defects, player 1willdefect. We now realize that whatever action player 2 chooses, player 1 will want to defect. We leave it to the reader to do the same type of analysis for player 2, whose payoffsarethesecondentries Lesson 14. Game Theory 4 in each of the payoff vectors. When you do this, you will conclude that player 2 will want to defect, whatever action player 1 chooses. In a game like this, actions that players might take are called strategies.Adominant strategy is a strategy which is optimal for a player, no matter what strategy the other player is choosing. In the prisoners’ dilemma, the best thing for player 1 to do is to defect, no matter what player 2mightdo.Therefore“Defect”isadominantstrategyforplayer1.Similarly,“Defect”isa dominant strategy for player 2. When a pair of strategies are each dominant for the two players, the pair is called a dominant strategy equilibrium or a solution in dominant strategies.Wenow know that (Defect, Defect) is a dominant strategy equilibrium in the prisoners’ dilemma. Rational players should choose dominant strategies if they exist; they clearly make sense, since a dominant strategy is the best for a player no matter what the other player is doing. We conclude that the two suspects should both confess to the police, or defect from each other. Therefore they will each end up with a prison sentence of 5 years. Between the two of them, the total will be 10 years of prison. But this outcome is very peculiar,becauseiftheyhad both chosen to keep their mouths shut, or cooperate with each other, they would have ended up with prison sentences of only 6 months each, and a total of 1 year between the two of them. Back in Lesson 11 on perfectly competitive markets, we introduced the reader to Adam Smith’s free market philosophy—his invisible hand theory. In brief, this is the theory that if the market is allowed to operate freely, with each consumer seeking to maximize his own utility and each firm seeking to maximize its own profits, with each of the players in the grand market game ignoring the welfare of all the others and doing the best it can for itself, the outcome will actually be best for society. That is, self-interested consumers and firms in a competitive market will end up maximizing social surplus, the sum of consumers’ and producers’ surplus. But now note the dramatically different conclusion in the prisoners’ dilemma. In this game, where we are focusing on the outcomes for the two suspects and ignoring the welfare of the police officers, the victims of the original crime, and the rest of society, the obvious and simple measure of social welfare for our two suspects is 1timesthesumofthetwoprisonsentences.(Weneed − the 1toconvertacost—prison time—into a benefit.) But our analysis above indicates that − each player, pursuing his own self-interest, maximizing his own welfare by minimizing his years Lesson 14. Game Theory 7 here is that it may sometimes be in the interest of people to have reputations as being “crazy” or “tough,” in order to induce beneficial changes in the behavior of others. The moral of this story is that game theory can sometimes improve its predictions in explaining real-world phenomena by expanding its models. 4. The Battle of the Sexes, and the Idea of Nash Equilibrium Most games are not as simple to solve as the prisoners’ dilemma.