Games of Perfect Information and Backward Induction Economics 222 - Introduction to Game Theory

Games of Perfect Information and Backward Induction Economics 222 - Introduction to Game Theory Shih En Lu Simon Fraser University ECON 222 (SFU) Perfect Info and Backward Induction 1 / 14 Topics 1 Basic Setup for Games of Perfect Information 2 Going from the Game Tree to the Normal Form 3 Problems with Nash Equilibrium 4 Backward Induction and Subgame-Perfect Equilibrium ECON 222 (SFU) Perfect Info and Backward Induction 2 / 14 Introduction to Sequential Games Up to now: studied games where players move simultaneously. But often, people/firms observe what others do before acting. Does it make a difference? Example: Battle of the Sexes Guy Ballet Hockey If simultaneous-move: Girl Ballet 2,1 0,0 Hockey 0,0 1,2 What happens if Girl texts Guy: "I’mgoing to the ballet, and my phone is dying. See you there!" So if Girl has the opportunity to send that text (and, for whatever reason, Guy doesn’t),will she do it? ECON 222 (SFU) Perfect Info and Backward Induction 3 / 14 The Extensive Form A good way to represent a sequential game is the extensive form, often called a "game tree." Consider the Battle of the Sexes, with the girl (player 1) first deciding "Ballet" (B) or "Hockey" (H), and then becoming out of reach. Each branch is an action. Let’scall the guy’s(player 2’s)actions B’ and H’to distinguish them from the girl’s. Each non-terminal node is a place where the specified player has to make a decision. Each terminal node is an outcome: still a combination of actions (but, in general, no longer necessarily one per player since a player might move more than once). The numbers below each terminal node are the payoffs from the outcome corresponding to the node. As usual, the first number is player 1’spayoff, the second is player 2’s,etc. ECON 222 (SFU) Perfect Info and Backward Induction 4 / 14 Perfect Information We will first study games of perfect information: one player acts at a time, and each player sees all previous actions. Simultaneous-move games are NOT games of perfect information (when at least two players have at least two actions each). Later, we will look at games that do not have perfect information. Example of the latter: playing a prisoner’sdilemma more than once. Note: don’tconfuse perfect information with complete information! ECON 222 (SFU) Perfect Info and Backward Induction 5 / 14 Strategies in Sequential Games A player’s strategy specifies a probability distribution over her actions at each node where she plays, regardless of whether that node is reached. Nodes that are reached form what is called the path (of play). Thus, a strategy must specify how a player plays both on-path and off-path. In other words, a strategy is a player’s full contingency plan. In our example, the Guy’sstrategy must include what he would do if the Girl’schooses H, even if we don’texpect the Girl to choose H. Example 1: Guy chooses H’regardless of what Girl does. (B H’,H H’) ! ! Example 2: Guy chooses the same thing as the Girl. (B B’,H H’) ! ! Just like before, a strategy profile is a collection of strategies with exactly one from each player. ECON 222 (SFU) Perfect Info and Backward Induction 6 / 14 Nash Equilibrium in Sequential Games Let’sfind the pure-strategy NE in this game. As before, we use the normal form: (B B’,H B’) (B B’,H H’) (B H’,H B’) (B H’,H H’) B! 3,1! ! 3,1! ! 0,0! ! 0,0 ! H 0,0 1,4 0,0 1,4 Are these pure-strategy NE all realistic? ECON 222 (SFU) Perfect Info and Backward Induction 7 / 14 Backward Induction Idea: should require that players play a best response (given what they know) at all nodes, even those that are not reached. Strategy profiles satisfying the above are called subgame-perfect (Nash) equilibria (SPE or SPNE) in games of complete information. In games of perfect information, solving for SPE is particularly easy: just start at the terminal nodes to infer what players will do at the last step. Given that, figure out what happens at the second-to-last step, and so on. This procedure is called backward induction. Let’sdo this for the sequential battle of the sexes. ECON 222 (SFU) Perfect Info and Backward Induction 8 / 14 Exercise 1 Consider Rock-Paper-Scissors, but suppose player 2 sees what player 1 does before acting. Payoff is 1 for a win, -1 for a loss, and 0 for a tie. Draw this game in extensive form, and find its pure-strategy SPE(s) using backward induction. ECON 222 (SFU) Perfect Info and Backward Induction 9 / 14 Some Questions on SPE When is there a unique SPE in perfect information games? ECON 222 (SFU) Perfect Info and Backward Induction 10 / 14 Commitment versus Flexibility In Battle of the Sexes, players gain from committing to a course of action. As a result, there is a first-mover advantage: the Guy would like to threaten to go to the hockey game after the Girl has gone to the ballet dance, but cannot do so credibly. As we saw, NE allows for such non-credible threats, while SPE doesn’t. By contrast, in Rock-Paper-Scissors, flexibility creates a second-mover advantage. There are also games where neither is the case. ECON 222 (SFU) Perfect Info and Backward Induction 11 / 14 Exercise 2 (Ultimatum Game) Player 1 proposes a division of a payoff of 100 by offering x to player 2 and keeping 100 x. x must be an integer (whole number) from 0 to 100, so player 1 has 101 available actions. Player 2 then decides whether to accept the proposal. If player 2 accepts, the proposal is enacted: player 1’spayoff is 100 x and player 2’spayoff is x. If player 2 rejects, both players receive a payoff of zero. MOBLAB: indicate both how much you would offer as Player 1, and how you would play as Player 2. As player 2, MobLab limits you to (pure) strategies of the form "I accept if and only if x is at least _." Payoffs are determined by matching you to every respondent, taking averages of how you would have done as player 1 and as player 2, and adding up your two averages. The number of MobLab units you earn is your payoff divided by 10. ECON 222 (SFU) Perfect Info and Backward Induction 12 / 14 Problems with Backward Induction May not be reasonable when game is long and/or complicated, e.g. chess. This is a similar problem as in ISD: we assume that players can do long chains of reasoning, that they trust others to do so, that they trust others to trust others to do so, and so on... Even if you accept this assumption, you need a further assumption when a player has multiple best responses at a node: players correctly anticipate what others will do, even though this cannot be deduced by logic alone. This is an assumption we also made for NE. Philosophical aside: unexpected hanging paradox. ECON 222 (SFU) Perfect Info and Backward Induction 13 / 14 Recap We introduced games of perfect information, and solved them using backward induction. Perfect information: one player acts at a time, and each player sees all previous actions. We represented these games using the extensive form (game tree). Backward induction: start at the bottom of the game tree, figure out the best response(s) at each node, and work our way up the tree. We said that the resulting strategy profile(s) is/are SPE(s). ECON 222 (SFU) Perfect Info and Backward Induction 14 / 14.

Games of Perfect Information and Backward Induction Economics 222 - Introduction to Game Theory

Lecture 4 Rationalizability & Nash Equilibrium Road

1 Sequential Games

Minimax TD-Learning with Neural Nets in a Markov Game

Lecture Notes

SEQUENTIAL GAMES with PERFECT INFORMATION Example

1 Bertrand Model

Economics 201B Economic Theory (Spring 2021) Strategic Games

Chapter 16 Oligopoly and Game Theory Oligopoly Oligopoly

Dynamic Games Under Bounded Rationality

Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction

Extensive Form Games with Perfect Information

UC Merced Proceedings of the Annual Meeting of the Cognitive Science Society