A Competitive Search Game with a Moving Target∗†
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Lecture 4 Rationalizability & Nash Equilibrium Road
Lecture 4 Rationalizability & Nash Equilibrium 14.12 Game Theory Muhamet Yildiz Road Map 1. Strategies – completed 2. Quiz 3. Dominance 4. Dominant-strategy equilibrium 5. Rationalizability 6. Nash Equilibrium 1 Strategy A strategy of a player is a complete contingent-plan, determining which action he will take at each information set he is to move (including the information sets that will not be reached according to this strategy). Matching pennies with perfect information 2’s Strategies: HH = Head if 1 plays Head, 1 Head if 1 plays Tail; HT = Head if 1 plays Head, Head Tail Tail if 1 plays Tail; 2 TH = Tail if 1 plays Head, 2 Head if 1 plays Tail; head tail head tail TT = Tail if 1 plays Head, Tail if 1 plays Tail. (-1,1) (1,-1) (1,-1) (-1,1) 2 Matching pennies with perfect information 2 1 HH HT TH TT Head Tail Matching pennies with Imperfect information 1 2 1 Head Tail Head Tail 2 Head (-1,1) (1,-1) head tail head tail Tail (1,-1) (-1,1) (-1,1) (1,-1) (1,-1) (-1,1) 3 A game with nature Left (5, 0) 1 Head 1/2 Right (2, 2) Nature (3, 3) 1/2 Left Tail 2 Right (0, -5) Mixed Strategy Definition: A mixed strategy of a player is a probability distribution over the set of his strategies. Pure strategies: Si = {si1,si2,…,sik} σ → A mixed strategy: i: S [0,1] s.t. σ σ σ i(si1) + i(si2) + … + i(sik) = 1. If the other players play s-i =(s1,…, si-1,si+1,…,sn), then σ the expected utility of playing i is σ σ σ i(si1)ui(si1,s-i) + i(si2)ui(si2,s-i) + … + i(sik)ui(sik,s-i). -
Frequently Asked Questions in Mathematics
Frequently Asked Questions in Mathematics The Sci.Math FAQ Team. Editor: Alex L´opez-Ortiz e-mail: [email protected] Contents 1 Introduction 4 1.1 Why a list of Frequently Asked Questions? . 4 1.2 Frequently Asked Questions in Mathematics? . 4 2 Fundamentals 5 2.1 Algebraic structures . 5 2.1.1 Monoids and Groups . 6 2.1.2 Rings . 7 2.1.3 Fields . 7 2.1.4 Ordering . 8 2.2 What are numbers? . 9 2.2.1 Introduction . 9 2.2.2 Construction of the Number System . 9 2.2.3 Construction of N ............................... 10 2.2.4 Construction of Z ................................ 10 2.2.5 Construction of Q ............................... 11 2.2.6 Construction of R ............................... 11 2.2.7 Construction of C ............................... 12 2.2.8 Rounding things up . 12 2.2.9 What’s next? . 12 3 Number Theory 14 3.1 Fermat’s Last Theorem . 14 3.1.1 History of Fermat’s Last Theorem . 14 3.1.2 What is the current status of FLT? . 14 3.1.3 Related Conjectures . 15 3.1.4 Did Fermat prove this theorem? . 16 3.2 Prime Numbers . 17 3.2.1 Largest known Mersenne prime . 17 3.2.2 Largest known prime . 17 3.2.3 Largest known twin primes . 18 3.2.4 Largest Fermat number with known factorization . 18 3.2.5 Algorithms to factor integer numbers . 18 3.2.6 Primality Testing . 19 3.2.7 List of record numbers . 20 3.2.8 What is the current status on Mersenne primes? . -
SEQUENTIAL GAMES with PERFECT INFORMATION Example
SEQUENTIAL GAMES WITH PERFECT INFORMATION Example 4.9 (page 105) Consider the sequential game given in Figure 4.9. We want to apply backward induction to the tree. 0 Vertex B is owned by player two, P2. The payoffs for P2 are 1 and 3, with 3 > 1, so the player picks R . Thus, the payoffs at B become (0, 3). 00 Next, vertex C is also owned by P2 with payoffs 1 and 0. Since 1 > 0, P2 picks L , and the payoffs are (4, 1). Player one, P1, owns A; the choice of L gives a payoff of 0 and R gives a payoff of 4; 4 > 0, so P1 chooses R. The final payoffs are (4, 1). 0 00 We claim that this strategy profile, { R } for P1 and { R ,L } is a Nash equilibrium. Notice that the 0 00 strategy profile gives a choice at each vertex. For the strategy { R ,L } fixed for P2, P1 has a maximal payoff by choosing { R }, ( 0 00 0 00 π1(R, { R ,L }) = 4 π1(R, { R ,L }) = 4 ≥ 0 00 π1(L, { R ,L }) = 0. 0 00 In the same way, for the strategy { R } fixed for P1, P2 has a maximal payoff by choosing { R ,L }, ( 00 0 00 π2(R, {∗,L }) = 1 π2(R, { R ,L }) = 1 ≥ 00 π2(R, {∗,R }) = 0, where ∗ means choose either L0 or R0. Since no change of choice by a player can increase that players own payoff, the strategy profile is called a Nash equilibrium. Notice that the above strategy profile is also a Nash equilibrium on each branch of the game tree, mainly starting at either B or starting at C. -
Subgame-Perfect Ε-Equilibria in Perfect Information Games With
Subgame-Perfect -Equilibria in Perfect Information Games with Common Preferences at the Limit Citation for published version (APA): Flesch, J., & Predtetchinski, A. (2016). Subgame-Perfect -Equilibria in Perfect Information Games with Common Preferences at the Limit. Mathematics of Operations Research, 41(4), 1208-1221. https://doi.org/10.1287/moor.2015.0774 Document status and date: Published: 01/11/2016 DOI: 10.1287/moor.2015.0774 Document Version: Publisher's PDF, also known as Version of record Document license: Taverne Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal. -
Economics 201B Economic Theory (Spring 2021) Strategic Games
Economics 201B Economic Theory (Spring 2021) Strategic Games Topics: terminology and notations (OR 1.7), games and solutions (OR 1.1-1.3), rationality and bounded rationality (OR 1.4-1.6), formalities (OR 2.1), best-response (OR 2.2), Nash equilibrium (OR 2.2), 2 2 examples × (OR 2.3), existence of Nash equilibrium (OR 2.4), mixed strategy Nash equilibrium (OR 3.1, 3.2), strictly competitive games (OR 2.5), evolution- ary stability (OR 3.4), rationalizability (OR 4.1), dominance (OR 4.2, 4.3), trembling hand perfection (OR 12.5). Terminology and notations (OR 1.7) Sets For R, ∈ ≥ ⇐⇒ ≥ for all . and ⇐⇒ ≥ for all and some . ⇐⇒ for all . Preferences is a binary relation on some set of alternatives R. % ⊆ From % we derive two other relations on : — strict performance relation and not  ⇐⇒ % % — indifference relation and ∼ ⇐⇒ % % Utility representation % is said to be — complete if , or . ∀ ∈ % % — transitive if , and then . ∀ ∈ % % % % can be presented by a utility function only if it is complete and transitive (rational). A function : R is a utility function representing if → % ∀ ∈ () () % ⇐⇒ ≥ % is said to be — continuous (preferences cannot jump...) if for any sequence of pairs () with ,and and , . { }∞=1 % → → % — (strictly) quasi-concave if for any the upper counter set ∈ { ∈ : is (strictly) convex. % } These guarantee the existence of continuous well-behaved utility function representation. Profiles Let be a the set of players. — () or simply () is a profile - a collection of values of some variable,∈ one for each player. — () or simply is the list of elements of the profile = ∈ { } − () for all players except . ∈ — ( ) is a list and an element ,whichistheprofile () . -
Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction
Pure and Bayes-Nash Price of Anarchy for Generalized Second Price Auction Renato Paes Leme Eva´ Tardos Department of Computer Science Department of Computer Science Cornell University, Ithaca, NY Cornell University, Ithaca, NY [email protected] [email protected] Abstract—The Generalized Second Price Auction has for advertisements and slots higher on the page are been the main mechanism used by search companies more valuable (clicked on by more users). The bids to auction positions for advertisements on search pages. are used to determine both the assignment of bidders In this paper we study the social welfare of the Nash equilibria of this game in various models. In the full to slots, and the fees charged. In the simplest model, information setting, socially optimal Nash equilibria are the bidders are assigned to slots in order of bids, and known to exist (i.e., the Price of Stability is 1). This paper the fee for each click is the bid occupying the next is the first to prove bounds on the price of anarchy, and slot. This auction is called the Generalized Second Price to give any bounds in the Bayesian setting. Auction (GSP). More generally, positions and payments Our main result is to show that the price of anarchy is small assuming that all bidders play un-dominated in the Generalized Second Price Auction depend also on strategies. In the full information setting we prove a bound the click-through rates associated with the bidders, the of 1.618 for the price of anarchy for pure Nash equilibria, probability that the advertisement will get clicked on by and a bound of 4 for mixed Nash equilibria. -
Tic-Tac-Toe Is Not Very Interesting to Play, Because If Both Players Are Familiar with the Game the Result Is Always a Draw
CMPSCI 250: Introduction to Computation Lecture #27: Games and Adversary Search David Mix Barrington 4 November 2013 Games and Adversary Search • Review: A* Search • Modeling Two-Player Games • When There is a Game Tree • The Determinacy Theorem • Searching a Game Tree • Examples of Games Review: A* Search • The A* Search depends on a heuristic function, which h(y) = 6 2 is a lower bound on the 23 s distance to the goal. x 4 If x is a node, and g is the • h(z) = 3 nearest goal node to x, the admissibility condition p(y) = 23 + 2 + 6 = 31 on h is that 0 ≤ h(x) ≤ d(x, g). p(z) = 23 + 4 + 3 = 30 Review: A* Search • Suppose we have taken y off of the open list. The best-path distance from the start s to the goal g through y is d(s, y) + d(y, h(y) = 6 g), and this cannot be less than 2 23 d(s, y) + h(y). s x 4 • Thus when we find a path of length k from s to y, we put y h(z) = 3 onto the open list with priority k p(y) = 23 + 2 + 6 = 31 + h(y). We still record the p(z) = 23 + 4 + 3 = 30 distance d(s, y) when we take y off of the open list. Review: A* Search • The advantage of A* over uniform-cost search is that we do not consider entries x in the closed list for which d(s, x) + h(x) h(y) = 6 is greater than the actual best- 2 23 path distance from s to g. -
Adversarial Search (Game)
Adversarial Search CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2019 Soleymani “Artificial Intelligence: A Modern Approach”, 3rd Edition, Chapter 5 Most slides have been adopted from Klein and Abdeel, CS188, UC Berkeley. Outline } Game as a search problem } Minimax algorithm } �-� Pruning: ignoring a portion of the search tree } Time limit problem } Cut off & Evaluation function 2 Games as search problems } Games } Adversarial search problems (goals are in conflict) } Competitive multi-agent environments } Games in AI are a specialized kind of games (in the game theory) 3 Adversarial Games 4 Types of Games } Many different kinds of games! } Axes: } Deterministic or stochastic? } One, two, or more players? } Zero sum? } Perfect information (can you see the state)? } Want algorithms for calculating a strategy (policy) which recommends a move from each state 5 Zero-Sum Games } Zero-Sum Games } General Games } Agents have opposite utilities } Agents have independent utilities (values on outcomes) (values on outcomes) } Lets us think of a single value that one } Cooperation, indifference, competition, maximizes and the other minimizes and more are all possible } Adversarial, pure competition } More later on non-zero-sum games 6 Primary assumptions } We start with these games: } Two -player } Tu r n taking } agents act alternately } Zero-sum } agents’ goals are in conflict: sum of utility values at the end of the game is zero or constant } Perfect information } fully observable Examples: Tic-tac-toe, -
Best-First Minimax Search Richard E
Artificial Intelligence ELSEVIER Artificial Intelligence 84 ( 1996) 299-337 Best-first minimax search Richard E. Korf *, David Maxwell Chickering Computer Science Department, University of California, Los Angeles, CA 90024, USA Received September 1994; revised May 1995 Abstract We describe a very simple selective search algorithm for two-player games, called best-first minimax. It always expands next the node at the end of the expected line of play, which determines the minimax value of the root. It uses the same information as alpha-beta minimax, and takes roughly the same time per node generation. We present an implementation of the algorithm that reduces its space complexity from exponential to linear in the search depth, but at significant time cost. Our actual implementation saves the subtree generated for one move that is still relevant after the player and opponent move, pruning subtrees below moves not chosen by either player. We also show how to efficiently generate a class of incremental random game trees. On uniform random game trees, best-first minimax outperforms alpha-beta, when both algorithms are given the same amount of computation. On random trees with random branching factors, best-first outperforms alpha-beta for shallow depths, but eventually loses at greater depths. We obtain similar results in the game of Othello. Finally, we present a hybrid best-first extension algorithm that combines alpha-beta with best-first minimax, and performs significantly better than alpha-beta in both domains, even at greater depths. In Othello, it beats alpha-beta in two out of three games. 1. Introduction and overview The best chess machines, such as Deep-Blue [lo], are competitive with the best humans, but generate billions of positions per move. -
Matsui 2005 IER-23Wvu3z
INTERNATIONAL ECONOMIC REVIEW Vol. 46, No. 1, February 2005 ∗ A THEORY OF MONEY AND MARKETPLACES BY AKIHIKO MATSUI AND TAKASHI SHIMIZU1 University of Tokyo, Japan; Kansai University, Japan This article considers an infinitely repeated economy with divisible fiat money. The economy has many marketplaces that agents choose to visit. In each mar- ketplace, agents are randomly matched to trade goods. There exist a variety of stationary equilibria. In some equilibrium, each good is traded at a single price, whereas in another, every good is traded at two different prices. There is a contin- uum of such equilibria, which differ from each other in price and welfare levels. However, it is shown that only the efficient single-price equilibrium is evolution- arily stable. 1. INTRODUCTION In a transaction, one needs to have what his trade partner wants. However, it is hard for, say, an economist who wants to have her hair cut to find a hairdresser who wishes to learn economics. In order to mitigate this problem of a lack of double coincidence of wants, money is often used as a medium of exchange. If there is a generally acceptable good called money, then the economist can divide the trading process into two: first, she teaches economics students to obtain money, and then finds a hairdresser to exchange money for a haircut. The hairdresser accepts the money since he can use it to obtain what he wants. Money is accepted by many people as it is believed to be accepted by many. Focusing on this function of money, Kiyotaki and Wright (1989) formalized the process of monetary exchange. -
SUPPLEMENT to “VERY SIMPLE MARKOV-PERFECT INDUSTRY DYNAMICS: THEORY” (Econometrica, Vol
Econometrica Supplementary Material SUPPLEMENT TO “VERY SIMPLE MARKOV-PERFECT INDUSTRY DYNAMICS: THEORY” (Econometrica, Vol. 86, No. 2, March 2018, 721–735) JAAP H. ABBRING CentER, Department of Econometrics & OR, Tilburg University and CEPR JEFFREY R. CAMPBELL Economic Research, Federal Reserve Bank of Chicago and CentER, Tilburg University JAN TILLY Department of Economics, University of Pennsylvania NAN YANG Department of Strategy & Policy and Department of Marketing, National University of Singapore KEYWORDS: Demand uncertainty, dynamic oligopoly, firm entry and exit, sunk costs, toughness of competition. THIS SUPPLEMENT to Abbring, Campbell, Tilly, and Yang (2018) (hereafter referred to as the “main text”) (i) proves a theorem we rely upon for the characterization of equilibria and (ii) develops results for an alternative specification of the model. Section S1’s Theorem S1 establishes that a strategy profile is subgame perfect if no player can benefit from deviating from it in one stage of the game and following it faith- fully thereafter. Our proof very closely follows that of the analogous Theorem 4.2 in Fudenberg and Tirole (1991). That theorem only applies to games that are “continuous at infinity” (Fudenberg and Tirole, p. 110), which our game is not. In particular, we only bound payoffs from above (Assumption A1 in the main text) and not from below, be- cause we want the model to encompass econometric specifications like Abbring, Camp- bell, Tilly, and Yang’s (2017) that feature arbitrarily large cost shocks. Instead, our proof leverages the presence of a repeatedly-available outside option with a fixed and bounded payoff, exit. Section S2 presents the primitive assumptions and analysis for an alternative model of Markov-perfect industry dynamics in which one potential entrant makes its entry decision at the same time as incumbents choose between continuation and exit. -
Constructing Stationary Sunspot Equilibria in a Continuous Time Model*
A Note on Woodford's Conjecture: Constructing Stationary Title Sunspot Equilibria in a Continuous Time Model(Nonlinear Analysis and Mathematical Economics) Author(s) Shigoka, Tadashi Citation 数理解析研究所講究録 (1994), 861: 51-66 Issue Date 1994-03 URL http://hdl.handle.net/2433/83844 Right Type Departmental Bulletin Paper Textversion publisher Kyoto University 数理解析研究所講究録 第 861 巻 1994 年 51-66 51 A Note on Woodford’s Conjecture: Constructing Stationary Sunspot Equilibria in a Continuous Time Model* Tadashi Shigoka Kyoto Institute of Economic Research, Kyoto University, Yoshidamachi Sakyoku Kyoto 606 Japan 京都大学 経済研究所 新後閑 禎 Abstract We show how to construct stationary sunspot equilibria in a continuous time model, where equilibrium is indeterninate near either a steady state or a closed orbit. Woodford’s conjecture that the indeterminacy of equilibrium implies the existence of stationary sunspot equilibria remains valid in a continuous time model. 52 Introduction If for given equilibrium dynamics there exist a continuum of non-stationary perfect foresight equilibria all converging asymptoticaUy to a steady state (a deterministic cycle resp.), we say the equilibrium dynamics is indeterminate near the steady state (the deterministic cycle resp.). Suppose that the fundamental characteristics of an economy are deterministic, but that economic agents believe nevertheless that equihibrium dynamics is affected by random factors apparently irrelevant to the fundamental characteristics (sunspots). This prophecy could be self-fulfilling, and one will get a sunspot equilibrium, if the resulting equilibrium dynamics is subject to a nontrivial stochastic process and confirns the agents’ belief. See Shell [19], and Cass-Shell [3]. Woodford [23] suggested that there exists a close relation between the indetenninacy of equilibrium near a deterministic steady state and the existence of stationary sunspot equilibria in the immediate vicinity of it.