The Moran Process As a Markov Chain on Leaf-Labeled Trees

Total Page:16

File Type:pdf, Size:1020Kb

The Moran Process As a Markov Chain on Leaf-Labeled Trees The Moran Process as a Markov Chain on Leaf-labeled Trees David J. Aldous∗ University of California Department of Statistics 367 Evans Hall # 3860 Berkeley CA 94720-3860 [email protected] http://www.stat.berkeley.edu/users/aldous March 29, 1999 Abstract The Moran process in population genetics may be reinterpreted as a Markov chain on a set of trees with leaves labeled by [n]. It provides an interesting worked example in the theory of mixing times and coupling from the past for Markov chains. Mixing times are shown to be of order n2, as anticipated by the theory surrounding Kingman’s coalescent. Incomplete draft -- do not circulate AMS 1991 subject classification: 05C05,60C05,60J10 Key words and phrases. Coupling from the past, Markov chain, mixing time, phylogenetic tree. ∗Research supported by N.S.F. Grant DMS96-22859 1 1 Introduction The study of mixing times for Markov chains on combinatorial sets has attracted considerable interest over the last ten years [3, 5, 7, 14, 16, 18]. This paper provides another worked example. We must at the outset admit that the mathematics is fairly straightforward, but we do find the example instructive. Its analysis provides a simple but not quite obvious illustration of coupling from the past, reminiscent of the elementary analysis ([1] section 4) of riffle shuffle, and of the analysis of move-to-front list algorithms [12] and move-to-root algorithms for maintaining search trees [9]. The main result, Proposition 2, implies that while most combinatorial chains exhibit the cut-off phenomenon [8], this particular example has the opposite diffusive behavior. Our precise motivation for studying this model was as a simpler version of certain Markov chains on phylogenetic trees: see section 3(b) for further discussion. The model also fits a general framework studied in [4]: see section 3(a). 1.1 The Moran chain The Moran model ([11] section 3.3) in population genetics models a popu- lation of constant size n. At each step, one randomly-chosen individual is killed and another randomly-chosen individual gives birth to a child. The feature of interest is the genealogy of the individuals alive at a given time, that is how they are related to each other by descent. In population genetics these “individuals” are in fact genes and there is also mutation and selection structure, but our interest goes in a different direction. There is some flexibility in how much information we choose to record in the genealogy of the current population, and we will make the choice that seems simplest from the combinatorial viewpoint. Label the individuals as [n] := {1, 2, . , n}, and declare that if individual k is killed then the new child is given label k. The left diagram in figure 1 shows a possible genealogy, in which we keep track of the order of the times at which all the splits in all the lines of descent occured, but not the absolute times of splits. 2 level 0 t t = f7(t, 4, 7) 7 6 ¨H ¨H ¨¨ HH ¨¨ HH 5 ¨ H ¨ H @ QQ A 4 @ Q A ¡A ¡@ @ 3 ¡ A ¡ @ @ ¡@ ¡@ ¡A 2 ¡ @ ¡ @ ¡ A ¡A ¡A ¡@ 1 ¡ A ¡ A @ ¡A ¡A ¡A 0 ¡ A ¡ A ¡ A 6 4 3 7 1 4 2 6 4 3 1 4 2 6 4 7 3 1 4 2 Figure 1. A transition t → t0 in the Moran chain. Precisely, the left diagram shows a tree t with leaf-set [n] and height n, where at each level one downward edge splits into two downward edges, and where we distinguish between left and right branches. Such a tree has n(n + 1)/2 edges of unit length. Write Tn for the set of such trees. The cardinality of this set is #Tn = n!(n − 1)! (1) We leave to the reader the (very easy) task of giving a bijective proof of (1); an inductive proof will fall out of the proof of Lemma 1 below. Interpreting the Moran model as a Tn-valued process gives a Markov chain on Tn which we call the Moran chain. Here is a careful definition. Take a tree t ∈ Tn and distinct ordered (j, k) from [n]. Delete leaf k from t, then insert leaf k into the edge incident at leaf j, placing it to the right of leaf j. This gives a new tree 0 t = fn(t, j, k). (2) Such a transition is illustrated in figure 1. Starting from the tree t in the left diagram, leaf 7 is deleted and the levels adjusted where necessary, to give the center diagram; then leaf 7 is inserted to the right of leaf 4, and levels adjusted, to give the tree t0 in the right diagram. The Moran chain is now defined to be the chain with transition probabilities 0 0 p(t, t ) = P (fn(t, J, K) = t ) where (J, K) is a uniform random distinct ordered pair from [n]. It is easy to check this defines an aperodic irreducible chain, which there- fore has some limiting stationary distribution. Our choice of Tn as the precise state-space was motivated by 3 Lemma 1 The stationary distribution of the Moran chain is the uniform distribution on Tn. Proof. From a tree t there are n × (n − 1) equally likely choices of (k, j) which define the possible transitions t → t0. To prove the lemma we need to show the chain is doubly stochastic, i.e. that for each tree t0 there are n(n − 1) such choices which get to t0 from some starting trees t. For a tree t0 (illustrated by the right diagram in figure 1) there is only one leaf k (leaf 7, in figure 1) which might have been inserted last, into a diagram like the tree t00 in the center diagram. The trees t such that deleting leaf k from t gives t00 are exactly the trees obtainable by attaching leaf k to any of the (n−1)n/2 edges of t00 and to the right or the left of the existing edge, giving a total of (n − 1)n/2 × 2 choices, as required. Remark. The final part of the argument says that the general element t ∈ Tn may be constructed from a tree in Tn−1 by attaching leaf n to one of the (n − 1)n/2 edges, to the right or the left of the existing edge. So #Tn = n(n − 1) #Tn−1, establishing (1) by induction. 1.2 Mixing time for the Moran chain Write (Xn(m), m = 0, 1, 2,...) for the Moran chain on Tn. One of the standard ways of studying mixing is via the maximal variation distance 1 X 0 0 dn(m) := max P (Xn(m) = t |Xn(0) = t) − πn(t ) (3) t∈T 2 n t0 0 where πn(t ) = 1/#Tn is the stationary probability. Our result is most easily expressed in terms of the following random variables. Let (ξi, 2 ≤ i < ∞) be 1 independent, and let ξi have the exponential distribution with mean i(i−1) ; then let ∞ X L = ξi. (4) i=2 2 Proposition 2 (a) lim supn dn(zn ) ≤ P (L > z) < 1 for each 0 < z < ∞ 2 (b) lim infn dn(zn ) ≥ φ(z), for some φ(z) ↑ 1 as z ↓ 0. Thus mixing time (as measured by variation distance) for the Moran chain is order n2 but variation distance does not exhibit the cut-off phenomenon of [8] which usually occurs with Markov chains on combinatorial sets. As briefly sketched in the next section, what’s really going on is that 2 dn(zn ) → d∞(z) 4 where d∞(·) is the maximal variation distance associated with a certain limit continuous-time continuous-space Markov process. We might call this dif- fusive behavior, from the case of simple random walks on the k-dimensional integers modulo n, whose n → ∞ limit behavior is of this form with the limit process being Brownian motion on [0, 1]k. 1.3 Remarks on a limit process We now outline why Proposition 2 is not unexpected. In the original Moran model for a population, one can look back from the present to determine the number of steps Ln back until the last common ancestor of the present pop- −2 ulation. It is standard (see section 2.3) that n Ln converges in distribution to L. Loosely speaking, this implies that the genealogy after n2L steps can- not depend on the initial genealogy, and Proposition 2(a) is a formalization of that idea. A more elaborate picture is given by the theory surrounding Kingman’s coalescent [13, 17], which portrays the rescaled n → ∞ limit of the genealogy of the current size-n population as a genealogy C of an infinite population. Informally, what’s really going on is that 2 dn(zn ) → d∞(z), 0 < z < ∞ where d∞(·) is the maximal variation distance associated with a certain continuous-time Markov process (Ct, 0 ≤ t < ∞) whose state space is a set of possible genealogies for an infinite population. However, defining the process (Ct) precisely and proving mixing time bounds via this weak convergence methodology is technically hard. It turns out to be fairly simple to give an analysis of Proposition 2 directly in the discrete setting, by combining the standard analysis of Ln with a coupling construction, so that is what we shall do. 2 Proof of Proposition 2 2.1 Coupling from the past The proof is based on a standard elementary idea. Suppose a Markov chain (X(s)) can be represented in the form X(s + 1) = f(X(s),Us+1) for some function f, where the (Us) are independent with common distribu- tion µ.
Recommended publications
  • A Stochastic Differential Equation Derived from Evolutionary Game Theory
    U.U.D.M. Project Report 2019:5 A stochastic differential equation derived from evolutionary game theory Brian Treacy Examensarbete i matematik, 15 hp Handledare: Ingemar Kaj Examinator: Denis Gaidashev Februari 2019 Department of Mathematics Uppsala University Abstract Evolutionary game theory models population dynamics of species when a concept of fit- ness is introduced. Initally ordinary differential equation models were the primary tool but these fail to capture inherent randomness in sufficiently small finite populations. There- fore stochastic models were created to capture the stochasticity in these populations. This thesis connects these two modeling paradigms. It does this by defining a stochastic process model for fitness-dependent population dynamics and deriving its associated in- finitesimal generator. This is followed by linking the finite population stochastic process model to the infinite population ordinary differential equation model by taking the limit of the total population which results in a stochastic differential equation. The behaviour of this stochastic differential equation is analyzed, showing how certain parameters af- fect how closely this stochastic differential equation mimics the behaviour of the ordinary differential equation. The thesis concludes by providing graphical evidence that the one- third rule holds with this stochastic differential equation. 3 Contents Introduction 5 1 Moran Process 6 1.1 Payoff ....................................6 1.2 Fitness . .7 1.3 Transition Probabilities . .7 1.4 Fixation Probabilities and Times . .8 2 Infinitesimal Generator 9 2.1 Infinitesimal Generator for alternative fitness function . 10 3 Stochastic Differential Equations 11 3.1 Heuristics . 11 3.2 Existence and Uniqueness . 11 3.3 Infinitesimal Generator . 12 4 Analysis 13 4.1 Analysis of deterministic component .
    [Show full text]
  • Populations in Environments with a Soft Carrying Capacity Are Eventually
    Populations in environments with a soft carrying capacity are eventually extinct † Peter Jagers∗ Sergei Zuyev∗ August 5, 2020 Abstract Consider a population whose size changes stepwise by its members reproducing or dying (disappearing), but is otherwise quite general. Denote the initial (non-random) size by Z0 and the size of the nth change by Cn, n = 1,2,.... Population sizes hence develop successively as Z1 = Z0 +C1, Z2 = Z1 +C2 and so on, indefinitely or until there are no further size changes, due to extinction. Extinction is thus assumed final, so that Zn = 0 implies that Zn+1 = 0, without there being any other finite absorbing class of population sizes. We make no assumptions about the time durations between the successive changes. In the real world, or more specific models, those may be of varying length, depending upon individual life span distributions and their interdepen- dencies, the age-distribution at hand and intervening circumstances. We could con- sider toy models of Galton-Watson type generation counting or of the birth-and-death type, with one individual acting per change, until extinction, or the most general multi- type CMJ branching processes with, say, population size dependence of reproduction. Changes may have quite varying distributions. The basic assumption is that there is a carrying capacity, i.e. a non-negative number K such that the conditional expectation of the change, given the complete past history, is non-positive whenever the population exceeds the carrying capacity. Further, to avoid unnecessary technicalities, we assume that the change Cn equals -1 (one individual dying) with a conditional (given the past) probability uniformly bounded away from 0.
    [Show full text]
  • An Ergodic Theorem for Fleming-Viot Models in Random Environments
    An Ergodic Theorem for Fleming-Viot Models in Random Environments 1 Arash Jamshidpey 2 2016 Abstract The Fleming-Viot (FV) process is a measure-valued diffusion that models the evolution of type frequencies in a countable population which evolves under resampling (genetic drift), mutation, and selection. In the classic FV model the fitness (strength) of types is given by a measurable function. In this paper, we introduce and study the Fleming-Viot process in random environment (FVRE), when by random environment we mean the fitness of types is a stochastic process with c`adl`ag paths. We identify FVRE as the unique solution to a so called quenched martingale problem and derive some of its properties via martingale and duality methods. We develop the duality methods for general time-inhomogeneous and quenched martingale problems. In fact, some important aspects of the duality relations only appears for time-inhomogeneous (and quenched) martingale problems. For example, we see that duals evolve backward in time with respect to the main Markov process whose evolution is forward in time. Using a family of function-valued dual processes for FVRE, we prove that, as the number eN of individuals N tends to ∞, the measure-valued Moran process µN (with fitness process eN ) e converges weakly in Skorokhod topology of c`adl`ag functions to the FVRE process µ (with fitness process e), if eN → e a.s. in Skorokhod topology of c`adl`ag functions. We also study e the long-time behaviour of FVRE process (µt )t≥0 joint with its fitness process e =(et)t≥0 and e prove that the joint FV-environment process (µt ,et)t≥0 is ergodic under the assumption of weak ergodicity of e.
    [Show full text]
  • An Extended Moran Process That Captures the Struggle for Fitness
    An extended Moran process that captures the struggle for fitness a, b c Marthe M˚aløy ⇤, Frode M˚aløy, Rafael Lahoz-Beltr´a , Juan Carlos Nu˜no , Antonio Brud aDepartment of Mathematics and Statistics, The Arctic University of Norway. bDepartment of Biodiversity, Ecology and Evolution, Complutense University of Madrid, Spain. cDepartamento de Matem´atica Aplicada a los Recursos Naturales, Universidad Polit´ecnica de Madrid, Spain dDepartment of Applied Mathematics, Complutense University of Madrid, Spain Abstract When a new type of individual appears in a stable population, the newcomer is typically not advantageous. Due to stochasticity, the new type can grow in numbers, but the newcomers can only become advantageous if they manage to change the environment in such a way that they increase their fitness. This dy- namics is observed in several situations in which a relatively stable population is invaded by an alternative strategy, for instance the evolution of cooperation among bacteria, the invasion of cancer in a multicellular organism and the evo- lution of ideas that contradict social norms. These examples also show that, by generating di↵erent versions of itself, the new type increases the probability of winning the struggle for fitness. Our model captures the imposed cooperation whereby the first generation of newcomers dies while changing the environment such that the next generations become more advantageous. Keywords: Evolutionary dynamics, Nonlinear dynamics, Mathematical modelling, Game theory, Cooperation 1. Introduction When unconditional cooperators appear in a large group of defectors, they are exploited until they become extinct. The best possible scenario for this type of cooperators is to change the environment such that another type of cooper- 5 ators that are regulated and only cooperate under certain conditions becomes advantageous.
    [Show full text]
  • Dynamics of Markov Chains for Undergraduates
    Dynamics of Markov Chains for Undergraduates Ursula Porod February 19, 2021 Contents Preface 6 1 Markov chains 8 1.1 Introduction . .8 1.2 Construction of a Markov chain . .9 1.2.1 Finite-length trajectories . .9 1.2.2 From finite to infinite-length trajectories . 11 1.3 Basic computations for Markov chains . 16 1.4 Strong Markov Property . 21 1.5 Examples of Markov Chains . 23 1.6 Functions of Markov chains . 35 1.7 Irreducibility and class structure of the state space . 42 1.8 Transience and Recurrence . 44 1.9 Absorbing chains . 53 1.9.1 First step analysis . 53 1.9.2 Finite number of transient states . 55 1.9.3 Infinite number of transient states . 61 1.10 Stationary distributions . 66 1.10.1 Existence and uniqueness of an invariant measure . 68 1.10.2 Positive recurrence versus null recurrence . 73 1.10.3 Stationary distributions for reducible chains . 74 1.10.4 Steady state distributions . 76 1.11 Periodicity . 77 1.12 Exercises . 85 2 Limit Theorems for Markov Chains 87 2.1 The Ergodic Theorem . 87 2.2 Convergence . 93 2.3 Long-run behavior of reducible chains . 98 1 CONTENTS 2 2.4 Exercises . 100 3 Random Walks on Z 101 3.1 Basics . 101 3.2 Wald's Equations . 103 3.3 Gambler's Ruin . 105 3.4 P´olya's Random Walk Theorem . 111 3.5 Reflection Principle and Duality . 115 3.5.1 The ballot problem . 120 3.5.2 Dual walks . 121 3.5.3 Maximum and minimum .
    [Show full text]
  • Scaling Limits of a Model for Selection at Two Scales
    Home Search Collections Journals About Contact us My IOPscience Scaling limits of a model for selection at two scales This content has been downloaded from IOPscience. Please scroll down to see the full text. 2017 Nonlinearity 30 1682 (http://iopscience.iop.org/0951-7715/30/4/1682) View the table of contents for this issue, or go to the journal homepage for more Download details: IP Address: 152.3.102.254 This content was downloaded on 01/06/2017 at 10:54 Please note that terms and conditions apply. You may also be interested in: Stochastic switching in biology: from genotype to phenotype Paul C Bressloff Ergodicity in randomly forced Rayleigh–Bénard convection J Földes, N E Glatt-Holtz, G Richards et al. Regularity of invariant densities for 1D systems with random switching Yuri Bakhtin, Tobias Hurth and Jonathan C Mattingly Cusping, transport and variance of solutions to generalized Fokker–Planck equations Sean Carnaffan and Reiichiro Kawai Anomalous diffusion for a class of systems with two conserved quantities Cédric Bernardin and Gabriel Stoltz Feynman–Kac equation for anomalous processes with space- and time-dependent forces Andrea Cairoli and Adrian Baule Mixing rates of particle systems with energy exchange A Grigo, K Khanin and D Szász Ergodicity of a collective random walk on a circle Michael Blank Stochastic partial differential equations with quadratic nonlinearities D Blömker, M Hairer and G A Pavliotis IOP Nonlinearity Nonlinearity London Mathematical Society Nonlinearity Nonlinearity 30 (2017) 1682–1707 https://doi.org/10.1088/1361-6544/aa5499
    [Show full text]
  • Stochastische Prozesse
    University of Vienna Stochastische Prozesse January 27, 2015 Contents 1 Random Walks 5 1.1 Heads or tails . 5 1.2 Probability of return . 6 1.3 Reflection principle . 7 1.4 Main lemma for symmetric random walks . 7 1.5 First return . 8 1.6 Last visit . 9 1.7 Sojourn times . 10 1.8 Position of maxima . 11 1.9 Changes of sign . 11 1.10 Return to the origin . 12 1.11 Random walks in the plane Z2 . 13 1.12 The ruin problem . 13 1.13 How to gamble if you must . 14 1.14 Expected duration of the game . 15 1.15 Generating function for the duration of the game . 15 1.16 Connection with the diffusion process . 17 2 Branching processes 18 2.1 Extinction or survival of family names . 18 2.2 Proof using generating functions . 19 2.3 Some facts about generating functions . 22 2.4 Moment and cumulant generating functions . 23 2.5 An example from genetics by Fischer (1930) . 24 2.6 Asymptotic behaviour . 26 3 Markov chains 27 3.1 Definition . 27 3.2 Examples of Markov chains . 27 3.3 Transition probabilities . 30 3.4 Invariant distributions . 31 3.5 Ergodic theorem for primitive Markov chains . 31 3.6 Examples for stationary distributions . 34 3.7 Birth-death chains . 34 3.8 Reversible Markov chains . 35 3.9 The Markov chain tree formula . 36 1 3.10 Mean recurrence time . 39 3.11 Recurrence vs. transience . 40 3.12 The renewal equation . 42 3.13 Positive vs. null-recurrence .
    [Show full text]
  • Allen&Tarnita
    J. Math. Biol. DOI 10.1007/s00285-012-0622-x Mathematical Biology Measures of success in a class of evolutionary models with fixed population size and structure Benjamin Allen Corina E. Tarnita · Received: 3 November 2011 / Revised: 24 October 2012 © Springer-Verlag Berlin Heidelberg 2012 Abstract We investigate a class of evolutionary models, encompassing many established models of well-mixed and spatially structured populations. Models in this class have fixed population size and structure. Evolution proceeds as a Markov chain, with birth and death probabilities dependent on the current population state. Starting from basic assumptions, we show how the asymptotic (long-term) behav- ior of the evolutionary process can be characterized by probability distributions over the set of possible states. We then define and compare three quantities characterizing evolutionary success: fixation probability, expected frequency, and expected change due to selection. We show that these quantities yield the same conditions for success in the limit of low mutation rate, but may disagree when mutation is present. As part of our analysis, we derive versions of the Price equation and the replicator equation that describe the asymptotic behavior of the entire evolutionary process, rather than the change from a single state. We illustrate our results using the frequency-dependent Moran process and the birth–death process on graphs as examples. Our broader aim is to spearhead a new approach to evolutionary theory, in which general principles of evolution are proven as mathematical theorems from axioms. B. Allen (B) Department of Mathematics, Emmanuel College, Boston, MA 02115, USA e-mail: [email protected] B.
    [Show full text]
  • Rates of Convergence of Some Multivariate Markov Chains With
    The Annals of Applied Probability 2009, Vol. 19, No. 2, 737–777 DOI: 10.1214/08-AAP562 c Institute of Mathematical Statistics, 2009 RATES OF CONVERGENCE OF SOME MULTIVARIATE MARKOV CHAINS WITH POLYNOMIAL EIGENFUNCTIONS By Kshitij Khare and Hua Zhou Stanford University and University of California, Los Angeles We provide a sharp nonasymptotic analysis of the rates of conver- gence for some standard multivariate Markov chains using spectral techniques. All chains under consideration have multivariate orthog- onal polynomial as eigenfunctions. Our examples include the Moran model in population genetics and its variants in community ecol- ogy, the Dirichlet-multinomial Gibbs sampler, a class of generalized Bernoulli–Laplace processes, a generalized Ehrenfest urn model and the multivariate normal autoregressive process. 1. Introduction. The theory of Markov chains is one of the most useful tools of applied probability and has numerous applications. Markov chains are used for modeling physical processes and evolution of a population in population genetics and community ecology. Another important use is sim- ulating from an intractable probability distribution. It is a well-known fact that under mild conditions discussed in [2], a Markov chain converges to its stationary distribution. In the applications mentioned above, often it is useful to know exactly how many steps it takes for a Markov chain to be reasonably close to its stationary distribution. Answering this question as accurately as possible is what finding “rates of convergence” of Markov chains is about. In the current paper, we provide a sharp nonasymptotic analysis of rates of convergence to stationarity for a variety of multivariate Markov chains.
    [Show full text]
  • Stochastic Evolution As a Generalized Moran Process
    Stochastic Evolution as a Generalized Moran Process Drew Fudenberg*, Lorens Imhof,** Martin A. Nowak***, and Christine Taylor**** First draft: August 29, 2003 This version: September 9, 2004 Abstract: This paper proposes and analyzes a model of stochastic evolution in finite populations. The expected motion in our model resembles the standard replicator dynamic when the population is large, but is qualitatively different when the population size is small, due to the difference between maximizing payoff and maximizing relative payoff. Moreover, even in large populations the asymptotic behavior of our system differs from that of the best-response and replicator dynamics due to its stochastic component. * Department of Economics, Harvard University, [email protected] ** Institute of Statistics, Aachen University, [email protected]. *** Program for Evolutionary Dynamics, Harvard University, [email protected] *** Department of Mathematics, MIT, [email protected] 1 1. Introduction This paper proposes and analyzes a model of stochastic evolution in finite populations. Our model is a generalization of the Moran process of evolutionary biology (Moran [1962], Ewens [2004]) to frequency-dependent fitness. In this process, one individual per period “dies” and is replaced by a newcomer. The newcomer’s strategy is a random variable whose distribution is given by an “updating function” that depends only on the current state, that is, the numbers of agents using each choice. The updating function we study has two parts, the “base rate” updating rule and a lower-frequency probability of “mutation.” In the base-rate or “unperturbed” process, each agent has a number of “offspring” that is equal to its payoff in the game, and the new agent is chosen at random from the pool of all the offspring.
    [Show full text]
  • Stochastic Models of Epithelial Cancer Initiation and Glioblastoma Recurrence
    Stochastic Models of Epithelial Cancer Initiation and Glioblastoma Recurrence A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Kathleen M. Storey IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Jasmine Foo June, 2018 c Kathleen M. Storey 2018 ALL RIGHTS RESERVED Acknowledgements I am incredibly lucky to have such a wonderful support system, without whom this thesis would not have been possible. First, I would like to thank my advisor, Jasmine Foo, for introducing me to such a fascinating research area and for her guidance and mentorship during this journey. I would also like to thank Kevin Leder for his helpful insight and feedback throughout the last few years. Additionally I am grateful for the encouragement I have received from numerous teachers and mentors at Carleton College, SMP, and the Haverford school district. I owe many thanks to my family: Mom, Dad, Eileen, and Pat, thank you for your endless loving support and guidance and for always making Havertown feel like home, even when I am 1200 miles away. I am also eternally grateful for the inspiration of my grandparents and entire extended family. Thank you to my friends in the math department, especially Will, Danika, Sam, Daniel, Nicole, and honorary math department members Blake and Libby; some of my fondest grad school memories are of board games, spades, and Acadia Fridays with all of you. Will, in particular, thank you for leading the way through each graduate school hurdle and for your advice and encouragement as I followed each a year later { I cannot imagine completing this thesis without your partnership and constant support.
    [Show full text]
  • 5 Amplifiers for the Moran Process
    Amplifiers for the Moran Process ANDREAS GALANIS, ANDREAS GOBEL,¨ LESLIE ANN GOLDBERG, JOHN LAPINSKAS, and DAVID RICHERBY, University of Oxford The Moran process, as studied by Lieberman, Hauert, and Nowak, is a randomised algorithm modelling the spread of genetic mutations in populations. The algorithm runs on an underlying graph where individuals correspond to vertices. Initially, one vertex (chosen uniformly at random) possesses a mutation, with fitness r > 1. All other individuals have fitness 1. During each step of the algorithm, an individual is chosen with probability proportional to its fitness, and its state (mutant or nonmutant) is passed on to an out-neighbour which is chosen uniformly at random. If the underlying graph is strongly connected, then the algorithm will eventually reach fixation, in which all individuals are mutants, or extinction, in which no individuals are mutants. An infinite family of directed graphs is said to be strongly amplifying if, for every r > 1, the extinction probability tends to 0 as the number of vertices increases. A formal definition is provided in the article. Strong amplification is a rather surprising property—it means that in such graphs, the fixation probability of a uniformly placed initial mutant tends to 1 even though the initial mutant only has a fixed selective advantage of r > 1 (independently of n). The name “strongly amplifying” comes from the fact that this selective advantage is “amplified.” Strong amplifiers have received quite a bit of attention, and Lieberman et al. proposed two potentially strongly amplifying families—superstars and metafunnels. Heuristic arguments have been published, arguing that there are infinite families of superstars that are strongly amplifying.
    [Show full text]