Fisher Waves: an individual based stochastic model Bahram Houchmandzadeh, Marcel Vallade

To cite this version:

Bahram Houchmandzadeh, Marcel Vallade. Fisher Waves: an individual based stochastic model. Physical Review E , American Physical Society (APS), 2017, 96, pp.012414. ￿10.1103/Phys- RevE.96.012414￿. ￿hal-01483999￿

HAL Id: hal-01483999 https://hal.archives-ouvertes.fr/hal-01483999 Submitted on 6 Mar 2017

HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Fisher Waves: an individual based stochastic model.

B. Houchmandzadeh and M. Vallade CNRS, LIPHY, F-38000 Grenoble, France Univ. Grenoble Alpes, LIPHY, F-38000 Grenoble, France

The propagation of a beneficial in a spatially extended population is usually studied using the phenomenological stochastic Fisher-Kolmogorov (SFKPP) equation. We derive here an individual based, stochastic model founded on the spatial Moran process where fluctuations are treated exactly. At high selection pressure, the results of this model are different from the classical FKPP. At small selection pressure, the front behavior can be mapped into a Brownian motion with drift, the properties of which can be derived from microscopic parameters of the Moran model. Finally, we show that the diffusion coefficient and the noise amplitude of SFKPP are not independent parameters but are both determined by the dispersal kernel of individuals.

I. INTRODUCTION. investigated[10]. Specifically, this equation allows for traveling wave solutions and it is known that a stable so- One of the most fundamental questions in - lution of the FKPP is a wave front connecting the two re- ary is the spread of a mutant with fitness 1+ s gions u =1 and u =0 with velocity c = (d/dt) R udx = into a wild type population with fitness 1. In a non- 2√aD and width B = u(1 u)dx =2 D/a.´ R − structured population (i.e., for a population at dimen- ´ The FKPP equation however is not wellp adapted to sion d = 0), the answer to this question was found by evolutionary population dynamics at small selection pres- Kimura[1] nearly 50 years ago as a good approximate so- sure, which is one of the relevant limits of population lution of the Fisher-Wright or the Moran model of popu- genetics[11, 12]. The FKPP equation describes quanti- lation genetics, and better solutions of the Moran model ties (individuals, molecules,...) that at the fundamental have been proposed recently[2]. For geographically struc- level are discrete; the noise associated with this discrete- tured populations however, the question is far from set- ness can play an important role in the dynamics of the tled and only some specific information, such as the fix- front, specifically at small selection pressures. This prob- ation probability, has received partial answers in a field lem was tackled phenomenologically by adding either a that is now called evolutionary graph dynamics[3, 4]. For cutoff [13] or alternatively a noise term to the equation. geographically structured populations where the main in- The form of the noise in the SFKPP was proposed by gredients of the competing populations, i.e., the fitness, Doering et al.[14]. The SFKPP proposed by Doering et the carrying capacity and the diffusion of individuals, are al. has now become a major mathematical tool for the independent of the space, the evolutionary dynamics has investigation of Fisher Waves. It has specifically been been mostly investigated through the stochastic Fisher used by Hallatschek and Korolev[15] to investigate the Kolmogorov Petrovsky, Piscounov (SFKPP) equation properties of the front at small selection pressure, where they revealed the marked difference of the solutions with ∂u = D 2u + au(1 u)+ bu(1 u)η(x, t) (1) respect to the deterministic equation. ∂t ∇ − − p The SFKPP equation however is phenomenological where u(x, t) is the local relative density of the mutant and cannot be derived rigorously from a microscopic, in- with respect to the local carrying capacity, D is the dif- dividual based model of . Firstly, in- fusion coefficient of individuals, a is proportional to the dividual based models such as the Moran model are gov- relative excess fitness of mutants; the last term is a noise erned by discrete master equations and can be approxi- term that captures the local , where b is re- mated by a Fokker-Planck equation, or their equivalent lated to the local carrying capacity and η is a . stochastic differential equation, only in the limit of large The problem that has attracted most attention is that of system size, i.e., large local carrying capacity[2, 16]. The the front propagation : if at the initial time, one half of local carrying capacity however does not appear explic- space is filled only with the mutant type and the other itly in the SFKPP equation and it is difficult to assess half only with the wild type, then the dynamics of the the precision of the Focker-Planck approximation solely problem can be reduced to the dynamics of the front sep- from this equation. Secondly, and more importantly, the arating the two types. noise term bu(1 u)η(x, t) in SFKPP is purely local. The deterministic part of the equation (FKPP) This noise term is− rigorous only for a 0 dimensional sys- was proposed by Fisher[5] and Kolmogorov, Petrovsky, tem, wherep the equivalence between the Fokker-Planck Piscounov[6]; it has found applications in many ar- approximation and the stochastic differential equation eas of science ranging from ecology and epidemiology[7] can be shown. For a spatially extended system, the noise to chemical kinetics[8] and particle physics[9]. The term should also include fluctuations arising from adja- properties of the FKPP equation have been widely cent lattice cells. To our knowledge, however, a rigorous 2 derivation has not yet been achieved (see Mathematical Details V A). The problem of noise arising from adjacent cells was also noted by Korolev et al.[17]. Finally, in an evolutionary model, both the diffusion coefficient and the noise amplitude are the result of the same phenomenon of individuals replacing each other randomly and they should be linked through an Einstein like relation.

Figure 1. The spatial Moran model for geographically ex- The aim of the present article is to study the dynamics tended populations. of the front between mutant and wild type individuals di- rectly from the individual based, stochastic Moran model of population genetics. For this model, the Master equa- II. THE ISLAND MODEL AND MUTANTS tion can be stated without ambiguity or approximation. PROPAGATION IN 1 DIMENSION. We show that the mean field approximation of the Mas- ter equation gives rise to a partial differential equation The fundamental model of population genetics for non that differs from the FKPP equation and its predictions structured populations[18] was formulated by Fisher and at high selection pressure. Going beyond mean field, we Wright[19]. A continuous time version, which is also more then derive the exact equations for the evolution of the mathematically tractable, was proposed by Moran[20]. various moments of the front for a one dimensional sys- The extension of the Moran model to geographically ex- tem and solve it at small selection pressure. In this ap- tended populations was formulated by Kimura[21] and proach, the noise term is not restricted to be only local. Maruyama[22] and in more recent terminology is referred We show, in agreement with [15] that even for a neutral to as evolutionary dynamics on graphs [3]. The model is model (i.e., s =0), the front is well defined and the dis- also widely used in ecology, specifically in the framework placement of the front can be mapped into a Brownian of the neutral theory of biodiversity[23–25]. motion at large times, the convergence time to this state In this model, populations, formed of wild type in- is shown to be in 1/√t. The front drift and its veloc- dividuals with fitness 1 and mutants with fitness 1+ s ity can then be derived at small selection pressure by a are structured into cells (or demes or islands) each con- perturbatiion approach where we can show, in contrast to taining N individuals. When an individual dies in one the FKPP predictions, that the speed of the front is linear island, it is immediately replaced by the progeny of an- in the excess relative fitness s. Finally, we show that the other, therefore keeping the number of individuals in each effective local population size which controls the noise island always equal to N. The replacement probability amplitude, and the diffusion coefficient are both deter- is weighted by the fitness of the individuals; moreover, mined by the dispersal kernel of individuals and cannot the progeny stems from a local parent with probability be chosen as independent parameters. (1 m) or a parent from a neighboring island with prob- ability− m (figure 1). These three parameters, N, s and m are the only ingredients of this generic model. Let us consider an infinitely extended one-dimensional This article is organized as follow. Section II is devoted collection of islands and call n the number of mutant to the generalization of the Moran model to population i individuals on island i. We use the vector n as a geographically structured into demes/islands, where the shorthand notation for the collection of these numbers dynamics of the front can be deduced from the internal n = ..., n , ... . The transition probability densities for population dynamics of the islands and their exchanges. i the number{ of} mutants on island i to increase/decrease We demonstrate in subsection II A how an FKPP-like by one individual is[4]: equation emerges from the mean field approximation of the Master equation and show how it differs from the µ(1 + s) m W +(n)= (N n ) n + n′′ (2) classical FKPP equation. The following subsections of i N − i i 2 i section II are devoted to full stochastic treatments of the µ hm i W −(n)= n (N n ) n′′ (3) dynamics of the front. Section III goes beyond the is- i N i − i − 2 i land model and considers general migration kernels be- h i where µ is the death rate of individuals and tween individuals that are no longer grouped into demes. Solving the Master equation of the model shows how the ′′ ni = (ni−1 + ni+1 2ni). (4) island size and migration number between neighboring − islands of section II are related through the dispersal The rate of increase (2) for example is the probability kernel. The approach allows for the determination of density per unit of time that one wild type individual the effective population size and therefore the noise am- dies (µ(N ni) ) multiplied by the probability that it is plitude. The final section is devoted to discussion and replaced by− a mutant, either from a local parent ((1 − conclusions. m)ni/N) or a neighboring parent ((m/2N)(ni−1 + ni+1) 3

), and multiplied by the fitness of the mutant (1+s). The 10 1 fitness can be seen as an increase in the death rate of the wild type individuals, or a higher replacement probabil- ity/decreased death rate for the mutants. The probabil- ity P (n,t) of observing the state n at time t obeys the B Master equation

1 0.1 dP (n) c 0.1 1 10 = W +(a n)P (a n) W +(n)P (n) s dt i i i − i i X  + W −(a†n)P (a†n) W −(n)P (n) (5) i i i − i   †n n where ai and ai are shorthand notations for states ..., ni 1, ... . Without loss of generality (see Mathemat- 0.1 { ± } 0.1 1 10 ical Details V D), the initial condition we use throughout s this article is that of an initial sharp front Figure 2. The FKPP (black circle) and the spatial Moran ni = N if i 0; =0 otherwise (6) mean field (equation 7) (red squares) are solved numerically ≤ for µ = 1, D = 0.04 and the front speed is extracted for various values of the excess relative fitness s. Solid curves A. Mean field approximation. represent the theoretical values: FKPP c = 2√µDs; Moran c = 2pµDs(1 + s). Inset : Width of the front as a function of s. Black circle: FKPP; red square: Moran. Solid curves The mean field approximation for n , the average i represent, for FKPP (black) B = 2pD/µs; for Moran (red) number of individuals on island i, is obtainedh i by neglect- B = 2pD(1 + s)/µs κ(s) (eq. 51). ing fluctuations (i.e., by setting n n = n n ) h i ki h ii h ki d n h ii = W +(n) W −(n) dt i − i where κ(s) is a small correction (see Mathematical details µm sµ m V C). Specifically, for large s, the width converges to a = n′′ + (N n ) n + n′′ 2 h i i N − h ii h ii 2 h i i constant (15/8) D/µ.   A phenomenological argument can be used to under- Taking the space continuum limit by setting x = ℓi, stand these modificationsp to the FKPP equation. It is u(x)= n /N, we obtain the partial differential equation i well known[26] that the dynamics of a pulled front is gov- ∂u ∂2u erned by the behavior at small u. In these regions, the = D [1 + s(1 u)] 2 + µsu(1 u) (7) mean field equation (7) can indeed be approximated by ∂t − ∂x − an FKPP equation, with the effective diffusion coefficient where the length ℓ is the spatial extension of an island Deff = D(1 + s). and D = µmℓ2/2 is the diffusion coefficient. We observe We observe that the spatial Moran model differs sig- that the mean field equation of the spatial Moran model nificantly from the prediction of FKPP equation at high is different from the FKPP equation in the diffusion term. fitness s. We will show below that the same is true at Fisher himself, in his original article[5], had stressed that low fitness. This difference was first noted by Hallatschek using a simple diffusion term is an oversimplification of and Korolev[15] in their study of the SFKPP equation. basic population genetics processes. The modification of the diffusion term has important consequences both on the speed of the propagation front and on its width. The B. Stochastic characterization of the front. minimum speed of the propagating wave in the equation (7) is now (see Mathematical details V C) Let us now come back to the full stochastic treatment of the propagating front. The temporal evolution of local cmin =2 µDs(1 + s) (8) population moments can be extracted from the Master equation (5) (see Appendix V B): which scales as s for highp value of the excess fitness, in contrast to the scaling in √s in the FKPP equation. Nu- d ni + − h i = Wi Wi merical resolutions of eq.(7) (Figure 2) show that vmin dt − computed above is an excellent estimator of the speed of d ninj h i = n W + W − + n W + W − the front. Furthermore, the width of the front does not dt i j − j j i − i scale as D/µs as in the case of the FKPP equation, + − + δi,j Wi + Wi  .  but is well approximated by p The most important global quantities are the front dis- 2 D(1 + s)/µsκ(s) (9) placement U(t) and its width B(t). These global quanti- p 4 ties can be measured in terms of local populations by appears in the SFKPP equation ; the other term, ′′ +∞ m nini /N, is the demographic noise due to adjacent 1 cellsh andi cannot a priori be neglected. In the extreme U(t)= [ni(t) ni(0)] (10) N − case where N = 1 and therefore m =1, the local demo- i=−∞ X +∞ graphic noise is exactly zero, but the stochasticity of the 1 system remains the same, as we will see below. B(t)= n (t)[N n (t)] (11) N 2 i − i From now on, we will measure time in generation time i=−∞ X units, i.e., set t µt. By summing over the first mo- The width B(t) weights the region where the mutant ments, we find trivially← that the mean front position stays population is different from either 0 or N, and in the at its initial value continuous limit, can be expressed as B = I u(1 u)dx − d m ′′ where u(x) = ni/N. Note that these quantities´ are al- U(t) = n =0 dt h i 2 h i i ways finite, as the sums involve only a finite number of i non-zero terms. We restrict this paper to the compu- X tation of the first moments of these quantities, namely The second moments on the other hand obey a set of U(t) , Var(U(t)) and B(t) , where stands for ensem- linear differential equations h i h i hi ble average. The computation of these quantities implies 1 dZ 0 = 2(1 + α)Z + 2(1 β)Z + (1 β)C (15) the computation of the second moments m dt − 0 − 1 − 0 1 1 dZp Z (t)= n (t)n (t) n (0)n (0) (12) = 2Zp + Zp+1 + Zp−1 + Cp p> 0 (16) p N 2 {h i i+p i− i i+p } m dt − i X where α = (1 m)/(Nm), β =1/N and the coefficients − The width of the front is then Cp depend on the initial conditions :

B(t) = Z0(t)+ U(t) + B(0) (13) 1 h i − h i C = n (0)n′′ (0) p N 2 i p+i and the of its displacement is i X V (t)= U 2(t) U(t) 2 The parameters α and β measure the relative contribu- − h i 1 tion to the demographic noise of local versus adjacent = n n n n (14) N 2 h i j i − h ii h j i cells . The parameter β can be neglected with respect to i,j X α only in the limit of small migration probability m 1. For an initially sharp front (eq. 6), C = δ .≪ We For the neutral front (s =0), self consistent, exact equa- p − p,0 tions without any moment closure approximation can be stress that we can assume this condition without loss of derived directly for the global quantities. At small selec- generality (see Appendix V D) tion pressures Ns 1, they can be recovered through a The above system (15,16) can be solved[24] exactly. In ≪ ˆ first order perturbation analysis. the Laplace space where Zp(ω) = R+ exp( ωt)Zp(t)dt, the solution is particularly simple, ´ − (1 β)C z 1 C. Behavior of the neutral front s = 0. Zˆ (ω)= − 0 p ω z2 +2αz +2β 1 zp − For a one dimensional system where mutants and wild where ω = (z +1/z) 2. By taking the inverse Laplace − type have the same fitness (s =0), we will show that the transform the exact solution of Zp(t) can be found as front separating these two populations can be envisioned a combination of modified Bessel functions[24]. In this as a well defined object that performs a Brownian motion article, we are mostly concerned with the large time and whose width fluctuates around an equilibrium value: limit, which can be deduced from the expansion of Zˆp(ω) U(t) = 0 and for large times, V (t)= mt and B(t) = around ω =0 : hm(N i 1)/2. h i − m(N 1) 1 p K To obtain the above quantities, we sum over local fluc- Zˆ (ω)= − + − + (1) p − 2 ω √ω O tuations   m(N 1) p K 1 d ni m −1/2 ′′ Zp(t)= − 1+ − + o(t ) (17) h i = ni − 2 √πmt µ dt 2 h i   1 d ninj m ′′ ′′ where K = m(N 1)+2. The above approximation is h i = ni nj + ninj 2 − µ dt 2 h i valid for t p ; a uniform large time approximation for all p can also≫ be found in terms of combinations of erf 2  m ′′ m ′′ + δi,j ni(N ni) + ni nini functions[24], but is not needed here. N h − i 2 h i− N h i   As B(0) = 0 and U(t) =0, eq.(13) implies that h i There are different contributions to d nini /dt : one is the local demographic noise 2 n (N h n )i/N, which B(t) = Z (t). h i − i i h i − 0 5

4 0.25 (a) (a) 0.2

> i 0.15

B β 1 < 0.1

0.05 0.4 0 0 20000t 40000 60000 -200 -100 0i 100 200 1 0.25 (b) (b) 0.2 ) 0.01

B i 0.15 ( β p 0.0001 0.1

0.05 1e-06 0 5 10 15 20 25 30 0 -200 -100 0 100 200 B i/Beq 1 (c) Figure 3. Front’s width (eq. 11) computed from numerical 0.1 simulations of the master equation (5) by a Gillespie algo- i β 0.01 rithm comprising M = 2000 sites (islands) for four sets of parameters (N,m): Black circles (20, 0.05) (Beq = 0.475); 0.001 green squares (40, 0.05) (Beq = 0.975); red diamonds (20, 0.2) 0.0001 (Beq = 1.9); blue triangles (40, 0.2) (Beq = 3.9). (a) aver- 3 1e-05 age front B(t) computed over 10 stochastically generated 1 10 100 1000 10000 B(t), for th [0,i105]. Dotted black curves : numerical simula- i ∈ tions ; solid red curves : theoretical prediction B(t)= Z0(t) − (eq17). (b) probability distribution of the width B after equi- Figure 4. Mean front shape βi (eq. 20) as a function of dis- librium has been reached (t [1, 100] 105 , sampling time102 tance to the center of the front i, in the moving reference ) for the same parameters as∈ in panel× a. Symbols: numeri- frame. Numerical simulations schemes are the same as in cal simulations ; solid curves : exponential fits of the data figure 3, with M = 6000. In each sampled stochastic real- p(B) = A exp( aB) after the peak of the distribution has ization, the displacement is computed from the relation (10) been reached − and the mutant population numbers in each site i in a win- dow of 1000-2000 sites around this position are recorded. The mean front shape νi (eq.19) is computed on approximately 7 The front therefore reaches a finite width 10 samples. (a) βi as a function of i for various (N,m) parameters : (20, 0.05) black, (40, 0.05) green, (20, 0.2) red, Beq = m(N 1)/2 (18) (40,0.2) blue. (b) Same as in panel (a), but the x axis for each − curved is normalized by the corresponding equilibrium width Beq = (N 1)m/2. (c) The long tail of the front βi (solid and the equilibrium value is reached as 1/√mt. Figure − 3a shows the perfect agreement of these results with nu- curves), where the parameters are the same as in panel (a). The dashed lines represent the function y = B /2i as visual merical simulations. The above equilibrium value of the eq guides. width is also in agreement with the value found from the SFKPP equation[15] 4Db−1 if the amplitude of the noise term is interpreted as b =4µℓ/N. As noted by Hal- number νi and their weight βi in this frame latschek and Korolev[15], genetic drift alone can maintain a finite front width at s =0 in one dimension. Moreover, 1 ν = n (19) numerical simulations of the discrete model show that i N i+[U] the width distribution probability of the front has an ex- βi = νi(1 νi) (20) ponential tail (figure 3b) − Note that the width B defined above as where [U] is the integer part of the front displacement given by the relation (10). These quantities are difficult 1 B = b = n (N n ) analytically but are readily computed by numerical sim- h i h ii N 2 h i − i i i i ulation, as shown in Figure 4. As it can be observed, X X the mean front shape βi , which is a function of N and weights the regions with populations 0 < ni < N, but m (Figure 4a) is spatially extended and decreases slowly contains no information about their spatial distribution. as a function of i, the distance to the center of the front A spatially wide front composed for example of alternat- (Fig. 4c). The width Beq computed above remains how- ing n =0 and n = N islands will have B =0. ever a good indicator of the mean front shape, and all βi The shape of the front can be characterized more pre- curves can be superimposed when the normalized index cisely by using the moving frame of the front as the ref- i/Beq is used(Fig. 4b). erence frame and computing the mean relative mutant The variance of the position of the front V (t) = 6

0. 05 (a) 0 104 0. 10 10 0. 15

0. 25 10-1

3

10 c 10-2 ) t (

V 10-3 102

10-4 10-3 10-2 10-1 100 101

101 (b) 10 101 20 40 102 103 104 105 t 100 B 100 Figure 5. Solid curves: Variance of the position of the neutral front as a function of time, computed from numer- ical simulations of the master equation (5) by a Gillespie algorithm, for various values of the migration parameter 10-1 10-3 10-2 10-1 100 101 m=0.05,0.1,0.15,0.25 and N = 10. Dashed lines: the theo- s retical values of V (t) = mt (eq. 21) for corresponding m. The numerical simulations comprised M = 2000 sites and the 3 variance was computed over 6.4 10 stochastically generated Figure 6. Speed c (panel (a) ) and width B (panel B) of the 5 × U(t), for t [0, 10 ]. front as a function of excess relative fitness s for m = 0.05 and ∈ N = 10, 20,40, and 100. Solid lines in panel (a) represent the first order theoretical expression c = Nms/2 for Ns . 0.5. U 2(t) U(t) 2 can be extracted by similar methods Dashed green lines represent the mean field values cm.f = − h i from equation (14) : p2ms(s + 1) and Bm.f = p2m(s + 1)/s.

dV 2 m = n (N n ) n n′′ dt N 3 h i − i i− 2 h i i i i At small selection pressures Ns 1, on expanding the X n o 2 m above expression to the first order≪ of perturbation, we = B(t) m (Z (t) Z (t)) C . N h i− 1 − 0 − 2 0 find, in the limit of large time n o As Zp(t) can be computed exactly, the temporal evolu- d U mNs tion of the variance can also be computed exactly. The h i = + (1/√t) (22) result is particularly simple for large times t 1 dt 2 O ≫ V (t)= mt + (√mt) (21) Note that at small s, the front speed scales as the selec- O tion pressure Ns. Even for N =1 when the front width The surprising result is that for large times, the diffusion Beq =0, the front acquires a non-zero speed (figure 6). of the front is independent of its width. The figure 5 The above computation of the position of the front at shows the agreement of this expression with numerical s> 0, which is a first order moment, requires the knowl- solutions. edge of second order moments Zi at s = 0. The same line of argument shows that computing the variance of the position and width of the front for s> 0 , which are D. Behavior of the front at small s. second order moments, necessitates the computation of third order statistical quantities. Even though compu- For a non-zero excess relative fitness, the moment clo- tation of higher momenta is theoretically possible in the sure does not hold and the front characteristics can no neutral case s = 0, their effective computation remains longer be derived exactly. It is however possible to derive extremely tedious. the front speed to the first order of the perturbation s. Figure 6 shows the result of stochastic based numeri- For s> 0 the position of the front is given by cal simulations for a wide range of s and the agreement with expression (22) at small selection pressure. It can d U 1 h i = W +(n) W −(n) be observed that the mean field approximation becomes dt N i − i i correct only at very high excess relative fitness s and lo- X m cal population size N. Fluctuations modify significantly = s B(t) m (Z (t) Z (t)) C − 1 − 0 − 2 0 the prediction of the FKPP model. n o 7

dividual at site j. Solving exactly this strictly individual model then leads us to choose the effective population size of each deme, which is used in the stepping stone approach (figure 7b) and derive the exact relation be- tween the diffusion coefficient and the noise amplitude, j both of which are a function of the dispersal kernel mi . In this evolutionary graph approach, each site contains exactly one individual (either wild type or mutant) ni = 0, 1 ; the transition probability densities for the number of mutants on site i to increase/decrease by one individual is a simple generalization of equations (2,3) . Thus:

W +(n)=(1+ s)µ(1 n ) mj n (23) i − i i j j Figure 7. Neutral spatial Moran model with general migration X kernel. (a) Individuals are uniformly distributed in space ; − j W (n)= µni m (1 nj ) (24) when an individual at site i dies, it is replaced by the progeny i i − j of its k th neighbor with probability m(k). (b) Individuals X are grouped− into demes of population size N ⋆. In the classical j where mi is the probability that the progeny of an indi- scheme of SFKPP (figure 1), the detailed migration kernel vidual at site j replaces an individual at site i. In the lit- m(k) is replaced by a single number m⋆ of migration between erature of plants, the migration probability mj is known neighboring demes. i as the dispersal kernel and can be measured precisely in the field[27]. In the following, we will consider dispersal kernels that depend only on the distance between two III. MICROSCOPIC MODEL AND GENERAL sites, i.e., mj = m( j i ). MIGRATION KERNEL. i | − |

In the SFKPP description of the mutant wave (eq.1) A. Mean field approximation. the diffusion coefficient D and the amplitude of the noise b are considered independent parameters. The same is Following the same steps as in subsection II A, it is true for the island model of the preceding section where straightforward to deduce the mean field approximation the population size of the island N and the migration of the corresponding master equation. For a migra- probability m between neighboring islands were consid- tion probability that depends only on the distance be- ered independent. However, at the individual level of tween two sites, the mean field approximation is exactly evolution, both migration and genetic drift are the result the same as expression (7), where ℓ here is the inter- of the same phenomenon of individuals replacing each individual distance (lattice size), µ the death rate and other. These two parameters must therefore be linked the diffusion coefficient through an Einstein-like relation and cannot be indepen- µℓ2 dent. D = k2m(k) (25) 2 There is a level of arbitrariness in the island model k X in the manner in which individuals are grouped together and deme size N is chosen. As the amount of fluctua- is given in terms of mean dispersal distance. tions is critically controlled by N, the grouping process is From now on and to avoid confusion, we will refer to all quantities derived in the island approximation (the crucial. This arbitrariness also impacts on the migration ⋆ probability. The very existence of a unique migration macroscopic view) of section II by the super script . rate between nearest neighbor islands can be brought The diffusion coefficient of the mean field approximation into question. Consider for example the low migration derived in subsection II A (relation 7), for example, is limit (m 1) of the island model: two individuals phys- µℓ⋆2 ically far≪ apart from each other but grouped into the D⋆ = m⋆ (26) 2 same island will have a higher probability of replacing each other than two close individuals belonging to neigh- For a 1d system, the patch extension ℓ⋆ = N ⋆ℓ (figure boring islands. 7b) ; comparing expression (25) and (26) therefore leads A more rigorous approach to this problem would be to to consider a “microscopic” model where individuals are uni- N ⋆2m⋆ = k2m(k) (27) formly distributed in space and not arbitrarily grouped into demes/islands. In this microscopic model, migra- Xk tion/replacement is not restricted to nearest neighbors We observe here that the deme size N ⋆ and the migration (figure 7a): when an individual dies at site i, it has a probability between demes m⋆ are indeed linked through j probability mi of being replaced by the progeny of an in- equation (27). 8

0.0 B. Stochastic characterization of the neutral front 100 at s = 0.

−0.1 ) 101

The relation (27) is not sufficient to determine the ef- ∞ ⋆ ( fective population size N of islands. To address this Z −

issue, we need to solve exactly the full stochastic model. −0.2 ) t

( 102 ) We restrict the computation here to the neutral case p t Z ( s = 0, the derivations for s = 0 following precisely the p 6 Z steps developed previously. −0.3

103 The computational approach is similar to subsection 101 102 103 104 II B. As before, the displacement of the front is defined t as −0.4 +∞

U(t)= [ni(t) ni(0)] (28) −0.5 − 0 2000 4000 6000 8000 10000 i=−∞ X t and

d Figure 8. Numerical simulation of the second moment Zp U(t) = W +(n) W −(n) dt h i i − i (eq. 29) in the neutral Moran model with geometric seed i dispersal kernel (eq. 34) for p = 1, 2, 4, 8. (Inset) Same data X but convergence to the limiting value Z( ) is shown ; the Therefore, for the neutral front s = 0, d U = 0 and t gray dashed line is 1/√t. Time is measured∞ in units of U(t) =0. h i ∝ h i generation time (1/µ). For the numerical simulation, a lattice The second order moments are also defined as before of 4096 individuals is used and the result is averaged over 8 105 trials. Z (t)= n (t)n (t) n (0)n (0) (29) × p h i i+p i− i i+p i X The case N ⋆ = 1, m⋆ = 1 of the preceding section is and we note that Zp = Z−p. The equations governing Zp for the rescaled time t µt are obtained when λ 0. Note that in the framework of ← the Moran model used→ here, a dead individual cannot re- d Z = 2Z (30) place itself, hence m(0) = 0. Moreover, for the geometric dt 0 − 0 dispersal kernel, relation (33) becomes ∞ d 1 Zp =2 m(k) (Zp+k Zp)+ Cp p =0 (31) C = − λ|p| dt − 6 p 1 λ k=X−∞ − where the Cp are defined by the initial condition It is straightforward to check that again all Zp converge as 1/√t to the same value ∝ Cp =2 m(k) ni(0) (ni+p+k(0) ni+p(0)) − λ 2 2 k i Z ( )= = θ κ (36) X X p ∞ −(1 λ)2 − − Note that equation (30) implies that Z0(t)=0. This is 2 where κ2 is defined in (35) and θ = 2 km(k)= due to the fact that ni = ni and therefore Z0(t)= U(t) . k>0 For an initially sharp front h i 1/(1 λ). We observe that the stationary value of Zp is given− by a quantity similar to the varianceP of the disper-− ni =1 if i 0; =0 otherwise (32) sal kernel. Figure 8 shows the agreement between these ≤ results and individual based numerical simulation of the which will be used here, same system. The width of the front can no longer be measured as Cp =2 m(k) ( p k) (33) | |− in relation (11) by B = i ni(1 ni) which is always 0. k>|p| − X Other analog metrics such as P For simplicity, we further restrict the solution of equa- Y = (n n )2 (37) tions (30-31) to the generic geometric dispersal kernel p i − i+p i X 1 λ |k| can be used to characterize the front. It is straightfor- m(k)=(1 δk,0) − λ (34) − 2λ ward to show that where the parameter λ controls the dispersal length κ: Y = p 2Z (38) p − p 1+ λ κ2 =2 k2m(k)= (35) Figure 9 shows the excellent agreement between the the- (1 λ)2 oretical results and the numerical simulations. Xk>0 − 9

C. Grouping into islands.

20 (a) We now group individuals virtually into islands of size ⋆ 10 N and establish the condition under which the results obtained by the SFKPP/islands model are valid. ⋆ )

t A patch of population size N is a virtual packing of ( 1 5 individuals into a deme, which we refer to by its index q Y (Figure 7b). The number of mutants in patch q is

bq 2 ⋆ nq = ni (40) aq 1 X 0 200 400 600 800 1000 ⋆ t where bq = aq + N 1 and aq+1 = bq +1. Note that we ⋆ − 102 have N possible choices for grouping individuals, as we ⋆ ⋆ can set aq = N q + r, where r =0, 1, ...,N 1. We de- (b) fine the displacement and the width of the (macroscopic)−

101 front as in subsection II B (definitions 10,11) )

t +∞ (

1 ⋆ 1 ⋆ ⋆ Y U (t)= ⋆ nq (t) nq (0) (41) 0 N − − 10 q=−∞ ) X  

∞ +∞

( 1 1 B⋆(t)= n⋆(t) N ⋆ n⋆(t) (42) Y ⋆2 q q 10-1 N − q=−∞ X   We compute the statistical properties of these quantities

10-2 by taking into account the detailed migration kernel (sub- 101 102 103 t section III B) and compare them to the results obtained in subsection II B where migrations were approximated by a single migration probability m⋆ between neighbor- Figure 9. (a) Numerical simulation of the front width ing demes. ⋆ Y1(t) (eq. 37) in the neutral Moran model with geomet- The macroscopic displacement U (relation 41) is eas- ric seed dispersal kernel (eq. 34) for increasing value of ily related to the microscopic displacement U (relation λ =0.25,0.33,0.4,0.5,0.55,0.6,0.66 and 0.75. (b) Same data 28) : but convergence to the limiting value Y1( ) is shown ; the ∞ +∞ bq gray dashed line is 1/√t. Time is measured in units of 1 generation time (1/µ∝). The numerical simulation parameters U ⋆ = [n (t) n (0)] N ⋆ i − i are identical to figure 8. q=−∞ a X Xq 1 ∞ = [n (t) n (0)] N ⋆ i − i i=−∞ 2 X The variance of the front displacement V = U 1 2 − = U U can be computed by methods analogous to the pre- N ⋆ vioush i section: and therefore V ⋆ = Var(U ⋆) = Var(U)/N ⋆2. The vari- ance of the microscopic displacement V is given by rela- dV tion (39), and hence, for long times, = W + + W − dt i i i κ2 X  V ∗ = t (43) = 2 m(k)Z C N ⋆2 − k − 0 Xk Comparing this expression to the relation V ⋆ = m⋆t of the island model (section II) where m⋆ is the migration

± probability between demes, we see that we must have where the Wi are the transition rates (23,24) and C0 is defined by relation 33). For the geometric kernel m(k) m⋆ = κ2/N ⋆2 (44) (eq. 34), using the long term solution (36) leads to The above relation is a confirmation of relation (27), which we obtained by a mean field approximation. We V (t)= κ2t + (√t) (39) can also compute, at small selection pressure Ns, the O 10 speed of the front that we find to be c = sκ2/2. Compar- selection pressure. For large non-structured (d = 0) pop- ing this result to the speed c⋆ = c/N ⋆ = sN ⋆m⋆/2 (eq. ulations, Kimura tackled this problem by using a Fokker- 22) of the island model leads again to the same relation Planck approximation of the Master equations govern- between m⋆ and N ⋆ as relation (44). ing the WF or Moran models. The stochastic differ- The macroscopic width of the front (relation 42) can ential equation associated with the Kimura equation is also be computed in terms of microscopic quantities (see dtu = su(1 u)+ u(1 u)/Nη(t). It then seemed nat- Appendix V E) ural to unite− the two approaches− and propose the SFKPP equation (1). p N−1 We see here that many assumptions were made in this ⋆ 1 ⋆ B = (N p)Yp process : (i) the form of the diffusion term may be dif- N ⋆3 − p=1 X ferent ; (ii) u(x, t) is a local relative density and the noise term of SFKPP would be a good approximation where Yp are the microscopic front width (defined by re- only if the number of individuals in each patch where u lation 37). For large times, relations (38) and (36) lead has been computed is large enough ; (iii) the noise term to u(1 u)/N itself was obtained for a non-structured − N ⋆ 1 1 population and it is far from obvious that it should be B⋆ = − κ2 θ2 + (N ⋆ + 1) (45) pthe same for an extended population and not involve the N ⋆2 − 6   spatial derivative of u. The above relation is in perfect agreement with numerical The individual based approached that we develop in simulations. Comparing the above relation to the width this article is intended to overcome these problems and of the front B⋆ = m⋆(N ⋆ 1)/2 of the island model to ground the SFKPP approach on a firmer basis. Us- (relation 18) and using relation− (44) for m⋆, we find that ing an explicit spatial island model, we have shown first we must have that the diffusion term is indeed different from the FKPP equation (relation 7) and this difference has important N ⋆ =3θ 1 (46) consequences on the speed and width of the front for − large selection pressures (relation 8,9). The above result determines the effective population size For small selection pressures, we derive the parame- in terms of the dispersal kernel. More precisely, ters of the front (speed, diffusion coefficient and width) without any assumption on the size of the local popula- 2+ λ tion and without neglecting the non-local noise. These N ⋆ = 1 λ results are in agreement with the predictions of SFKPP 1+− λ equation at small selection pressure as developed by [28]. m∗ = (2 + λ)2 Finally, by taking into account the explicit migration kernel, we establish the relation between the amplitude We note that m⋆ is weakly dependent on λ over its whole of the diffusion and that of the noise ; this approach also range of variation [0, 1], whereas N ⋆ diverges as the dis- allows us to define the effective size of the local popu- persion characteristic length when λ 1. lation N, which is the crucial parameter controlling the → noise as a function of the dispersal length. Individual based models have the same level of com- IV. DISCUSSION AND CONCLUSION. plexity as their equivalent stochastic differential equation approach. We believe that the formalism developed in In this article, we have used the formalism of the spa- this article is a step forward in the search for a better understanding of natural populations and the dynamics tial Moran model to study the propagation of mutants in a one dimensional geographically extended population. of mutant waves. The propagation of the mutant wave has usually been studied in the framework of the SFKPP equation (1). ACKNOWLEDGMENTS The SFKPP equation however is a phenomenological ap- proach and its derivation from the fundamental models of population genetics such as Wright-Fisher or Moran We are grateful to Erik Geissler for the critical reading is not obvious. The deterministic Fisher equation for of the manuscript and fruitful discussions. the dynamics of the proportion u(t) of a mutant in a non-structured population is dtu = su(1 u). For geo- graphically structured populations, it seemed− natural[5] V. MATHEMATICAL DETAILS. to add a spatial diffusion term and generalize simply A. The noise term in SFKPP this equation to ∂tu = D∆u + su(1 u), where u(x, t) is the local proportion of the mutant.− On the other hand, since the time of Fisher and Wright, it was ob- The argument for the phenomenological noise term vious that genetic drift is an important factor at small used by Doering et al. can be rephrased as follows in the 11 framework of population genetics. For a non-structured After replacing ∂tP by its value from equation 48, the population (a population at d = 0), in an ecosystem with first term on the r.h.s. of the above equation reads: carrying capacity of N individuals formed of wild type in- dividuals with fitness 1 and mutants with fitness 1+ s, I = n n W +(a n)P (a n) (...) 1 i j k k k − the transition rates for the one-step Moran process is [2] n k X X µ Changing the variable n a† n implies n n + δ W (n m)= δ (N n)n(1 + δ s) k i i i,k → |m−n|,1 N − m−n,1 and n n + δ and → → j → j j,k where n is the number of mutants. The probability I = (n + δ ) (n + δ ) W +(n)P (n) (...) 1 i i,k j j,k k − P (n,t) of observing n mutants at time t is governed by n k the Master equation associated with these rates X X+ + + = niWj + njWi + δi,j Wi ∂P (n,t) = W (m n)P (m,t) W (n m)P (n,t) Computing now the second term and grouping all the ∂t → − → m terms, we get X d n n For a large ecosystem (N 1) at small selection pressure i j + − + − + h i = ni Wj Wj + nj Wi Wi +δi,j Wi + W (Ns 1), the above Master≫ equation can be approx- dt − − ≪ imated by a Fokker-Plank equation called the Kimura For the neutral case s =0 we have  equation[1, 2, 18] + − m ′′ Wi Wi = ni ∂P (u,t) ∂ [u(1 u)P ] 1 ∂2 [u(1 u)P ] − 2 = s − + − ∂t − ∂u N ∂u2 and where u = n/N and time is measured in units of 1/µ. + − 2 m ′′ m ′′ Wi + Wi = ni(N ni)+ ni nini The above diffusion equation is equivalent to a stochastic N − 2 − N differential equation for the density u[29]: So finally

du d ninj m ′′ ′′ = su(1 u)+ u(1 u)/Nη(x, t) (47) h i = nin + nj n dt − − dt 2 j i p 2 m m The origin of the noise is the genetic drift due to the size + δ n (N n ) + n′′ n n′′ i,j N h i − i i 2 h i i− N h i i i of the system.   For a spatially extended system, the Doering et al. phenomenological approach to derive the SFKPP con- C. Front speed and width in the mean field 2 sists in adding a (spatial) diffusion term D u to the approximation. stochastic equation 47, but conserving the∇ same local noise term, and neglecting fluctuations from adjacent The minimum speed of propagating front of the equa- cells. tion ∂u ∂2u = D [1 + s(1 u)] + µsu(1 u) (49) B. Moment computation algebra. ∂t − ∂x2 − is obtained by following the original Fisher[5] approach. The rules of moment computations are fairly standard For a propagating front, on setting ∂tu = v∂xu and (see for example [30]), but we give them here for self- then setting g = du/dx, we get the equation− for g(u) : consistency. Various moments can be extracted directly − from the Master Equation dg D [1 + s(1 u)] g vg + µsu(1 u)=0. (50) n − du − − ∂P ( ,t) + + = W (akn)P (akn) W (n)P (n) ∂t − Setting p = dg/du u=0 as the slope of the curve at the k | X  origin u =0, the equation for p is − † n † n − n n + W (ak )P (ak ) W ( )P ( ) (48) − D(1 + s)p2 vp + µs =0 Xk n o − by multiplying it by some operator and then making the which has a solution only for † change of variable n akn or a n. Consider for example → k v v =2 µDs(1 + s) ≥ min d n n ∂ h i j i = n n P (n,t) The width B = u(1 pu)dx of the front can be dt ∂t i j R − n computed following´ the same approach. Setting v = X{ } 12

∂t R udx, exchanging the derivation on t and integra- E. Relation between microscopic and macroscopic tion´ on x, and performing integration by parts on the front width. propagating front, we get By definition, du 2 v = µsB + Ds dx N ⋆2B⋆ = n⋆(N ⋆ n⋆) ˆR dx q − q   q 1 X = µsB + Ds gdu bq ˆ0 = n (1 n ) i − j q i,j=a The shape of the front g(u) is not known. However, g(u) X Xq is a smooth function, g(0) = g(1) = 0 and its slope at bq bq −1 bq = n (1 n )+ (n + n 2n n ) both ends, p = dg/du u=0 and q = dg/du u=1 are known  i − i i j − i j  | | q i=a i=a j=i+1 and determined from equation (50). Approximating then X  Xq Xq X  g(u) by a third order polynomial that respects these con- ⋆ ⋆ where n is defined by relation (40), bq = aq + N 1 and  straints ⋆ − aq+1 = bq +1. Note that we have N possible choices for ⋆ grouping individuals, as we can set aq = N q + r, where g(u)= u(1 u)(p (p + q)u) ⋆ − − r = 0, 1, ...,N 1. As ni = 0, 1, the first term in the − 2 above sum is zero and ni = ni. Rearranging the indices leads to: in the second term, we have

⋆ D(1 + s) N −1 bq −k B =2 κ(s) (51) N ⋆2B⋆ = (n + n 2n n ) (52) s µs i i+k − i i+k q i=a X Xk=1 Xq where Note that for an unrestricted sum ∞ s 2+ s n + n 2n n = Y κ(s)=1 1 (s +1)+1 i i+p − i i+p p − 24(1 + s) 1+ s − i=−∞ r ! ! X 15 13 = + + (s−2) for s 1 However, the problem with expression (52) is that we 16 192s O ≫ are missing the terms nini+k , where i is in one cell and i + k in another one. In fact, for each cell, we are missing k terms of the form nini+k connecting two neighboring D. Choice of initial conditions. cells. We now use our freedom to choose the grouping r: we use N ⋆ different choices of r and sum all of them. The definition of various moments we use in this article A term missing in one choice of r will be recovered in such as relations (10-12) ensures that the infinite sums another. As the result must not depend on the choice of over sites contain only a finite number of non-zero terms; r, we have it avoids the problem of spurious effects due to manipu- N ⋆−1 lation of divergent series. However, the initial front may N ⋆(N ⋆2B⋆)= (N ⋆ p)Y − p not need to be sharp, but only finite. p=1 Consider the discrete function X The index manipulation is clearer when performed man- ually on a few simple examples such as N ⋆ =2 or 3. f = N if i 0; =0 otherwise i ≤ where the position 0 corresponds the to middle of the F. Numerical simulations. initial front. We can redefine the moments as All numerical simulations are written in C++, and U = n (t) f i − i data analysis is performed by the high level language i X Julia[31]. Numerical simulation of the island model (sec- 1 Z = n (t)n (t) f f tion II) is performed by a Gillespie algorithm by com- p N 2 {h i i+p i− i i+p} i puting the jump probabilities from the transition rates X (2,3). For the generalized migration kernel of section III, The differential equations we derived throughout this ar- the Gillespie approach is too cumbersome and a direct ticle remain invariant under this definition of the mo- approach has been used: the index of an individual is ments, the only difference being that the initial values of chosen at random and it is replaced by the value of an- these moments are non-zero. other individual chosen according to the kernel m(k). 13

[1] M Kimura. On the probability of fixation of mutant Proc Nat Acad Sci USA, 74:5096–5098, 1977. in a population. Genetics, 47:713–719, 1962. [17] K S Korolev, Mikkel Avlund, Oskar Hallatschek, and [2] B Houchmandzadeh and M Vallade. Alternative to the David R Nelson. Genetic demixing and evolution in lin- diffusion equation in population genetics. Phys Rev E ear stepping stone models. Reviews of modern physics, Stat Nonlin Soft Matter Phys, 82(5 Pt 1):51913, 2010. 82(2):1691–1718, jun 2010. [3] Erez Lieberman, Christoph Hauert, and Martin A [18] W J Ewens. Mathematical Population Genetics. Nowak. Evolutionary dynamics on graphs. Nature, Springer-Verlag, 2004. 433(7023):312–316, 2005. [19] R A Fisher. The genetical theory of , [4] Bahram Houchmandzadeh and Marcel Vallade. The fix- a complete variorum edition. Oxford University Press, ation probability of a beneficial mutation in a geograph- 1999. ically structured population. New Journal of Physics, [20] P Moran. Statistical Processes of Evolutionary Theory. 13(7):073020, jul 2011. Clarendon Press, 1962. [5] R.A. Fisher. The wave of advance of advantageous genes. [21] M Kimura and G H Weiss. The Stepping Stone Model of Annals of Eugenics, 7:355–369, 1937. Population Structure and the Decrease of Genetic Corre- [6] A. N. Kolmogorov. A study of diffusion equation with lation with Distance. Genetics, 49(4):561–76, apr 1964. increase in the amount of substance. In Selected work [22] T Maruyama. A Markov process of frequency of N.A. Kolmogorov, chapter 38, pages 242–271. Kluwer change in a geographically structured population. Ge- Academic Publishers, 1991. netics, 76(2):367–377, 1974. [7] James D. Murray. Mathematical Biology: I. An Intro- [23] Stephen P. Hubbell. The unified neutral theory of Bio- duction (Interdisciplinary Applied Mathematics) (Pt. 1). diversity and Biogeography. Princeton University Press, Springer, 2007. 2001. [8] H.P. Beuer, W. Huber, and F. Petruccione. The Macro- [24] B Houchmandzadeh and M Vallade. Clustering in neutral scopic Limit in a Stochastic Reaction-Diffusion Process. ecology. Phys Rev E Stat Nonlin Soft Matter Phys, 68(6 Europhysics Letter, 30(April):69–74, 1995. Pt 1):61912, 2003. [9] S. Munier and R. Peschanski. Geometric Scaling as Trav- [25] M Vallade and B Houchmandzadeh. Analytical solution eling Waves. Physical Review Letters, 91(23):232001, dec of a neutral model of biodiversity. Phys Rev E Stat Nonlin 2003. Soft Matter Phys, 68:61902, 2003. [10] W Vansaarloos. Front propagation into unstable states. [26] Debabrata Panja. Asymptotic scaling of the diffusion Physics Reports, 386(2-6):29–222, nov 2003. coefficient of fluctuating pulled fronts. Physical Review [11] J G Kingsolver, H E Hoekstra, J M Hoekstra, D Berri- E, 68(6):065202, dec 2003. gan, S N Vignieri, C E Hill, a Hoang, P Gibert, and [27] R Nathan and Hc Muller-Landau. Spatial patterns of P Beerli. The strength of phenotypic selection in natu- seed dispersal, their determinants and consequences for ral populations. The American naturalist, 157(3):245–61, recruitment. Trends in ecology & evolution, 15(7):278– mar 2001. 285, jul 2000. [12] Masatoshi Nei. Selectionism and neutralism in molecular [28] Oskar Hallatschek. The noisy edge of traveling waves. evolution. and evolution, 22(12):2318– Proceedings of the National Academy of Sciences of the 42, dec 2005. of America, 108(5):1783–7, feb 2011. [13] Eric Brunet and Bernard Derrida. Shift in the velocity [29] C Gardiner. Handbook of Stochastic Methods: for of a front due to a cutoff. Physical Review E, 56(3):2597– Physics, Chemistry and the Natural Sciences. Springer, 2604, sep 1997. 2004. [14] Charles R. Doering, Carl Mueller, and Peter Smereka. [30] Leonid Pechenik and Herbert Levine. Interfacial velocity Interacting particles, the stochastic Fisher-Kolmogorov- corrections due to multiplicative noise. Physical Review Petrovsky-Piscounov equation, and duality. Physica A: E, 59(4):3893–3900, apr 1999. Statistical Mechanics and its Applications, 325(1-2):243– [31] Jeff Bezanson, Alan Edelman, Stefan Karpinski, and Vi- 259, jul 2003. ral B. Shah. Julia: A Fresh Approach to Numerical Com- [15] Oskar Hallatschek and K. Korolev. Fisher Waves puting. arxiv, page 1411.1607, nov 2014. in the Strong Noise Limit. Physical Review Letters, 103(10):108103, sep 2009. [16] S. N. Ethier and M. F. Norman. Error estimate for the diffusion approximation of the Wright–Fisher model.