<<

arXiv:1612.00825v2 [q-bio.PE] 8 Dec 2017 r bet pnaeul sebeit ii vesicles lipid into assemble spontaneously to These abundantly able [19–23]. Earth meteorites likely are prebiotic were chondrite the molecules by on such present earth Hence, could to lipids prebioti- [16–18]. imported Alternatively, a been in [15]. produced have manner be plausible can membrane, cally blocks lipid building the are amphiphilic that that for acids shown fatty been simple availability like has molecules, prebiotic It the membranes. of lipid support of in evidence el- is occurred. genetic have There self-replicating must compartmentalization of dur- such emergence before, ements, the either after life, at or of development Hence, ing, the researchers. in world point to RNA some by akin after molecules sought those enzymatic and re- information-carrying has world) RNA the 14]. of [13, grail challenge a holy mained the effi- called (some- building replicases times 12]), plausible [11, prebiotically and stable, e.g. cient, prebiotic progress, in exciting some efforts (and of [10]. decades despite challenge synthesis Indeed, prebiotic a spontaneous creates their catalysts explaining sequences efficient for RNA as long serve of could complexity that enzymes the cooperative Moreover, for parasitism [6–9]. and catastrophe [5] error replicases the suffer for including often pitfalls, bio- calamitous similar theoreti- from or Both RNA of life. well-mixed that populations of show investigations origin experimental the and complete cal of a theory in into narrative rigorous role this and difficul- developing central formidable for a remain But played ties [2–4]. have self-replication must as initiating well one, bio-, as enzymatic similar , a an information-carrying or an RNA origin both that the being in postulates RNA It of life. role of en- the framework about leading plausi- theories a find is compassing self-replication. hypothesis” to of world capable therefore “RNA are The is of that life objective systems of major chemical A ble origin the [1]. on evolve research and replicate to ability nmdr el,lpdmmrnscompartmentalize membranes lipid cells, modern In their is organisms living of characteristic defining A 1 h iet rdc neovbepooelsae algebraic scales protocell evolvable an when produce that to show We time produced. the is protocell evolvable minimal a cln nteasneo uin edsusteimplication the discuss processes. We biological fusion. other of as absence the in scaling rgno el.I h otx foii flf,w sueth assume we life, of origin of context requires the components, functional In cells. of origin rga o vltoayDnmc,HradUiest,Ca University, Harvard , Evolutionary for Program oprmnsaeuiutu hogotbooy e their yet biology, throughout ubiquitous are Compartments 4 eateto ahmtc,HradUiest,Cambridge University, Harvard Mathematics, of Department a Sinai Sam 3 eateto hsc,HradUiest,Cmrde Mas Cambridge, University, Harvard , of Department rmrilSxFclttsteEegneo of Emergence the Facilitates Sex Primordial 1 , 2 avr nvriy abig,Msahsts018 USA 02138, Massachusetts Cambridge, University, Harvard , ∗ ao Olejarz Jason , 2 eateto raimcadEouinr Biology, Evolutionary and Organismic of Department N opnnst eeovbe ecluaetetmsaei wh in timescale the calculate We evolvable. be to components Dtd eebr1,2017) 11, December (Dated: 1 , ∗ ui .Neagu A. Iulia , hc nti otx r nw as compartments, known are forming context 24], this in [23, which conditions aqueous in asg rcs 3,3] easm hti re obe number, certain to a order contain in N to that needs assume protocell We a evolvable, 37]. [36, process passage they because miracle. assumption, probabilistic origin this a no on it an Almost consider operate which models large. from life as- be molecules of to could of unreasonable emerges set be cell starting not evolvable the would it that by cases sume cells such vastly complicated In have producing luck. could of replication, chances possibly the of membranes, improved advent of the presence before Hence, early even protocell. that evolvable for propose an ability the we produce the changes to that categorically required suggest merge time we to study, compartments this outlined these of sex”, reasons For rest “primordial the of attention. in much form received a not is have which protocells, among sex. of considered property is defining individuals in- a two sharing thereby biology, between In content merge, formation 33–35]. to [25, able contents are their informa- sharing protocells enclosing dividing, to addition and particular to In tion in them 28–32]. polymers, 7, enable cooperative [6, protocells replicases contents for of selection their in properties of help These parts divide inherit also 27]. that can [26, protocells and Protocells any start new [25]. for to into reactions metabolism required side a be of may maintain effects that the polymer. the cycles contiguous auto-catalytic dampen within one also information in can stored house the It be can not protocell need i.e. the protocell genome, that segmented means the a also en- within It outside interactions the same with vironment. increased interactions the in decreased share and results and protocell of This contents other The each near fate. life. held to are prelife protocells from transition the pede fcmoettps(.. itntmlclso var- of molecules distinct (i.e., types component of , ots hshptei,w netgt ipefirst- simple a investigate we hypothesis, this test To ability information-sharing this of implications The im- an that pitfalls the of some alleviate Protocells 1 lyin ally tapooel oprmn enclosing compartment a protocell, a at fti eutfroiiso ie swell as life, of origins for result this of s , 3 n atnA Nowak A. Martin and , rtclsfs n hr information, share and fuse protocells big,Msahsts018 USA 02138, Massachusetts mbridge, motnesrthsbc othe to back stretches importance ascuet 23,USA 02138, Massachusetts , N ncnrs oa exponential an to contrast in , ahsts018 USA 02138, sachusetts 1 , 2 , protocells 4 , † ∗ ich . 2 ious complexity) [8, 38–42]. In early life, these could be molecules as simple as , activated monomers, molecules that stabilize the membrane, or more compli- cated polymers, like oligo-peptides, and even elementary ribozymes and simple unlinked genes [12, 25, 29, 30, 43– 48]. More precisely, the target set should result in an auto-catalytic network that results in a evolvable cell with non-negligible probability. Such a scheme has been proposed since Oparin, and has been defended more re- cently [48]. We term the smallest set of necessary and sufficient components from which an evolvable protocell can be made a minimal evolvable protocell. We can accordingly represent the functional (or ge- netic) content of each protocell as a binary string of length N. For simplicity, we ignore the redundancy (or dose) of each component in the protocell, and are only concerned with each component’s presence. If a proto- cell contains a particular component i, then the string will have a value of 1 at the ith position and 0 otherwise. Whenever a protocell randomly assembles, we assume that it contains each of the N component types indepen- dently (components do not compete for positions) with FIG. 1. Merging occurs between randomly assembled proto- probability p. I.e. protocell assembly uniformly samples cells. (A) Each color (and a “1” bit at each corresponding each type (with sufficient abundance) from the environ- position on a protocell’s representative binary string) indi- cates presence of one of the four components needed for the ment with probability p. Whenever two protocells merge, protocell to be evolvable (here, N = 4). Randomly assembled the value of the resulting string at every position i is sim- lipid membranes form around the components. (B) When- ply determined by a bitwise OR operation on the ith bits ever two protocells merge, they share their contents. Sharing of the two parent protocells (i.e. if either of the origi- of contents is computed as a bitwise OR operation between nal cells contain a component, the resulting cell will also each of the two parent strings of length N. contain it). This is shown schematically in Figure 1. The dynamical process is as follows. On the first step, the accumulator—the object of our attention—consists number of random assembly and merging events in the of a randomly assembled protocell. If less than N com- accumulation process. ponents are enclosed, then one of two things can happen: The time, Z, needed to form a minimal evolvable pro- With probability δ, the accumulator loses its contents, tocell is thus a random variable that depends on the par- and on the second step, the accumulator consists of a ticular accumulator being tracked. If we track many such new randomly assembled protocell, with the accumula- accumulators, then what is the mean first-passage time, tion process starting over. The accumulator can lose its E[Z], for an accumulator to achieve all N components contents if, for example, its membrane’s integrity is lost, necessary for evolvability? it is infected by a parasite, or it divides, and the param- Begin by considering the simple case δ = 1 (no merg- eter δ accounts for all such possibilities. Or with proba- ing occurs). If the accumulator consists of a randomly bility 1 δ, on the second step, the accumulator merges assembled protocell that has all N components, then the with a randomly− assembled protocell from the environ- minimal evolvable protocell has been achieved. But if ment, possibly gaining additional components. In this there are less than N components, then the accumulator case, if the accumulator still has less than N components is reset without merging. Thus, the expected number of after merging, then one of two things can happen: With such random assembly events required to accumulate all probability δ, the accumulator loses its contents, and on N components necessary for evolvability, Eδ=1[Z], grows the third step, the accumulator consists of a new ran- exponentially with N, i.e., domly assembled protocell, with the accumulation pro- cess starting over. Or with probability 1 δ, on the third 1 N − E [Z]= step, the accumulator merges with another randomly as- δ=1 p sembled protocell from the environment, possibly gaining   additional components. This process continues until the For large values of N, the spontaneous generation of a accumulator gains all N components necessary for evolv- minimal evolvable protocell would be a probabilistic mir- ability. The total number of steps (or time units), Z, acle. We now focus our attention on understanding how needed to gain all N components is equal to the total E[Z] grows with N when 0 <δ< 1. 3

In what follows, it is convenient to use the param- To extract the large-N behavior of E[Z] from Eq. (5), eter q 1 p. Denote by S(q,δ,N) the probabil- we simplify the summation in Eq. (1) for large N using ity that,≡ starting− from a randomly assembled proto- the following procedure. For a smooth function f(x), we cell, the accumulator achieves all N components be- use the notation f (i)(x) = dif(x)/dxi. We can express fore being reset. We determine S(q,δ,N) as follows. an integration of f (i)(x) with respect to x from 0 to First, assume that there is no death of the accumula- as ∞ z tor. Then 1 q is the probability that, after z steps, ∞ − ∞ 1 1 z + y the accumulator has achieved a component. Therefore, dx f (i)(x)= dy f (i) 1 (1 qz)N is the probability that the accumulator N N Z0 z=0 Z0   has− not− achieved all N components after z steps. It fol- X lows that (1 qz)N (1 qz−1)N is the probability that Next, we write a Taylor expansion of f (i)((z + y)/N) in − − − the accumulator achieves all N components in exactly z powers of y/N and perform the integration over y. We steps. Then, considering death of the accumulator, since have the probability that the accumulator survives for z steps z−1 ∞ ∞ ∞ without being reset is simply (1 δ) , we have (i) 1 1 (i+m) z − dx f (x)= m f 0 (m + 1)!N "N N # Z m=0 z=0   ∞ X X (6) − N − N S(q,δ,N)= (1 δ)z 1 (1 qz) 1 qz 1 Substituting Eq. (6) into Eq. (1) to express the sum- − − − − z=1 X h i mation as an integration, substituting the integral form  1 x−1 y−1 This can be simplified as of the Beta function, B(x, y)= 0 dt t (1 t) , and using to denote asymptotic equivalence as−N , we ∞ ∼ R → ∞ δ obtain S(q,δ,N)= (1 δ)z (1 qz)N (1) 1 δ − − − z=1 δ log(1 δ) X S − B − ,N +1 (7) ∼ (1 δ) log(q) log(q) Denote by T (z; q,δ,N) the probability mass function for −   the number of steps, z, needed for the accumulator to Substituting Eq. (7) into Eq. (5), expressing the gain all N components (i.e., reach its target) when start- Beta function using Gamma components, B(x, y) = ing from a randomly assembled protocell, given that all Γ(x)Γ(y)/Γ(x + y), using Stirling’s formula for the N components are accumulated before being reset. We Gamma function, Γ(x) xxe−x 2π/x, and simplify- have ing for large N, we find∼ that E[Z] grows asymptotically N p (1 δ)z−1 (1 qz)N 1 qz−1 as T (z; q,δ,N)= − − − − h S(q,δ,N)  i E[Z] αN k, (8) (2) ∼ Denote by R(z; q,δ,N) the probability mass function for where the number of steps, z, taken before the accumulator is (1 δ) log(1 p) reset when starting from a randomly assembled protocell, α = − − − given that the accumulator is reset before gaining all N δ2Γ(k) components. We have and z−1 z N δ(1 δ) 1 (1 q ) log(1 δ) R(z; q,δ,N)= − − − (3) k = − . 1 Sh(q,δ,N) i log(1 p) − − In what follows, we omit explicitly writing the functional The time complexity of concurrence of components for dependencies on q, δ, and N for notational convenience. the problem of is thus fundamentally altered: For all 0 <δ< 1, the mean first-passage time, E[Z], For any slight amount of merging, i.e., for any value 0 < needed to form a minimal evolvable protocell is calculated δ < 1, E[Z] grows algebraically with N. Intriguingly, for directly from many values of p and δ, E[Z] grows only as a small power of N, and for many other values of p and δ, E[Z] grows ∞ z [ST (z)+(1 S)R(z)] E[Z]= z=1 − (4) only sublinearly with N (Figure 2). S P For the particular case in which δ 1, p 1, and ≪ ≪ Substituting Eqs. (1), (2), and (3) into Eq. (4) and δ is not too large relative to p, Eq. (8) admits a simple simplifying, we obtain approximation: 1 1 δ 1 E[Z]= − (5) E[Z] N δ/p (9) Sδ − δ ≈ δ 4

as a randomly assembled protocell, and it is never reset. For this case, the mean first-passage time, Eδ=0[Z], grows logarithmically with N, i.e. [49],

N N ( 1)i+1 log(N) E [Z]= − δ=0 i 1 (1 p)i ∼ log(1 p) i=1 X   − − − − Also of interest for the biologically realistic case 0 < δ < 1 is the probability mass function, P (Z = z), for the number of steps needed to achieve a minimal evolvable protocell. P (Z = z) is given by

z P (Z = z)= S (1 S)i−1 T (z ) R(z ) − 1 j i=1 i z =z j=16 X Pj=1Xj Y (10) If N is small, then there is typically a small number of resets before the accumulator gains all components, which corresponds to each zj being comparable in mag- nitude to z in the summations in Eq. (10). But if N is large, then there is typically a large number of resets before the accumulator gains all components, which cor- responds to having z z for all j in the summations in FIG. 2. Minimal evolvable protocells are achieved in poly- j ≪ nomial time for the vast majority of parameter space. For Eq. (10). In this case, the total number of steps, Z, is N = 10, N = 25, and N = 100, we perform Monte Carlo the sum of many independent and identically distributed simulations of the accumulation of components, and we plot random variables. E Z N logN ( [ ]) as a function of p and δ. For N → ∞, we plot k To provide a sense of how well of an estimator E[Z] N as a function of p and δ. is for the variable Z we look at its concentration Z˜ = Z/E[Z]. Denote µ as the average number of steps be- Comparison of the model, approximation, and simulation 20 data for =001 and varying fore an accumulator resets given that the accumulator 2 2 19 resets before gaining all N components. Denote σ as ] 2 Model

18 [ 2 Simulation =0015 17 the variance in the number of steps before an accumula- 2 Eq. (8) 216 tor resets given that the accumulator resets before gain- 15 Eq. (9) ∞ 2 =001 ing all N components. We have µ = zR(z) 1/δ 14 z=1 2 2 ∞ 2 2 2 ∼ 213 and σ = z=1 z R(z) µ (1 δ)/δ . Since both 12 2 − ∼ −P 2 =0005 µ and σ are finite, the central limit theorem enables a 211 P  210 simplification of Eq. (10) for large values of N: we obtain 29 the probability density function for Z˜: 8

Mean first-passage time first-passage Mean 2 27 ∞ 21 22 23 24 25 26 27 28 S(1 S)i (E[Z]˜z iµ)2 Number of components N P (Z˜ =z ˜) E[Z] − exp − − ∼ √ 2 2iσ2 i=1 2πiσ X   FIG. 3. For p = 0.01 and δ = 0.005, δ = 0.01, and δ = (11) 0.015, we plot the exact form of E[Z] for any value of N The moments of Z˜ are directly computed from Equation given by Eq. (5) (solid lines), E[Z] measured using a Monte (11): Carlo simulation of the accumulation of components (×), the  ∞ asymptotically exact form of E[Z] given by Eq. (8) ( ), and m m ◦ E[Z˜ ] dz˜ z˜ P (Z˜ =z ˜) m! the approximation for E[Z] given by Eq. (9) ( ). ∼ ∼ Z0 We immediately see that Z is exponentially distributed The exact form of E[Z] for all values of N given by Eq. about E[Z]: (5), E[Z] measured using a Monte Carlo simulation of 1 z the accumulation of components, the exact asymptotic P (Z = z) exp ∼ αN k −αN k result for E[Z] given by Eq. (8), and the approximation   for E[Z] given by Eq. (9) are plotted in Figure 3 for For large N, the natural production of minimal evolvable several values of p, δ, and N. protocells via random assembly and repeated fusion is For the particular case δ = 0, the accumulator begins therefore simply a Poisson process. 5

It is noteworthy that E[Z], Eq. (5), provides an upper host [56, 57]. The occurrence of this type of combi- bound on the time to construct a minimal evolvable pro- natorial reproduction in many RNA , which are tocell for many natural variations of this process. There thought to be ancient, is consistent with the thesis that are many ways in which the first-passage time can be primordial sex played an integral role in early life [58, 59]. shortened. For instance, the expected time to reach the Research into minimal synthetic cells has shown that target set of N components is reduced if cells divide (and cells with few hundred genes are able to self-sustain in retain some components) instead of losing all components complex media [60–62]. This suggests that even for low through death. Redundancy in components, where a pro- values of p, in this case the probability of required genes tocell might have one or more backup copies of each com- vs random -coding genes, novel self-sustaining ponent, can have a similar effect. Moreover, our simple cells (and possibly viruses) could be produced, either in model is specified by only three parameters. Our model lab or in early life, by a feasible number of fusions. The is therefore robust for exploring the time complexity of feasibility of finding novel viable combinations through myriad of compartmentalization scenarios by simply tun- a merging process in lab should also be of help in order ing the values of p, δ, and N. to understand the density of viable solutions within the Doing so will help in understanding several biological fitness landscape [51]. questions, and relates our work to other studies that are We may never know with certainty what path has interested in the timescale of evolutionary events. For resulted in the emergence of life on earth. There are instance, Wilf and Ewens [50] arrive at exactly the same likely many possible paths to evolvability, none of which formula for δ = 0 when looking for the time it takes have been fully delineated to this date. So far, virtually for evolution (on a smooth landscape with a single peak, all models of protocells assumed a small initial set size, hence δ = 0) to reach a target set of genes. This anal- precisely because co-occurrence of many components to- ysis is also favorable to viewing sex as the default bi- gether is unlikely. We show that even if the number of ological state. Computational analyses of sex suggest required components N is large, there are tenable paths that it makes evolutionary search over a landscape more to construct such an assembly. The merging mechanism efficient[51–53]. Our analysis adds that this advantage is not as critical if N is small, but in the presence of could be present, and aid, in starting cellular replication merging compartments we are no longer restricted to this itself. While biologists have considered the possibility of scenario. Here, we have devised and analyzed a model early sex before [34, 54], it was soon observed that par- that captures a general set of possibilities for an evolv- asitism could be a serious problem [25]. However, our able protocell to emerge. It is noteworthy that our model exact asymptotic analysis instead suggests that sex is a remains agnostic about whether template-directed repli- good strategy, even in the presence of parasites. cation or metabolism emerged first and it can apply in Oceanic currents in early earth could have brought to- both scenarios as well as different levels of complexity gether primitive protocells with disparate components, (from chemicals to enzymes and genes) in the underlying which subsequently merged and eventually spawned an components. evolvable protocell. In this scenario, protocell forma- To the best of our knowledge, our study is the first tion, convection, and merging act as a necessary bridge to provide a rigorous and quantitative blueprint for com- between physically and chemically heterogeneous prebi- paring the plausibility of a subset of paths to life: those otic environments for biological construction. Indeed, that involve compartmentalization. there is exciting, ongoing experimental work on creating “self-sustaining” protocells, which can divide and subse- The authors thank Krishnendu Chatterjee for com- quently restore their viable composition via fusion for a ments about the manuscript. We also thank Leslie few generations [55]. Valiant and Scott Linderman for helpful comments in Our mathematical model is similarly well suited for the initial phases of this project. We thank Robert Israel investigating the biological activation of modern viruses. for pointing us to related literature. We thank Michael In particular, our model captures a process known as Nicholson and Nicolas Fraiman for helpful discussions. multiplicity reactivation. In this process multiple non- We thank Artem Kaznatcheev for a great discussion of functional, mutant viruses of the same strain combine, our preprint on his blog. This work was supported by thereby “covering” each other’s loss-of-function muta- the John Templeton Foundation and in part by a grant tions and producing a functioning . Our analysis from B. Wu and Eric Larson. readily provides the number of such viral particles re- quired (in expectation) that would re-activate a virus. In a similar scenario, in multi-compartment viruses, multi- ple distinct components need to co-infect the same host ∗ S. S. and J. O. contributed equally to this work.; in order to produce a new virion. In many plant viruses, † martin [email protected] such as the genus Tymovirus, the occurs when [1] M. A. Nowak, Evolutionary dynamics (Harvard Univer- two or more functionally distinct virions infect the same sity Press, 2006). 6

[2] F. H. Crick et al., Journal of 38, 367 bridge University Press, 2001). (1968). [37] T. Chou and M. R. D’Orsogna, “First passage problems [3] L. E. Orgel, Journal of molecular biology 38, 381 (1968). in biology,” (World Scientific, 2014) Chap. 13, pp. 306– [4] C. Woese, The genetic code (Harper and Row, New York, 345. 1967). [38] E. Szathm´ary and L. Demeter, Journal of theoretical bi- [5] M. Eigen, Naturwissenschaften 58, 465 (1971). ology 128, 463 (1987). [6] Y. Bansho, T. Furubayashi, N. Ichihashi, and T. Yomo, [39] D. Segr´e, D. Ben-Eli, and D. Lancet, Proceedings of the Proceedings of the National Academy of Sciences , National Academy of Sciences 97, 4112 (2000). 201524404 (2016). [40] T. G´anti, BioSystems 7, 15 (1975). [7] A. J. Markvoort, S. Sinai, and M. A. Nowak, Journal of [41] S. A. Kauffman, Journal of theoretical biology 119, 1 theoretical biology 357, 123 (2014). (1986). [8] J. F. Fontanari, M. Santos, and E. Szathm´ary, Journal [42] N. Vaidya, M. L. Manapat, I. A. Chen, R. Xulvi-Brunet, of theoretical biology 239, 247 (2006). E. J. Hayden, and N. Lehman, 491, 72 (2012). [9] P. Hogeweg and N. Takeuchi, Origins of Life and Evolu- [43] S. S. Mansy, J. P. Schrum, M. Krishnamurthy, S. Tob´e, tion of the Biosphere 33, 375 (2003). D. A. Treco, and J. W. Szostak, Nature 454, 122 (2008). [10] H. S. Bernhardt, Biology Direct 7 (2012), 10.1186/1745-6150-7-23.[44] M. Fishkis, Origins of Life and Evolution of Biospheres [11] J. Attwater, A. Wochner, and P. Holliger, Nature chem- 37, 537 (2007). istry 5, 1011 (2013). [45] I. A. Chen and M. A. Nowak, Accounts of chemical re- [12] T. A. Lincoln and G. F. Joyce, 323, 1229 (2009). search 45, 2088 (2012). [13] A. Pross and R. Pascal, [46] R. A. Black, M. C. Blosser, B. L. Stottrup, Open Biology 3 (2013), 10.1098/rsob.120190. R. Tavakley, D. W. Deamer, and S. L. Keller, [14] P. G. Higgs and N. Lehman, Nature Reviews Proceedings of the National Academy of Sciences of the United States of America 110, 13272 (2013). (2014). [47] Y. Bar-Yam, Dynamics of complex systems, Vol. 213 [15] T. M. McCollom, G. Ritter, and B. R. Simoneit, Origins (Addison-Wesley Reading, MA, 1997). of Life and Evolution of the Biosphere 29, 153 (1999). [48] V. Vasas, C. Fernando, M. Santos, S. Kauffman, and [16] G. Yuen, N. Blair, D. J. Des Marais, and S. Chang, E. Szathm´ary, Biology Direct 7, 1 (2012). Nature (1984). [49] A reader with interest in algorithms may recognize this [17] J. G. Lawless and G. U. Yuen, Nature (1979). result as the expected number of levels in a skip list with [18] D. W. Deamer, Nature (1985). N elements, where 1-p is the probability of each element’s [19] D. Segr´e, D. Ben-Eli, D. W. Deamer, and D. Lancet, appearance in the next level of the skip list [63, 64]. Origins of Life and Evolution of the Biosphere 31, 119 [50] H. S. Wilf and W. J. Ewens, Proceedings of the National (2001). Academy of Sciences 107, 22454 (2010). [20] N. Lane and W. F. Martin, Cell 151, 1406 (2012). [51] K. Chatterjee, A. Pavlogiannis, B. Adlam, and M. A. [21] P. L. Luisi, P. Walde, and T. Oberholzer, Current opin- Nowak, PLoS Comput Biol 10, e1003818 (2014). in colloid & interface science 4, 33 (1999). [52] A. Livnat, C. Papadimitriou, J. Dushoff, and M. W. [22] C. Paleos, Trends in Biochemical Sciences 40, 487 (2015). Feldman, Proceedings of the National Academy of Sci- [23] D. W. Deamer, Origins of Life and Evolution of the Bio- ences 105, 19803 (2008). sphere 17, 3 (1986). [53] A. Livnat and C. Papadimitriou, Communications of the [24] S. Yamamoto, Y. Maruyama, and S.-a. Hyodo, The Jour- ACM 59, 84 (2016). nal of Chemical Physics 116, 5842 (2002). [54] J. B. S. Haldane, Rationalist Annual 148, 3 (1929). [25] E. Szathm´ary, M. Santos, and C. Fernando, in Prebiotic [55] K. Kurihara, Y. Okura, M. Matsuo, T. Toyota, Chemistry (Springer, 2005) pp. 167–211. K. Suzuki, and T. Sugawara, Nature communications [26] A. Markvoort, A. Smeijers, K. Pieterse, R. van Santen, 6 (2015). and P. Hilbers, The Journal of B 111, [56] J. M. Smith, Evolutionary genetics. (Oxford University 5719 (2007). Press, 1989). [27] D. Fanelli and A. J. McKane, [57] A. Rao, Annu. Rev. Phytopathol. 44, 61 (2006). Physical Review E 78 (2008), 10.1103/PhysRevE.78.051406.[58] A. Nasir and G. Caetano-Anoll´es, Science Advances 1, [28] G. Bianconi, K. Zhao, I. A. Chen, and M. A. Nowak, e1500527 (2015). PLoS Comput Biol 9, e1003051 (2013). [59] E. V. Koonin, T. G. Senkevich, and V. V. Dolja, Biol [29] T. F. Zhu and J. W. Szostak, Journal of the American Direct 1, 29 (2006). Chemical Society 131, 5705 (2009). [60] A. R. Mushegian and E. V. Koonin, [30] I. Budin, A. Debnath, and J. W. Szostak, Journal of the Proceedings of the National Academy of Sciences of the United States of America 93, 10268 (1996). American Chemical Society 134, 20812 (2012). [61] R. Gil, F. J. Silva, J. Pereto, and A. Moya, [31] K. Adamala and J. W. Szostak, Science 342, 1098 (2013). Microbiology and Molecular Biology Reviews 68, 518 (2004). [32] M. M. Hanczyc, S. M. Fujikawa, and J. W. Szostak, [62] C. A. Hutchison III, R.-Y. Chuang, V. N. Noskov, Science 302, 618 (2003). N. Assad-Garcia, T. J. Deerinck, M. H. Ellisman, [33] P. L. Krapivsky, S. Redner, and E. Ben-Naim, A Kinetic J. Gill, K. Kannan, B. J. Karas, L. Ma, J. F. Pel- View of (Cambridge University Press, letier, Z.-Q. Qi, A. Richter, E. A. Strychalski, L. Sun, 2010). Y. Suzuki, B. Tsvetanova, K. S. Wise, H. O. Smith, J. I. [34] H. Bernstein, H. C. Byerly, F. A. Hopf, and R. E. Mi- Glass, C. Merryman, D. G. Gibson, and J. C. Venter, chod, Journal of Theoretical Biology 110, 323 (1984). Science 351 (2016), 10.1126/science.aad6253. [35] M. Santos, E. Zintzaras, and E. Szathm´ary, Origins of [63] W. Pugh, Communications of the ACM 33, 668 (1990). Life and Evolution of the Biosphere 33, 405 (2003). [64] P. Kirschenhofer, C. Mart´ınez, and H. Prodinger, The- [36] S. Redner, A Guide to First-Passage Processes (Cam- oretical Computer Science 144, 199 (1995).