Universal motifs and the diversity of autocatalytic systems

Alex Blokhuisa,b,c,d,1, David Lacostea, and Philippe Ngheb,1

aGulliver Laboratory, UMR CNRS 7083, Paris Sciences et Lettres University, Paris F-75231, France; bLaboratoire de Biochimie, UMR CNRS 8231, Chimie Biologie et Innovation, Ecole Superieure´ de Physique et de Chimie Industrielles de la ville de Paris, PSL University, Paris F-75231, France; cGroningen Institute for Evolutionary Life Sciences, University of Groningen, 9747 AG Groningen, The Netherlands; and dCentre for Systems Chemistry, Stratingh Institute, University of Groningen, 9747 AG Groningen, The Netherlands

Edited by Peter Schuster, University of Vienna, Vienna, Austria, and approved September 1, 2020 (received for review June 30, 2020)

Autocatalysis is essential for the origin of life and chemical evo- From this definition, we derive conditions to determine lution. However, the lack of a unified framework so far prevents whether a subnetwork embedded in a larger chemical network a systematic study of autocatalysis. Here, we derive, from basic can be catalytic or autocatalytic. These conditions provide a principles, general stoichiometric conditions for and auto- mathematical basis to identify minimal motifs, called autocat- catalysis in networks. This allows for a classifica- alytic cores. We found that cores have five fundamental cat- tion of minimal autocatalytic motifs called cores. While all known egories of motifs. They allow classification of all previously autocatalytic systems indeed contain minimal motifs, the classifica- described forms of autocatalysis, and also reveal hitherto uniden- tion also reveals hitherto unidentified motifs. We further examine tified autocatalytic schemes. We then study the kinetic condi- conditions for kinetic viability of such networks, which depends tions, which we call viability conditions, under which autocat- on the autocatalytic motifs they contain and is notably increased alytic networks can appear and be maintained on long timescales. by internal catalytic cycles. Finally, we show how this frame- We find that networks have different viabilities depending on work extends the range of conceivable autocatalytic systems, by their core structure, and, notably, that viability is increased applying our stoichiometric and kinetic analysis to autocatalysis by internal catalytic cycles. Finally, we expand the repertoire emerging from coupled compartments. The unified approach to of autocatalytic systems, by demonstrating a general mecha- autocatalysis presented in this work lays a foundation toward nism for their in multicompartment systems (e.g., the building of a systems-level theory of chemical evolution. porous media, vesicles, multiphasic systems). This mechanism strongly relaxes chemical requirements for autocatalysis, making autocatalysis | origin of life | chemical reaction networks the phenomenon much more diverse than previously thought. Examples, Definitions, and Conventions he capacity of living systems to replicate themselves is rooted Tin a chemistry that makes more of itself, that is, an auto- Catalysis and Autocatalysis. The following reactions have the catalytic system. Autocatalysis appears to be ubiquitous in liv- same net mass balance but a different status regarding catalysis: ing systems from molecules to ecosystems (1). It is also likely (I) (II) (III) to have been continually present since the beginning of life A B, A + E B + E, A + B 2B. [1] and is invoked as a key element in prebiotic scenarios (2–5). Surprisingly, autocatalysis is considered to be a rarity in chem- Since no species is both a reactant and product in reaction I, it istry (6). Developments in systems chemistry are changing this should be regarded as uncatalyzed. Reactions II and III instead view, with an increasing number of autocatalytic systems synthe- contain species which are both a reactant and a product, species sized de novo (7–9). Chemical replicators have been endowed E in reaction II and species B in reaction III, and, following the with biomimetic properties such as protein-like folding (10) and parasitism (11). Autocatalysis has also found technologi- Significance cal applications, for example, enantiomer enrichment and acid amplification (12–14). Autocatalysis, the ability of chemical systems to make more Understanding autocatalysis represents a primary challenge of themselves, is a hallmark of living systems, as it underlies for theory. Models based on autocatalysis were first built to metabolism, reproduction, and evolution. Here, we present explain a diversity of dynamical behaviors in so-called dissipative a unified theory of autocatalysis based on . structures, such as bistable reactions (15), oscillating reactions, This allows us to identify essential motifs of autocatalytic and chemical waves (16). Autocatalysis then became a central networks, namely, autocatalytic cores, which come in five cat- topic in the study of self-replication dynamics in biological and egories. In these networks, internal catalytic cycles are found prebiotic systems (3, 17–19) (see refs. 20–22 for recent reviews). to favor growth. The stoichiometry approach furthermore Despite this history, a unified theory of autocatalysis is still reveals that diverse autocatalytic networks can be formed lacking. Such a theory is needed to understand the origins, diver- with multiple compartments. Overall, these findings suggest sity, and plausibility of autocatalysis. It would also provide design that autocatalysis is a richer and more abundant phenomenon principles for artificial autocatalytic systems. Here, we present a than previously thought. framework that unifies the different descriptions of autocatalysis and is based on reaction network stoichiometry (23–27). Author contributions: A.B., D.L., and P.N. designed research, performed research, and Let us start from basic definitions in chemistry as established wrote the paper.y by the International Union of Pure and Applied Chemistry (28) The authors declare no competing interest.y (see SI Appendix, section I for full definitions), where autocataly- This article is a PNAS Direct Submission.y sis is a particular form of catalysis: “A substance that increases Published under the PNAS license.y the rate of a reaction without modifying the overall standard 1 ◦ To whom correspondence may be addressed. Email: [email protected] or philippe. Gibbs energy change (∆G ) in the reaction; the process is called [email protected] catalysis. The catalyst is both a reactant and product of the reac- This article contains supporting information online at https://www.pnas.org/lookup/suppl/ tion. Catalysis brought about by one of the products of a (net) doi:10.1073/pnas.2013527117/-/DCSupplemental.y reaction is called autocatalysis.” First published September 28, 2020.

25230–25236 | PNAS | October 13, 2020 | vol. 117 | no. 41 www.pnas.org/cgi/doi/10.1073/pnas.2013527117 Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 lkuse al. et Blokhuis obnn ecin 1 reactions Combining (A) catalysis. 1. Fig. pro- species every vector for reaction positive A negative and duced. is consumed coefficient species the every reaction; the for of that coefficients in stoichiometric participating the are species column a species. in to entries correspond The rows matrix and reactions stoichiometric to correspond columns a as represented Vectors. Reaction and Matrix Stoichiometric stoichiometric the generalize ones. reac- kinetic first net examine will the then we conditions, of following, rate the the In accelerate tion. also according should which, definition, catalysis, the characterize to to sufficient of not but case sary the with to it both. contrast propose to to common we allocatalysis being literature, catalysis of autocatalysis, example chemistry an the it in call to catalyzed represents referred simply III usually experi- is reaction as II B reason, reaction species this Although the autocatalysis. in For genuine where production. unchanged, III, net remains reaction a E of catalysts. ences species case as of the to amount considered contrast the be II, can reaction species In these above, definition ersnaino h uoaayi subnetwork. autocatalytic the of submatrix representation (E autocatalytic B. the to encloses square A converting 1 cycle Combining alytic (D) subnetwork. allocatalytic the submatrix allocatalytic ( the B. to A converts either is species avoided steps participating is reactions a product, a This that of or so I. reactant sequence leading a emerges, reaction a it side, for which through each as from catalysis on vector cancel describing column catalysts by same III, the and to II instance, reactions For matrix. in stoichiometric the in ones 1A, uncatalyzed from Fig. in example, For compounds). law. (total conservation are mass-like quantities conserved a coefficients as all to of case, case that The in quantity. because, laws, vation Vectors nonzero a ν is cycle reaction A vector coordinates. reaction nonzero its of set the is numbers species AB D A AB · eepaieta tihoerccnieain r neces- are considerations stoichiometric that emphasize We aty aaye ecin a o lasb distinguished be always not may reactions catalyzed Lastly, c + + = B2B AB AE EA E r equivalently, or, 0, ifrn ersnain o (A–C for representations Different b T 1’ 2’ 2’’ 1’’ eogn otelf ulsaeof space null left the to belonging EA AB tihoercmti of matrix Stoichiometric B) c uhta ontseisnme hneoccurs: change number species net no that such ∆ + n B A A = E + + ν ν ¯ 0 n AB · B E EA E B A B A c g o network for E g h upr of support The . IIIa

IIa eog otergtnl pc of space null right the to belongs [g = + tihoercmti of matrix Stoichiometric ) EA n AB 0 -1 -1 -1 -1 EA 2 + 1 1’’ 0 1 1-1 1’ ,

.., IIIb IIb

0 ctlss and (catalysts) fod nalctltcccethat cycle allocatalytic an affords b g E -1 2B. 2’’ · (C B. b 0 r loaayi n (D–F and allocatalysis ) 2’ 2 0 1 1 ν ¯ h ahdsur encloses square dashed the A; n k ] + 00 T 00 ongtv sreferred is nonnegative ersnsaconserved a represents B, o network for rp ersnainof representation Graph ) ecinntok are networks Reaction eut nacag of change a in results ν 2 + g 2,2) nwhich in 26), (23, 00 eoe supp( g denoted , ν fod nautocat- an affords F C nueconser- induce n 1’’ 1’ A h dashed the D; ( F E. + AB EA n B E graph A ) EA auto- ) + [3] [2] 2’ n 2’’ ν ), B . nkntc,nro hc pce r aayi rnt matrix A not. of or catalytic restriction are a species assumptions is nor which coefficients on priori vector nor reaction a kinetics, of on signs makes and autocatal- values identification or the on such allocatalysis neither that of definitions Note motif, the ysis. a with called compatible subnetwork, is corresponding the whether the ask of and network, stoichiometry reaction a submatrix of possible matrix any stoichiometric consider will we section, this In Matrices Stoichiometric in Autocatalysis and Catalysis respected. is it convention that this call We hc l nraei ubrcniinlo pce nteset the in species on conditional number species of in set increase a involves all reactions of which combination a which by cess Autocatalysis. for Criterion submatrix the matrix of allo- larger property being the that a Note not (26). driven is species state external catalytic steady the of nonequilibrium and turnover a emergent the establish by as can it it qualified that have shown the authors Since some matrix, reaction. original net the cycle expresses reaction 4 condition and catalysts, reactions the in of catalysts the of columns of involvement the matrix. of allocatalysts: called columns in the participate which in of included cycle tion is the supp(c) vector to 2) reaction nonzero leads a and This S, numbers. conserved their remain undergo species changes species below. of which conditions external set other turnover while a a catalysts) which (the by number in reactions of ble Allocatalysis. for motif Criterion the outside reactions that motif. forbid the of not species does (30), produce it ( Nets that Petri signs in Note reaction concept 1). on siphon assumption the without to catalysis. similar but is of and autocatal- definition (24), for conditions ysis the former than to restrictive less is opposition Autonomy in Otherwise, already substance, an motif. on present the unconditionally on allowed of be conditional species would is ensures acceleration chemical motif rate it other the of of Indeed, presence species catalysis. any the for of production condition the a that as autonomy pose and respective their submatrices with 1 respectively, Fig autocatalysis, and yellow. allocatalysis in have species boxed submatrices external point stoichiometric 1, been while the Fig. blue, In from in motif. colored waste the been of and have species (29), internal set of view food com- of the feedstock called represent also and (26), pounds, chemostatted cases, some in of of rows (rows motif (remaining the species of species internal into arated under motif the of reactions and consideration. species the to correspond tively aho t ecin a tlatoeratn n tlatone least at and reactant one that of least such called at motif is A has it. product reactions of its outside of occur which each those and motif the odto a endsusdaoe odto expresses 2 Condition above. discussed been has 1 Condition hr xssasto species of set a exists There of species the that means rows the of restriction The etito fclmssprtsratoswihaepr of part are which reactions separates columns of Restriction c ν ¯ F nov aayt.Cniin3epesstecnevto of conservation the expresses 3 Condition catalysts. involve otisapstv n eaiecefiin.Blw we Below, coefficient. negative a and positive a contains . ν ¯ ν ¯ r ozr u ocniin1 ota l reactions all that so 1, condition to due nonzero are ν ¯ 0 c and ( PNAS sacceo h eue arxbtnto the of not but matrix reduced the of cycle a is ν ν ¯ ν hsmasta vr column every that means This autonomous. · sipsdb odto 4. condition by imposed as nalctltccceand cycle allocatalytic an c, ν ¯ c c ocranrw n oun,wihrespec- which columns, and rows certain to 00 = ie,ta r osmdadpoue)are produced) and consumed are that (i.e., | n yegahrpeettosFg 1 Fig. representations hypergraph and , ;ad4) and 0); coe 3 2020 13, October ydfiiin loaayi sa ensem- an is allocatalysis definition, By nonambiguity ydfiiin uoaayi stepro- the is autocatalysis definition, By ν .Teeetra pce ol be, could species external These ). A submatrix a S, and c uhta 1) that such ν · D c | 0 6= n suehenceforth assume and ersnseape of examples represents o.117 vol. h ebr of members The . ν section Appendix, SI ¯ ν ν ¯ ¯ ln u involves but alone 3) ; ν ν ¯ ¯ of nallocatalytic an , ν | ¯ sautonomous; is ν n external and ) o 41 no. c hr all where c, etitdto restricted ν ¯ sareac- a is ν of r sep- are | ν 25231 the , C ν ¯ S

APPLIED PHYSICAL SCIENCES itself (the autocatalysts), while other species undergo a turnover. Type I Type II Type III Type IV Type V This leads to the following conditions. A There exists a set of species S, a submatrix ν¯ of ν restricted to S, and a reaction vector g such that 1) ν¯ is autonomous and 2) all coordinates of ∆n = ν¯ · g are strictly positive, or equivalently, ν¯ has no mass-like conservation laws. The members of S consumed (and produced) by g are called autocatalysts, g is an autocatalytic mode, and ν¯ is an autocatalytic matrix. C3 Condition 1 ensures the conditionality of the reactions on B C2 C S C2 C3’ autocatalysts, as it forbids cases where species of are pro- C3 duced from external reactants only, thus playing the role of C4 C4’ conditions 1 and 2 in the definition of allocatalysis. Condition 2 C5 C4’ C3’ expresses the increase in autocatalyst number. The equivalence Type I Type II C5’ between the two formulations of condition 2 is an immedi- OH ate consequence of Gordan’s theorem (31). Importantly, the O O HO O second formulation of condition 2 does not involve an autocat- HO H H H H O O O alytic mode g, so that conditions 1 and 2 can be expressed as HO H H OH HO OH O properties of a matrix itself, in contrast with allocatalysis. This O OH allows us to look for minimal autocatalytic motifs, which we do HO HO O OH O

next. Note that external species must feed the autocatalytic sys- OH O OH HO O OH OH tem in order to guarantee the net mass increase imposed by O OH HO OH OH OH condition 2. HO O OH O O H H Autocatalytic Cores. An autocatalytic core is an autocatalytic H H HO HO motif which is minimal because it does not contain any smaller OH OH autocatalytic motif. Consequently, an autocatalytic system is Fig. 2. (A) Five minimal motifs. Orange squares indicate where further either a core or it contains one or several cores. The stoichio- nodes and reactions may be added, provided this preserves the motif type (I, metric conditions show that characterizing cores is equivalent to II, III, IV, V) and minimality. (B and C) Examples of chemical networks, along finding all autonomous matrices whose images contains vectors with their autocatalytic cores. Blue, external species; yellow, autocatalysts. with only strictly positive components. This well-posed formula- (B) Type I: Breslow’s 1959 mechanism for the formose reaction (32). (C) Type tion allowed us to show that the stoichiometric matrix ν¯ of an II: Another autocatalytic cycle in the formose reaction. Species denoted as autocatalytic core must verify a number of nonobvious proper- Cx inside the nodes refer to molecules containing x carbon atoms, which are ties reported below and demonstrated in SI Appendix, sections 2 shown below in standard chemical representation. and 3. First, ν¯ must be square (the number of species equals the number of reactions) and invertible. The inverse has a chemi- a same chemical species. It is obligatory for types II and III to cal interpretation. By definition of the inverse, the kth column contain one fork with two distinct products, for type IV to con- of ν¯−1 is a reaction vector such that species k increases by tain two such forks, and for type V to contain three. In Fig. 2A, one unit, making it an elementary mode of production. Like- the orange squares on the links between the nodes indicate that wise, the reaction vector obtained by summing the columns of these links could contain further nodes and reactions in series, ν¯−1 leads to a net increase by one unit of every autocata- provided certain rules on cycles given in the next paragraph. lyst, which thus represents an elementary mode of autocatalysis. The five types differ in their number of graph cycles and the ∗ This shows how stoichiometry informs on fundamental modes of way these cycles overlap. Type I consists of a single graph cycle autocatalysis (27). that is weight asymmetric, defined as the product of the stoichio- Second, every forward reaction of a core involves only one core metric coefficients of its reaction products being different than species as a reactant. While this excludes reactions between two that of its reactants. Types II and III comprise two distinct but different core species, a single core species may react with itself. overlapping graph cycles, Type IV comprises three, and Type V As ν¯ is square, this also implies that every species of a core is more than three, where any such graph cycle involving a strict consumed (none is only produced), and thus is an autocatalyst. subset of the core species must be an allocatalytic cycle, that is, Furthermore, every species is the reactant of a single reaction. weight symmetric (it would otherwise be of type I, contradicting Overall, every species is uniquely associated with a reaction as minimality). being its reactant, so that ν¯ admits a representation with a nega- tive diagonal and zero or positive coefficients elsewhere, at least Unification of Autocatalytic Schemes. The stoichiometric charac- one coefficient of each column being strictly positive to ensure terization of autocatalysis provides a unified approach to auto- autonomy. catalytic networks reported in the literature. The examples below These properties are constraining enough to allow an exhaus- are further detailed in SI Appendix, section 4. The formose reac- tive enumeration of reaction graphs that are cores. Autocatalytic tion is a classic example of autocatalysis known to contain many cores are found to belong to five categories, denoted as type I to autocatalytic cycles (33). Fig. 2 B and C shows types I and III type V. Fig. 2A represents typical members of each category as cores both found in the formose reaction. Similarly, autocat- reaction hypergraphs (see SI Appendix, Fig. S1 for general cases). alytic cores of types I and III can be found in the Calvin cycle As can be seen in these graphs, all minimal motifs contain a fork, and reverse Krebs cycle (SI Appendix, Fig. S4). Some reaction which is a reaction with a single reactant and two products. The steps in Fig. 2B may be catalyzed externally (e.g., by enzymes, presence of at least one fork is consistent with the intuition that base, ions), but external catalysis, in general, does not alter the autocatalysis requires reaction steps that amplify the amount of core. By the same token, proposed examples of autoinduction autocatalysts. In type I cores, the fork ends with two copies of the same compound, whereas, in types II to V, forks end with differ- ent compounds. Note that type I cores can be seen as a particular *Graph cycles are closed paths in the reaction hypergraph and should not be confounded case of type II if one formally allows different nodes to represent with reaction cycles which are right null vectors of the stoichiometric matrix.

25232 | www.pnas.org/cgi/doi/10.1073/pnas.2013527117 Blokhuis et al. Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 ouin eie in derived solution, in steps. detailed intermediate trials, representation. simulated by RAF 1,000 connected first-order after cycles a allocatalytic with II, GARD in type node. motif dashed common the a cycles, at starting size, cycle. similar of networks catalytic lkuse al. et Blokhuis 3. Fig. only networks, step order reaction first-order forward Among each is, autocatalyst. that one 38); auto- dilute involves 22, first-order highly (17, require the rates in catalysis that, appreciable noted regime, have autocatalyst authors several reactants, of function a steps. as reaction motifs, of autocatalytic probabilities transition within species (17, of problem fixation) probability fixation the examined so-called kinetic we this on 22), depends address degrades To or considerations. amplifies Whether autocatalyst amplify. initial motifs within an autocatalysts that guarantee not Networks. Autocatalytic IV of types Viability of examples found yet not have We V. III. and (SI and II, core I, III types type a to corresponds S5). Fig. described and thus be Appendix, species, can framework process caged our this the that within find with We exchange release. that its amplifying species react can two counterpart form free whose to cage, reactive molecular a involves a mechanism in The compound (37). cavitands to due cation” suggested models. recently these the between establishes 36) (5, formally irreducible link This form (35). they sets and where Autocatalytic RAF (Reflexively (29), RAF framework the in Eq. Food-generated) treated in be as can reactions built they single cycles as 1, written II compactly (cross- are type cycles and cycles catalytic fork allocatalytic as Appendix one such nonoverlapping (SI with incorporation, sequential cycles assemblies described from I is amphiphile type up autocatalysis by of S6) underlying Fig. all growth 5), (SI self-enhancing (4, cores for III and model I type contain 34 S3). and Fig. 21 Appendix, refs. in introduced B A N P 1 osdrn ooeeu ytmwt tayspl of supply steady a with system homogeneous a Considering comprise schemes autocatalytic described previously Overall, amplifi- “chemical is autocatalysis of form reported Another Domain) Replication Autocatalysis (Graded GARD the In ex N 2 (A) yeI ihoefork. one with II type : P ex N sfnto of function as 2 eto 6. section Appendix, SI N N 3 ζ 3 nFg ) oegnrly hnsuch when generally, More 3). Fig. in N (legend: 3 yeI,tonnvrapn allocatalytic nonoverlapping two II, type : ie r exact are lines 8 ; section Appendix, SI P P ex tihoerccniin do conditions Stoichiometric N ex 4 (ζ fetnto (or extinction of for ) P ex < v auto- five (B) for 1) N P 5 ex N 5 yeV. type : 1 − N 1 yeI type : P ex N P of ex 4 , : erdto rbblt) xto spsil o obigprob- probability degradation. doubling a success ability of for possible a is result step fixation has probability), reaction a degradation cycle every as the if disappear that, of or found species (38) irreversibly King proceed (e.g., next either networks may the species I to given type a step, on transition a focused far 2 so Fig. have models fixation tp ntentok elcdby replaced yielding network, pro- the stochastic in these steps X In species (40). autocatalytic an processes using cesses, branching I type of beyond theory networks the and reactions reversible include to in networks contexts. such various for statistics time-dependent (n detailed species reported 39). one (19, derive containing to threshold loop processes decay birth–death autocatalytic used the (17) called al. et is Bagley possible is fixation which l akadfrhtaetre rmX from trajectories back-and-forth all where X have to We revert to probability X sition From cycle. the along reactions net- backward I type for p observation (17) al.’s to works et Bagley generalizes This its that probability The with pnst h ipettp fbacigpoes birth–death a process: branching of Eq. type process. simplest the to sponds yield Eq. simplifies and which converted prematurely, successfully X be species either with step as first such steps, reaction Cycles. I Type Reversible being structures. networks other the for pare solutions in networks, known I generalizing detailed by type method for this results exemplify net- reaction we from Below, constructed works. are processes branching where 5, derive to is bilities here difficulty main The 2 ee eetn h ramn ftefiainpolms as so problem fixation the of treatment the extend we Here, sfudb osdrn l osbesqecso owr and forward of sequences possible all considering by found is p ,wihhv igegahccecontaining cycle graph single a have which B), 0 p Γ Π k X + 2 k k n k s ecnat neednl oextinct, go independently descendants Γ P P = p rcdr o hsi ie in given is this for procedure A . oishpe ihaprobability a with happen copies −→ ∅, eusvl (Γ recursively k > 2 ex ex p +1 Q ete pl tt com- to it apply then We 5. section Appendix, SI 0 1 = 5 1 = fatctltcmtf hc ifri hi core their in differ which motifs autocatalytic of k n hnbcmsaqartceuto htyields that equation quadratic a becomes then = n eesberatos o eesbereactions, reversible For reactions. reversible and =1 p h vrl ucm ecie yEq. by described outcome overall The . P PNAS X 0 s ∞ ex =0 Π P + X k + ex htX that p (Π s | ≥ 1 −→ = ∅ p P k − coe 3 2020 13, October 1 p hsmnmmvleof value minimum This 1/2. osdratp yl ossigof consisting cycle I type a Consider N ←− +1  ex 1 1 2 p 1 1 = 0 X 1, = p mre oe.Utmtl,X Ultimately, node). (marked Γ + s 1 2 s n e ssata the at start us let and 3B, Fig. in k osetnti hnteprobability the then is extinct goes X , k Y p − Π onstesaitclwih of weight statistical the counts ) =1 n 1 2 k k + 1, P −→ . . . −1 s p Π ) ex 4 2 s fe euneo reaction of sequence a after is, 2 k s k + and , to = + 2X ois ecinsequences Reaction copies. Γ p p k p X k . . . 1 | , 2 2 k 1 oisl,i em of terms in itself, to s − , Π < ≥ Π −→ o.117 vol. rmtasto proba- transition from = p Π k + k + k 2 2 1 1 p k 2X X section Appendix, SI k k − let , . , k ∞ =0 1 = (1 ocnett X to convert to +1 k 1 uhthat such , k 1 X − Γ p rb degraded be or .Shse (22) Schuster ). s Π among | k k , Π P Π k − o 41 no. k + n ex k k + etetran- the be . . . . pce.In species. . P , en the being p ex 2 6 n | corre- o an for above 1 steps 25233 k+1 will Π [4] [7] [9] [8] [6] [5] k − n .

APPLIED PHYSICAL SCIENCES and Π+. In the irreversible reaction limit Π− → 0, Γ → 1, King’s k k k A B expression for p2 is recovered (38).

Viability of Autocatalytic Cores. To investigate how autocatalytic motif structure affects survival, we calculated Pex for five differ- ent cores (N1 to N5; Fig. 3): They are of equal size (six reaction steps, six species), and all reactions proceed irreversibly with the C same success probability ζ, which plays a similar role as the tran- + sition probability Πk in the example above and is sometimes called specificity (19, 38, 39).† Fig. 3 highlights how Pex depends on ζ for each core structure. The highest ζ for extinction (Pex = 1) is observed for the type I cycle N1, and progressively lower values are found for N2 to N4, which are all of type II. Type V network N5 tolerates the lowest specificity ζ before extinction, sustaining almost 3 times higher failure rates 1 − ζ than N1. These differences can be qualitatively understood by counting the minimum number of steps needed to produce more autocatalysts. In respective order, networks N1 to D N5 in Fig. 3B do so in six, four, three, three, and two steps. In particular, given their symmetries, the Pex of N3 and N5 have the same dependence on ζ as a three- and two-membered type I cycles, respectively. It has been suggested that large networks are disfavored in general (38). The examples of Fig. 3 indicate that this can be counterbalanced by the presence of more allocatalytic cycles in the network. This is, in particular, the case for autocatalytic sets, as every net reaction must participate in an allocatalytic cycle. Extensions: Multicompartment Autocatalysis We finally show how stoichiometric criteria allow the identifica- tion of autocatalysis that emerges from compartments coupled via selective exchange, as found in systems comprising vesicles, pores, emulsions, and complex coacervates (41). The reaction Fig. 4. Multicompartment autocatalysis. (A) Reaction network with two network in Fig. 4A is incapable of autocatalysis, as it does not reactions and five species in a single compartment. The network does not contain any autocatalytic core. However, when we place this contain any autocatalytic core, and thus cannot perform autocatalysis. (B) network in two compartments α and β coupled by a mem- Same reaction network as in A, but duplicated in two compartments α brane permeable to A and A2B only, a type II core emerges and β, coupled by the selective exchange of species A and A2B. A type II (Fig. 4B). core, highlighted in orange, emerges. (C) Open reactor with two compart- The core identification indicates a possible setting for auto- ments, a semipermeable membrane, degradation, and exchange reactions. catalysis: U and V are chemostatted in α, and AB is chemostatted Chemostatted species have a lighter background: U and V in α and AB in β.(D) Extinction probability Pex for multicompartment autocatalysis in in β (Fig. 4C). The reaction involving U and V may, in principle, ex C, starting from a single Aα, as a function of exchange rate k and also take place in β, but it is not required for autocatalysis, as it is degradation rate kd, relative to other relevant reaction rates fixed at k. not part of the type II core. In Fig. 4 C and D, it is assumed that d Slanting asymptote: exchange-limited√ survival kex = 2k . Vertical asymptote: U and V are absent. reaction-limited survival kd/k = 13 − 3/2. Dashed white line: transition We now apply our viability analysis to this autocatalytic net- between extinction and potential fixation (Pex < 1). Expressions for Pex and work in the presence of degradation reactions (r6 and r7 in Fig. asymptotes are derived in SI Appendix, section 7. 4C). Let us introduce a characteristic rate k for reactions r2, r4 d and r5, a degradation rate k for reactions r6 and r7, and an ex single reaction A B + C can give rise to type III motifs, given exchange rate k for reactions r1 and r3. Fig. 4D shows the d three compartments coupled by selective exchange as detailed in extinction probability as a function of the degradation rate k ex SI Appendix, section 9 and Fig. S9. and exchange rate k , both normalized by k. To overcome the d degradation threshold (Pex < 1), the ratio k /k must lie above Discussion a certain threshold (vertical black dotted line in Fig. 4D), and We presented a theoretical framework for autocatalysis based k ex the rate of exchange should outpace the rate of degradation on stoichiometry, which allows a precise identification of the (black slanting dotted line in Fig. 4D). Stochastic simulations (SI different forms of autocatalysis. Starting with a large stoichio- Appendix, section 8 and Fig. S9) confirm autocatalytic growth in metric matrix, we provide criteria for reaction network motifs these conditions. that allow allocatalysis and autocatalysis. A detailed analysis of In this example of coupled compartments, compounds are no the graph structure contained in these reduced stoichiometric α longer restricted to one role: AB is an autocatalyst in and a matrices reveals that they contain only five possible recurrent β feedstock in . Chemical reactions are no longer restricted to motifs, which are minimal in the sense that they do not contain α one direction: The reaction used for reproduction in is reused smaller motifs. Fundamental modes of production of minimal β in to provide the missing step to close the cycle. Such multicom- autocatalytic cores are encoded in the column vectors of the partment autocatalysis is, however, more general. For instance, a inverse of the autocatalytic core submatrix. Autocatalytic cores are found to have a single reactant species for each reaction. This means that autocatalytic networks require the availability † Note that, in general, the calculation of Pex may involve reactions that are not in the of certain chemical species in their cores to operate properly, core. Here, we consider cases where this is not necessary. but also implies that the proper functioning of an autocatalytic

25234 | www.pnas.org/cgi/doi/10.1073/pnas.2013527117 Blokhuis et al. Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 8 .Segr D. 18. 7 .J aly .J amr .Fnaa Eouino eaoim in metabolism” a of “Evolution Fontana, W. Farmer, J. D. Bagley, J. R. 17. molecules. self-replicating of behavior Parasitic Otto, S. Altay, Y. Altay, M. 11. Liu B. 10. 6 .Ncls .Prigogine, I. Nicolis, G. 16. Schl F. 15. Autocatalytic proliferation. acid of concept novel A Kudo, K. Arimitsu, K. Ichimura, K. enan- an 14. to route ripening Viedma A Noorduin, W. Kellogg, M. purity Leeman, M. chiral Baglai, Complete I. crystallization: 13. during breaking symmetry Chiral Viedma, C. 12. lkuse al. et Blokhuis simple semipermeability as certain schemes provided reaction, reaction way, bimolecular this from a In to emerge as compartment. access can single allows In environ- a which autocatalysis in reactions. the networks, unattainable new reaction on conditions of requiring rely coupling without cycles mental species autocatalysis, of multicompartment number enlarge effectively the compartments Coupled problem: this around leading different reactions, extinction. side many to but favors with reactions, together components different networks many are many large putting in motifs in engaging Autocatalytic found components be chemical 39): to to (19, likely related specificity” a intimately more of is from This “paradox and degradation. results the as autocatalysts Viability such produce reactions framework. that side unified reactions between a highlighting competition of cores, their interest of the identification defined upon auto- to appropriately models applicable an generally catalytic is in approach This probability be process. can survival branching Viability a viability. of as notion computed the using evaluated we which evolution chemical of sys- 47). modes a prebiotic (46, possible to in the way of the evolution exploration paves of tematic identification block their thus building auto- (35); the chemistries Additionally, are into (45). cores networks structures chemical catalytic large manageable of more has complexity may the as smaller, we break way, 43), this to 42, In able (44). (24, be networks auto- regulatory networks gene recurring for chemical these done large been identify in to motifs algorithms metabolic catalytic for also basis central cores Autocatalytic a sets. reaction, provide RAF and formose model, GARD the the cycles, as such or autocatalysis enzymes key are a products, products certain these metabolites. of when supply advantage stable definitive the guarantee will network .H .Miras N. H. 9. Semenov N. S. 8. earth. prebiotic the on cycles metabolic of lipid implausibility in The life Orgel, of E. Origin L. protobiology: 6. Systems Markovitch, O. Zidovetzki, R. Lancet, D. 5. proteins. Segr of D. sets Autocatalytic 4. Kauffman, A. S. 3. Oparin, economy. I. the A. to life 2. of origin the From sets: Autocatalytic Hordijk, W. 1. .A iste .Oel .Fece,Pyia uoaayi rvnb odfrigthiol– bond-forming a by driven autocatalysis Physical Fletcher, S. Odell, B. Bissette, A. 7. utcmatetatctlssitoue eeofr way a offers here introduced autocatalysis Multicompartment robustness, of degrees different provide motifs Autocatalytic of examples known in motifs minimal these identified We GR) iei nlsso efrpiaini uulyatctltcsets. autocatalytic mutually in Biosph. self-replication Evol. of analysis 141– Kinetic pp. (GARD): 1991), (Westview, Eds. Rasmussen, S. Farmer, D. J. 158. Taylor, C. Langton, G. C. Soc. Chem. Am. clusters. oxide metal inorganic reactions. organic relevant networks. catalytic (2001). 119–145 31, (2013). 877–881 tutrst re hog Fluctuations through Order to Structures Ed. Int. Chem. reaction. ene (2008). e18 4–6 (1972). 147–161 253, amplifier. acid an as derivative (1995). 551–552 acetoacetate an of fragmentation Brivaracetam. and (2019). Levetiracetam 35–38 for block building tiopure recycling. and autocatalysis nonlinear by induced g,Ceia ecinmdl o o-qiiru hs transitions. phase non-equilibrium for models reaction Chemical ogl, ope oeue htfl iepoen a mrespontaneously. emerge can proteins like fold that molecules Complex al., et ¨ ,D e-l,D .Dae,D act h ii world. lipid The Lancet, D. Deamer, W. D. Ben-eli, D. e, ´ ,D act .Kdm .Ple,Gae uoaayi elcto domain replication autocatalysis Graded Pilpel, Y. Kedem, O. Lancet, D. e, ´ pnaeu omto fatctltcst ihself-replicating with sets autocatalytic of formation Spontaneous al., et rgno Life of Origin 0–1 (1998). 501–514 28, a.Commun. Nat. 06–06 (2018). 10564–10568 57, uoaayi,bsal,oclaoyntok fbiologically of networks oscillatory bistable, Autocatalytic, al., et 6518 (2019). 1685–1689 141, .R o.Interface Soc. R. J. efOgnzto nNnqiiru ytm:Fo Dissipative From Systems: Nonequilibrium in Self-Organization Dvr 1952). (Dover, Nature 67(2014). 4607 5, rc al cd c.U.S.A. Sci. Acad. Natl. Proc. 5–6 (2016). 656–660 537, 0819(2018). 20180159 15, Wly e ok Y 1977). NY, York, New (Wiley, .Ter Biol. Theor. J. hs e.Lett. Rev. Phys. 09–00 (2020). 10699–10705 117, –4(1986). 1–24 119, r.Boo.Chem. Biomol. Org. rg ieEo.Biosph. Evol. Life Orig. 654(2005). 065504 94, hm Lett. Chem. rica ieII Life Artificial BioScience lsBiol. Plos rg Life Orig. Angew. .Phys. Z. 63, 24, 17, 6, J. , 5 .Vss .Frad,M ats .Kufa,E Szathm E. Kauffman, S. Santos, M. Fernando, C. Vasas, V. 35. autocatalysis?. Feinberg, M. about special 23. is chemistry What the in Schuster, cycles autocatalytic P. of role 22. the of examination An Blackmond, G. D. reaction chemical in 21. sets self-sustaining autocatalytic, Detecting Steel, M. Hordijk, W. 20. Szathm E. 19. 4 .J .Teunissen P. J. A. 34. Ricardo A. reaction. formose 33. the of mechanism the On Breslow, R. 32. Lewis, S. A. Borwein, J. networks. 31. reaction in Catalysis Gopalkrishnan, M. clarified. confusion 30. Autocatalytic Hordijk, W. 29. Wilkinson, A. McNaught, D. A. 28. in functioning of modes “Elementary Fell, A. D. Woods, H. J. Hilgetag, C. Schuster, S. networks. 27. chemical open of thermodynamics Irreversible Esposito, M. Polettini, M. 26. networks. reaction in Autocatalysis Gopalkrishnan, M. Deshpande, A. 25. Barenholz U. 24. h oaino oahnRtbr o eerho rica life. artificial on acknowledges research A.B. for Rothberg manuscript. Jonathan the of of donation pour reading the Avanc Franco-Indien careful Centre Recherche for and Griffiths the la Krishna, Andrew from S. de funding Allamigeon, Promotion and X. la discussions acknowledges for P.N. Flamm Vie. C. Strat la et de d’Apparition Interdisciplinaires (Initiatives from A.B., IRIS Recherches Lettres support and Sciences Paris acknowledge de ANR-10-IDEX-0001-02) program D.L. the (Grant acknowledge and P.N. Recherche and A.B. D.L., la Zeravcic. de Z. Nationale and Agence Weem de van D. ACKNOWLEDGMENTS. Availability. Data compartments. three and reaction bimolecular one inter- of chemical tion derivation cycles, their of process branching metabolic determination 3) 5) and autoinduction, GARD; and theory; formose, sets, RAF to graph amplification, chemical application from 4) cores and pretation autocatalytic deriva- 2) of definitions; in and detailed tion terminology 1) are sections: results following the of comprising derivation and methods Theoretical Methods and Materials how (50). life understanding toward for complexify may vistas chemistry new than of provides extensions widespread further which for more autocatalysis, search a be invites must This thought. chemistry previously indicating in ways, unexpected autocatalysis in emerge that can and forms of such diversity explore further to places promising interfaces. principles. are (49) multiphase (41) and solid–gas organization coacervates and cellular (48) in interfaces liquid–solid Liquid–liquid reactions, in com- different observed with this to to as phases how- conducive coupling reactions 4), general, from conditions more emerge and physical–chemical (Fig. also is compounds may principle Autocatalysis here The the ever: cycles. shown between reuse autocatalytic compounds to plete example of us the exchange allowed the In for compartments. met are conditions vrl,orfaeoksosta uoaayi oe na in comes autocatalysis that shows framework our Overall, il Direct Biol. (2019). reactions. primordial proposed of systems. Sci. Biol. ainlcnrloe ecinkinetics. reaction over control rational reaction. formose the of analysis (2006). the to Application (1959). 2006). NY, York, New (2011). Book”) Gold pp. 1996), Singapore, Scientific, (World Eds. Paton, 151–165. R. Holcombe, M. Cuthbertson, R. in networks” laws. biochemical conservation broken and cycles (2014). Emergent . I (2014). 2570–2595 points. 76, branch flux at saturation substrate low (2017). force and kinetics 2019). NY, P ex .Ter Biol. Theor. J. 7117 (2006). 1761–1776 361, r,Teoii frpiaosadreproducors. and replicators of origin The ary, ´ )sohsi iuain;ad9 uoaayi from autocatalysis 9) and simulations; stochastic 8) 4D; Fig. for -yrxmtybrnt saraett eetcarbohydrates: detect to reagent a as 2-Hydroxymethylboronate al., et (2012). 1 7, BakelSinic xod ntdKndm d ,1997). 2, ed. Kingdom, United Oxford, Scientific, (Blackwell onain fCeia ecinNtokTheory Network Reaction Chemical of Foundations einpicpe fatctltccce osri enzyme constrain cycles autocatalytic of principles Design al., et urmlclritrcin ewe aayi pce allow species catalytic between interactions Supramolecular al., et hr r odt neligti work. this underlying data no are There PNAS 5–6 (2004). 451–461 227, P ovxAayi n olna Optimization Nonlinear and Analysis Convex ex ..akolde tmltn icsin with discussions stimulating acknowledges A.B. optto nClua n oeua ilgclSystems, Biological Molecular and Cellular in Computation )dtriainof determination 6) ; | coe 3 2020 13, October UA opnimo hmclTriooy(“The Terminology Chemical of Compendium IUPAC ne.Ce.It Ed. Int. Chem. Angew. hm Sci. Chem. e .. .. n ..acknowledge P.N. and D.L., A.B., ee. ´ .Ter Biol. Theor. J. gqe)Oiie tConditions et Origines egiques) ´ 1592 (2019). 9115–9124 10, | P ul ah Biol. Math. Bull. oas.Chem. Monatsh. o.117 vol. ex .Og Chem. Org. J. .Ce.Phys. Chem. J. hls rn.R o.Ln.B. Lond. Soc. R. Trans. Philos. o i.3 )determina- 7) 3; Fig. for r,Eouinbfr genes. before Evolution ary, erhdo Lett. Tetrahedron ´ 8–9 (2009). 386–390 48, 22 (2017). 22–28 435, | Srne,NwYork, New (Springer, o 41 no. (Springer-Verlag, eLife ul ah Biol. Math. Bull. 2962–2982 73, 9503–9505 71, IAppendix SI 763–775 150, 024117 141, e20667 6, 22–26 21, | 25235

APPLIED PHYSICAL SCIENCES 36. S. A. Kauffman, A World beyond Physics: The Emergence & Evolution of Life (Oxford 44. R. Milo et al., Network motifs: Simple building blocks of complex networks. Science University Press, 2019). 298, 824–827 (2002). 37. J. Chen et al., Chemical amplification with encapsulated reagents. Proc. Natl. Acad. 45. P. Dittrich, P. Speroni di Fenizio, Chemical organisation theory. Bull. Math. Biol. 69, Sci. U.S.A. 99, 2593–2596 (2002). 1199–1231 (2007). 38. G. King, Recycling, reproduction, and life’s origins. BioSystems 15, 89–97 (1982). 46. Z. Peng, A. Plum, P. Gagrani, D. A. Baum, An ecological framework for the analysis 39. E. Szathmary,´ The evolution of replicators. Philos. Trans. R. Soc. B Biol. Sci. 355, 1669– of prebiotic chemical reaction networks and their dynamical behavior. J. Theor. Biol. 1676 (2000). 507, 11045 (2020). 40. S. Karlin, H. M. Taylor, A First Course in Stochastic Processes (Academic, New York, NY, 47. C. Fernando, J. Rowe, The origin of autonomous agents by . 1975). BioSystems 91, 335–373 (2008). 41. T. Lu, E. Spruijt, Multiphase complex coacervate droplets. J. Am. Chem. Soc. 142, 2905– 48. J. Halpern, Kinetics of the dissolution of copper in aqueous ammonia. J. Electrochem. 2914 (2020). Soc. 100, 421 (1953). 42. A.´ Kun, B. Papp, E. Szathmary,´ Computational identification of obligatorily autocat- 49. P. Grosfils, P. Gaspard, T. V. de Bocarme,´ The role of fluctuations in bistability and alytic replicators embedded in metabolic networks. Genome Biol. 9, R51 (2008). oscillations during the H2 + O2 reaction on nanosized rhodium crystals. J. Chem. Phys. 43. J. L. Andersen, C. Flamm, D. Merkle, P. F. Stadler, Chemical transformation motifs— 143, 064705 (2015). Modelling pathways as integer hyperflows. IEEE/ACM Trans. Comput. Biol. Bioinform. 50. R. Krishnamurthy, Life’s biological chemistry: A destiny or destination starting from 16, 510–523 (2017). prebiotic chemistry?. Chem. Eur. J. 24, 16708–16715 (2018).

25236 | www.pnas.org/cgi/doi/10.1073/pnas.2013527117 Blokhuis et al. Downloaded by guest on September 24, 2021