Article CORE . ; ), 2006 . brought to you by by you to brought ), it can be ons, please Goethals et al RERO DOC Digital Library Digital DOC RERO by provided Young et al , though “small” plasmids ). ), bacteriocin production ), and antibiotic resistance Tazzyman and Bonhoeffer Anderson 1968 ; es and plasmids, and the 2009 Dowling and Broughton 1986 1975 has been suggested that there . . ), gall formation ( ), from hereon referred to as “es- y located on plasmids. We model Thomas 2000 e transfer, antibiotic resistance. nvestigate whether selection favors a Eberhard 1990 Evans et al ). Additionally, there are examples of plasmids Nogueira et al Eberhard 1990 € urich, Switzerland ; Jacob and Hobbs 1974 ), plant root nodulation ( ), all seem to be functions coded for by genes commonly There are several examples of such differences in genetic One particular element of plasmid genetics that has been nce Access publication December 23, 2014 3079 Gonzalez and Kunka 1987 Koch 1981 Eberhard 1990 ral, chromosomal locations for essential genes are indeed , it has been suggested that essential genes, such as those often lack this feature).exploit Because the there act are of phages conjugation that ( can 2001 composition. Naturally, there are athat suite plasmids of will phenotypic benefit traits fromto possessing, cause such conjugation as ( the ability virulence factors ( costly for a plasmid’s bacterialvious host, benefit without (as gaining the any donorchromosomes ob- receives are nothing). probably Consequently, lessgenes likely coding than for plasmids conjugation. to However,notypic bear there traits are other that phe- biased also genes. seem For to reasons(though that be have see coded not for been( fully by explained plasmid- genes ( observed is the factcode that for plasmids structural seldom( carry proteins genes or that basic metabolic functions bearing genescell that they are inhabit essential (either for all the the growth time; of the 2014 found on plasmids ( sential genes,” However,tween because of bacterial ratespopulation of sizes chromosom gene of bacterial flow populations,over be- evolutionary it time, seems most likely or that at all some bacterial time located genes on have both been ( plasmids and on chromosomes e evolution of prokaryotes. It wo degradation rates are equal, or if plasmid genes degrade mids can be a location for functioning essential genes is if etabolic functions, are rarel s and those in the “core” genome and that these differences € urich, CH 8092, Z ty for Molecular Biology and Evolution. All rights reserved. For permissi ). ,1 Jain ; 2010 . ntial genes will be found only on chromosomes. ile genetic elements, horizontal gen Goldenfeld ; Hacker and Hacker and Sørensen et al. Tazzyman and ; ; ng bacteria within a single population to i Rankin et al ; ). At the very least, the Ochman et al. 2000 be horizontally transferred ; e average genetic composi- Young et al. 2006 Integrative Biology (IBZ), ETH Z ; ). There are many mechanisms ssory” genome, which contains ories and interests of plasmids and Sebastian Bonhoeffer* 2010 ) and specifically in the case of plas- E-mail: [email protected]. . 1 Werren 2011 n, and transformation ), allowing for transfer of genes across Eberhard 1990 Koonin and Wolf 2008 ; Manolo Gouy ). A distinction has been drawn between 2009 . 32(12):3079–3088 doi:10.1093/molbev/msu293 Adva Rankin et al ), but conjugative plasmids have been cited as ; plasmids, genetic architecture, mob Bordenstein and Reznikoff 2005 Bergstrom et al. 2000 ;
Thomas and Nielsen 2005 ;
It has been suggested that there are functional differences The Author 2014. Published by Oxford University Press on behalf of the Socie Theoretical Biology, Institute of Norman et al Hacker and Carniel 2001 Bonhoeffer 2013 the “core” genome, consisting oftally genes transferred, and that the are “acce nothorizontally horizon- transferableCarniel 2001 genetic elements ( Plasmids replicate independently fromtheir the host and chromosome can of Consequently, be transferred their horizontally evolutionary andthose vertically: interests of may differ thegive from rise chromosome. to These conflictchromosomal or divergent to genes cooperation interests ( between can plasmid and large taxonomic distances. between genes that are likely to mid-biased genes ( implicated in horizontal transfer (broadlyjugation, grouped into transductio con- Carniel 2001 a particularly common( and successful vector of transfer and those that are( part of the core genome, both generally different evolutionary traject and chromosomes mean thattion th of each is likely to differ. and Woese 2007 *Corresponding author: Associate editor: ß Mol. Biol. Evol. et al. 2003 e-mail: [email protected] Mobile genetic elements play an importantevolution role in ( prokaryote Introduction Mobile genetic elements such as plasmids are important for th Abstract Samuel J. Tazzyman 1 Why There Are No Essential Genes on Plasmids competition between genotypically varyi 2005 are differences between functions codedcan for be by seen mobile between gene plasmidsinvolved and in chromosomes. the formation In of particular structural proteins or in basic m chromosomal location for essential genes.favored. We This find is that in becausedation” gene the rate inheritance as of the chromosomesessential is rate genes more at nonfunctioning. stable which The than chance only that genetic way for processes, in plasmids. for which We plas example, define mutation, the deletion, “degra- or translocation, render chromosomal genes degrade faster than plasmid genes. If the t faster than chromosomal genes, functioningKey esse words: View metadata, citation and similar papers at core.ac.uk at papers similar and citation metadata, View Tazzyman and Bonhoeffer . doi:10.1093/molbev/msu293 MBE
Dziewitetal. 2014; or under specific, regularly encountered on plasmids. In either setting, the gene can undergo “degra- environments, e.g., in the dark, Nagarajan et al. 2014,orunder dation.” By degradation, we mean any genetic process that anaerobic conditions, Ebertetal. 2013). Thus, because essen- makes the gene no longer functional, including, for example, tial genes are in fact sometimes found on plasmids (and must mutation, deletion, and translocation events, as long as they have been found there at times throughout evolutionary his- sufficiently disrupt the gene, so that it no longer works. For tory), why are they not found there more often? simplicity, we group all such events under degradation and The first possibility is that this is a semantic problem: assume that they occur at a rate c per individual for chro- Plasmids are often loosely defined as being replicons lacking mosomal locations and p per plasmid for plasmid locations. in essential genes, and consequently, no essential genes can be Thus, every generation functioning essential genes on chro- found on plasmids. As we are interested in what forces make mosomes and on plasmids lose their functionality at these essential genes less likely to be horizontally transferred, and rates. because (as noted above) there are in fact instances of essen- tial genes on plasmids, we can safely disregard this semantic Six Types of Individuals solution. Each individual in our population has one of two possible Another possibility is that there is only finite space on chromosome states: Either “wildtype” (i.e., with a functioning plasmids, and this space is taken up by the kinds of genes essential gene) or “mutant” (i.e., with a degraded essential that are favored on plasmids, as noted above. For example, gene). We denote these states as w and m, respectively. there have been various hypotheses to explain why plasmids Wild-type individuals have a functioning essential gene on would be an ideal locations for antibiotic resistance genes their chromosome, but mutant individuals do not. (Eberhard 1990; Rankin et al. 2010; Tazzyman and Therefore, to avoid death due to lacking the essential gene, Bonhoeffer 2014). We feel, however, that this explanation is a mutant individual must have a functioning essential gene begging the question somewhat: Even if these other kinds of on a plasmid. genes are favored in a plasmid context, we need to show that Each individual also has one of three possible plasmid essential genes are not also favored. Ideally, we would like to states: Either no plasmid (denoted with the “empty set" establish some sort of advantage accruing to essential genes symbol ;), a wild-type plasmid (i.e., with a functioning essen- from a chromosomal location. tial gene, denoted p), or a mutant plasmid (i.e., with a de- A final possibility is given by the “complexity hypothesis.” graded essential gene, denoted q). This suggests that the connectivity of a protein in the cellular There are thus six types of individual in total: Wild-type network, rather than the function of the protein, has a large chromosome with no plasmid (which we denote w;), wild- influence over how easily it can be horizontally transferred type chromosome with wild-type plasmid (wp), wild-type (Jain et al. 2003; Cohen et al. 2011). Genes whose products chromosome with mutant plasmid (w ), mutant chromo- have a high degree of protein–protein interaction will likely q some with no plasmid (m;), mutant chromosome with disrupt the metabolism of a cell into which they are trans- wild-type plasmid (mp), and mutant chromosome with ferred and consequently are unlikely candidates for horizontal mutant plasmid (m ). The relative frequency of a type x is transfer (Baltrus 2013). Essential genes by their very nature q y then denoted fðxyÞ, and the relative frequency after t gener- have been suggested to have high connectivity (Jeong et al. ations will be denoted ftðxyÞ. To simplify notation, we will 2001; Zotenko et al. 2008) and so by the complexity hypoth- denote the equilibrium relative frequency of a type xy as xy esis would be unlikely to be transferred horizontally. This is a rather than the more cumbersome f ðxyÞ. compelling explanation. However, our model below considers a much simpler case, attempting to explain the situation Fitness Scheme without reference to the complexities of protein–protein All fitnesses are defined relative to the fitness of type w;, interactions. which has a wild-type chromosome and no plasmid, and Understanding what forces bias essential gene location which we define to have fitness 1. There are then two ele- away from plasmids and on to chromosomes will help us ments to the fitness scheme. First, any individual lacking at to better understand the evolutionary processes shaping least one functioning copy of the essential gene dies and thus plasmid genomes. To investigate this, we here construct a has fitness zero. Therefore, the fitnesses of type m; and mq competition model, incorporating essential genes on both are zero, and no individuals of these types in fact exist in our plasmids and chromosomes and allowing for some degrada- population. tion of these genes. We can therefore see under which con- The second element in our scheme is the presence of ditions a chromosomal location is favored for such crucial the plasmid, which confers a small fitness cost, 0 < c 1. genes. The relative fitness of plasmid bearing types wp, wq,andmp are then all 1 c. The Model Interactions between the Types The Essential Gene In addition to the relative fitnesses, the frequencies of each Our model features a single, idealized essential gene. type are also determined by interactions between the types. Individuals require at least one working copy of this gene to There are three types of interactions: Mutation, conjugation, survive. The gene is found both on the chromosome and and segregation. We describe the details below; a diagram
3080 Plasmids and Essential Genes . doi:10.1093/molbev/msu293 MBE showing all interactions between the four surviving types is that our individuals are in a well-mixed, constant volume and shown in figure 1. that the process of horizontal transfer can therefore be rea- sonably well approximated by mass action-type equations Degradation. As mentioned above, degradation occurs on both chromosomes and plasmids, at rates and , respec- (Ross 1915). c p Conjugation can occur between any pair of individuals tively. Thus, in every generation, a proportion c of individ- uals with wild-type chromosomes become individuals with consisting of one plasmid bearer and one nonplasmid bearer. However, because in our system m; and mq individuals mutant chromosomes. In the case of w; and wq individuals, the chromosomal degradation immediately results in immediately die, we can ignore conjugation events featuring death, because it leaves the individual without a functioning these types. Conjugation therefore occurs between w; indi- viduals and wp, wq,ormq individuals. It transforms housekeeping gene. In the case of a wp individual, however, the nonplasmid-bearing w; into a plasmid-bearing transcon- the chromosomal degradation results in an mp individual, whichis“rescued”fromdeathbythefunctioning jugant. Thus, conjugation between w; and wp or mp individ- essential gene on the plasmid. Similarly, in every generation, uals results in the w; individual becoming the transconjugant wp, whereas conjugation between w; and wq results in the w; a proportion p of individuals with wild-type plasmids become individuals with mutant plasmids. In the case of individual becoming the transconjugant wq. Note that the an m individual, the plasmid degradation immediately chromosomal state is unchanged by conjugation. p Because we use mass action equations, the proportion of resultsindeath,asitmeansbecominganmq individual with no functioning essential gene. However, in the case of w; individuals that become transconjugants every generation is proportional to the relative frequencies of w; individuals, a wp individual, the plasmid degradation results in a wq indi- vidual, which still has a functioning essential gene on its and of the plasmid bearers, and is controlled by a parameter chromosome. . Given frequencies fðw;Þ; fðwpÞ; fðwqÞ,and fðmpÞ at a given generation, we then have fðw;Þ fðwpÞþfðmpÞ Conjugation. Conjugation is the process through which transconjugant wp individuals and fðw;ÞfðwqÞ transconju- plasmids are transferred horizontally in the population, gant wq individuals created, in that generation. from a plasmid bearer to a nonplasmid bearer. We assume Segregation. Segregation is the process by which plasmid- bearing individuals lose their plasmids, usually during replica- tion. It is generally believed to be rare because of a variety of phenotypic adaptations that plasmids have evolved to pre- vent it from occurring (e.g., postsegregational killing [Cooper and Heinemann 2000]; or methods to ensure even splitting of plasmids during cell division [Ebersbach and Gerdes 2005]). We model it by assuming that some small proportion of plasmid bearers lose their plasmids per generation. Thus, wp and wq individuals become w; individuals, whereas mp indi- viduals die because they become m; individuals and thus lack a functioning essential gene.
The Model Weusetheaboveelementstoconstructthemodelasfollows. We census the population at every generation. We concern FIG.1. Diagram of the interactions between the four surviving types. ourselves with relative frequencies and assume that the total Each box represents one of the six types, with the two nonsurviving population size is a (very large) constant. Therefore at any types marked with dashed lines and the label “(Dead).” Within each box, generation, the frequencies of each type add up to one: a cartoon shows the chromosomal status (squiggly line) and the plasmid fðw;ÞþfðwpÞþfðwqÞþfðmpÞ¼1. Given frequencies status (circles), marked with a tick for functioning essential gene and a ftðw;Þ; ftðwpÞ; ftðwqÞ,andftðmpÞ after t generations, we cross for degraded essential gene. Each arrow shows a transformation as wish to know the frequencies after t +1generations. a result of some interaction type. The labels for the arrows show the proportion of the type which the arrow leaves that are transformed into To simplify the mathematics, we first take fitness into ac- the type the arrow leads to. Some transformations result in the forma- count and then the interactions. The fitness of type w; is 1, tion of the nonviable types m; or mq; these transformations are shown whereas the fitness of types wp, wq,andmp is 1 c. here to lead to boxes marked “Dead.” The lines marked gwq and gwp gw; ¼ ftðw;Þ þ gmp are conjugation reactions, where is the conjugation rate, and the values gi are as given by equation (1). The lines marked c j gwp ¼ð1 cÞftðwpÞ are degradation events taking place on chromosomes, and the lines ð1Þ marked p are degradation events taking place on plasmids. The lines gwq ¼ð1 cÞftðwqÞ marked are segregation events. For full details of each reaction, see the ¼ð Þ ð Þ main text. gmp 1 c ft mp
3081 Tazzyman and Bonhoeffer . doi:10.1093/molbev/msu293 MBE
Then, the frequencies of each genotype in the next genera- Equilibrium with Only w; and wq Individuals tion are This equilibrium point corresponds to the case where all the essential genes on plasmids have degraded, whereas all those !ftþ1ðw;Þ¼gw; ðÞ 1 c gw; gw þ gw þ gm p q p on chromosomes are still functional. The equilibrium fre- þ gw þ gm ; quencies are p p
!ftþ1ðwpÞ¼gwp 1 c p þ gw; gwp þ gmp ; ð1 cÞ w ¼ ; ð5Þ ; ð1 cÞ cð1 Þ !ftþ1ðwqÞ¼gw ðÞþ1 c gw; gw þ pgw ; c q q p !ftþ1ðmpÞ¼gm 1 p þ cgw : p p and wq ¼ 1 w;. This solution is stable if c p,thatis,if ð2Þ plasmid genes degrade more rapidly than chromosomal genes, and unstable in the contrary case where 4 , where ! is the average fitness for generation t,definedas c p where plasmid genes degrade more slowly than chromosomal genes (Appendix). ! ¼ ftðÞw; ðÞþ1 ðÞ1 c ft wp c þ ðÞ1 c ft wq ðÞ1 ð3Þ c Equilibrium with All Four Types þ ðÞ1 c ft mp 1 p ; The exact location of the equilibrium is a lengthy expres- sion for each genotype frequency. We have reproduced so that the new genotype frequencies add to one. We can it in the Appendix, but here we instead make the substitu- combine the sets of equations (1) and (2) with equation (3) to tion ¼ , and note that is likely to be much find equilibrium frequencies w ; w ; w ,andm . p c c ; p q p smaller than the other expressions, and so we can let Results it tend to zero and see that the global equilibrium is approx- imately There are four different equilibria possible. Inspection of figure 1 reveals that equilibria are possible in which there ð1 cÞ w;& ; are only w; types, in which there are only mp types, and in ð1 cÞ c which there are only w; and wq types. There is also an equi- librium in which all four types are present. We now go þ cð1 þ Þ w &ð1 Þ ; through the stability of these equilibria in turn p ð1 cÞ c ð6Þ Equilibrium with Only w Individuals þ cð1 þ Þ ; w &