<<

Copyright Ó 2007 by the Genetics Society of America DOI: 10.1534/genetics.107.075812

Evolution Can Favor Antagonistic

Michael M. Desai,*,1 Daniel Weissman† and Marcus W. Feldman† *Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544 and †Department of Biological Sciences, Stanford University, Stanford, California 94305 Manuscript received May 10, 2007 Accepted for publication August 8, 2007

ABSTRACT The accumulation of deleterious plays a major role in evolution, and key to this are the interactions between their fitness effects, known as epistasis. Whether mutations tend to interact syner- gistically (with multiple mutations being more deleterious than would be expected from their individual fitness effects) or antagonistically is important for a variety of evolutionary questions, particularly the evolution of sex. Unfortunately, the experimental evidence on the prevalence and strength of epistasis is mixed and inconclusive. Here we study theoretically whether synergistic or antagonistic epistasis is likely to be favored by evolution and by how much. We find that in the presence of recombination, evolution favors less synergistic or more antagonistic epistasis whenever mutations that change the epistasis in this direction are possible. This is because evolution favors increased buffering against the effects of deleterious mu- tations. This suggests that we should not expect synergistic epistasis to be widespread in nature and hence that the mutational deterministic hypothesis for the advantage of sex may not apply widely.

HE interactions between deleterious mutations, and Bourguet 2000), some have found no epistasis T known as epistasis, can have important effects on (Elena 1999; Wloch et al. 2001), some have found many evolutionary processes. Epistasis affects the muta- antagonistic epistasis (Bonhoeffer et al. 2004; Burch tion load of a sexual population (Kimura 1961; Kimura and Chao 2004; Jasnos and Korona 2007), and others and Maruyama 1966), the evolution of ploidy have found that some mutations interact synergistically (Kondrashov and Crow 1991; Jenkins and Kirkpatrick and others antagonistically, with no clear bias toward 1995), speciation (Wagner et al. 1994), and Muller’s one or the other type of interaction (deVisser et al. ratchet (Felsenstein 1974; Kondrashov 1994; Muller 1997b; Elena and Lenski 1997). Since the experimen- 1964). Epistasis also affects the linkage disequilibria be- tal evidence is unclear, in this article we focus instead on tween deleterious mutations (Barton 1995; Peters and a theoretical approach, asking under which circum- Lively 2000). This is the basis of the mutational deter- stances synergistic or antagonistic epistasis would tend ministic theory for the evolution of sex, which finds that to evolve. By understanding whether evolution favors sex can provide an advantage when interactions between one type of epistasis, we can shed light on whether or deleterious mutations are synergistic (i.e., mutations have not it is plausible that this type of epistasis is common. a stronger effect in combination than would be expected Analyzing the evolution of epistasis can also help us from their individual effects) (Kimura and Maruyama understand the forces shaping patterns of genetic 1966; Kondrashov 1982, 1988, 1993; Charlesworth architecture. We can view epistasis as a property of the 1990). types of genetic networks in an organism: for example, This has led to a number of experimental studies to more modular genetic pathways will tend to favor dif- quantify the type and strength of epistasis between ferent types of epistasis than less modular ones (Wagner deleterious mutations in a variety of organisms (de- and Altenberg 1996; Liberman and Feldman 2005; Visser et al. 1996, 1997a,b; Elena and Lenski 1997; Hansen et al. 2006). Alternatively, we can think of epis- Elena 1999; Whitlock and Bourguet 2000; Wloch tasis as a way of looking at general properties of fitness et al. 2001; Burch and Chao 2004). Unfortunately, landscapes: for example, rougher and smoother land- epistasis is difficult to measure (West et al. 1998). The scapes correspond to different patterns of epistasis be- measurements that exist show no general patterns. Rather, tween (Whitlock et al. 1995). Thus by studying some studies have shown mostly synergistic epistasis the evolution of epistasis between deleterious muta- (Mukai 1969; deVisser et al. 1996, 1997a; Whitlock tions, we are studying whether certain types of fitness landscapes or types of genetic networks are likely to be favored. 1Corresponding author: Lewis-Sigler Institute for Integrative Genomics, Carl Icahn Laboratory, Princeton University, Princeton, NJ 08544. Epistasis is caused by a variety of different mecha- E-mail: [email protected] nisms. This makes it unclear precisely what the evolution

Genetics 177: 1001–1010 (October 2007) 1002 M. M. Desai, D. Weissman and M. W. Feldman of epistasis means. Epistasis between many loci could be determined in a complex way by the states of these and possibly multiple other loci. In this article, we do not attempt to model all possibilities. Rather, we use a sim- ple modifier locus approach, in which there are a num- ber of ‘‘major’’ loci at which deleterious mutations can occur, and another ‘‘modifier’’ locus that controls the epistatic interactions between the major loci. This ap- proach may be realistic in certain situations. For exam- ple, one gene may code for an enzyme that catalyzes the reaction between several others; mutations in this mod- ifier gene make mutations in certain residues in the other interacting proteins interact more synergistically or more Figure 1.—An example of fitness as a function of the num- antagonistically. Alternatively, mutations that affect cross- ber of deleterious mutations for synergistic epistasis (dotted line), no epistasis (solid line), and antagonistic epistasis talk between distinct functional modules, or that cause (dashed line). duplications or eliminate redundancy in a genetic net- work, could be modifier loci that affect the epistatic in- teractions between many other loci (Gu et al. 2003; at rate m per individual per generation. In isolation, Wagner 2005; Sanjuan and Elena 2006). However, our each allele ai carries a fitness cost s, and we assume s>1. main goal with this particularly simple model is to focus That is, an individual with one deleterious mutation has on whether generally more synergistic or more antago- fitness es 1 s. nistic epistasis is favored by evolution, and by how much, We introduce epistasis by assuming that the fitness of rather than to accurately account for all the details of an individual with k deleterious mutations is how epistasis is determined. sk11e Our approach is related to the modifier approach of wk ¼ e : ð1Þ Liberman and Feldman (2005, 2006; Liberman et al. The parameter e describes the sign and strength of the 2007). These authors considered two loci that control epistasis. When e , 0, the mutations in combination fitness in diploids, coupled to a third modifier locus that have less effect than the product of their separate ef- affects the epistatic interactions between them. They fects; this corresponds to antagonistic (positive) epista- analyzed the fate of a rare allele at the modifier locus sis. When e . 0, the mutations in combination have a that changes the epistasis between the other two. This larger effect than the product of their separate effects; work considered more general types of epistasis than we this is synergistic (negative) epistasis. This is illustrated study, but only within the context of a simple two-locus in Figure 1. This model is identical to that of Lenski et al. system. Others have used an alternative approach based (1999), with their b equal to our 1 1 e. It differs from on the theory of quantitative inheritance, which views earlier models where fitness is a quadratic function of k epistasis as a contribution to variance in phenotype and (Charlesworth 1990; Elena and Lenski 1997); these asks about the evolution of this epistatic variance models present a less clear distinction between syner- (Hansen and Wagner 2001a,b; Hermisson et al. gistic and antagonistic epistasis and can introduce 2003; Carter et al. 2005; Hansen et al. 2006). artifacts for large k (Wilke and Adami 2001). After defining our model more precisely in the next We assume that selection is strong compared to section, we begin our analysis by studying the develop- mutation, such that Lm=s>1. In this regime, haplotypes ment of mutation–selection balance in a population with more deleterious mutations are always rare com- with a given type and strength of epistasis between the pared to haplotypes with less, so that we can neglect major loci, initially in an asexual population and then back mutations from a to A. In addition, because with varying levels of recombination. We next study the k11es>1 for essentially all individuals in the population, evolution of epistasis by considering whether a mutation multiplicative and additive epistasis are equivalent: at the modifier locus that changes the epistasis between the major loci can invade a population that is initially sk11e 11e wk ¼ e 1 k s: ð2Þ homogeneous at the modifier locus. Finally, assuming Lm=s>1 means that stochastic effects and Muller’s ratchet can be ignored even in moderate- MODEL sized populations, and hence we can use deterministic We consider a haploid population evolving under the rather than stochastic equations to describe the evolu- pressure of mutation to alleles at many loci which may tion of the population. This focus on moderate to larger interact epistatically. Suppose that there are L loci, each populations makes sense, since this is where the mu- with two possible alleles. At each locus i, allele Ai is tational deterministic hypothesis for the advantage of assumed to be more fit and mutates to the less fit allele ai sex and other important effects of epistasis apply. Evolution Can Favor Antagonistic Epistasis 1003

Figure 3.—Diagram of the assumed arrangement of the major loci Ai and the modifier locus M. The recombination rate between adjacent major loci is r, while that between the set of major loci and the modifier locus is R.

P ^ L k 11e 1 where F0 k¼0ððLm=sÞ =k! Þ . Note that for e ¼ 0, this reduces to the classical result in the absence of epistasis (Haigh 1978). ^ It is easy to calculate w from Equation 7 to demon- Figure 2.—The steady-state mutation–selection balance Fk in populations at two different values of e. The solid line strate that Equation 6 is satisfied, which confirms the shows e ¼ 0.8 (synergistic epistasis) and the dashed line consistency of our solution. Note that while w^ and the 1 ^ ^ ^ shows e2 ¼0.8 (antagonistic epistasis). These two distribu- ratio F1=F0 do not depend on e, the full distribution Fk tions have identical mean fitness given the respective values does (Figure 2). More antagonistic epistasis leads to the of e. The effect of recombination in a population that con- maintenance of more multiple deleterious mutants in tains individuals with both values of e is to bring the two dis- tributions closer together. This corresponds to a net transfer the population, and vice versa. However, these different distributions are such that the mean fitnesses of pop- of deleterious mutations from individuals with e ¼ e2 to indi- viduals with e ¼ e1, increasing the mean fitness of the e2 indi- ulations at different values of e are identical. viduals and decreasing the mean fitness of the e1 individuals. Recombination: To study the effects of recombina- Thus smaller (or more negative) e is favored by recombination. tion, we must assume an arrangement for the location of the various loci. Suppose that each individual has a single linear chromosome and that all of the sites Ai, THE EPISTATIC MUTATION–SELECTION BALANCE which we refer to as the ‘‘majorloci,’’are arranged along this chromosome, with a recombination rate r between We begin by studying the evolution under mutation each pair of adjacent loci. There is an additional locus and selection of a population that is homogeneous at the M, which we refer to as the ‘‘modifier locus,’’ at which modifier locus, with epistasis e. We start with the asexual different alleles confer different values of e. This locus is case and then consider the effects of recombination. assumed to be at the end of the chromosome, with a An asexual population: Denote by F the frequency in k recombination rate R between M and the rest of the the population of individuals with k deleterious muta- major loci (Figure 3). tions. The frequency F 9 in the next generation can then k Recombination between the modifier locus and the be written in terms of the frequency in the previous major loci clearly matters only when the modifier locus generation, giving the recursions is polymorphic, but recombination among the major

wF 09 ¼ð1 LmÞF0 loci will have an effect on the distribution of deleterious 11e 11e alleles even when the population is homogeneous for a wF 9 ¼½1 ðL kÞmesk F 1 ðL 1 1 kÞmesðk1Þ F ; k k k1 single value of e. In fact, when r 6¼ 0(i.e., there is ð3Þ recombination between the major loci), it is no longer P sk11e true that all major loci are equivalent, and the frequen- where w ¼ k e Fk is the mean fitness of the pop- ulation, measured after mutation. cies of deleterious alleles at different loci will not be Since we assume Lm>s>1, we can neglect terms in equal. Instead, deleterious alleles will either tend to the product of s and m in Equation 3 and neglect k com- cluster together along the chromosome or tend to pared to L. This produces the simplified recursions spread themselves apart. By doing so, they will either minimize or maximize the effective recombination rate wF 09 ¼ð1 LmÞF0 ð4Þ among themselves. To analyze this effect, we use an iterative approach,

11e formally valid for small recombination rates r. We begin wF k9 ¼½1 k sFk LmFk 1 LmFk1: ð5Þ with the solution to the r ¼ 0 case, where the frequencies of deleterious alleles and the linkage disequilibria are At steady state, this yields uniform across the chromosome. We write down the w^ ¼ 1 Lm ð6Þ recursions describing the dynamics of the frequencies of these alleles for nonzero r. These are impossible to solve exactly, but we solve for the steady-state (now k ^ ðLm=sÞ ^ position-dependent) frequencies of the deleterious Fk F0; ð7Þ k!11e alleles by assuming the linkage disequilibria are still 1004 M. M. Desai, D. Weissman and M. W. Feldman given by their r ¼ 0 values. From these approximate These recursions cannot be solved exactly. However, as frequencies, we can calculate a better approximation for noted above, for small r we can find an approximate the position-dependent linkage disequilibria. We can equilibrium by assuming that D takes its r ¼ 0 value given then plug this new formula for the disequilibria into the in Equation 9. We detail this in appendix a and find recursions for the frequencies of the alleles and solve these again to give a more accurate expression for the m 1Lr Lm 2i 2 F^ ðiÞ F^ 1 F^ ð1 2eÞ 11 1 ; frequencies of the deleterious alleles. We can repeat this 1 s 0 4 s s 0 L iterative process; for small r each successive iteration ð13Þ produces a smaller correction, so we can iterate as far as needed to get the desired accuracy. Fortunately, for small Lm/s, the recombination rate can be quite large m 2 F^ 1 Lr j j i j F^ ði; jÞ 0 1 1 F^ ð1 2eÞ : ð14Þ before this approach fails; the precise conditions are 2 s 2e 2 s 0 L described below. To begin this analysis, we define Fk(i1, ..., ik) to be the ^ ^ We can use these expressions for F1ðiÞ and F2ði; jÞ frequency of individuals with a total of k mutations at ^ ^ ^ ^ P (and the values of F0 that they imply, F0 ¼ 1 F1 F2)to loci i1, ..., ik 2 {1, ..., L}. Note that Fk ¼ Fkði1; ...; ikÞ, calculate a better approximation for D^ði; jÞ, which will where the sum is over all possible configurations of i1 , now be position dependent. We find i2 , ... , ik. We define the between two loci i, j as m 2 Lr 1 j j i j D^ði; jÞ¼ ð2e 1ÞF^2 1 F^ s 0 s 2 0 L Dði; jÞ [ F0F2ði; jÞF1ðiÞF1ðjÞ: ð8Þ Lm 2i 2 2j 2 1 2 1 1 1 1 : s L L ð15Þ Note that this definition of D neglects terms involving mutations at other sites; these will all be higher order in Lm/s. Without recombination (i.e., for r ¼ 0), we find We see that jDj is equal to its r ¼ 0 value minus a position- from Equation 7 that to lowest order in m/s, the linkage dependent correction due to recombination that is disequilibrium is a constant, small when Lr=s>1. As claimed, the effect of recombi- nation is to reduce the absolute value of D without m 2 D^ ¼ F^2 ð2e 1Þ: ð9Þ changing its sign. We can insert this corrected D back 0 s into our recursions to calculate a better approximation for F^ ðiÞ and F^ ði; jÞ and repeat this iterative process Note that D^ has the opposite sign to e, in agreement 1 2 until the desired accuracy is reached. The above for- with earlier work (Feldman et al. 1980; Liberman and mulas make clear that the condition required for this Feldman 2005). We expect that recombination will act iterative approach to converge is Lr=s>1; this is what is to reduce the absolute value of the steady-state linkage meant by low recombination rates. In fact, whenever disequilibrium for all pairs of loci, but will not change Lr=s>1, the above formulas are already a good approx- the sign (this is confirmed by simulations, not shown). imation to the disequilibria and the frequencies of the Note that this means that in steady state, we expect D to deleterious alleles F^ . A comparison of these results with be at most of order (m/s)2, regardless of r. k simulations is shown in Figure 4. We assume a model of recombination where recom- Some of the results above are valid even when Lr=s is bination acts separately from selection (i.e., selection not small compared to 1. We see from Equation 13 that acts only on nonrecombining sequences; our model for ðLr=sÞðLm=sÞ>1, the correction to F^ ðiÞ due to also describes the case where recombination acts jointly 1 nonzero r and the position dependence of the frequen- with selection but r is small enough that we can neglect cies of these single mutants are small compared to the products of s and r). The recursions for the frequencies r ¼ 0 value F^ ðiÞ¼ðm=sÞF^ . For Lm=s small, any bi- of the deleterious alleles thus become, to order (m/s)2 1 0 ological values of r (0 , Lr , 0.5) satisfy this small-r (and ignoring products of s and m), condition. Because recombination can only reduce jDj, X this means that even this zeroth order iterative result wF 09 ¼ð1 LmÞF0 r ð j iÞDði; jÞð10Þ ^ ^ gives accurate results for the average value of F1=F0, and , i j hence also for the mean fitness of the population, for any biological r. On the other hand, the position XL dependence in Equation 14 will be small compared to wF 1ðiÞ9 ¼ð1 s LmÞF1ðiÞ 1 mF0 1 r j j i j Dði; jÞ ^ the average F2ði; jÞ only when Lr=s>1. If this condition ¼ j 1 is not satisfied, then the number of double mutants ð11Þ formed by recombination of single mutants will be com- parable to the number of double mutants formed by wF ði; jÞ9 ¼ð1 211es LmÞF ði; jÞ 2 2 mutation of single mutants. In this case, the average 1 m½F1ðiÞ 1 F1ð jÞ r j j i j Dði; jÞ: ð12Þ ^ ^ value of F2ði; jÞ½and therefore Dði; jÞ may differ Evolution Can Favor Antagonistic Epistasis 1005

This analytical result for dw^ agrees with simulations (data not shown). Note that dw^ has the opposite sign to D and therefore the same sign as e. Thus, recombination increases the equilibrium mean fitness of populations with synergistic epistasis and decreases the mean fitness of populations with antagonistic epistasis. In this sense, synergistic epistasis favors recombination. This agrees with earlier work on the mutational deterministic hypothesis for the advantage of sex (Kimura and Maruyama 1966; Kondrashov 1982, 1988, 1993; Charlesworth 1990). This result also indicates that if a population consists of separate subpopulations with different values of e and there is recombination within but not between subpo- pulations, then the subpopulation with the highest value of e will be favored over the others. We can also see from Equations 13 and 14 that the position dependence in the distribution of deleterious alleles depends on the sign of e. For synergistic epistasis, recombination will increase the relative frequency of single mutants near the center of the chromosome and decrease the relative frequency of double mutants with large separation between their two mutations, and vice versa for antagonistic epistasis. This is confirmed by simulations, as shown in Figure 4, and makes intuitive sense. When there is synergistic epistasis, bringing together multiple mutations into the same individual decreases mean fitness, while breaking up combinations igure F 4.—The effect of recombination on the distribution of deleterious alleles (dispersing them into separate of single mutations. The theoretical predictions (solid lines) and simulation results (crosses) for the frequency of individ- individuals) increases it. The population therefore de- uals that have a single deleterious mutation at each of the creases the effective recombination rate between single major loci, as a function of the position of the major loci along mutants by clustering their mutations together in the the chromosome (horizontal axis), are shown for (a) e ¼0.2 middle of the chromosome and increases the effective and (b) e ¼ 0.2. Insets are smaller-scale graphs that reveal the recombination rate for double mutants by clustering position dependence of the mutation frequencies. The other parameters are L ¼ 200, m ¼ 2.5 3 106, s ¼ 0.01, r ¼ 2.5 3 their mutations at opposite ends of the chromosome. 106. Simulation methods are described in appendix b. For antagonistic epistasis, mean fitness decreases when combinations of deleterious alleles are broken up and increases when they are brought together, so we see the opposite effect. substantially from its r ¼ 0 value, and there may be significant position dependence in both quantities. THE EVOLUTION OF EPISTASIS We can also calculate how recombination affects the When mutations that change the sign or strength of mean fitness of the population. From the recursion epistasis are possible, epistasis can evolve. A mutation at Equation 10, we find that with recombination w^ ¼ locus M may produce an allele whose effect is to change 1 Lm 1 dw^, where the difference in equilibrium mean e, and the subpopulation with this different e may be fitness from the r ¼ 0 value is favored or disfavored by evolution. In the long term, the X D^ði; jÞ population will tend to evolve toward those values of e dw^ ¼r ðj iÞ : ð16Þ that are most favored. Our analysis is done within the F^ i , j 0 context of this modifier locus approach, where e is controlled by the locus M, which gives us a well-defined 2 ^ This correction is at most order ðÞLm=s ,sow 1 Lm mutation and recombination structure in which to for any biological value of r. For Lr=s>1 we can make concrete calculations. However, the basic idea is ^ approximate Dði; jÞ by its r ¼ 0 value to obtain more general: we ask whether or not more synergistic or more antagonistic epistasis will tend to be favored, and m 2 Lr L e ^ dw^ ¼ ð1 2 ÞF0: ð17Þ by how much, regardless of the cause of the change in 6 s the epistasis. Naturally the details of the evolution of 1006 M. M. Desai, D. Weissman and M. W. Feldman epistasis in any particular system will depend on whether antagonistic epistasis, e1. We denote by Fk(i1, ..., ik)the or not mutations changing e in a given way are possible. frequency of individuals with e0 and k deleterious Our analysis demonstrates when such mutations will be mutations at major loci i1, ..., ik and by Gk(i1, ..., ik) favored by evolution, given that they have occurred. the frequency of individuals with e1 and k deleterious We have seen above that, as measured by mean fitness, mutations at i1, ..., ik.Fand G denote the total fraction synergistic epistasis favors recombination. Surprisingly, of the population with epistasis e0 and e1, respectively. we find below that the converse is not true. Rather, When recombination is rare ðÞR=s>1; Lr=s>1 , assum- recombination favors the evolution of antagonistic (or ing as before that recombination acts separately from less synergistic) epistasis. Thus, although synergistic selection, the recursions for Gk are given to leading epistasis will tend to favor recombination, this re- order in Lm=s, R=s, and Lr=s by combination will, if possible, eliminate the synergistic X epistasis. wG 09 ¼ð1 LmÞG0 RðFG0 GF0Þr ð j iÞDG ði; jÞ Asexual populations: In the absence of recombina- X i , j tion the mean fitness in steady state does not depend on r ðL iÞðG0F1ðiÞF0G1ðiÞÞ e. This means that if two subpopulations with different Xi values of e are placed in competition, neither will be r ½ðL iÞG0F2ði; jÞðL jÞF0G2ði; jÞ favored over the other in the long term. Depending i , j on the initial mean fitness of the two subpopulations ðj iÞG1ðiÞF1ð jÞ ð18Þ (which may depend, for example, on the genotype of the individual in which the mutation changing e ð Þ9 ¼ð m Þ ð Þ 1 m ð 1 ð ÞÞ occurs), one or the other may initially become more wG1 i 1 L s G1 i GX0 R r L i common. However, they will both eventually reach their 3 ðFG1ðiÞGF1ðiÞÞ 1 r j j i jDG ði; jÞ mutation–selection steady state, in which they will both " j have the same mean fitness. This is confirmed with Xi simulations (not shown). Once this happens, the frac- 1 r ði jÞ½F0G2ð j; iÞG1ðiÞF1ð jÞ tion of the population with epistasis e will drift n j¼1 # neutrally. XL Recombination: This simple result no longer holds 1 ð j iÞ½G0F2ði; jÞF1ðiÞG1ð jÞ when recombination is possible. With more than one j¼i11 allele possible at the modifier locus, there are two types ð19Þ of recombination events: those between the modifier and the set of all major alleles and recombination be- tween the major alleles. Consider a population in which 11e1 wG 2ði; jÞ9 ¼ð1 Lm 2 sÞG2ði; jÞ1mðG1ðiÞ1G1ðjÞÞ there are two alleles M and M producing different 0 1 ðR 1ðL iÞrÞðFG ði; jÞGF ði; jÞÞ values of e segregating at the modifier locus. If these two 2 2 subpopulations are initially at their individual muta- rðj iÞDG ði; jÞ ^ ^ tion–selection balances, described by Fk and Gk, re- 1rðj iÞðF1ðiÞG1ð jÞG0F2ði; jÞÞ; spectively, they have equal mean fitness and neither is ð20Þ favored. However, recombination brings the frequency distributions Fk and Gk of these two subpopulations where we have defined DG(i, j) ¼ G0G2(i, j) G1(i)G1(j) closer together, as it converts individuals of type Fk to and in the last recursion we have assumed i , j. The F type Gk and vice versa. This increases the mean fitness of recursions are the same, with all G’s replaced by F ’s and the subpopulation with the smaller (or more negative) vice versa. Note that we can neglect Gk and Fk for k $ 3 value of e, because in this subpopulation the number of because these terms are all higher order in Lm=s, R=s, individuals with deleterious mutations is reduced. This and Lr=s. is illustrated in Figure 2. In contrast, the mean fitness of When M1 is rare, we can ignore the terms containing 2 the subpopulation with the larger value of e is de- Gk in the Fk recursions and the terms proportional to Gk creased. This means that the subpopulation with more in the Gk recursions. This is because when M1 is rare, negative e is favored by selection. Thus, recombination recombination events of e1 haplotypes are almost always favors the development of more antagonistic (or less with e0 haplotypes, and recombination events of e0 hap- synergistic) epistasis between deleterious mutations. lotypes are also almost always with e0 haplotypes. With When recombination is rare, we can estimate the this simplification, the recursions for Fk become simply strength of the selection for lower e. We imagine a those given in Equations 10–12 and can be solved to give population dominated by one value of epistasis, e0.We the steady-state distributions, Equations 13 and 14. The introduce, at low frequency, individuals with a different e0 subpopulation remains in this steady-state distribu- value of epistasis e1, where e1 , e0, and calculate the tion as long as M1 is rare. In this steady state, the mean strength of the selection for individuals with the more fitness w0 of the e0 individuals is, using Equation 17, Evolution Can Favor Antagonistic Epistasis 1007 Lr Lm 2 w ¼ 1 Lm ð1 2e0 ÞF^ : ð21Þ 0 6 s 0

We can now substitute this steady-state Fk distribution into our recursions for Gk. This leads to a quasi-steady- state Gk distribution, because, as we shall see, the distribution of Gk (i.e., the values of Gk/G) quickly reaches its steady state relative to the timescale on which G or F change. For simplicity, we describe this distribu- tion by g1(i) [ G1(i)/G0 and g2(i, j) [ G2(i, j)/G0. Upon substituting the steady-state Fk distributions into the recursions for Gk, and solving for the quasi-steady-state Gk, we find that in this quasi-steady state, m m 2 1 L Lr e0 2i g1ðiÞ¼ 1 F ð1 2 Þ 1 1 1 s 4 s s L i 2 1 2ð2e0 2e1 Þ L 1 R Lm 2 1 F ð2e0 2e1 Þ 2 s s m 2 1 1 Lr j i j i g ði; jÞ¼ 1 1 F 1 2e0 1 2e1 1 2 e1 s 2 2 s L L L R 1 ð2e0 2e1 Þ ; s ð22Þ where i , j in the second equation. Note that the effect of R on g1(i) is higher order in Lm=s than that of r.We can find the strength of selection for the M1 allele by calculating the mean fitness w1 of the G individuals and igure comparing it to the mean fitness w of the F individuals F 5.—Analytic and simulation results for the strength 0 of selection for antagonistic epistasis. Simulation methods found in Equation 21, are described in appendix b. (a) Simulation results for t !(in millions of generations, vertical axis) as a function of X X 1 G ðÞðLr=3Þ 1 R for R ¼ 2 3 103 and r varying (dots), for 0 11e1 w ¼ 1 1 ð1 sÞ g ðiÞ 1 ð1 2 Þ g ði; jÞ 3 3 1 G 1 2 R ¼ 10 and r varying (crosses), and for Lr ¼ 2.5 3 10 i i , j and R varying (squares), with the other parameters held con- 2 De 5 Lm Lr 1 2 stant at L ¼ 25, m ¼ 5 3 10 , s ¼ 0.05, e0 ¼ 0.4, e1 ¼0.4. (b) ¼ w 1 1 R F ; 0 11e1 Simulation results for t (in millions of generations, vertical s 3 2 11e1 De axis) as a function of 2 =ð1 2 Þ for e1 ¼0.4 and De ð23Þ varying (crosses) and for De ¼0.2 and e1 varying (squares), with the other parameters held constant at L ¼ 1000, m ¼ 6 3 where De ¼ e1 e0. This means that when M1 is rare 310 , s ¼ 0.03, r ¼ 0, R ¼ 10 . In both plots, the solid line (F 1), its frequency increases roughly exponentially, is the theoretical prediction, Equation 24. at a rate evolution. In Figure 5, we compare these analytical pre- 1 Lm 2 Lr 1 2De dictions for the strength of selection to simulations. 1 R ; ð24Þ t s 3 211e1 DISCUSSION which is the strength of selection for the more an- tagonistic epistasis. Note that this selection pressure The patterns of epistasis in natural populations is weak compared to s, justifying our quasi-steady-state contain information about the way in which genetic assumption. and regulatory networks are organized. This epistasis, in As the frequency of e1 changes, the distribution gk also turn, has important implications for the future evolu- slowly changes. Eventually, the e1 individuals become tion of the population, such as the forces acting on common and the e0 individuals become rare. In this recombination rate and mutational load and driving opposite limit, the distribution of the Gk is given by the speciation. The aspects of the biology that determine steady-state result, Equations 13 and 14, and we can find epistatic interactions are also themselves under evolu- a quasi-steady-state distribution for the e0 individuals. tionary pressures, both for unrelated pleiotropic effects This yields a roughly constant selection pressure for they have on fitness and because of their impact on the e1 individuals throughout most of the course of the epistasis itself. We have studied this latter effect. 1008 M. M. Desai, D. Weissman and M. W. Feldman

A variety of types of epistasis are important for these We have found that the strength of selection against questions. Epistasis between beneficial mutations, or synergistic epistasis and in favor of antagonistic epistasis the possibility that some mutations may be neutral or increases with the recombination rate. Thus in mostly deleterious by themselves but beneficial in the right asexual populations, synergistic epistasis is more likely combinations (‘‘sign epistasis’’), may be very impor- to persist. This poses a problem for the mutational tant to adaptation and speciation (Iwasa et al. 2004; deterministic hypothesis for the advantage of recombi- Weinreich and Chao 2005). However, combinations of nation, which posits that sex be advantageous because it mutations involved in sign epistasis are likely to be rare increases fitness in the presence of synergistic epistasis compared to deleterious mutations. We have focused between deleterious mutations. While this remains true, exclusively on the latter case of the tendency of dele- recombination will create selection pressure against the terious mutations to interact either synergistically or synergistic epistasis and thus tend to eliminate the antagonistically. source of its own evolutionary advantage. The epistasis that exists among these deleterious The analytical work in this article is strictly valid for mutations is determined by several factors. First, there R; Lr; Lm>s>1, and L?1, that is, for a large number are presumably only a limited number of ways in which of loci with potential epistatic interactions and subject biological networks can change that affect epistasis. It to rare slightly deleterious mutations. It is only in this may be that there are fewer ways to change existing parameter range that Equation 24 accurately describes genetic networks to make epistasis more antagonistic the strength of selection for antagonistic epistasis. How- than to make it more synergistic, or vice versa. Second, ever, the more general result that recombination gives mutations that change epistasis may also tend to have a selective advantage to more antagonistic epistasis by direct fitness impacts, which could be stronger than the breaking up associations between antagonistic epistasis selection due to their effect on epistasis. These direct and deleterious mutations should not depend strongly fitness effects might ‘‘average out,’’ in the sense that the on the values of the parameters in the model. In direct fitness effects of mutations that make epistasis particular, we expect it to hold for R; Lr; s & 1, and more synergistic are on average the same as those that L?/ 1. In fact, the effect should be stronger for larger make epistasis more antagonistic. But this does not have values of R, r, and s. to be the case; it could be that the evolution of epistasis is In a computational study of the evolution of artificial simply a by-product of these other direct impacts on gene networks, Azevedo et al. (2006) recently came to fitness. Third, mutations that change epistatic interac- the opposite conclusion that evolution favors synergistic tions are selected because of this change, independent epistasis. They found that selection acts to increase the of any direct effects on fitness. This final effect has been mutational robustness in sexual populations, which the focus of this article. tends to be correlated with synergistic epistasis. Their The patterns of epistasis in natural populations are work is not necessarily inconsistent with ours, because in ultimately an empirical question. Given particular ob- their simulations synergistic epistasis evolves only due to served patterns of epistasis, an understanding of the other fitness effects of the changes that create this type direction and strength of one of the evolutionary forces of epistasis. that may have acted to generate this epistasis can help us Mutational robustness is closely related to genetic understand what the observed epistasis might mean. If, canalization, which refers to the extent to which a for example, we observe an organism where epistasis genotype is buffered against mutations (Wagner et al. between deleterious mutations is predominantly syner- 1997). Clearly canalization is also related to epistasis. gistic, our analysis has shown that this cannot be because Synergistic epistasis can be considered to represent evolution favors such synergistic interactions. Rather, canalization, as it means that one or a few mutations there must be some other reason that we can look for, have a relatively small effect on fitness, compared to the such as direct advantages to the types of networks that larger effects of multiple mutations that take a genotype happen to produce synergistic epistasis. away from the buffered state (Burch and Chao 2004). Since experimental evidence on the structure of Since the only selection on canalization is through its epistasis in natural populations is at present unclear, effects on fitness through the epistasis that it creates, our analysis is also useful in showing that there is no our analysis thus shows that selection should tend to act theoretical reason to believe that synergistic epistasis is against canalization. likely to be widespread in nature. Rather, we expect However, this conclusion is sensitive to definitions— evolution to have favored the development of more specifically, to what extent we distinguish the effects of antagonistic epistasis—‘‘buffering’’ between the effects single mutations from their epistatic interactions. In our of deleterious mutations. While synergistic epistasis model, the effects of single mutations are fixed and the may indeed be present when it happens to be a by- epistasis between them is allowed to evolve. If we instead product of other selectively advantageous traits, we can were to allow the effects of single mutations to change, expect that it has been eliminated by evolution where while fixing the fitness of double and triple mutants, possible. we would find that reducing the fitness cost of single Evolution Can Favor Antagonistic Epistasis 1009 mutations would be favored, consistent with the work of deVisser, J. A. G. M., R. F. Hoekstra and H. Van den Ende, Gardner and Kalinka (2006). This would then have 1997b Test of interaction between genetic markers that affect fitness in aspergillis niger. Evolution 51: 1499–1505. the side effect of producing synergistic epistasis and Elena, S. F., 1999 Little evidence for synergism among deleterious hence canalization. In our view, however, it would be mutations in a nonsegmented rna virus. J. Mol. Evol. 49: 703–707. lena enski misleading to classify this as the evolution of synergistic E , S. F., and R. E. L , 1997 Test of synergistic interactions among deleterious mutations in bacteria. Nature 390: 395–398. epistasis. Rather, it is selection in favor of direct fitness Feldman, M., F. Christiansen and L. Brooks, 1980 Evolution of impacts that happen to also be correlated with epistasis. recombination in a constant environment. Proc. Natl. Acad. When attention is focused specifically on the selection Sci. USA 77: 4838–4841. Felsenstein, J., 1974 The evolutionary advantage of recombina- on epistasis, as in our model, more antagonistic epistasis tion. Genetics 78: 737–756. is instead favored. Gardner, A., and A. T. Kalinka, 2006 Recombination and the evo- Regardless of definitions, we see that buffering of lution of mutational robustness. J. Theor. Biol. 241: 707–715. Gu, Z., L. M. Steinmetz,X.Gu,C.Scharfe,R.W.Davis et al., genotypes is favored. When the fitness effects of single 2003 Role of duplicate genes in genetic robustness against null mutations are allowed to evolve, evolution favors the mutations. Nature 421: 63–66. buffering of genotypes against their effects—in other Haigh, J., 1978 The accumulation of deleterious genes in a popu- lation—Muller’s ratchet. Theor. Popul. Biol. 14: 251–267. words, the reduction of their fitness cost. If we assume Hansen,T.F.,andG.P.Wagner, 2001a Epistasis and the mutation that the fitness of double mutants is not correspond- load: a measurement-theoretical approach. Genetics 158: 177–185. ingly reduced, this leads to synergistic epistasis and Hansen,T.F.,andG.P.Wagner, 2001b Modeling genetic architecture: a multilinear theory of gene interactions. Theor. Popul. Biol. 59: 61–86. hence canalization or ‘‘mutational robustness.’’ On the Hansen, T. F., J. M. Alvarez-Castro,A.J.Carter,J.Hermisson and other hand, if the fitness effects of single mutations are G. P. Wagner, 2006 Evolution of genetic architecture under fixed and their epistatic interactions are allowed to directional selection. Evolution 60: 1523–1536. Hermisson, J., T. F. Hansen and G. P. Wagner, 2003 Epistasis in evolve, selection favors an increase in the buffering polygenic traits and the evolution of genetic architecture under between the effects of the mutations—that is, more stabilizing selection. Am. Nat. 161: 708–734. antagonistic epistasis. Iwasa, Y., F. Michor and M. A. Nowak, 2004 Stochastic tunnels in We can view our results as a specific example of a more evolutionary dynamics. Genetics 166: 1571–1579. Jasnos, L., and R. Korona, 2007 Epistatic buffering of fitness loss in general process by which recombination favors genetic yeast double deletion strains. Nat. Genet. 39: 550–554. robustness. In a population with multiple segregating Jenkins, C., and M. Kirkpatrick, 1995 Deleterious mutation and modifier alleles affecting the degree of robustness to the evolution of genetic life cycles. Evolution 49: 512–520. Kimura, M., 1961 Some calculations on the mutation load. Jpn. J. deleterious mutations, mutations will accumulate in indi- Genet. 36: 179–190. viduals carrying the modifier that confers the most ro- Kimura, M., and T. Maruyama, 1966 The mutational load with bustness. Recombination will break up the associations epistatic gene interactions in fitness. Genetics 54: 1337–1351. Kondrashov, A., 1982 Selection against harmful mutations in large between this modifier and deleterious mutations, trans- sexual and asexual populations. Genet. Res. 40: 325–332. ferring them to less robust individuals for whom they Kondrashov, A., 1988 Deleterious mutations and the evolution of will cause a greater loss of fitness. Thus, recombination sexual reproduction. Nature 336: 435–440. Kondrashov, A., 1993 Classification of hypotheses on the advan- favors the evolution of increased robustness to mutation. tage of amphimixis. J. Hered. 84: 372–387. ondrashov We thank Daniel Fisher for many useful ideas and discussions. This K , A., 1994 Muller’s ratchet under epistatic selection. work was supported by National Institutes of Health grant GM 28016. Genetics 136: 1469–1473. Kondrashov, A., and J. Crow, 1991 Haploidy or diploidy: which is D.W. is supported by a National Science Foundation graduate research better. Nature 351: 314–315. fellowship. Lenski, R. E., C. Ofria,T.C.Collier and C. Adami, 1999 Genome complexity, robustness and genetic interactions in digital organ- LITERATURE CITED isms. Nature 400: 661–664. Liberman, U., and M. W. Feldman, 2005 On the evolution of epis- Azevedo, R. B. R., R. Lohaus,S.Srinivasan,K.K.Dang and C. L. tasis i: diploids under selection. Theor. Popul. Biol. 67: 141–160. Burch, 2006 Sexual reproduction selects for robustness and Liberman, U., and M. W. Feldman, 2006 Evolutionary theory for negative epistasis in artificial gene networks. Nature 440: 87–90. modifiers of epistasis using a general symmetric model. Proc. Barton, N. H., 1995 A general model for the evolution of recom- Natl. Acad. Sci. USA 103: 19402–19406. bination. Genet. Res. 65: 123–144. Liberman, U., A. Puniyani and M. W. Feldman, 2007 On the evo- Bonhoeffer, S., C. Chappey,N.T.Parkin,J.M.Whitcomb and C. J. lution of epistasis ii: a generalized Wright-Kimura framework. Petropoulos, 2004 Evidence for positive epistasis in HIV-1. Theor. Popul. Biol. 71: 230–238. Science 306: 1547–1550. Mukai, T., 1969 The genetic structure of natural populations of Burch, C. L., and L. Chao, 2004 Epistasis and its relationship to Drosophila melanogaster. vii. Synergistic interaction of spontaneous canalization in the RNA virus phi6. Genetics 167: 559–567. mutant polygenes controlling viability. Genetics 61: 749–761. Carter, A. J., J. Hermisson and T. F. Hansen, 2005 The role of Muller, H. J., 1964 The relation of recombination to mutational epistatic gene interactions in the response to selection and the advantage. Mutat. Res. 1: 2–9. evolution of . Theor. Popul. Biol. 68: 179–196. Peters,A.D.,andC.M.Lively, 2000 Epistasis and the maintenance of Charlesworth, B., 1990 Mutation-selection balance and the evolu- sex, pp. 99–112 in Epistasis and the Evolutionary Process,editedbyJ.B. tionary advantage of sex and recombination. Genet. Res. 55: 199– Wolf,E.D.Brodie andM.J.Wade, Oxford University Press, Oxford. 221. Sanjuan, R., and S. F. Elena, 2006 Epistasis correlates to genomic deVisser, J. A. G. M., R. F. Hoekstra and H. Van den Ende, complexity. Proc. Natl. Acad. Sci. USA 103: 14402–14405. 1996 The effect of sex and deleterious mutations on fitness Wagner, A., 2005 Distributed robustness versus redundancy as in chlamydomonas. Proc. R. Soc. Lond. Ser. B 263: 193–200. causes of mutational robustness. BioEssays 27: 176–188. deVisser, J. A. G. M., R. F. Hoekstra and H. Van den Ende, Wagner, A., G. Wagner and P. Similion, 1994 Epistasis can facili- 1997a An experimental test for synergistic epistasis and its tate the evolution of reproductive isolation by peak shifts: a application to chlamydomonas. Genetics 145: 815–819. two-locus two-allele model. Genetics 138: 533–545. 1010 M. M. Desai, D. Weissman and M. W. Feldman

Wagner, G. P., and L. Altenberg, 1996 Complex adaptations and Whitlock, M. C., P. C. Phillips,F.Moore and S. J. Tonsor, the evolution of evolvability. Evolution 50: 967–976. 1995 Multiple fitness peaks and epistasis. Annu. Rev. Ecol. Syst. Wagner, G. P., G. Booth and H. Bagheri-Chaichian, 1997 A pop- 26: 601–629. ulation genetic theory of canalization. Evolution 51: 329–347. Wilke, C. O., and C. Adami, 2001 Interaction between directional Weinreich, D. M., and L. Chao, 2005 Rapid evolutionary escape by epistasis and average mutational effects. Proc. R. Soc. Lond. large populations from local fitness peaks is likely in nature. Ser. B 268: 1469–1474. Evolution 59: 1175–1182. Wloch, D., R. Borts and R. Korona, 2001 Epistatic interactions of West, S. A., A. D. Peters and N. H. Barton, 1998 Testing for epis- spontaneous mutations in haploid strains of the yeast saccharo- tasis between deleterious mutations. Genetics 149: 435–444. myces cerevisiae. J. Evol. Biol. 14: 310–316. Whitlock, M. C., and D. Bourguet, 2000 Factors affecting the genetic load in drosophila: synergistic epistasis and correlations among fitness components. Evolution 54: 1654–1660. Communicating editor: N. Takahata

APPENDIX A: STEADY-STATE FREQUENCIES OF DELETERIOUS ALLELES WITH RECOMBINATION In this section we outline the iterative approach that leads to an approximate steady-state solution, valid for small r, to the recursion equations for the position-dependent frequencies of the deleterious alleles in the presence of recombination, Equations 10–12. Since D(i, j) appears only in the recursions multiplied by r, to leading order in r we 2 2 e ^ can use the r ¼ 0 value, D ¼ F0 ðÞm=s ðÞ2 1 . The equation for F1ðiÞ then becomes XL m 2 ^ ^ ^ ^ ^2 e ð1 Lm 1 dwÞF1ðiÞð1 Lm sÞF1ðiÞ 1 mF0 1 rF0 ð2 1Þ j j i j ; ðA1Þ s j¼1 where w [ 1 Lm 1 dw^. Note that XL L2 2i 2 j j i j 1 1 1 ; ðA2Þ j¼1 4 L which gives m 1 Lr Lm 2i 2 F^ ðiÞ¼ F^ 1 F^ ð1 2eÞ 1 1 1 : ðA3Þ 1 s 1 dw^ 0 4 s s 0 L Since dw^ LrðÞ Lm=s 2 (see Equation 16 and the surrounding discussion), the dw^ in the denominator does not affect the leading term in ðÞLm=s , and we can drop it, giving Equation 13. For F2(i, j), we have m 2 ð1 Lm 1 dw^ÞF^ ði; jÞð1 Lm 211esÞF^ ði; jÞ 1 mðF ðiÞ 1 F ðjÞÞ rF^2 ð2e 1Þjj i j : ðA4Þ 2 2 1 1 0 s ^ ^ From (13), we can see that the contribution of the position dependence of F1ðiÞ to F2ði; jÞ will be higher order in ^ ^ ðÞLm=s than the contribution of the rj j ijD term, so we can just use the r ¼ 0valueF1ðiÞðm=sÞF0. Plugging this in gives 1 m2 Lr j j i j F^ ði; jÞ¼ F^ 2 F^ ð2e 1Þ : ðA5Þ 2 211es 1 dw^ s 0 s 0 L As before, dw^ is higher order in ðÞLm=s and can be ignored, giving Equation 14. If desired, we could now iterate this process to find the correction to second order in r. We recalculate D using Equations 13 and 14, substitute this into the recursion equations, Equations 10–12, and repeat the process outlined above.

APPENDIX B: SIMULATION METHODS We tested our analytical results with both deterministic (infinite population) and stochastic (finite population) simulations. In the deterministic simulations, individuals with zero, one, or two mutations were assumed to evolve according to the recursion equations defined in the text, while individuals with more than two mutations were assumed to be nonviable (including higher-order mutants that had a negligible effect on the results.) The simulation results shown in Figures 4 and 5 are from these deterministic simulations. We also checked the results for the average frequencies of deleterious alleles and the strength of selection for antagonistic epistasis using fully stochastic finite- population simulations. For these, we assumed that the population evolved according to a Wright–Fisher model with recombination, mutation, and selection (occurring in that order) and again assumed that individuals with more than two mutations were nonviable. To find the strength of selection, we first ran the simulations until they approached quasi-steady-state distributions and then recorded the difference in mean fitness of the populations with different values of e. The results of these finite-population simulations (not shown) are consistent with our analytical predictions.