arXiv:1210.8423v2 [q-bio.PE] 22 Jan 2013 ten Rajon Etienne traits architectures quantitative genetic underlying of The akatertclfaeokfrforming stati for these framework from theoretical distinct a Nevertheless, lack 2010). stud actively al., already et are Yang trait kn a of with determinants molecular human ar of i however, number them traits, large detect Other the to including 2010). power – lack al., loci et studies Yang QTL 2012; that Rockman, fro small contributions These so with often 2012). architectures, are al., Fisherian et 2005 on Ehrenreich al., rely et 2008; to Brem al., 2009; et ta al., Emilsson diverse et 2010; in Manolio (QTL) 2008; loci Visscher, trait 2009; quantitative an map agricultural of to promise efforts the dous as well as technologies, cent (Fier speciation and isolation reproductive (Azev even recombination and of evolution 2007) timesc longer the Over for 2006). consequences Hansen, tant 2005; t al., adapt to et capacity Carter R population’s 2004; 2006; a (Hans affects al., trait therefore et and a Carlborg 2009) of 2005; con Mitchell-Olds, architecture others’ and genetic (Kroymann each resulting modulate The to traits. epistatically interact value, 2 1 ri RcmnadKula,20;Hne,20) o insta For 2006). Hansen, 2006; Kruglyak, and (Rockman trait eateto ilg,Uiest fPnslai,Philad [email protected] Pennsylvania, E-mail: of University Biology, of Department h utesaitclise fdsgigadinterpretin and designing of issues statistical subtle The pheno of basis genetic the studied have scientists Although whose loci genetic of set a by encoded is trait quantitative A h iest fosre eei rhtcue,rnigfo Me from introd ranging it architectures, and provide genetic traits, analysis observed of Our of basis diversity genetic yeast. the the in in relative levels patterns by hum empirical expression of broad encoded genetics of be the v in genetics should highly trends the selection qualitative with to strong loci predictions many or t theoretical by weak the architectur encoded either on be genetic pressures under should of selection selection evolution the moderate t the under on how for depends understood model it poorly population-genetic how remains or consider it evolve, these t traits, should Despite begun of diver blueprint. determinants have across Fisher’s genetic mapping phenotypes with QTL contrast of in sometimes numbers advances vast Recent underlying chitectures effects. additive small, ntecascve nrdcdb ..Fse,aqatttv ri sen is trait quantitative a Fisher, R.A. by introduced view classic the In 1 , 2 ohaB Plotkin B. Joshua , 1 priori a Abstract xettosaottegntcacietr neliga underlying architecture genetic the about expectations 1 tclise fifrne rmeprcldt,we data, empirical from inferences of issues stical aylc Or 05,woeadtv effects additive whose 2005), (Orr, loci many m eia plctos aesiuae tremen- stimulated have applications, medical d e Be n rgyk 05 oka,2012; Rockman, 2005; Kruglyak, and (Brem ied tadHne,2010). Hansen, and st ls eei rhtcue ftat aeimpor- have traits of architectures genetic ales, properties variational its influences 2006) en, T tde nodrt cuaeyifrthe infer accurately to order in studies QTL g d ta. 06,o e d isradElena, and Visser (de sex of 2006), al., et edo rbtos n osbycnrbt oother to contribute possibly and tributions, e niomna odtos(oe tal., et (Jones conditions environmental new o rmadKula,20;Rcmne al., et Rockman 2005; Kruglyak, and Brem ; tde aervae aytat htseem that traits many revealed have studies lha A114 USA 19104, PA elphia, cmnadKula,20;Mca tal., et Mackay 2006; Kruglyak, and ockman a(nee ta. 02 ln n Mackay, and Flint 2002; al., et (Ungerer xa yi aito o oeta etr,re- century, a than more for variation typic noe yarltvl ml ubrof number small relatively a by encoded e c,wa ye ftat hudw expect we should traits of types what nce, w edla inheritance. Mendelian own dla oFisherian. to ndelian etx,pouigosrain that observations producing taxa, se dvdal Be n rgyk 2005; Kruglyak, and (Brem ndividually ntat,adt ytmtcdt on data systematic to and traits, an s u oe rdcsta traits that predicts model Our es. csasnl rmwr htunifies that framework single a uces beeprcleot ompthe map to efforts empirical able egntcacietr fatrait a of architecture genetic he neouinr xlnto for explanation evolutionary an s lee otiuedrcl h trait the directly contribute alleles at eew eeo simple, a develop we Here rait. yfwlc.W opr these compare We loci. few ly ral ffcs hra traits whereas effects, ariable lcdt h eei ar- genetic the elucidate o oe ymn oiwith loci many by coded to be monogenic, and what traits should be highly polygenic? More generally, how does the genetic architecture underlying a trait evolve, and what features of a trait shape the evolution of its architecture? To address these questions we developed a mathematical model for the evolution of genetic architectures, and we compared its predictions to a large body of empirical data on quantitative traits.

Results and Discussion

Genetic architectures predicted by a population-genetic model Our approach to understanding the evolution of genetic architectures combines standard models from (Lande, 1976) with the Wright-Fisher model from population genetics (Ewens, 2004). In its simplest version, our model considers a continuous trait whose value, x, is influenced by L loci. Each locus i contributes additively an amount αi, so that the trait value is defined as the mean of the αi values across contributing loci. This trait definition means that a gene’s contribution to a trait is diluted when L is large, which prevents direct selection on gene copy numbers when have similar contributions (Proulx and Phillips, 2006; Proulx, 2012). We discuss this definition below, along with alternatives such as the sum. The fitness of an individual with trait value x is assumed Gaussian with mean 0 and standard deviation σf , so that smaller values of σf correspond to stronger stabilizing selection on the trait (Lande, 1976). Individuals in a population of size N replicate according to their relative fitnesses. Upon replication, an offspring may acquire a point that alters the direct effect of one locus, i, perturbing the value of αi for the offspring by a normal deviate; or the offspring may experience a duplication or a deletion in a contributing locus, which changes the number of loci L that control the trait value in that individual (see Methods). Point , duplications, and deletions occur at rates µ, rdup, rdel, which have comparable magnitudes in nature (table S1; Lynch et al., 2008; Watanabe et al., 2009; Lipinski et al., 2011; van Ommen, 2005). Finally, an offspring may also increase the number of loci that contribute to its trait value by recruitment – that is, by acquiring a recruitment mutation, with probability µ × rrec, in some gene that did not previously contribute to the trait value (see Methods). Over successive generations in our model, the genetic architecture underlying the trait – that is, how many loci contribute to the trait’s value, and the extent of their contributions – varies among the individuals in the population, and evolves. The genetic architectures that evolve in our model represent the complete genetic determinants of a trait, which may include – but do not correspond precisely to – the genetic loci that would be detected based on polymorphisms segregating in a sample of individuals in a QTL study. We discuss this important distinction below, when we compare the predictions of our model to empirical QTL data. We studied the evolution of genetic architectures in sets of 500 replicate populations, simulated by Monte Carlo, with different amounts of selection on the trait. We ran each of these simulations for 50 mil- lion generations, in order to model the extensive evolutionary divergence over which genetic architectures are assembled in nature. The form of the genetic architecture that evolves in our model depends critically on the strength of selection on the trait. In particular, we found a striking non-monotonic pattern: the equilibrium number of loci that influence a trait is greatest when the strength of selection on the trait is intermediate (Fig. 1). Moreover, the variability in the contributions of loci to the trait value (Fig. S1) and the effects of deleting or duplicating genes (Fig. S2) are also greatest for a trait under intermediate selection. In other words, our model predicts that traits under moderate selection will be encoded by many loci with highly divergent effects; whereas traits under strong or weak selection will be encoded by relatively few loci. We also studied how epistatic interactions among loci influence the evolution of genetic architecture. To incorporate the influence of locus j on the contribution of locus i we introduced parameters

2 Figure 1. The genetic architecture underlying a trait de- pends on the strength of selection on the trait, in a population- genetic model. Traits subject to intermediate selection (inter-

number of loci number mediate values of σf ) evolve genetic architectures with the greatest number of controlling loci. Dots denote the mean number of loci in the architecture underlying a trait, among 500 replicate Wright-Fisher simulations, for each value of the selection pressure σf . The rectangular areas represent the dis- tribution of the number of loci in the architecture. The neutral 1 5 10 15 20 25 30 35 40 expectations for the equilibrium number of loci (see Methods) −3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 are represented as grey lines, when recruitment events are neu- tral (top line) or not (bottom line). Parameters are set to their (σ ) strong selection log10 f weak selection default values (table S2).

βji so that the trait value is now given by

L L 1 x = αi × fβ βji , (1) L    Xi=1 Xj=1 where fβ is a standard sigmoidal filter function (Azevedo et al., 2006, see Methods and Fig. S4). As with the direct effects of loci, the epistatic effects were allowed to mutate and vary within the population, and evolve. Although significant epistatic interactions emerge in the evolved populations (Fig. S3B), the presence of epistasis does not strongly affect the average number of loci that control a trait (Figs. S3A and S4). Epistasis is not required for the evolution of large L, nor does it change the shape of its dependence on the strength of selection.

Intuition for the results There is an intuitive explanation for the non-monotonic relationship between the selection pressure on a trait and the number of loci that control it. For a trait under weak selection (high σf ), changes in the trait value have little effect on fitness. Thus, even if deletions, recruitments and duplications change the trait value, these changes are nearly neutral (Fig. 2). As a result, the number of loci controlling the trait evolves to its neutral equilibrium, which is small because deletions are more frequent than duplications and recruitments (see Methods, Figs. 1 and S3). On the other hand, when selection on a trait is very strong (low σf ), few point mutations, and only those with small effects on the trait, will fix in the population. As a result, all loci have similar contributions to the trait value (Fig. 2 – row 1), and so duplications or deletions again have little effect on the trait or on fitness (Fig. 2 – rows 2 and 3). In this case, the equilibrium number of loci is given by the value expected when deletions and duplications, but not recruitments, are neutral (Figs. 1 and S3). Only when selection on a trait is moderate can variation in the contributions across loci accrue and impact the fixation of deletions and duplications (Fig. 2 – row 4), by a process called compensation: a slightly deleterious point mutation at one locus, which perturbs the trait value, segregates long enough to be compensated by point mutations at other loci (Rokyta et al., 2002; Meer et al., 2010; Kimura, 1985; Poon and Otto, 2000). Compensation increases the variance in the contributions among loci (Fig. 2, row 1), as has been observed for many phenotypes in plants and animals (Rieseberg et al., 1999). Finally, even though duplications and deletions are mildly deleterious in this regime, there is a bias favoring duplications over deletions (Fig. 2 – row 3). This bias arises because

3 duplications increase the number of loci in the architecture, which attenuates the effect of each locus on the trait (Fig. 2 – row 2). Thus when selection is moderate, duplications and recruitments fix more often than deletions and drive the number of contributing loci above its neutral expectation (Fig. 2 – rows 4 and 5). As the number of loci increases the bias is reduced (Fig. 2 – rows 4 and 5), and so L equilibrates at a predictable value (Figs. 1 and S3). Duplications and recruitments might also be slightly favored over deletions under intermediate selection, because architectures with more loci also have reduced genetic variation (Wagner et al., 1997). This effect – which would positively select for an increase in gene copy numbers – is likely weak in our model, as duplications and recruitments are deleterious on average under intermediate selection, only less so than deletions (Fig. 2 – rows 4 and 5).

Robustness of results to model assumptions The predictions of our model – notably, that the number of loci in a genetic architecture is greatest for traits under intermediate selection – are robust to choices of population-genetic parameters. The non- monotonic relation between selection pressure on a trait and the size of its genetic architecture, L, holds regardless of population size; but the location of maximum L is shifted towards weaker selection in larger populations (Fig. S5). This result is compatible with our explanation involving compensatory evolution: selection is more efficient in large populations, and so compensatory evolution occurs at smaller selection coefficients. Likewise, when the mutation rate is smaller the resulting equilibrium number of controlling loci is reduced (Fig. S6). This result is again compatible with the explanation of compensatory evolution, which requires frequent mutations. Increasing the rate of deletions relative to duplications also reduces the equilibrium number of loci in the genetic architecture, but our qualitative results are not affected even when rdel is twice as large as rdup (Fig. S7). Finally, increasing the rate of recruitment rrec (or the genome size) increases the number of loci contributing to all traits except those under very strong selection, as expected from Fig. 2. Our prediction that traits under intermediate selection are encoded by the richest genetic architectures is insensitive to changes in this parameter, and it holds even in the absence of recruitment (Fig. S8). Our analysis has relied on several quantitative-genetic assumptions, which can be relaxed. First, we assumed that all effects of locus i (i.e. αi and all βij and βji) are simultaneously perturbed by a point mutation. Relaxing this assumption, so that a subset of the effects are perturbed, does not change our results qualitatively (Fig. S9). Second, we assumed that point mutations have unbounded effects so that variation across loci can increase indefinitely. To relax this assumption we made mutations less perturbative to loci with large effects (see Methods). Even a strong mutation bias of this type led to very small changes in the equilibrium behavior (Fig. S10). Third, we assumed no metabolic cost of additional loci, even though additional genes in Saccharomyces cerevisiae are known to decrease fitness slightly (Wagner, 2005, 2007). Nonetheless, including a metabolic cost proportional to L does not alter our qualitative predictions (Fig. S11). Finally, we defined the trait value as the average of the contributions αi across loci, as opposed to their sum. This definition reflects the intuitive notion that a gene product’s contribution to a trait will generally depend on its abundance relative to all other contributing gene products. Moreover, this assumption that increasing the number of loci influencing a trait attenuates the effect of each one is supported by empirical data: changing a gene’s copy number is known to have milder phenotypic effects when the gene has many duplicates (Gu et al., 2003; Conant and Wagner, 2004). Nonetheless, alternative definitions of the trait value, which span from the sum to the average of contributions across loci, generically exhibit the same qualitative results (text S1 and Fig. S12). Although robust to model formulation and parameter values, our results do depend in part on initial conditions. When selection is strong, the initial genetic architecture can affect the evolutionary dynamics of the number of loci (Fig. S14). This occurs because the initial architecture may set dependencies among loci that prevent a reduction of their number. This result indicates that only those architectures of traits under very strong selection should depend on historical contingencies. We have also studied a multitrait version of our model, where genes participating in other traits can be recruited or lost through

4 L=1 L=2 L=5 L=8 L=12 L=30

indirect effects, Σj(βji) direct effects, αi variation across loci variation 0.00 0.06 0.12 0.18

deletions duplications recruitments effect on the trait effect 0.00 0.01 0.02 0.03 selection coef. 0.00 0.01 0.02 0.03 ) 9 − x10 ( 0 0.5 1 fixation rate more duplications o+o−o / recruitments more deletions

−3 −2 −1 0 1 −3 −2 −1 0 1 −3 −2 −1 0 1 −3 −2 −1 0 1 −3 −2 −1 0 1 −3 −2 −1 0 1

selection pressure, log10(σf)

Figure 2. The consequences of gene duplications, recruitments and deletions in a population-genetic model. Populations were initially evolved with a fixed number of controlling loci L (line 1), and we then measured the effects of recruitments, deletions and duplications on the trait value (line 2) and on fitness (line 3). From the latter, we calculated the rate at which deletions, recruitment and duplications enter and fix in the population (line 4), and the resulting rate of change in the number of loci contributing to the trait (line 5). Line 1: For L > 1, the variation in direct effects (αi) and indirect effects among controlling loci (Pj (βji)) increases as selection on the trait is relaxed. Line 2: As a consequence of this variation among loci, the average change in the trait value following a duplication or a deletion also increases as selection on the trait is relaxed. Line 3: Changes in the trait value are not directly proportional to fitness costs, because the same change in x has milder fitness consequences when selection is weaker (larger σf ). As a result, the average fitness detriment of duplications and deletions is highest for traits under intermediate selection. Line 4: Consequently, the fixation rates of duplications and deletions are smallest under intermediate selection. Line 5: The equilibrium number of loci controlling a trait under a given strength of selection is determined by that value of L for which duplications and recruitments on one side, and deletions on −1.5 the other, enter and fix in the population at the same rate. For example, when σf = 10 these rates are equal when L is close to 12 (black arrow), so that the equilibrium genetic architecture contains ≈ 12 loci on average (compare Fig. S3 black arrow).

5 mutation. Even though this model features , and the effects of recruitments evolve neutrally, our qualitative results remained unaffected (text S3 and Fig. S15).

The dynamics of copy number Previous models related to genetic architecture have been used to study the evolutionary fate of gene duplicates. These models typically assume that a gene has several sub-functions, which can be gained (neo-functionalization; Ohno, 1970) or lost (sub-functionalization; Force et al., 1999; Lynch and Force, 2000) in one of two copies of a gene. Such “fate-determining mutations” (Innan and Kondrashov, 2010) stabilize the two copies, as they make subsequent deletions deleterious. Such models complement our approach, by providing insight into the evolution of discrete, as opposed to continuous or quantitative, phenotypes. Yet there are several qualitative differences between our analysis and previous studies of gene duplication. Most important, our model considers the dynamics of both duplications and deletions, in the presence of point mutations that perturb the contributions of loci to a trait. This co-incidence of timescales is important in the light of empirical data (Lynch et al., 2008; Watanabe et al., 2009; Lipinski et al., 2011; van Ommen, 2005) showing that changes in copy numbers occur at similar rates as point mutations (table S1). Under these circumstances, a gene may be deleted or acquire a loss-of-function mutation before a new function is gained or lost. Our model includes these realistic rates, and accordingly we find that duplicates are very rarely stabilized by subsequent point mutations. Instead, the number of loci in a genetic architecture may increase, in our model, because compensatory point mutations introduce a bias towards the fixation of duplications as opposed to deletions.

Comparison to empirical eQTL data Like most evolutionary models, our analysis greatly simplifies the mechanistic details of how specific traits influence fitness in specific organisms. As a result, our analysis explains only the broadest, qualitative features of how genetic architectures vary among phenotypic traits, leaving a large amount of variation unexplained. This remaining variation may be partly random (as predicted by the distributions of the number of evolving loci, see e.g. Fig. 1), and partly due to ecological and developmental details that our model neglects. Due to this variation, a quantitative comparison between our model and empirical data would require information about the genetic architectures for at least hundreds of traits (see below, for our analysis of expression QTLs). Nevertheless, the qualitative, non-monotonic predictions of our model (Fig. 1) may help to explain some well-known trends in the genetics of human traits. For instance, in accordance with our predictions, human traits under moderate selection, such as stature or susceptibility to mid- life diseases like diabetes, cancer, or heart-disease, are typically complex and highly polygenic; whereas traits under very strong selection, such as those (e.g. mucus composition or blood clotting) affected by childhood-lethal disease like Cystic fibrosis or Haemophilias are often Mendelian; and so too traits under very weak selection (such as handedness, bitter taste, or hitchhiker’s thumb) are often Mendelian. Our analysis provides an evolutionary explanation for these differences, and it delineates the selective conditions under which we may expect a Mendelian, as opposed to Fisherian, architecture. We tested our evolutionary model of genetic architectures by comparison with empirical data on a large number of traits. Such a comparison must, of course, account for the fact that our model describes the true genetic architecture underlying a trait, whereas any QTL study has limited power and describes only the associations detected from polymorphisms segregrating in a particular sample of individuals. Accounting for this discrepancy (see below), we compared our model to data from the study of Brem et al. (2005), who measured mRNA expression levels and genetic markers in 112 recombinant strains produced from two divergent lines of S. cerevisiae. For each yeast transcript we computed the number of non-contiguous markers associated with transcript level, at a given false discovery rate (see Methods). We also calculated the codon adaptation index (CAI) of each transcript – an index that correlates with the gene’s wildtype expression level and with its overall importance to cellular fitness (Sharp and Li, 1987). We found a

6 striking, non-monotonic relationship between the CAI of a transcript and the number of loci linked to variation in its abundance (Fig. 3A). Thus, assuming that CAI correlates with the strength of selection on a transcript, Brem et al. (2005) detected more loci regulating yeast transcripts under intermediate selection than transcripts under either strong or weak selection. We compared the empirical data on yeast eQTLs (Fig. 3A) to the predictions of our evolutionary model. In order to make this comparison, we first evolved genetic architectures for traits under various amounts of selection (Fig. S3), and for each architecture we then simulated a QTL study of the exact same type and power as the yeast eQTL study: that is, we generated 112 crosses from two divergent lines using the yeast genetic map (text S2). As expected, the simulated QTL studies based these 112 segregants detected many fewer loci linked to a trait than in fact contribute to the trait in the true, underlying genetic architecture (Fig. 3B versus Fig. 1). This result is consistent with previous interpretations of empirical eQTL studies (Brem and Kruglyak, 2005). The simulated QTL studies revealed another important bias: a locus that contributes to a trait under weak selection is more likely to be correctly identified in a QTL study than a locus that contributes to a trait under strong selection (Fig. S16). Furthermore, our simulations demonstrate that the number of associations detected in such a QTL study depends on the divergence time between the parental strains used to generate recombinant lines (Fig. S17). Finally, traits under weaker selection may be more prone to measurement noise, which we also simulated (Fig. S18). Despite these detection biases, which we have quantified, the relationship between the selection pressure on a trait and the number of detected QTLs in our model (Fig. 3B and Figs. S18 and S19) agrees with the relationship observed in the yeast eQTL data (Fig. 3A). Importantly, both of these relationships exhibit the same qualitative trend: traits under intermediate selection are encoded by the richest genetic architectures.

Conclusion

Many interesting developments lie ahead. Our model is far too simple to account for tissue- and time- specific gene expression, , context-dependent effects, etc (Mackay et al., 2009; Ala-Korpela et al., 2011). How these complexities will change predictions for the evolution of genetic architectures remains an open question. Nonetheless, our analysis shows that it is possible to study the evolution of genetic architecture from first principles, to form a priori expectations for the architectures underlying different traits, and to reconcile these theories with the expanding body of QTL studies on molecular, cellular, and organismal phenotypes.

Methods

Model We described the evolution of genetic architectures using the Wright-Fisher model of a replicating popu- lation of size N, in which haploid individuals are chosen to reproduce each generation according to their relative fitnesses. The fitness of an individual with L loci encoding trait value x is

ωk = G(x, 0,σf ) × (1 − L × c) (2) where G denotes the density at x of a Gaussian distribution with mean 0 and standard deviation σf , and the second term denotes the metabolic cost of harboring L loci, which depends on a parameter c. The trait value of such an individual, given the direct contributions αi and epistatic terms βji is described by Eq. (1) where 2 fβ(y)= − (3) 1+ e sβy is a sigmoidal curve, so that the epistatic interactions either diminish or augment the direct contribution of locus i depending on whether j βji is positive or negative (Fig. S4). In general, loci do not influence P 7 Experiment Simulations

A B 0 1 2 3 4 5 6 7 8 9 number of detected loci (experiment) number number of detected loci (simulations) number 0 1 2 3 4 5 6 7 8 9

0.0 −0.6 −1.2 −1.8 −2.4 −3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 log (σ ) log(CAI) strong selection 10 f weak selection

Figure 3. The number of genetic loci controlling a trait inferred from real S. cerevisiae populations (panel A) and from simulated populations (B) has a non-monotonic relationship with the strength of selection on the trait. A: In the yeast data of Brem Brem et al. (2005), the largest number of eQTLs were detected for those transcripts (i.e, traits) under intermediate levels of selection (intermediate CAI), whereas fewer eQTLs were detected for transcripts under either weak or strong selection. Transcripts were binned according to their log CAI values. Squares represent the distribution of the number of one-way eQTLs identified from the study of Brem et al. (2005), for traits within each bin of CAI. Greyscale indicate the number of transcripts in each bin (darker means more data). Mean numbers of detected eQTLs are represented by circles. B: For the simulated experiment, we evolved 100 populations of genetic architectures, using the parameters corresponding to Fig. S3. From each such population, we then evolved two lines independently for 25, 000 generations in the absence of deletions, duplications and recruitment, to mimic the divergent strains used in the yeast cross of Brem et al. (2005). From these two divergent genotypes we then created 112 recombinant lines following the genetic map from Brem et al. (2005). We then analyzed the resulting simulated data with R/qtl in the same way as we had analyzed the yeast data (text S2). The distribution of QTLs detected and their means are represented as in Fig. 1, for each value of selection strength σf .

themselves (βii ≡ 0) and, in the model without epistasis, all βji ≡ 0 and fβ ≡ 1. If an individual chosen to reproduce experiences a duplication at locus i then the new duplicate, labelled k, inherits its direct effect (αk = αi) and all interaction terms (βkj = βij and βjk = βji for all j =6 i, k), with the interaction terms βik and βki initially set to zero. Recruitment occurs with probability rrec per mutation of one of the 6, 000 genes not contributing to the trait. The initial direct contribution αi of recruited locus i is drawn from a normal distribution with mean zero and standard deviation σm; its interaction terms with other loci (k), βik and βki, are initially set to zero. Note that this assumption is relaxed in the multilocus version of our model, where the direct and indirect effects of recruitments evolve neutrally (text S3 and Fig. S15). In general a point mutation at locus i changes its contribution to the trait, αi, and all its epistatic interactions, βij and βji, each by an independent amount drawn from a normal distribution with mean zero and standard deviation σm. The normal distribution satisfies the assumptions that small mutations are more frequent than large ones (Orr, 1999; Eyre-Walker and Keightley, 2007), and that there is no mutation pressure on the trait (Lande, 1976). We relaxed the former assumption by drawing mutational effects from a uniform distribution without qualitative changes to our results (Fig. S13). In order to relax the latter assumption we included a bias towards smaller mutations in loci with large effects, so that the mean effect of a mutation at locus i now equals −bα × αi and −bβ × βij, respectively for αi and βij (Rajon and Masel, 2011). We also considered a model in which a mutation at locus i affects only a proportion pem of the values αi, βij, and βji. By default, simulations were initialized with L = 1 and α1 = 0; alternative initial conditions were also studied, as shown in Fig. S14.

8 Markov chain for neutral changes in copy number When deletions and duplications are neutral, and recruitments strongly deleterious, the evolution of the number of loci L in the genetic architecture is described by a Markov-chain on the positive integers. The probability of a transition from L = i to L = i + 1 equals rdup × i, and that of a transition from i to i − 1 is rdel × i. We disallow transitions to L = 0, assuming that some regulation of the trait is required. We obtained the stationary distribution of L by setting the density of d1 of individuals in stage 1 to 1 and calculating the density di of individuals in the following stages as

rdup × (i − 1) di = di−1 (4) rdel × i The equilibrium probability of being in state i was calculated as

di pi = ∞ (5) i=1 di P ∞ −6 −6 and the expected value of L was calculated as i=1 pi × i. With rdup = 10 and rdel = 1.25 × 10 , we found an equilibrium expected L of 2.485. P When deletions, duplications and recruitments are all neutral, equation (4) can be replaced by:

rdup × (i − 1) + 6000 × µ × rrec di = di−1 (6) rdel × i This equation illustrates the fact that the rates of deletions (which include loss of function mutations) and duplication depend on the number of loci in the architecture, whereas the rate of recruitments does −6 −5 not. With µ = 3 × 10 and rrec = 5 × 10 , we found an equilibrium expected L of 4.705.

Calculation of s and pfix We first evolved populations to equilibrium with a fixed number of controlling loci L, and we then measured the effects of deletions, duplications or recruitments introduced randomly into the population. We simulated the evolution of the genetic architecture with L fixed in 500 replicate populations, over 8 × 106 generations for deletions and 10 × 106 generations for duplications, reflecting the unequal waiting time before the two kinds of events. We used 10×106 generations for recruitment as well, although different durations did not affect our results. For each genotype k in each evolved population, we calculated the fitness ωk(i) of mutants with locus i deleted or duplicated. We calculated the corresponding selection coefficients as: ω (i) s (i)= k − 1 (7) k <ω> where <ω> denotes mean fitness in the population. We calculated s as the mean across loci and geno- types of sk(i), weighted by the number of individuals with each genotype. We calculated the probability of fixation of a duplication, deletion or recruitment as

− 1 − e 2sk(i) pfix(sk(i)) = − , (8) 1 − e 2Nsk(i) and obtained the mean pfix using the same method as for s. Rates of deletions and duplications fixing were calculated per locus (Fig. 2) as rdel or rdup times pfix. The total probability of a duplication or a deletion entering the population and fixing is, of course, also multiplied by L. However, recruitment rates remain constant as L changes. Therefore, we divided the rate of recruitments by L in Fig. 2, for comparison to the per-locus duplication and deletion rates.

9 Number of loci influencing yeast transcript abundance We used the R/qtl (Broman et al., 2003; R Development Core Team, 2011) package to calculate LOD scores for a set of 1226 observed markers and 3223 uniformly distributed pseudomarkers separated by 2 cM, by Haley-Knott regression. We calculated the LOD significance threshold for a false discovery rate (FDR) of 0.2 as the corresponding quantile in the distribution of the maximum LOD after 500 permutations (a FDR of 0.01 and a fixed LOD threshold of 3 produced qualitatively similar results). The number of detected loci linked to the expression of a transcript was calculated as the number of non-consecutive genomic regions with a LOD score above the threshold. We downloaded S. cerevisiae coding sequences from the Ensembl database (EF3 release), and calculated CAI values with the seqinr (Charif and Lobry, 2007) package, using codon weights from a set of 134 ribosomal genes.

Literature Cited

Ala-Korpela, M., A. J. Kangas, and I. Michaek, 2011. Genome-wide association studies and systems biology: together at last. Trends Genet. .

Azevedo, R. B. R., R. Lohaus, S. Srinivasan, K. K. Dang, and C. L. Burch, 2006. Sexual reproduction selects for and negative epistasis in artificial gene networks. Nature 440:87–90.

Brem, R. B. and L. Kruglyak, 2005. The landscape of genetic complexity across 5,700 gene expression traits in yeast. Proc. Natl. Acad. Sci. U. S. A. 102:1572–1577.

Brem, R. B., J. D. Storey, J. Whittle, and L. Kruglyak, 2005. Genetic interactions between polymorphisms that affect gene expression in yeast. Nature 436:701–3.

Broman, K. W., H. Wu, S. Sen, and G. A. Churchill, 2003. R/qtl: QTL mapping in experimental crosses. Bioinformatics 19:889–890.

Carlborg, O., L. Jacobsson, P. Ahgren, P. Siegel, and L. Andersson, 2006. Epistasis and the release of genetic variation during long-term selection. Nat. Genet. 38:418–420.

Carter, A. J. R., J. Hermisson, and T. F. Hansen, 2005. The role of epistatic gene interactions in the response to selection and the evolution of . Theor Popul Biol 68:179–96.

Charif, D. and J. R. Lobry, 2007. SeqinR 1.0-2: a contributed package to the R project for statistical computing devoted to biological sequences retrieval and analysis. Pp. 207–232, in U. Bastolla, M. Porto, H. Roman, and M. Vendruscolo, eds. Structural approaches to sequence evolution: Molecules, networks, populations, Biological and Medical Physics, Biomedical Engineering. Springer Verlag, New York. ISBN : 978-3-540-35305-8.

Conant, G. C. and A. Wagner, 2004. Duplicate genes and robustness to transient gene knock-downs in caenorhabditis elegans. Proc. Biol. Sci. 271:89–96.

Ehrenreich, I. M., J. Bloom, N. Torabi, X. Wang, Y. Jia, and L. Kruglyak, 2012. Genetic architecture of highly complex chemical resistance traits across four yeast strains. PLoS Genet 8:e1002570.

Emilsson, V., G. Thorleifsson, B. Zhang, A. S. Leonardson, F. Zink, J. Zhu, S. Carlson, A. Helgason, G. B. Walters, S. Gunnarsdottir, M. Mouy, V. Steinthorsdottir, G. H. Eiriksdottir, G. Bjornsdottir, I. Reynisdottir, D. Gudbjartsson, A. Helgadottir, A. Jonasdottir, A. Jonasdottir, U. Styrkarsdottir, S. Gretarsdottir, K. P. Magnusson, H. Stefansson, R. Fossdal, K. Kristjansson, H. G. Gislason, T. Ste- fansson, B. G. Leifsson, U. Thorsteinsdottir, J. R. Lamb, J. R. Gulcher, M. L. Reitman, A. Kong, E. E. Schadt, and K. Stefansson, 2008. Genetics of gene expression and its effect on disease. Nature 452:423–8.

10 Ewens, W., 2004. Mathematical population genetics: theoretical introduction. Springer Verlag.

Eyre-Walker, A. and P. D. Keightley, 2007. The distribution of fitness effects of new mutations. Nat. Rev. Genet. 8:610–618.

Fierst, J. L. and T. F. Hansen, 2010. Genetic architecture and postzygotic reproductive isolation: evolu- tion of bateson-dobzhansky-muller incompatibilities in a polygenic model. Evolution 64:675–93.

Flint, J. and T. F. C. Mackay, 2009. Genetic architecture of quantitative traits in mice, flies, and humans. Genome Res. 19:723.

Force, A., M. Lynch, F. B. Pickett, A. Amores, Y. Yan, and J. Postlethwait, 1999. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151:1531–1545.

Gu, Z., L. M. Steinmetz, X. Gu, C. Scharfe, R. W. Davis, and W.-H. Li, 2003. Role of duplicate genes in genetic robustness against null mutations. Nature 421:63–66.

Hansen, T. F., 2006. The evolution of genetic architecture. Annu. Rev. Ecol. Evol. Syst. 37:123–157.

Innan, H. and F. Kondrashov, 2010. The evolution of gene duplications: classifying and distinguishing between models. Nat Rev Genet 11:97–108.

Jones, A., S. Arnold, and R. B¨urger, 2004. Evolution and stability of the g-matrix on a landscape with a moving optimum. Evolution 58:1639–1654.

Kimura, M., 1985. The role of compensatory neutral mutations in molecular evolution. J. Genet. 64:7–19.

Kroymann, J. and T. Mitchell-Olds, 2005. Epistasis and balanced polymorphism influencing complex trait variation. Nature 435:95–98.

Lande, R., 1976. The maintenance of genetic variability by mutation in a polygenic character with linked loci. Genet. Res. 26:221–235.

Lipinski, K. J., J. C. Farslow, K. A. Fitzpatrick, M. Lynch, V. Katju, and U. Bergthorsson, 2011. High spontaneous rate of gene duplication in caenorhabditis elegans. Curr Biol 21:306–10.

Lynch, M. and A. Force, 2000. The probability of duplicate gene preservation by subfunctionalization. Genetics 154:459–473.

Lynch, M., W. Sung, K. Morris, N. Coffey, C. R. Landry, E. B. Dopman, W. J. Dickinson, K. Okamoto, S. Kulkarni, D. L. Hartl, and T. W. K., 2008. A genome-wide view of the spectrum of spontaneous mutations in yeast. Proc. Natl. Acad. Sci. U. S. A. 105:9272–9277.

Mackay, T. F. C., E. A. Stone, and J. F. Ayroles, 2009. The genetics of quantitative traits: challenges and prospects. Nat. Rev. Genet. 10:565–77.

Manolio, T. A., F. S. Collins, N. J. Cox, D. B. Goldstein, L. A. Hindorff, D. J. Hunter, M. I. McCarthy, E. M. Ramos, L. R. Cardon, A. Chakravarti, J. H. Cho, A. E. Guttmacher, A. Kong, L. Kruglyak, E. Mardis, C. N. Rotimi, M. Slatkin, D. Valle, A. S. Whittemore, M. Boehnke, A. G. Clark, E. E. Eichler, G. Gibson, J. L. Haines, T. F. C. Mackay, S. A. McCarroll, and P. M. Visscher, 2009. Finding the missing of complex diseases. Nature 461:747–53.

Meer, M. V., A. S. Kondrashov, Y. Artzy-Randrup, and F. A. Kondrashov, 2010. Compensatory evolution in mitochondrial tRNAs navigates valleys of low fitness. Nature 464:279–283.

Ohno, S., 1970. Evolution by gene duplication. London: George Alien & Unwin Ltd. Berlin, Heidelberg and New York: Springer-Verlag.

11 van Ommen, G.-J. B., 2005. Frequency of new copy number variation in humans. Nat. Genet. 37:333–334. Orr, H. A., 1999. The evolutionary genetics of adaptation: a simulation study. Genet. Res. 74:207–14. ———, 2005. The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6:119–27. Poon, A. and S. Otto, 2000. Compensating for our load of mutations: freezing the meltdown of small populations. Evolution 54:1467–1479. Proulx, S. R., 2012. Multiple routes to subfunctionalization and gene duplicate specialization. Genetics 190:737–51. Proulx, S. R. and P. C. Phillips, 2006. Allelic divergence precedes and promotes gene duplication. Evo- lution 60:881–892. R Development Core Team, 2011. R: A Language and Environment for Statistical Computing. R Foun- dation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0. Rajon, E. and J. Masel, 2011. Evolution of molecular error rates and the consequences for evolvability. Proc. Natl. Acad. Sci. USA 108:1082–1087. Rieseberg, L. H., M. A. Archer, and R. K. Wayne, 1999. Transgressive segregation, adaptation and speciation. Heredity 83:363–372. Rockman, M. V., 2012. The QTN program and the alleles that matter for evolution: all that’s gold does not glitter. Evolution 66:1–17. Rockman, M. V. and L. Kruglyak, 2006. Genetics of global gene expression. Nat. Rev. Genet. 7:862–72. Rockman, M. V., S. S. Skrovanek, and L. Kruglyak, 2010. Selection at linked sites shapes heritable phenotypic variation in c. elegans. Science 330:372–6. Rokyta, D., M. R. Badgett, I. J. Molineux, and J. J. Bull, 2002. Experimental genomic evolution: extensive compensation for loss of DNA ligase activity in a virus. Mol. Biol. Evol. 19:230. Sharp, P. and W. Li, 1987. The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 15:1281. Ungerer, M. C., S. S. Halldorsdottir, J. L. Modliszewski, T. F. C. Mackay, and M. D. Purugganan, 2002. Quantitative trait loci for inflorescence development in Arabidopsis thaliana. Genetics 160:1133–1151. Visscher, P. M., 2008. Sizing up human height variation. Nat. Genet. 40:489–490. de Visser, J. A. G. M. and S. F. Elena, 2007. The evolution of sex: empirical insights into the roles of epistasis and drift. Nat Rev Genet 8:139–149. Wagner, A., 2005. Energy constraints on the evolution of gene expression. Mol. Biol. Evol. 22:1365–74. ———, 2007. Energy costs constrain the evolution of gene expression. J. Exp. Zool. 308B:322–4. Wagner, G., G. Booth, and H. Bagheri-Cahichian, 1997. A population genetic theory of canalization. Evolution 51:329–347. Watanabe, Y., A. Takahashi, M. Itoh, and T. Takano-Shimizu, 2009. Molecular spectrum of spontaneous de novo mutations in male and female germline cells of . Genetics 181:1035–43. Yang, J., B. Benyamin, B. P. McEvoy, S. Gordon, A. K. Henders, D. R. Nyholt, P. A. Madden, A. C. Heath, N. G. Martin, G. W. Montgomery, M. E. Goddard, and P. M. Visscher, 2010. Common SNPs explain a large proportion of the heritability for human height. Nat Genet 42:565–9.

12 Supplementary material

Text S1: Alternative definitions of the trait

As described in the main text, the non-monotonic relationship between the strength of selection on a trait and the number of loci in its underlying genetic architecture depends on the unequal fitness consequences of deletions and duplications. This behavior should therefore be absent when the trait value x is the sum, rather than the average, of the contributions across loci. To explore this issue, we generalized our definition of the trait by introducing an additional parameter ǫ:

L L 1 x = αi × fβ βji . (S9) 1+ ǫ(L − 1)    Xi=1 Xj=1

As the parameter ǫ ranges from 0 to 1 the trait definition ranges from a sum to an average. As expected when ǫ = 0, the number of loci in the equilibrium genetic architecture is greatly reduced under intermediate selection as compared to the results in the main text (Fig. S15). Interestingly, the mean number of loci also shows a non-monotonic trend in this situation, with a small peak at log10(σf ) = −0.5. This trend is likely driven by the fixation rate of recruitment mutations (as seen in Fig. 2). This explains why it is much less pronounced than those in Figs. 1 and S3-A, which involve a higher fixation rate of duplications over deletions. For any other value of ǫ < 1, we found a non-monotonic relationship similar to the one reported in the main text. Thus, our qualitative results hold for all models provided the trait is not defined strictly as the sum of contributions across loci. When the trait value equals the sum of the contributions of all locus (ǫ = 0), the effect of a gene deletion, knock-out or knock-down is independent of the number of copies of the gene. Conversely when the trait value is the mean of the contributions (ǫ = 1), or is some function between the mean and the sum (0 <ǫ< 1), the effect of a deletion decreases with the number of loci in the genetic architecture. As shown by Conant and Wagner (2004) in C.elegans, the number of detectable knock-down phenotypes decreases with the number of copies of genes in a gene family, suggesting that ǫ does indeed exceed 0 in this species. A similar stronger effect of the deletion of a singleton compared to that of a duplicate has also been observed in S. cerevisiae (Gu et al., 2003).

Text S2: QTL detection in simulated populations

We analyzed the genetic architectures that evolved under our population-genetic model using a simulated QTL study of the exact same type and power as the yeast eQTL study (Brem et al., 2005). Specifically, 100 evolved populations were taken from simulations with parameters corresponding to Fig. S2 for the model with epistasis. From each population, we evolved two lines independently for T generations in the absence of deletions, duplications and recruitment. We then used the most abundant genotype from each line to create parental strains, mimicking the diverged BY and RM parental strains in Brem et al. (2005). A few populations were polymorphic for the number of loci initially, sometimes resulting in two lines with different values of L, which we discarded. In each parent, we assigned the L contributing loci randomly among 1226 simulated marker sites, and also assigned their associated αi values and the interactions βij between loci. We constructed 112 recombinant haploid offspring by mating these two parents according to the genetic map inferred from Brem et al. Each offspring inherited each αi value, and the set of interactions towards other loci (βij ∀j), from either one or the other parent. The trait value in each offspring was calculated as in eq. (1) and then was perturbed by adding a small amount of noise (normally distributed with mean 0 and standard deviation σn), to simulate measurement noise. We then analyzed these artificial genotype and data

13 following the same protocol we used for the real yeast eQTLs data (i.e. using Rqtl). We repeated this entire process with 100 different pairs of parents for each value of σf . Fig. S18 shows the relationship between the selection pressure σf and the number of linked loci detected in this simulated QTL study, for different divergence times between the two lines and different values of σn. In Fig. S19, we increased σn proportionally to log10(σf ), from 0.0001 to 0.001. We also calculated the probability that a locus known to influence the trait in the true architecture (Fig. S3) is in fact detected in the QTL study. This probability is plotted as a function of σf for different values of the noise σn (Fig. S16) and of the time of divergence (Fig. S17).

Text S3: Multitrait model

We simulated the evolution of the genetic architecture underlying multiple traits with a model slightly modified from the single-trait version. In this model, the phenotype consists of 10 traits, each trait k under a different selection pressure σf (k) (the values of σf (k) are those used in independent simulations of the single-locus model; see the x-axis of Fig. S15). In the multiple traits version, L denotes the total number of loci forming the architecture of the 10 traits. L can change when loci are duplicated at rate rdup and deleted at rate rdel. Each locus participates to a set of traits. The direct effect of locus i on trait t is now denoted αit and the indirect effect of locus i on the part of locus j that contributes to t is denoted βijt. To allow for partial gains and losses of function, we define two new matrices A and B, which have the same dimensions as α and β. The functions corresponding to αit and βijt are ‘on’ when Ait = 1 or Bijt = 1, respectively, and are ‘off’ otherwise. Similarly to eq. (1) in our single-trait model, we calculate the trait value t as: L L L xt = Aitαit × fβ Bjitβjit / Ait (S10)    Xi=1 Xj=1 Xi=1 where fβ is the sigmoidal function defined in eq (3). Point mutations of locus i alter all αit and βijt by a normal deviate. Moreover, a mutation can change Ait and Bijt to 0 with probability 0.1 and to 1 with probability 0.005. Over successive generations, the genetic architecture underlying each trait evolves through gene deletions and duplications, and through recruitments and losses of new functions. In this model, only the L genes in the simulated architecture can be recruited – i.e. we do not assume a fixed number of genes that can be recruited at any time. Therefore, the phenotypic effects of recruitment evolve during our simulation, instead of being sampled from a given distribution. If i Ait = 0 for any trait t, the individual is considered non-viable and fitness ωk equals 0. Otherwise, fitnessP is the product of Gaussian functions for each trait times the cost associated to the number of loci, as follows: 10 ωk = G(xk, 0,σf (k)) × (1 − L × c) (S11) kY=1 We simulated the evolution of the genetic architecture through a Wright Fisher process, with population genetics parameters identical to the default values in table S2, except c = 10−4.5 (Wagner, 2005, 2007)). The results of 200 simulations are represented in Fig. S15.

Additional reference

Xu, L, et al, 2006. Average gene length is highly conserved in prokaryotes and eukaryotes and diverges only between the two kingdoms. Mol Biol Evol 23:1107–8.

14 Table S1. Estimates of rates of mutations µ, gene duplications rdup and deletions rdel. All rates are per gene per generation. µ is the rate of non-silent mutations (Lynch et al., 2008) (0.75× the per-nucleotide mutation rate). When the mutation rate was given per nucleotide, we multiplied it by the average gene length in Eukaryotes (Xu et al., 2006) (1346 bp). For D. melanogaster (Watanabe et al., 2009), the rate of detectable mutations was used, after correcting for the length of the 3 loci in the study. The scale of analysis can be the whole genome (WG), or a specific set of loci, in which case the number of loci is denoted in the table.

Species µ rdup rdel Scale Refs S. cerevisiae 3.33 × 10−7 3.4 × 10−6 2.1 × 10−6 WG (Lynch et al., 2008) D. melanogaster 9.18 × 10−7 4 × 10−7 4 × 10−7 3 (Watanabe et al., 2009) C. elegans 2.02 × 10−6 1.25 × 10−7 1.36 × 10−7 WG (Lipinski et al., 2011) − − − H. sapiens 1.5 × 10 5 10 5 6.67 × 10 5 1 (van Ommen, 2005)

Table S2. Definition of parameters, their default values, and range of values examined in the corresponding figures.

Parameter name Definition Default value Values used Fig.

sβ Slope of fβ() 4 {1, 4, 16} S7 N Population size 1000 {100, 1000, 10000} S8 − − µ Mutation rate 3 × 10 6 {0.33, 1, 3, 6}× 10 6 S9 −6 rdup Duplication rate - 10 - −6 −6 rdel Deletion rate 1.25 × 10 {1.25, 1.5, 2}× 10 S10 −5 −5 rrec Probability of recruitment after the 5 × 10 {2.5, 5, 10}× 10 S11 mutation of a non-contributing lo- cus −3.5 σf SD of the fitness function [10 − 10] - All

σm SD of mutation effect function 0.01 - -

pem Probability that a subfunction is 1 {0.25, 0.5, 1} S12 changed by a mutation

bα Mutation bias on α 0 {0, 0.2, 0.4} S13

bβ Mutation bias on β 0 {0, 0.2, 0.4} S13 −4.5 −3.5 cL Metabolic cost of L loci 0 {0, 10 , 10 } S14

15 iueS4. Figure oe ihu psai.A risudritreit sele intermediate under Traits A: epistasis. without model e oterdfutvle tbeS) :Ti ieec acr difference This B: mean S2). the (table reduce values strongly default to 1) their to Fig. set in goe variance distributions this the va but (see selection, loci, weak multiple Under mu includes mutations. because architecture other low by is compensated loci be across variance the selection, strong ntedsrbto of distribution the in ekslcinaedmntdb oiwt ml niiule individual small with contributions. loci diverse by dominated are selection weak A variation across loci h itiuino ieteet flc ntetrait, the on loci of effects direct of distribution The 0.00 0.02 0.04 0.06 0.08 strong selection 35− 25− 15− 050051 0.5 0 −0.5 −1 −1.5 −2 −2.5 −3 −3.5 α i h eei rhtcue ftat ne togselectio strong under traits of architectures genetic The . log 10 ( σ f ) weak selection o0we h ubro oirahs1 hsocr fe enou often occurs This 1. reaches loci of number the when 0 to s to itreit ausof values (intermediate ction ain hnigtetatvleaeeiiae hrl and shortly eliminated are value trait the changing tations et.Tat ne nemdaeslcinrl nlc wit loci on rely selection intermediate under Traits ffects. 16 s risudrvrossrntso eeto sas appa also is selection of strengths various under traits oss aineo h hntpceet coslc.Prmtr a Parameters loci. across effects phenotypic the of variance inecnices hog opnaoyeouinwe th when evolution compensatory through increase can riance B αi α i

eed ntesrnt fslcino h ri,i the in trait, the on selection of strength the on depends , −0.50 −0.25 0.00 0.25 0.50 strong selection 35− 25− 15− 050051 0.5 0 −0.5 −1 −1.5 −2 −2.5 −3 −3.5 ,adt esretn ftat under traits of extent lesser a to and n, σ f log aemr aibeeet.Under effects. variable more have ) 10 ( σ f ) weak selection more h cannot rent gh re e sas o-oooi ucino h tegho selectio with of continuously strength decreases the of fitness function trait on the non-monotonic on a selection also of is strength the of function monotonic uain fec yeadcluae h enaslt effec absolute mean the calculated and type each of mutations iueS5. Figure h ffc fpitmttos ulctos eein n r and deletions duplications, mutations, point of effect The

negative selection coefficient deviation of the trait value 0.00 0.04 0.08 0.12 0.16 0.20 0.000 0.002 0.004 0.006 0.008 σ f strong selection 35−. 15−. . 1 0.5 0 −0.5 −1.5 −2.5 −3.5 o ahidvda nteeovdppltoso i.S3-A Fig. of populations evolved the in individual each For . h ffc fdltosaddpiain nfins bto p (bottom fitness on duplications and deletions of effect The . ntetat u h ffc fpitmttosadrecruitme and mutations point of effect the but trait, the on n 17 so h ri n nfitness. on and trait the on ts log 10 ( σ f ) cutet ntetatvle(o ae)i non- a is panel) (top value trait the on ecruitments deletions duplications recruitments point mutations weak selection eitoue 20 introduced we , anel) nts A number of loci / trait number 1 5 10 15 20 25 30 35 40

B

Σj(βji)

αi variation across loci variation 0.0 0.1 0.2 0.3 −3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S6. The genetic architecture underlying a trait depends on the strength of selection on the trait, in the presence of epistasis. A: Traits under intermediate selection (intermediate values of σf ) evolve genetic architectures with the greatest number of controlling loci. The rectangle areas are proportional to the number of Wright-Fisher simulations (among 500 per value of σf ) in which the number of loci on the y-axis evolved. Dots denote the ensemble mean of each distribution. The neutral expectations for the equilibrium number of loci (see methods) are represented as grey lines, when recruitment events are neutral (top line) or not (bottom line; deletions and duplications are neutral in both cases). The black arrow represents the number of loci for which the number of deletions fixing approximately equals that of duplications or recruitments for −1.5 σf = 10 (Fig. 2), where the mode of the distribution is expected. B: Standard deviations of αi (direct effects) and

Pj (βji) (indirect) are maximum under intermediate selection. Parameters are set to their default values (table S2).

18 sβ = 1 sβ = 4 sβ = 16

sβ = 1 )) β = ji s 4 β ( j

sβ = 16 Σ ( β f 0.0 0.5 1.0 1.5 2.0

−2 −1 0 1 2

∑ βji number of loci / trait number 1 10 20 30 40 50 60

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S7. Impact of parameter sβ on the evolution of genetic architecture. The insert on the top-right corner represents function fβ for different values of sβ .

19 N = 100 N = 1000 N = 10000 number of loci / trait number 1 10 20 30 40 50

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S8. Increasing the population size increases the value of σf at which the expected number of loci L is maximum. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

20 − µ = 0.33 × 10 6 − µ = 1 × 10 6 − µ = 3 × 10 6 − µ = 6 × 10 6 number of loci / trait number 1 10 20 30 40 50 60 70

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S9. The expected number of loci L increases with the mutation rate µ. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2). A − mutation rate of 3 × 10 6 was used to sample recruitment events, so the overall probability of recruitment remains constant.

21 −6 rdel = 1.25 × 10 −6 rdel = 1.5 × 10 −6 rdel = 2 × 10 number of loci / trait number 1 10 20 30 40

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S10. The expected number of loci L decreases as deletions become more frequent (i.e. rdel increases). All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

22 rrec = 0 −5 rrec = 2.5 × 10 −5 rrec = 5 × 10 −5 rrec = 10 × 10 number of loci / trait number 1 10 20 30 40 50 60 70

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S11. The expected number of loci L increases as the probability of recruitment, rrec, increases. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

23 pem = 0.25 pem = 0.5 pem = 1 number of loci / trait number 1 10 20 30 40 50

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S12. Decreasing the probability that a given subfunction is affected by a point mutation, pem, has similar effects as decreasing µ. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

24 bias = 0 bias = 0.2 bias = 0.4 number of loci / trait number 1 10 20 30 40 50

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S13. The evolution of L is not strongly affected by mutation biases in α or β. A strong bias (bα = bβ = 0.4) reduces the maximum variation across loci and therefore reduces L when log10(σf ) > −2.5. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (Table S2). One data point was omitted: L ≈ 61 at log10(σf )= −1.5 and bα = bβ = 0.2.

25 c = 0 − c = 10 4.5 − c = 10 3.5 number of loci / trait number 1 10 20 30 40

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S14. The expected value of L decreases with the metabolic cost c. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

ε 0 0.2 0.4 0.6 0.8 1 mean(L) 2 4 6 8 10 12 14

−3 −2 −1 0 1

log10(σf)

Figure S15. The number of loci contributing to a trait is a non-monotonic function of σf whenever ǫ is higher than 0 (Text S1). All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

26 normal distribution uniform distribution number of loci / trait number 1 10 20 30 40

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S16. The expected value of L is not affected by the form of the distribution of mutation effects. All values represent the ensemble average of 500 replicate simulations run for 5 × 107 generations. All other parameters are set to their default values (table S2).

27 number of loci number 1 5 10 15 20 25 30 35 40

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S17. Evolution of the genetic architecture in simulations initiated with 5 loci and variable effects (see description in the methods section). Compared to Fig. S3-A, only the architectures of traits under strong selection have changed. The mean number of loci under strong selection would be expected at the bottom grey line if deletions and duplications had neutral effects. Instead, this initial variation across loci prevents deletion or duplication, so the mean number of loci remains close to its initial value. Architectures of traits under intermediate and weak selection are not affected. number of loci / trait number 1 5 10 15 20 25 30

−3.5 −2.5 −1.5 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S18. Evolution of genetic architectures in a multi-trait version of our model (see text S3). In this model the overall rate of recruitments of new loci is reduced, and therefore so too the equilibrium number of loci per trait. Nevertheless, the qualitative relationship between selection pressure and number of loci is similar to that in the single-locus version of our model.

28 noise = 0.0001 noise = 0.001 noise = 0.01 detection p 0.00 0.05 0.10 0.15 0.20

−3 −2 −1 0 1

log10(σf)

Figure S19. The probability to detect a locus in the true architecture increases as selection becomes weaker (σf increase). Detection is more accurate as the noise decreases. Error bars represent the mean ± one standard error, calculated over 100 replicate QTL simulations.

divergence time = 25000 gen. divergence time = 50000 gen. divergence time = 100000 gen. detection p 0.0 0.1 0.2 0.3 0.4 0.5

−3 −2 −1 0 1

log10(σf)

Figure S20. The probability to detect a locus in the true architecture increases as selection becomes weaker (σf increase). Detection is more accurate as the divergence time increases. Error bars represent the mean ± one standard error, calculated over 100 replicate QTL simulations.

29 noise = 0.0001 t = 25000 gen. t = 50000 gen. noise = 0.001 noise = 0.01 number of loci / trait number 0 2 4 6 8 10 12 14 16 18 20

t = 100000 gen. t = 200000 gen. number of loci / trait number 0 2 4 6 8 10 12 14 16 18 20 (σ ) (σ ) strong selection log10 f weak selection strong selection log10 f weak selection

Figure S21. Number of QTL detected in a simulated study. The analysis is similar to Fig. 3B in the main text, but we changed the time of divergence between the two lines in the experiment (indicated in the top right corner of each panel) and the noise in traits measurements.

30 t = 25000 t = 50000 t = 100000 t = 200000 number of loci / trait number 0 2 4 6 8 10 12 14 16 18 20

−3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 (σ ) strong selection log10 f weak selection

Figure S22. Number of QTL detected in a simulated study. The analysis is similar to Fig. 3B in the main text, but we changed the time of divergence between the two lines in the experiment. The noise in traits measurement increases proportionally to log10(σf ), from 0.0001 to 0.001.

31