A Rigorous Measure of Genome-Wide Genetic Shuffling That Takes Into

Downloaded by guest on September 24, 2021 ftelvl fsufln ihndfeetcrmsms(50–52), chromosomes different same within shuffling the made of been by levels have produced the comparisons gametes Finally, of 45–49). across 39, and (37, 44] individual ref. in reviewed www.pnas.org/cgi/doi/10.1073/pnas.1817482116 and flies (37–42), in [e.g., humans sex (35–37), same the mice of differ- (30–34), individuals exist also across can shuffling There in (26). informative ences shuffling of be value concomitantly adaptive 27), could the differ- which of 26, these 26–29), explain (14, 14, (13, to shuffling ences theories in evolutionary has several differences the literature prompting testing male/female large to A studied 17). 23–25) distinguishing (15, also 15, domestication from (1, of sex effects ranging (reviewed genomic of implications species advantages across with evolutionary the out 22–24), carried refs. been in have shuffling interest. considerable wide empirical of much of quantity occurs subject the a that genetic been therefore, has shuffling selfish and is, of importance of production amount invasion gamete harmful total in the The from (11–14). par- and complexes evolving separately 10) been rapidly (9, act further from asites has populations to shuffling protecting selection in Genetic implicated (3–8). natural efficiency mutations long-term allowing distinct the on by improves adaptation it envi- addition, of changing or In novel (2). to and adaptation ronments gametes aids among thus diversity and genetic (1), offspring to leads It populations. ual T recombination independent crossovers. is to of entirely which that almost of (ii due (i contribution that is and the that 1/2 shuffling assortment, reveals of of value level female possible high maximum and this its male to analysis close human our is from of humans data Application such sequencing. DNA to either by positions or crossover advantage determining cytologically for selective methods of a emergent Utilization ful suggests interference. which crossover spaced, of evenly more that fact are the implica- include metric Intrinsic this assortment. of independent tions number from crossover and be position from can and measure contributions This separate production. into gamete decomposed during shuffled are loci a Veller Carl law and second positions Mendel’s crossover account into shuffling takes genetic that genome-wide of measure rigorous A osntsfe rmteelmttos edfieteparame- the define We that limitations. shuffling these genome-wide from ter of suffer assort- measure not rigorous independent does Here, a law). from present second (Mendel’s shuffling we chromosomes homologous consider traditional of Moreover, to ment tip. the criti- fail more toward also causes measures crossover chromosome are a a of chromosome than middle shuffling a the toward along crossover a because crossovers cal: inadequate, of qualitatively positions is the This of meiosis. number average per the crossovers counting measures simply Existing by quantity. this crossing-over occurs by of consider measure hampered that direct been a shuffling of have absence on genetic studies the Such of critically production. amount gamete rely Singh) total during Nadia genetics and the Przeworski evolutionary of Molly by evaluation reviewed in 2018; 11, studies October review Comparative for (sent 2018 27, November Kleckner, Nancy by Contributed 02138 MA Cambridge, University, Harvard 02138; MA Cambridge, eateto raimcadEouinr ilg,HradUiest,Cmrde A02138; MA Cambridge, University, Harvard Biology, Evolutionary and Organismic of Department orsodnl,cmaaiesuis(,1–1 fgenome- of 15–21) (1, studies comparative Correspondingly, r uiggmt rdcini nipratpoesi sex- in process genes important inherited an paternally is and production gamete maternally during of shuffling he stepoaiiyta h lee ttornol chosen randomly two at alleles the that probability the as a,b | ac Kleckner Nancy , crossing-over c eateto oeua n ellrBooy avr nvriy abig,M 23;and 02138; MA Cambridge, University, Harvard Biology, Cellular and Molecular of Department | sex | c,1 meiosis n atnA Nowak A. Martin and , r slre hncrossovers when larger is 0tmsgetrthan greater times ∼30 r sealdb power- by enabled is Arabidopsis ) a,b,d (43); r in ) aeit con hfigcue yI.Afut esr of measure fourth A measures IA. above by supposed caused the this shuffling beyond of account shuffling None into to take minimum. contribute num- required the that structurally is COs frequency” to CO of chromosomes “excess each ber its the Since for (55), CO (15). properly one number segregate least COs at chromosome of requires haploid number usually the the bivalent distance is (54). of map gametes used excess of sometimes the 1% in is being in that shuffled cM measure are that Another 1 loci CO either cM, linked average measured two 50 is between length as by of Map multiplied prophase number data. frequency the sequence meiotic from simply during or is cytologically occur frequency CO that only fre- length. COs CO considered map specifically, have more or and shuffling quency crossing-over of of contribution measures the used widely most respectively. shuffling, chromo- chromosomal” homologous of (IA), assortment somes crossovers independent by by both and caused (COs) is Shuffling shuffling. genome-wide of sex new as used be 53). to (26, or chromosomes (13) complexes to genetic susceptible selfish most harboring are chromosomes which for implications with 1073/pnas.1817482116/-/DCSupplemental. y at online information supporting contains article This 1 aadpsto:MTA oefralcluain saalbeo iHb( GitHub on available is calculations all for code MATLAB deposition: Data the under Published interest.y of conflict no declare authors Oregon.y The of University N.S., paper.y and and the University; wrote N.K., Columbia M.A.N. C.V., M.P., and Reviewers: research; N.K., C.V., performed and C.V. data; analyzed research; M.A.N. designed C.V. contributions: Author github.com/cveller/rbar † * owo orsodnesol eadesd mi:[email protected]. y Email: addressed. be should correspondence whom To h nmiuu em“eei hfig o ut“hfig)throughout. “shuffling”) use just we (or confusion, shuffling” avoid “genetic To biology. term molecular unambiguous molecules—in the DNA break- of meaning—the rejoining different and a has age recombination but literature, genetics population below. hfigcue yete rsigoe rI srfre oa rcmiain nthe in “recombination” as to referred is IA or crossing-over either by caused discussed Shuffling as contribution negligible a makes but shuffling causes also conversion Gene ihvlei agl u oidpnetassortment. this independent that to and due one-half largely of is value value high possible maximum its to close htarnol hsnpi flc hfe hi lee na in alleles their shuffles loci of Measuring pair gamete. chosen randomly shuf- features: a these total that account of into measure takes a that the develop fling importantly, We crossovers. and, of assortment positions independent take to account failing crossovers, into of shuffling number genetic average of the count amount simply total chromosomes. the homologous of measures over of Traditional crossing assortment both of by independent shuffling gametes the and in DNA is paternal organisms and sexual maternal in process important An Significance uniaiecmaioslk hs eur rprmeasure proper a require these like comparisons Quantitative ∗ hc opie“nrcrmsml n “inter- and “intrachromosomal” comprise which NSlicense.y PNAS b rga o vltoayDnmc,HradUniversity, Harvard Dynamics, Evolutionary for Program ). y r nhmn,w n htttlsufln is shuffling total that find we humans, in d eateto Mathematics, of Department † www.pnas.org/lookup/suppl/doi:10. NSLts Articles Latest PNAS npeiu tde,the studies, previous In r h probability the , | f10 of 1 https://

EVOLUTION aggregate shuffling, which does take into account IA, is Dar- incorporates CO position in the measurement of genome-wide lington’s “recombination index” (RI) (1, 56, 57), defined as the shuffling has been developed and implemented. sum of the haploid chromosome number and CO frequency. The Here, we present a simple intuitive measure of the genome- rationale for this measure derives from the fact that, given no wide level of shuffling. We define r¯ as the probability that a chromatid interference in meiosis (54, 58), two linked loci sep- randomly chosen pair of loci shuffles their alleles in meiosis, arated by one or more COs shuffle their alleles with probability taking into account CO number and position as well as the con- 1/2 in the formation of a gamete, as if the loci were on sepa- tribution of IA. We have chosen the name “r¯” to echo classic rate chromosomes. The RI is, therefore, the average number of population genetics terminology, where the parameter r for a “freely recombining” segments per meiosis. given pair of loci is the probability that they shuffle their alleles Importantly, none of these existing measures take into account in a gamete. Our parameter r¯ is simply this quantity averaged the specific positions of COs on the chromosomes. Intuitively, across all locus pairs. however, CO position is a critical parameter. For example, a CO r¯ is the most direct metric of the primary effect of genetic at the far tip of a pair of homologous chromosomes does little shuffling, the diversification of gametes, and thus the transmis- work in shuffling the genetic material of those chromosomes, sion of new genetic combinations to offspring (1). r¯ should be while a CO in the middle causes much shuffling (Fig. 1). Addi- used when such diversification is the quantity of interest: for tionally, two COs close together may cancel each other’s effect example, in judging the ability of a population to respond to a (except for the few loci that are between them), and thus result sudden change (or continual fluctuations) in its environment (2). in less allelic shuffling than two COs spaced farther apart. Genetic shuffling also has indirect effects, such as the gradual Therefore, existing measures do not actually define the total reduction of allelic correlations (linkage disequilibrium) within genome-wide amount of shuffling, instead serving only as prox- populations over time (6, 7) and the prevention of invasion of ies for this critical parameter. This is not a trivial concern. There selfish genetic complexes (11). Here, CO positions and IA again is significant heterogeneity in the chromosomal positioning of matter, so standard metrics that simply count COs remain inade- COs at all levels of comparison—between species (16, 59, 60), quate. Therefore, r¯ is likely a better measure in these cases as populations within species (61–65), the sexes (14, 26, 66–69), well, although we suggest (Discussion) modifications of r¯ that individuals (42, 70–73), and different chromosomes (50, 69, 74, more directly measure these quantities of interest. 75), all of which will impact the level of shuffling that these COs In this work, we define the quantity r¯, develop it mathemat- cause. Moreover, there can be major differences in chromosome ically and statistically, and document its intrinsic implications. number and size across species (1, 76, 77), which will seriously We show how it can be decomposed into separate components influence the total amount of shuffling due to IA. deriving from COs and from IA. We then discuss the approaches The fact that the positions of COs matter for the total amount now available to allow chromosome-specific and/or genome-wide of shuffling has been recognized for many years (1, 66, 76, 78–80). measurement of CO positions. With these developments in hand, The need for a measure of total shuffling that accounts for CO we present an application of r¯ to quantitative evaluation of positions has also previously been recognized. Indeed, Burt et al. shuffling in human males and females. (27) give an explicit formula, identical to Eq. 3 below, for “the proportion of the genome which recombines” (in the population Derivation of ¯r genetics sense†). Colombo (81) also gives an explicit character- r¯ is the probability that the alleles at two randomly chosen loci ization of what we call r¯ phrased as a generalization of the RI. are shuffled in the production of a gamete. It can be calculated Finally, Haag et al. (82), after noting that terminal COs cause with knowledge of CO positions on each chromosome in con- little shuffling and that map length is, therefore, an imperfect junction with knowledge of the fraction of the total genomic measure of genome-wide shuffling, suggest a better measure to length (in base pairs) accounted for by each chromosome (chro- be “the average likelihood that a CO occurs between two ran- mosome lengths influence shuffling caused by both IA and COs). domly chosen genes.” However, no mathematical expression that In what follows, we always assume that there are sufficiently many loci so that the difference between sampling them with or without replacement is negligible.

AB Formulas for r. In the ideal situation, CO positions are defined for individual gametes, and the parental origins of the chromosome segments delimited by these COs are known. In this case, the proportion p of the gamete’s genome that is paternal can be determined, and the probability that the alleles of a randomly chosen pair of loci were shuffled during formation of the gamete (equivalently, the proportion of locus pairs at which the alleles are shuffled), either by crossing-over or by IA, is simply the product of the probabilities that one locus is of paternal origin (probability p) and the other is of maternal origin (probability 1 − p) in each of two possible combinations:

r¯ = 2p(1 − p). [1]

Such data can emerge directly from sequencing of single gametes (Fig. 2B) or of a diploid offspring in which the haploid contribution from a single gamete can be identified (Fig. 2C). Fig. 1. The position of a CO affects the amount of genetic shuffling that In some analyses, while CO positions for a gamete are known, it causes. The figure shows the number of shuffled locus pairs that result the specific parental identities of the chromosome segments that from crossing-over between two chromatids in a one-step meiosis. The chro- these COs delimit are not known, because the gamete producer mosome is arbitrarily divided into eight loci. (A) A CO at the tip of the has been sequenced but its parents have not. In this case, we can chromosome between the seventh and eighth loci causes 7 of 28 locus pairs nonetheless calculate an expected value of r¯ for the gamete. The to be shuffled in each resulting gamete. (B) A CO in the middle of the chro- parental origin of segments will alternate across COs on each mosome between the fourth and fifth loci causes 16 of 28 locus pairs to be chromosome in the gamete, and therefore, we can still define, shuffled in each resulting gamete. The central CO thus causes more shuffling for any given chromosome k, the proportions that originate from than the terminal CO. the two different parents (pk and 1 − pk ). However, it is not

2 of 10 | www.pnas.org/cgi/doi/10.1073/pnas.1817482116 Veller et al. Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 oiaesufldi eutn aeei / utpidb the by multiplied 1/2 segments: of separate is pair on are gamete chosen loci resulting randomly two the a a that at in probability meiosis alleles shuffled at the are COs that loci of probability configuration the interference the chromatid I, given no Therefore, assumes 57)]. [this (54, alle- 1/2 seg- their probability shuffle different they with on segments, les different situated is on be situated which indeed to of are need probability production loci the the two in ments, alleles the its gamete, shuffle a to of pair locus selected domly oa eoelnt is length segment genome that suppose total and order, (arbitrary) some of into number bivalents haploid Burt the the by divide If given 2A). (Fig. formulas (81) is the chromosomes Colombo and following (27) case al. this et in calculated be 1 elre al. et Veller their way. about quantitative assertions rigorous a previous shuffling), in evaluate contributions (interchromosomal to relative IA us allowing from thereby deriving component and a shuffling) (intrachromosomal COs from deriving component (Discussion). of species 86) of number 85, a formulation the (1, Our frequency for increase CO to way than is rather effective chromosomes shuffling IA, most genome-wide is increase the species to sexual that in correspondingly, shuffling of or, source predominant of the Components that Intrachromosomal and Inter- in especially diversity, of measure (84). used ecology commonly a (83), index E[¯ for value expected An I. prophase meiotic of stage chromosomes bivalent that along at positions CO define to allows somes gamete. the in realized actually value the Eq. by so multiplied probabilistic, chromosome), is that chromosomes, same term different the probability on on the are are is loci term chosen 1 second randomly The two the origin. parental ent k Eq. mosomes in term first The proportion a are there is chromosome fraction and If shuffled a set alle- Law. been haploid the the Second have in Mendel’s chromosomes that chromosomes with separate probability consistent on the loci one-half, that two address at We assuming sets parents. les which by different next or ambiguity same the the this to from chromosome are one segments of from know to possible ‡ fpsil atrso hoai novmn nterslto fCsa eoi and I meiosis at COs of thereafter. distribution resolution chromatids the the of to in patterns respect involvement segregation with chromatid time of this patterns but possible along value of expected COs its observe calculate we we therefore, when I, Similarly, meiosis to at assortments. respect chromosomes possible with bivalent expectation these its of calculate distribution we parental Therefore, the possible segments. the various of assortments the different of for origins then If different known, being “parameter”). value a not realized is are its it with origins here, paternal (i.e., and known maternal precisely the shuffled—is been have alleles of which value realized the segments, chromosome ie aeesqec n nweg fteptra n aenloiiso all of origins maternal and paternal the of knowledge and sequence gamete a Given − − r ial,ctlgclaayi fmitcpcyeechromo- pachytene meiotic of analysis cytological Finally, utpidb h rbblt ht fs,te r fdiffer- of are they so, if that, probability the by multiplied ] p P sdfie yEq. by defined as , k k n so h te,then other, the of is =1 E[¯ k L L r k 2 k httornol hsnlc r nchromosome on are loci chosen randomly two that = ] p ie,1 (i.e., ftegnm’ eghadi edtriethat determine we if and length genome’s the of k n X k fchromosome of =1 n n hr r oa of total a are there and r ¯ 2p E[¯ − 2 losu opriinttlsufln noa into shuffling total partition to us allows r k h rbblt htarno aro loci of pair random a that probability the stepoaiiysme vralchro- all over summed probability the is l = ] (1 i spootoa oteGini–Simpson the to proportional is 3, n with , − ¯ r + 1 2 narsliggmt saanarno variable; random a again is gamete resulting a in 2 p I

k sa xetto of expectation an is )L emns ae h emnsin segments the Label segments. 1 l 1 − k k 2 + ¯ r bigtepooto flcspisat pairs locus of proportion the —being + so n aetloii and origin parental one of is n X 1 i · · · =1 +I − 1 2 l P +

i 2 ‡ ! I 1 l i n ¯ r n =1 +I . − O,te hs COs these then COs, +I eoe admvariable, random a becomes r . X k l ti fe argued often is It 1 = =1 n i 2 hssecond This 1/2. ftetoloci two the If . L o n ran- any For . k i r k 2 ¯ sfato of fraction ’s consfor accounts ! ahrthan rather . r ¯ can [2] [3] n L is length genomic total of length tion genomic total of fractions with is chromosomes lent of number haploid the that pose Eq. as same the is which Eq. in term first the then is retain partition we therefore, and 4 which), is which and origin parental we proportion then a delimit, the calculate they not still that but can segments complement the haploid of origin a parental in specific realizations CO know we If i.2. Fig. (B nagmt rhpodcmlmn fa fsrn.I chromo- If is offspring. decomposition an for appropriate of decomposition complement this and haploid some chromosome or present same gamete first a the We in on alleles. are and is their loci component chromosomes shuffle two intrachromosomal that separate the probability on while the are alleles, their loci shuffle chosen randomly two C B A 1 u oeifrainaottescn em h appropriate The term. second the about information lose but and enwpeettedcmoiinfor decomposition the present now We h necrmsmlcmoetof component interchromosomal The + k DNA ingamete ofpaternal proportion total genomiclength of length asafraction segment bivalent chromosome crossover positionon L C k exhibits 2 acltn h xetdvleof value expected the Calculating (A) Calculating ) Paternal gamete + otisaproportion a contains E[¯ r ¯ · · · = i r ’s genomic = ] X | k + =1 n nrcrm component Intrachrom. I k a | X L k

=1 2p n nrcrm component Intrachrom. O,dvdn tit segments into it dividing COs, n ¯ r 1 = k ngmtso fsrn sn Eq. using offspring or gametes in 2p 1 (1 {z − k hn Eq. Then, . − (1 {z p 2. chromosome 1 p k − k fteohr(lhuhntknowing not (although other the of )L p k L chromosome 2 k 2 } )L p k p + k k = k 2 } fchromosome of | X 3 fptra otn,te the then content, paternal of i l + 6=j k spriinda follows: as partitioned is necrm component Interchrom. ,1 l | 2 1 2p k necrm component Interchrom. ¯ + r ,i NSLts Articles Latest PNAS ,

E[ i r ¯ Chromosome . l chromosome 1 (1 k 1 E[¯ ¯ r ,2 stepoaiiythat probability the is ] {z − tmissIuigEq. using I meiosis at , − r + Other gamete Other {z ] X k chromosome 2 p tmissI Sup- I. meiosis at =1 · · · n Expected valueof Expected j i Maternal gamete 1. )L n 1, = L + k n htbiva- that and i k 2 L ob fone of be to ! l k j } . . . } ,I . , k k +1 | , sfrac- ’s I f10 of 3 with , k

1 + [5] [4] 3. r ¯

EVOLUTION n Ik +1 ! n ! 1 X 2 X 2 1 X 2 Properties and Intrinsic Implications of r [¯r] = L − l + 1 − L . [6] E 2 k k,i 2 k Properties. We note three properties of r¯. First, its minimum k=1 i=1 k=1 value is zero, and its maximum value is one-half, the latter rely- | {z } | {z } Intrachrom. component Interchrom. component ing on our assumption of many loci. The maximum value of r¯ in a gamete can occur by chance equal segregation of mater- Averaging r. The above measures can be aggregated to obtain nal and paternal DNA to the gamete. The maximum value of the average value of r¯ across many gametes (or meiocytes). We E[¯r] at meiosis I requires, unrealistically, that every pair of loci denote the average value of a variable x by hxi. To average r¯ either is on separate chromosomes or experiences at least one given sequence data from many gametes and supposing that, for CO between them in every meiosis. The minimum value of r¯ in each gamete, we can distinguish which sequences are paternal a gamete could result from chance segregation of only CO-less and which are maternal, we can take the average value of Eq. 4, chromatids of one parental origin to the gamete. The minimum noting that the Lk are constants: value of E[¯r] at meiosis I requires crossing-over to be absent [as in Drosophila males and Lepidoptera females (76)] and either a n karyotype of one chromosome or a meiotic process that causes X 2 X hr¯i = h2pk (1 − pk )i Lk + h2pi (1 − pj )i Li Lj . [7] multiple chromosomes to segregate as a single linkage group [as k=1 i=6 j in some species in the evening primrose genus, Oenothera (87)]. | {z } | {z } Second, r¯ satisfies the intuitive property that a CO at the Intrachrom. component Interchrom. component tip of a chromosome causes less shuffling than a CO in the middle (some example calculations of r¯ are given in Fig. 3). If segregation is Mendelian, then in a large sample of gametes, This is easily seen in Eq. 3. Consider a bivalent on which a h2p (1 − p )i ≈ 1/2 i j , and therefore, the interchromosomal com- single CO will be placed. This will divide the chromosome 1 1 − Pn L2 ponent in Eq. 7 will be close to 2 k=1 k . into two segments of length l1 and l2, where l1 + l2 = L is If we have sequence data from many gametes and can deter- the chromosome’s fraction of total genomic length. The con- mine CO positions but not the parental origin of the sequences tribution of this CO to E[¯r] is seen from Eq. 3 to be 1/2 − 5 2 2 that these COs delimit, then we take the average of Eq. , not- (l + l )/2, which simplifies to l1l2 under the constraint l1 + l2 = L 1 2 ing again that the k are constants (which here means that the L. l1l2 is maximized when l1 = l2 = L/2 (i.e., when the CO is interchromosomal component is constant): placed in the middle of the bivalent). It is minimized when l1 = 0 or l2 = 0 (i.e., when the CO is placed at one of the far ends of the n n ! X 2 1 X 2 bivalent). In general, as the CO is moved farther from the mid- h [¯r]i = h2p (1 − p )i L + 1 − L . [8] E k k k 2 k dle of the bivalent, E[¯r] decreases. This is true regardless of the k=1 k=1 positions of the COs on other chromosomes. Importantly, this | {z } | {z } Intrachrom. component Interchrom. component relationship also holds if, instead of considering where to place a single CO on a whole bivalent, we consider where to place a Given data of CO positions along bivalent chromosomes in new CO on a segment already delimited by two COs (or by a CO many meiocytes, we take the average of Eq. 6: and a chromosome end). In general, if we place a fixed number of COs along a single chromosome, E[¯r] is increased if they are n *Ik +1 +! n ! evenly spaced (SI Appendix). 1 X 2 X 2 1 X 2 h [¯r]i = L − l + 1 − L . [9] Third, r¯ will tend to increase sublinearly with both the num- E 2 k k,i 2 k k=1 i=1 k=1 ber of COs and the number of chromosomes (Figs. 3 D–F). | {z } | {z } For example, increasing the number of COs by some factor will Intrachrom. component Interchrom. component increase the intrachromosomal component of r¯ by a lesser factor (and r¯ as a whole by a yet lesser factor, because its interchromo- Eqs. 7–9 mean that we can estimate the average intrachro- somal component is unaffected). Similarly, doubling the number mosomal contribution to r¯ separately for each chromosome, of chromosomes in the haploid set will cause a less than twofold possibly from different sets of data, and then combine these aver- increase in the interchromosomal component of r¯. That is, there ages into a final measure of r¯. This is useful, because often in are “diminishing returns” to genome-wide shuffling from having sequencing or cytological studies of large numbers of gametes increasingly more COs and more chromosomes. These effects or meiosis I nuclei, it is possible (or desired) to obtain accurate are described in mathematical detail in SI Appendix. measurements for only a subset of the chromosomes in each cell so that the sets of cells from which measurements are taken for Intrinsic Implications. We have noted above that, given some two chromosomes will not overlap. This is the case, for exam- number of COs along a chromosome at meiosis I, E[¯r] is max- ple, in the rich cytological data from human pachytene nuclei imized if they are evenly spaced. This observation carries the in ref. 48. interesting implication that positive CO interference—a classical Unrelated to averaging r¯ across multiple measurements of phenomenon (88, 89) where the presence of a CO at some point individual gametes or meiocytes, an average value for r¯ can also along a bivalent chromosome inhibits the formation of nearby be calculated directly given pairwise average rates of shuffling COs—will tend to increase r¯. It thus suggests a possible selective for all loci. These can be estimated from linkage maps generated advantage for this phenomenon (Discussion). from pooled sequence data (see below). Suppose that we have Also, the general diminishing returns to r¯ of having more COs measured, for each locus pair (i, j ), their rate of shuffling rij . hr¯i (above) carry particular implications for differences in total auto- is then simply the average value of rij across all locus pairs (i, j ). somal shuffling in the two sexes of a species. When the number If Λ is the total number of loci, then of COs and/or their localization do not differ grossly between the ! ! ! sexes, then the amount of shuffling will be similar in both sexes X Λ X Λ X Λ hr¯i = rij = rij + rij , (e.g., for male and female humans as discussed below). 2 2 2 i

4 of 10 | www.pnas.org/cgi/doi/10.1073/pnas.1817482116 Veller et al. Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 h hoai fec hoai sarne nalna ra of array linear a in arranged is chromatid each of chromatin the chromosome the and along (below of distance dis- axes physical genomic use to meiosis, of proportional advanta- stage The is this tance particularly at pairs). because is regard, this base in 4) geous (in (Fig. distances chromosomes measure- pachytene-stage genomic the into so data, distances cytological of from inferred ment directly be can species 106). many ref. for in generated species been bovid have 28 and (e.g., com- literature now the are in cases positions mon CO favorable for immunostained in nuclei pachytene or be arrays), can fluores- [e.g., system (e.g., chromosomes tags repressor–operator individual fluorescent cent locus-specific and FISH, pro- CENP-A, by centromeric identified for as staining by such visualized be teins, be can centromeres recombination can axes of CO positions the axes the along to desired, the If specific MLH1). central along (e.g., molecules complexes complex positions staining synaptonemal by CO the visualized SYCP1–3). or (e.g., axes region the either staining used. The commonly (104). most organisms now most is in method 103) majority ref. vast latter the in are (reviewed which COs 4), (interfering) (99–102)] (Fig. I” tech- “type COs staining detect immunofluorescent to of all using niques visualization microscopy mark light direct [which with or by nodules” microscopy section- recombination serial electron “late from with reconstructions either 3D ing) in spread (or in reliably nuclei determined pachytene be can chromosomes pachytene of Analysis. Cytological maximize to as so sized evenly are chromosomes when that even Notice number, chromosome. one only with organism diploid a in positions 3. Fig. elre al. et Veller of advances estimation and technological simple 95), These allow (94, 96–98). gametes (42, individual 93), triads/tetrads 92, anal- meiotic 41, sequence/marker (40, using pedigrees scale of genomic ysis fine a at increases. determined number CO average as rate decreasing a at component increases interchromosomal component to the intrachromosomal small, contribution the not five), intrachromosomal is (here, chromosomes total of chromosomes average number of the the When along calculate nuclei). spaced across then evenly (E constant dominates. be We is to (which chromosome. contribution always that interchromosomal assumed the for are with where COs COs indicates and of size, isocline number equal “45” of total the be example, to For assumed the constant. are of is Chromosomes of case do. ratio combination which COs this each than in For which shuffling chromosomes. female, across to more combinations human contributions times number 45 a of chromosome contributes ratio in IA and The meiosis CO (D) CO-less indicate chromosomes. a Isoclines multiple shuffling). represents with organism 4 diploid Column a one-half. in of value maximum its maximize to h hsclsrcueo hs hoooe shgl regular: highly is chromosomes these of structure physical The chromosomes along micrometers) (in distances physical Only by achieved be can axes chromosome the of Visualization n Arabidopsis (A–C iaetcrmsmsi ulu,w rwavlefo oso itiuinwt parameter with distribution Poisson a from value a draw we nucleus, a in chromosomes bivalent r ¯ CEFB AD xml auso h xetdvleof value expected the of values Example ) E[ h necrmsmlcmoeticessa eraigrt scrmsm ubricess (F increases. number chromosome as rate decreasing a at increases component interchromosomal The ) rmsc aarqie a ocnetphysical convert to way a requires data such from ¯ r ] aiu hoooenmesfradpodogns ihu rsigoe.Ntc that Notice crossing-over. without organism diploid a for numbers chromosome Various (B) . IAppendix). SI 15] eaiecrmsm egh Spread length. chromosome relative (105)], h hsclpstoso O ln h axes the along COs of positions physical The r ¯ . n ≥ hoooe ntehpodstand set haploid the in chromosomes 2 ¯ r , E[ ¯ r tmissI nec case, each In I. meiosis at ], E[ ¯ E[ r oietota,we h ubro hoooe sntsmall, not is chromosomes of number the when that, too Notice ]. ¯ r ] nrae ulnal ihC ubr vnwe O r vnysae oas so spaced evenly are COs when even number, CO with sublinearly increases ysqecn niiuldpodofpigpoie htthe that offspring—provided genomes diploid gametic inferring individual by indi- sequencing or individual sequencing bodies by the polar by or sequencing directly gametes meioses—either by vidual those inferred of be products can meioses vidual’s Analysis. Sequence above derived formulas the into for substituted cytolog- be from measured can (easily spreads) length Therefore, ical axis (micrometers). total length of fractions axis their physical total of cal- fractions for required as of pairs), (base culation length genomic total of segments’ packing fractions chromosome the average approximately considerations, is the these nucleus Given that a constant. within infer chromosomes different we of which ratio from S2 species), Fig. Appendix, diverse (SI axis pachytene physical relative at genomic their lengths relative match closely the chromosomes Second, of 101). vary lengths ref. to in 107 appear reviewed (refs. 111–114; nuclei species, within and of chromosomes variety across and a along in minimally and, directly measured of be analysis for required rel- distances the r inter-CO at of least scale at chromosomes crude nucleus, different atively each Moreover, across within constant 111). and approximately along 110, is ratio 108, axis dis- packing 106, bivalent local (101, genomic pachytene the ratio”) a the “packing along define the local distance Therefore, (the directly physical 107–109). DNA unit loops of per (101, little the all tance very bases of Almost with loop lengths 108). loops, between genomic and the axis com- in 107, a the accommodated along 101, in is intervals refs. regular DNA in at the (reviewed are axis which of mon bases the loops, ¯ § hsi o eea,i uhseisaporaeajsmnswudb eddt convert to needed lengths. be would genomic adjustments to appropriate lengths species physical such in general, in not data is the this While euchromatin. in than erochromatin nsm pce eg,tmt 15] thsbe eotdta op r ogri het- in longer are loops that reported been has it (115)], tomato [e.g., species some In sidctdb w rtra is,pyia oplntscan lengths loop physical First, criteria. two by indicated as , I E[¯ ≥ E[ r ¯ O e hoooeo vrg,w iuae10 simulate we average, on chromosome per COs 1 r ] ]. rmI itrhoooa hfig n O (intrachromosomal COs and shuffling) (interchromosomal IA from E[ § ¯ r ] E[¯ scluae codn oEq. to according calculated is r orsodt,adtu a edfie ytheir by defined be can thus and to, correspond ], Inter-chrom. component

0.25 # Chromosomes 0.5 10 20 30 40 50 h eoi oiin fCsi nindi- an in COs of positions genomic The 1 0 0100 50 1 2 10 123 Ratio ofinter/intra-chrom.Ratio shuffling 85 25 E[ I − Average COs/chromosome ¯ r 65 ] E[ n d tt adtr Ot e the get to CO mandatory 1 to it add and 1 s044 (C 0.474. is 75 ¯ r ] 4 iial,hligcntn h number the constant holding Similarly, ) nrae ulnal ihchromosome with sublinearly increases E[ 5 ¯ r ] 6 costence n opr it compare and nuclei the across 45 aiu oﬁuain fCOs of conﬁgurations Various )

aiu Onmesand numbers CO Various (A) 3. Intra-chrom. component 7 NSLts Articles Latest PNAS 0.05 i.S2 Fig. Appendix, SI 0.1 8 25 0 51015 5 hw xmlsfrom examples shows 9 5 uli o each For nuclei. E[ ¯ r ] ugs that suggest scoeto close is | f10 of 5

EVOLUTION A contains a bivalent of paired X chromosomes, which, in this case, has not been distinguished from the other bivalents (e.g., by FISH). Therefore, the X is included in the calculation for the oocyte, so the interchromosomal components of E[¯r] for the spermatocyte and oocyte are not expected to be equal. Although we have estimated them directly from the single cytological spreads in Fig. 4, they can be calculated exactly from ♂ known genomic lengths of the chromosomes. Substituting the chromosome lengths reported in assembly GRCh38.p11 of the B human reference genome (available online at https://www.ncbi. nlm.nih.gov/grc/human/data?asm=GRCh38.p11) (lengths are listed in Dataset S1) into the first term in Eq. 6, we find that the value for the spermatocyte should be 0.4730, close to the value of 0.4744 calculated in Fig. 4A, while the value for the oocyte should be 0.4744, close to the value of 0.4750 calculated in Fig. 4B. These close matches illustrate the reliability of physical length as a proxy for genomic length in the calculation of E[¯r]. Interestingly, although the female spread exhibits about 50% Fig. 4. Calculation of E[¯r] from CO patterns at meiosis I in humans. Left more COs than the male spread (74 vs. 48), the intrachromo- shows micrographs of immunostained spreads of a pachytene spermatocyte somal component of E[¯r] for the female spread is only about (A) and a pachytene oocyte (B). The red lines show localization of the synap- 30% larger than that of the male spread. This is expected from tonemal complex protein SYCP3 and demarcate the structural axes of the the general arguments given earlier concerning the diminishing chromosomes. The green dots along the axes show foci of the protein MLH1 returns of more COs (Fig. 3 D and F). and demarcate the positions of CO associations along the axes. The white We have carried out measurement of [¯r] using data from arrow in A points to the paired X and Y chromosomes. In B, the blue dots E along the chromosome axes mark the centromeres. The paired X chromo- 824 spermatocytes from 10 male humans (data from ref. 122) somes in B have not been identified and are included in the calculation of (provided by Fei Sun, Nantong University, Nantong, China). 9 E[¯r]. A, Center and B, Center show the traced axes and CO positions from Using Eq. , we calculate each chromosome’s average contribu- the spreads. From these, the lengths of the segments separated by the COs tion to the intrachromosomal component of E[¯r] (SI Appendix, can be measured and converted to fractions of the total physical axis length Table S1). Summing these values, we find that the average intra- (these measurements can be found in Dataset S1). The segments’ fractions of chromosomal component of E[¯r] is 0.0143, similar to the value total physical axis length are approximately equal to their fractions of total calculated for the single spermatocyte in Fig. 4A. Restricting genomic length for reasons described in the text. With these fractions of attention to the 715 cells with data for every chromosome allows genomic length, E[¯r] may be calculated for each spread according to Eq. 3, us to calculate the average and SD of the intrachromosomal com- and its inter- and intrachromosomal components can be determined accord- ponent of E[¯r] for each of the 10 individuals. The SD for each ing to Eq. 6 (A, Right and B, Right). Notice that we do not require individual individual is of order 0.0015 (SI Appendix, Table S2) or about ¯ chromosomes to be identified to calculate E[r] or its intra- and interchro- 10% of the average value. There is some variation among the mosomal components. (A) Modified from ref. 103 by permission of Oxford University Press. (B) Modified from ref. 39. individuals for the average value of the intrachromosomal component of E[¯r]: the minimum is 0.0130, while the maximum is 0.0153. Interestingly, the minimum and maximum do not come individual’s diploid genome has been haplophased. Haplophas- from the individuals with the smallest and largest average CO ing can be achieved by sequencing an extended pedigree involv- frequencies, respectively (SI Appendix, Table S2). In fact, the ing the individual (40, 73, 116); by sequencing multiple offspring, individual with the largest average value of the intrachromoso- gametes, and/or polar bodies of the individual (42, 52, 95, 97, 117, mal component of E[¯r] has only the 4th largest average number 118); or by isolating individual chromosomes from a somatic cell of COs per spermatocyte. This further highlights the importance and sequencing them separately (94, 119). of CO position for the amount of genetic shuffling and hints that r¯ can also be calculated from linkage maps derived from individuals might differ systematically in their CO positioning in pooled sequencing data [as obtained, for example, from pop- ways that affect the amount of shuffling in their gametes. ulation pedigrees (40) or pooled sperm (120)]. A linkage map gives the map distance (in centimorgans) between pairs of linked Analysis of Gamete Sequence Data. Hou et al. (97) sequenced markers. Using this, we can generate a grid of evenly spaced loci the products of multiple meioses (first polar bodies, second for each chromosome (to ensure even sampling of loci in the cal- polar bodies, and ova) of several females and haplophased these culation of r¯) and impute map distances between these by linear females directly from the sequences, allowing the CO points in interpolation from distances between true markers in the linkage those products to be determined. Fig. 5 is a modified version map (details are in SI Appendix). Thus, we generate a map dis- of figure S2 in ref. 97, showing the CO points along the chro- tance dij between every pair of linked loci (i, j ), which we can mosomes of an egg cell from one of the individuals. Because convert to a rate of shuffling rij using a map function (54) [e.g., relatives of the individual were not sequenced, the paternal and 1 Kosambi’s (121): rij = r(dij ) = 2 tanh (2dij )]. This gives a rate maternal sequences cannot be discerned. Therefore, calculation of shuffling for every linked locus pair, from which we calculate of E[¯r] proceeds according to Eq. 2, with its inter- and intrachro- the average intrachromosomal component of hr¯i using Eq. 10. mosomal components determined according to Eq. 5. Carrying out this calculation for all chromosomes, including the X, yields Measuring r in Humans E[¯r] = 0.4971, with an interchromosomal component of 0.4756 Analysis of Cytological Data. Fig. 4 shows the calculation of E[¯r] (calculated using chromosome lengths as they appear in Fig. 5) from the immunostained pachytene chromosomes of a single and an intrachromosomal component of 0.0215 (all details are human spermatocyte (103) and oocyte (39). For both sperma- in Dataset S2). The interchromosomal contribution is about 22 tocyte and oocyte, E[¯r] is close to its maximum value of 1/2. This times larger than the intrachromosomal contribution. Excluding high level of shuffling is due almost entirely to IA: in both cases, the X, E[¯r] = 0.4965, with an interchromosomal component of the interchromosomal component of E[¯r] is much larger than the 0.4735 and an intrachromosomal component of 0.0230. intrachromosomal component (from COs)—by a factor of about Fan et al. (119) haplophased a male using microfluidic tech- 35 in the male and 27 in the female. niques, and Wang et al. (94) sequenced 91 individual sperm cells The spermatocyte in Fig. 4A contains easily identified, par- from this male using the haplophasing from ref. 119 to determine tially paired X and Y chromosomes, while the oocyte in Fig. 4B the CO points in each sperm’s sequence (given in table S2 in

6 of 10 | www.pnas.org/cgi/doi/10.1073/pnas.1817482116 Veller et al. Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 hfigC rqec n a eghd o aeinto take not length—do genetic map genome-wide of and measures frequency current value shuffling—CO common female most the The X, from the calculated Excluding Discussion 0.0215 5. of in Fig. value spread in the 0.0193. single-oocyte is egg the than the to single smaller from similar the but 0.0183, calculated 4 is 0.0177 Fig. chromosome, val- of estimating agreement X value of This the method S4). including map Table value, linkage Appendix, the from (SI above idates are well) calculated contributions as 0.0143 chromosome-specific of similar (the value spermatocytes the to 824 value similar male a 0.0138, yields This of lengths). chromosome known from lated sexes at both available for are maps Icelandic linkage (data of high-density generate pairs to father–offspring individuals 6,407 and Data. mother–offspring Map Linkage of Hutterite Analysis for cited 26.2 and previously (min- (52) the each males (117). Icelandic with COs males for compared 22.8 25.9 average, of 32) averages on maximum, show, 12; sperm imum, 91 the of variant Indeed, of a for reduction homozygous a is subsection he previous as the the expected, is than in average This lower spermatocytes The (0.0143). is from respectively. individual calculated 0.0178, this that and across for 0.0058 component component are intrachromosomal intrachromosomal sperm the 91 all variation of values of of The maximum (coefficient component component. 0.0026 intrachromosomal this is in the gametes of genomic across SD fixed variance the Eq. no (using from chromosomes therefore, deduced individual’s of average, the origin an of lengths is parental component sperm interchromosomal the each the about sperm, for 119, each length. information ref. in genomic sequences no in chromosome’s the is reported that of (as there position proxy chromosome Because our the each as chromosome used on S2) we SNP the dataset genome, last exceeded reference the human individual of the this of for lengths shuffled positions some because SNP component, times interchromosomal calcu- 37 the In lating contribution. about intrachromosomal average is the than contribution larger interchromosomal average The elre al. et Veller of component intrachromosomal the estimate the from h function then from available map We pairs Kosambi is rbar). locus the and code linked using loci all them generat- (MATLAB “pseudomarker” between for for spaced distances shuffling above evenly map of described of rates method map estimate linkage the a apply ing we sex, each for code from (MATLAB available respectively is calculations 0.0127, all and 0.4729 not of ponents are sequences of paternal of and gametes Eq. maternal use we Because known, 94). ref. r ¯ i E[¯ RNF212 sn Eq. using r ] f045,wt vrg ne-aditahoooa com- intrachromosomal and inter- average with 0.4856, of E[¯ eeta,i ooyosfr,i soitdwith associated is form, homozygous in that, gene r ] 10 –.%i Onmesi ae 18 123). (118, males in numbers CO in ∼5–7.5% n t opnns hsyed naeaevalue average an yields This components. its and teitrhoooa opnn scalcu- is component interchromosomal (the 8 ocluaeteaeaevlears l 91 all across value average the calculate to https://www.decode.com/addendum oge l 4)sqecd8,850 sequenced (41) al. et Kong https://github.com/cveller/rbar ≈ https://github.com/cveller/ 0. .Temnmmand minimum The 2). r ¯ h r ¯ cosgametes across h female The i. .Teeis, There 5). .For ). ). oisuflsterallsi meiosis. in alleles their shuffles account. loci into features into these measure, genome-wide of take This of both measure takes not a naturally do that proposed shuffling have also limitation, we shuffling Here, significant IA. aggregate account a Tra- of COs. is central measures than This shuffling ditional less COs. cause COs of terminal because positions the account h oiin fteCsidpnetyfo h miia dis- average empirical observed The the spermatocytes. from all independently across pooled COs tribution the of chro- positions contribution each the average for interferenceless intrachromosomal interference, an to without 94 calculate refs. COs we in sampling mosome used for method the 96 Following and from above. data al. spermatocyte described the et 122 using ref. males Goldstein human that for by In effect shuffling (127)]. made this Gorlova tify of been and amount Gorlov have and the points (126), increase [similar to cause tend they interference will intriguingly, COs Thus, among out. spread evenly more become Shuffling. of Genetic component intrachromosomal Increases Interference CO quantitatively. tested be to Our distinctions shuffling. of purpose of adaptive decomposition the of theories short-term for IA the We of Therefore, be component shuffling. typically se. aggregate will to per IA contributor that greatest shuffling example, the an aggregate as humans However, on using shown, shuffling. rely have of theories value short-term adaptive the the of theories long-term onsaogacrmsm) ne hsve,teC compo- many CO at the occur view, can this Under COs of nent chromosome). whereas shuf- a points, IA than along (because fixed effective points associations more at allelic are genomes loci reducing COs fles different that gradually at argued in genes has IA on (86) natural efficiently Burt that allows more (3–7)]. theory time operate over to “Fisher–Muller” correlations selection positing the allelic those [e.g., reducing and gradually advantage theories] 125) (1, long-term (2, Bank a Tangled sib-competition the and short-term e.g., 124) shuf- a of positing diversification; those value [offspring categories: adaptive advantage two the been into of long roughly Theories has fall relative 86). shuffling fling 85, of The sources 15, IA. as (1, from IA discussed and component COs a of and importance COs from component Sec- Mendel’s Law. and ond Crossing-Over of Contributions the Distinguishing data. cytological will and sequence that this standard pos- demonstrated from that this have of hope support we In We sibility, possible. where species). dif- practice across across common variation become (e.g., its and situations shuffling ferent genetic basis, nature, of the significance signifi- understanding and of of therefore, possibility production should, the the measure improve cantly in this occurs of that Application gametes. shuffling of amount Shuffling. aggregate Genetic of son r hudB h esr fCoc o vlainadCompari- and Evaluation for Choice of Measure the Be Should r ¯ ol emr motn hnteI opnn for component IA the than important more be would ehv hw that shown have We r ¯ r ¯ stepoaiiyta admycoe arof pair chosen randomly a that probability the is , a emr motn hnteC component CO the than important more be may r ¯ t opnns oie rmrf 7 Copyright 97. Elsevier. from ref. permission with from (2013), Modified components. its Eq. use then and which) knowing proportion 5. Fig. h rprino h g’ N hti paternal. is chromosome that each for DNA however, egg’s can, are the We which of calculate and proportion directly the maternal cannot sep- are we haplotypes Therefore, points the paternal. CO of pedi- by which extended tell arated an cannot of we sequencing gree, no With human haplophased female. a from obtained egg individual noissprt opnnsalw these allows components separate its into E[¯ r ] r ¯ yrsmln,i ahspermatocyte, each in resampling, by aclto of Calculation stems ietmaueo the of measure direct most the is p k hti foeprna rgn(without origin parental one of is that r ¯ r ¯ a edcmoe noa into decomposed be can sta ticessa COs as increases it that is r ¯ a edl emeasured be readily can NSLts Articles Latest PNAS ¯ r rmtesqec fan of sequence the from equan- we Appendix, SI rpryo the of property A 5 ocalculate to k | n the find f10 of 7 ¯ r and p,

EVOLUTION value of the intrachromosomal contribution to E[¯r] is about 15% would be easiest with sequence data, especially in species with higher than this interferenceless value. well-annotated genomes. For species without well-annotated CO interference, first described more than a century ago genomes, proxies of regional gene density, such as euchromatin (88), is a deeply conserved feature of the meiotic program content or guanine–cytosine richness, could be used. Correcting (128, 129). Despite this, its adaptive function, if any, remains for gene density with cytological data would be more compli- unclear (104, 130, 131). One category of hypotheses invokes cated, since it requires knowledge of the identity and orienta- mechanical advantages of spread-out COs either to bivalent sta- tion of the various chromosomes. At a crude resolution, this bility in prophase (132) or to homolog segregation at meiosis can be achieved with chromosome-specific FISH probes (and I (133). However, such ideas are challenged by the fact that if required, centromere localization). Finer-scale identification meiotic segregation proceeds without trouble in organisms that of different genomic regions in cytological spreads could be lack CO interference [e.g., fission yeast and Aspergillus (134)] achieved using high-resolution FISH (147–149). In any case, or that have had interference experimentally reduced or elimi- the usual problems would arise in deciding which parts of the nated (135, 136). This study raises the possibility of a qualitatively genome are “functional” (150, 151). different idea: that CO interference provides an evolutionary advantage via its effects on genetic transmission. By explicitly Taking Heterozygosity into Account. Genetic shuffling of alleles taking into account the positions of COs in quantifying how much at two loci in meiosis is practically relevant only if both loci shuffling they cause, we show that CO interference serves to are heterozygous. A simple modification of r¯ that measures the increase the shuffling caused by a given number of COs. This extent of such “effective” shuffling in meiosis is the proportion of finding, therefore, suggests a possible adaptive function of CO locus pairs that shuffle their alleles and are both heterozygous. interference. Again, we are mostly interested in the shuffling of functional elements, in which case estimation of this measure requires Additional Measures of Genetic Shuffling. r¯ is a suitable measure knowledge of the genomic landscape of functional heterozygosity of the total amount of shuffling, linearly aggregating the rates [e.g., nonsynonymous heterozygosity in protein-coding genes as of shuffling between different locus pairs. In this way, it quanti- available for humans (152)] together with knowledge of linkage fies the first-order effect of shuffling: the genetic diversification disequilibrium between nearby loci [e.g., for humans (153)]. of gametes/offspring (1). However, certain population genetic properties exhibit nonlinear dependence on the rates of shuffling Taking Gene Conversion into Account. Another source of shuf- between loci. For example, theoretical studies have determined fling, in addition to COs and IA, is gene conversion (GC) (154, that certain allelic associations can jointly increase in frequency 155): the unidirectional copying of a DNA tract from one chro- only if their constituent loci shuffle their alleles at a rate below matid (the donor) to a homologous chromatid (the acceptor) some critical threshold. This has been shown for coadapted either in mitosis or in meiosis. In SI Appendix, we show how gene complexes exhibiting positive fitness epistasis (137–139), to take both mitotic and meiotic GC into account in measur- “poison–antidote” gamete-killing haplotypes involved in meiotic ing r¯. In meiosis, chromosome pairing involves many points drive (11, 140–142), and associations between sex-determining of interaction between homologous chromatids [∼ 200 across genes and genes with sexually antagonistic fitness effects (53, all chromosomes in mouse spermatocytes (156)], each induced 143–145). Similarly, Hill–Robertson interference is effective only by a double-strand break on one of the chromatids. In most among loci that are tightly linked, shuffling their alleles at a rate organisms, only a minority of these interactions result in COs below a small threshold (which depends on the effective pop- (104, 157, 158), the majority instead resulting in non-COs. Both ulation size and other parameters) (146). In quantifying these outcomes are associated with short tracts of GC [e.g., ∼ 100 various properties at an aggregate level, a more suitable measure bp for non-CO GCs and ∼ 500 bp for CO GCs in mice (98, would be the proportion of locus pairs that shuffle their alleles at 156)]. These tracts, although numerous, are too short to seriously a rate below the critical threshold. Such measures would be infor- affect the total amount of shuffling [for example, the contribu- mative, for example, of which chromosomes are most likely to be tion from meiotic GCs in mammals will be of the order 10,000 co-opted as new sex chromosomes, of how susceptible a species times smaller than the contribution from COs (SI Appendix); is to the invasion of meiotic drive complexes (or within species, in yeast, this figure is smaller—about 200 (SI Appendix)]; how- which chromosomes are most likely to harbor such complexes), ever, they may have important local effects (154, 159). Mitotic or of the average fitness reduction caused by Hill–Robertson GC tracts result from homologous chromosome repair, are interference. Calculating these threshold-based aggregate mea- typically much longer than meiotic GC tracts (often of order sures would require estimates of the average rates of shuffling for 10 kb) (160, 161), and if they occur in germline cell divisions specific locus pairs. High-density linkage maps would, therefore, leading to meiocytes, can lead to shuffling of maternal and be especially promising for such measurements. paternal DNA in gametes (which should especially affect males, who have more germline cell divisions). Too little is presently Taking Gene Density into Account. The calculation of r¯, as we known about the global frequency of these events for us to have defined it, treats all regions of equal genomic length as give a reasonable quantitative estimate of how much shuffling of the same importance. However, the quantity that we seek to they cause. measure is really the amount of shuffling of functional genetic elements—it is largely irrelevant if two functionless pseudogenes ACKNOWLEDGMENTS. We thank Anthony Geneva, David Haig, Arbel shuffle, for example. Correcting the calculation of r¯ to take Harpak, Dan Hartl, Alison Hill, Thomas Lenormand, Pavitra Muralidhar, gene density into account is an important extension, a point Naomi Pierce, Molly Przeworski, Nadia Singh, and Martin White for helpful comments and Colin Donihue for help with figure preparation. C.V. thanks already noted by Burt et al. (27). This is especially so if the Bob Trivers for impressing on him the importance of CO position for the distribution of genes is nonrandom along or across chromo- amount of recombination. The research of N.K. was supported by NIH Grant somes. Correcting for gene densities of different genomic regions GM-044794.

1. Bell G (1982) The Masterpiece of Nature: The Evolution and Genetics of Sexuality 7. Felsenstein J (1974) The evolutionary advantage of recombination. Genetics 78:737– (University of California Press, Berkeley, CA). 756. 2. Williams GC (1975) Sex and Evolution (Princeton Univ Press, Princeton). 8. McDonald MJ, Rice DP, Desai MM (2016) Sex speeds adaptation by altering the 3. Fisher RA (1930) The Genetical Theory of Natural Selection (Clarendon, Oxford). dynamics of molecular evolution. Nature 531:233–236. 4. Muller HJ (1932) Some genetic aspects of sex. Am Nat 66:118–138. 9. Hamilton WD (1980) Sex versus non-sex versus parasite. Oikos 35:282–290. 5. Muller HJ (1964) The relation of recombination to mutational advance. Mut Res 1: 10. Hamilton WD, Axelrod R, Tanese R (1990) Sexual reproduction as an adaptation to 2–9. resist parasites (a review). Proc Natl Acad Sci USA 87:3566–3573. 6. Hill WG, Robertson A (1966) The effect of linkage on limits to artiﬁcial selection. 11. Haig D, Grafen A (1991) Genetic scrambling as a defence against meiotic drive. J Theor Genet Res 8:269–294. Biol 153:531–558.

8 of 10 | www.pnas.org/cgi/doi/10.1073/pnas.1817482116 Veller et al. Downloaded by guest on September 24, 2021 Downloaded by guest on September 24, 2021 4 rnvi ,Co 21)Srmln gs eoi rv n h vlto of evolution the and drive Meiotic eggs: Scrambling (2012) G Coop drive. Y, meiotic Brandvain and recombination, 14. Segregation, tetrads: in Games (2010) D Haig (2006) RL 13. Trivers A, Burt 12. 6 reJ,Mre M areC 19)Dfeecsi rsoe frequency crossover of test in A domestication: under Differences recombination of evolution (1996) The (2004) CC J Ross-Ibarra Laurie 17. of JM, theories two Mercer of test a JR, as frequencies True chiasma 16. Mammalian (1987) G Bell A, Burt 15. 9 uotB,PyerB 21)Eouino h eoi eobnto aei murid in rate recombination in genomic the recombination of of Evolution (2011) rate BA genomic Payseur BL, the Dumont of 19. Evolution (2008) BA Payseur BL, Dumont 18. 1 ro ,e l 21)Vraino h eoi eobnto adcp n properties and landscape recombination meiotic the of Variation (2017) al. et C, Brion 21. into Insights mammals: eutherian in recombination of Evolution (2013) al. et J, Segura 20. 5 ohe E hryJ,Ln ,Hn A asl J(02 eei oto fmam- of control Genetic (2002) TJ Hassold nat- PA, Hunt of A, architecture Lynn JP, genetic Cherry The KE, (2016) Koehler N 35. Singh T, Mackay W, Huang C, back- Hunter genetic male 34. of effects the Testing matter? males Do (2014) N in Singh recombination C, of Hunter landscapes many 33. The (2012) S Bailin R, Ratnappan in JM, Comeron value 32. recombination of change Genetic (1972) for role MG A sexes: Kidwell between difference 30. Recombination (2005) J Dutheil T, Lenormand 29. in Variation (2017) CM Smadja AW, Santure related SE, closely Johnston PGD, in Feulner variation J, rate Stapley Recombination 23. (2011) MAF Noor CS, Smukowski 22. 6 uotB,Boa W asu A(09 aito ngnmcrcmiainrates recombination genomic in Variation (2009) BA Payseur KW, Broman BL, Dumont 36. in value recombination of change Genetic (1972) MG Kidwell 31. recombination. in dimorphism sex of evolution The (2003) T Lenormand 28. recombina- understand to data and theory Connecting (2017) BA Payseur AL, Dapper 24. 1 ogA ta.(00 iesaercmiainrt ifrne ewe sexes, seg- chromosome between and recombination differences of maps rate Genome-wide (2015) recombination al. et CS, Fine-scale Ottolini (2010) 42. al. et Comprehensive (1998) A, JL Kong Weber 41. RL, White oocytes. VC, human Sheffield in JC, recombination Murray KW, Meiotic Broman (2009) 40. al. humans. et in success EY, reproductive Cheng and rate 39. Recombination (2004) mammalian al. and et length A, complex Kong synaptonemal of 38. Covariation (2002) al. et A, Lynn 37. recombination. in differences Sex (1991) PH Harvey G, Bell A, Burt selection. 27. sexual and recombination of rates in differences Sex (1988) RL Trivers rates recombination genomic 26. in Variation (2007) P Schmid-Hempel J, Gadau L, Wilfert 25. 6 asl ,e l 20)Ctlgclsuiso eoi eobnto nhmnmales. human in recombination meiotic of in studies Cytological frequency (2004) al. chiasma et T, of Hassold variation or 46. Adaptive Developmental rate: (1972) recombination in I Variation Variation Maudlin (2017) (2002) 45. ND Singh G MAF, Jones Noor KR, F, Ritz Franklin 44. J, Santos S, Armstrong E, Sanchez-Moran 43. 7 ez L ta.(05 xrm eeoeet ntemlclreet edn to leading events molecular the in heterogeneity Extreme (2005) al. et ML, Lenzi 47. elre al. et Veller eaercmiainrates. recombination female Nat n itiuinaogtresbigseiso Drosophila. hypotheses. two of species sibling three 523. among distribution and recombination. rodents. mammals. vrabodeouinr itnei yeasts. in distance evolutionary broad a over Sci interference. Biol crossover B and rates recombination affect that mechanisms ainmitcrcmiain .Vraini xhnefeunisaogmales among frequencies exchange in strains. mouse Variation inbred I. from recombination. melanogaster. meiotic Drosophila malian in rate recombination in 12:e1005951. variation ural in rates crossover meiotic 68:2718–2726. female on ground melanogaster. Drosophila some and selection. natural recombination Simulated II. low melanogaster. and high for genes. recombination-modifying selection of properties Artificial I. melanogaster. selection. haploid 163:811–822. Sci Biol B processes. Soc and R Patterns Trans eukaryotes: Philos across distribution and frequency recombination species. mn eeoeeu tc mice. stock heterogeneous among 277. evolution. rate tion brunneus. recombination maternal for selection show embryos rates. and oocytes human in regation individuals. and populations recombination. in variation sex-specific and Genet Hum Individual maps: genetic human 5:e1000661. Genet rates. exchange meiotic (Sinauer, BR Levin RE, Michod eds 270–286. Ideas, pp Current MA), of Sunderland, Examination An Sex: of Evolution insects. social of case the and taxa animal among yoe eoeRes Genome Cytogen not? of accessions eight among 162:1415–1422. frequency chiasma in h salsmn fcisaadrn eoi nhmnoocytes. human in I meiosis 76:112–127. during chiasmata of establishment the 176:404–413. rnsGenet Trends a Genet Nat 36:1203–1206. Heredity Genetics 280:20131945. Heredity Evolution 63:861–869. Nature mNat Am 47:727–735. 107:496–508. 187:643–657. LSBiol PLoS 33:364–374. 29:259–262. hlsTasRScBBo Sci Biol B Soc R Trans Philos 62:276–294. 107:249–255. 326:803–805. Science ee nConflict in Genes 163:105–112. LSGenet PLoS Genetics 372:20160455. 3:e63. Nature Genetics 296:2222–2225. Genetics 467:1099–1103. 162:297–306. 190:709–723. 8:e1002905. Blnp abig,MA). Cambridge, (Belknap, 182:1345–1349. LSGenet PLoS Genetics Genetics rspiamelanogaster Drosophila 372:20160469. Heredity rbdpi thaliana. Arabidopsis 70:433–443. 70:419–432. 13:e1006917. 98:189–197. Genetics vlBiol Evol J mJHmGenet Hum J Am rcRySoc Roy Proc Chorthippus LSGenet PLoS LSGenet PLoS . Drosophila Drosophila Evolution 142:507– Genetics Genetics 4:259– mJ Am The Nat Am 0 igM amnD(98 esnlvraino ham rqec in frequency chiasma of variation Seasonal (1978) D Hayman map M, linkage King high-density 70. A (2017) JM Pemberton PA, genome Ellis J, and Huisman conflict SE, genetic Johnston between 69. feedback evolutionary The distribution (2014) and frequency A chiasma Young of 68. comparison A (1980) GM Hewitt contrasted. meiocytes HL, female Fletcher and male 67. in Recombination (1977) by PE identified Perry individuals HG, admixed Callan in 66. rates Recombination (2011) al. et Americans. D, African Wegmann in recombination 65. of landscape The (2011) al. of et AG, strains of Hinch cytogenetics different population The 64. in (1981) J Parker arising G, Jones over L, Edgar crossing C, Whitehouse Variable in 63. (1955) over EE crossing of Levine control RP, genotypic Recombination Levine The are: 62. (1954) crossovers EE the Levine Where RP, (2004) Levine S 61. Keeney AJ, Jeffreys L, Kauppi in 60. studies interference. Cytological chromatid (1935) of A analysis Levan Statistical (1995) 59. TP Speed MS, McPeek H, Zhao 58. (1939) CD crossing-over. of Darlington biology The 57. (1937) CD Darlington 56. (1932) CD Darlington 55. (2012) M Ruvolo DL, environmental Hartl of instability the 54. and genome. antagonism Sexual human (2018) the C Veller of P, map Muralidhar recombination high-resolution 53. A (2002) al. et A, recombination Kong for variation 52. genetic of organization oocytes. The in (1986) RW formation Marks chiasma LD, and Brooks over 51. Crossing IX. meiosis Human (1985) aneuploidy M elevated Bojko underlies maturation 50. crossover of Inefficient studies (2017) al. Cytological et (2013) S, Wang T Hassold 49. PA, Hunt KW, Broman C, Rubio JR, Gruhn 48. 9 ulrH 11)Temcaimo crossing-over. of mechanism The (1916) HJ Muller 89. chromosomes. individual for maps (1973) MJD recombination White male 76. Human recombina- (2004) mouse al. et Male F, (2002) Sun T 75. Ashley J, in Wienberg variation for LK, Polymorphic maps Anderson (2007) L, recombination M Morley Froenicke in D, 74. distribution Hirschmann in MLH1 JT, distribution Burdick in VG, chiasma Variation Cheung (2006) in 73. al. variation et Inter-individual F, (1981) Sun G 72. Jones D, Laurie 71. 8 treatA 11)Telna ragmn fsxsxlne atr in factors sex-linked six of arrangement linear The (1913) selection—was AH of Sturtevant efficacy 88. (1972) the RE and Cleland recombination, 87. Sex, Perspective: (2000) A Burt 86. recombination. of importance The (1988) JF diversity. Crow and Entropy 85. in (2006) rates L diversity. Jost recombination of Low Measurement 84. (1949) (2017) EH T Simpson Lenormand 83. R, Zahab L, Theodosiou chiasma CR, from Haag recombination genetic 82. estimating for index new A distribu- (1992) PC chiasma Colombo in 81. differences Sex (1994) OY Gorlova AI, Zhelezova IP, recombination. Gorlov of and system Sex 80. (1988) inversion HL pericentric Fletcher the 79. in variability Geographical (1976) DB Weissman eusociality. and 78. numbers chromosome Insect (1979) PW Sherman 77. evselaphus). (Cervus deer red in (Bethesda) G3 landscapes recombination sexually-dimorphic reveals MA). Cambridge, University, (Harvard thesis PhD architecture. grasshoppers. of species three in sexes between Sci Biol B Lond Soc R Trans Philos inference. ancestry-based 476:170–175. capillaris pseudoobscura. Drosophila pseudoobscura. mammals. in distributions of species diploid some Genetics UK). Cambridge, Ed. 8th MA), Burlington, Learning, determination. sex Genet Nat in Commun Res Carlsberg meiosis. female human to, in prior or at, originate breaks. recombination double-strand in of establishment differences Sex-specific meiosis: human marmoratus mJHmGenet Hum J Am painting. chromosome by identified 71:1353–1368. autosome each for maps tion recombination. meiotic human males. human from chromosomes individual brunneus Chorthippus ssonb hi oeo association. of mode their by shown as pp MA), Sunderland, right? (Sinauer, Weismann BR Levin RE, Michod eds Ideas, 56–73. Current of ination transitions. sex-asex and species sexual frequency. data. distribution recombination in distribution differences chiasma sex in for Differences 166. reason chromosomes: mouse a marked as two along tion grasshopper the 935. Ed. 3rd UK), rspiamelanogaster Drosophila 139:1057–1065. .Cisavariation. Chiasma I. 31:241–247. Ga)Gkoia Reptilia). – (Gray)(Gekkonidae 7:2859–2870. Genetics rmrtoi pseudofasciata. Trimerotropis Evolution 74:521–531. Heredity eohr:Ctgntc n Evolution and Cytogenetics Oenothera: nmlCtlg n Evolution and Cytology Animal a clEvol Ecol Nat eetAvne nCytology in Advances Recent Ototr:Acrididae). (Orthoptera: h vlto fGntcSystems Genetic of Evolution The 50:43–72. Allium. 39:677–691. a Genet Nat Cell eeis nlsso ee n Genomes and Genes of Analysis Genetics: a e Genet Rev Nat 69:412–415. Genetics 54:337–351. . Genetics Heredity 168:977–989. mJHmGenet Hum J Am Hereditas 2:343–351. 277:227–233. Oikos 40:399–405. 43:847–853. 114:525–547. I h hoooemrhlg of morphology chromosome The VI. Allium, hlsTasRScBBo Sci Biol B Soc R Trans Philos x o A Zoo Exp J 47:95–103. LSOne PLoS 5:413–424. 113:363–375. 20:289–330. Nature Chromosoma u o Genet Mol Hum Nature Chromosoma Heredity Chromosoma 80:526–530. CmrdeUi rs,Cambridge, Press, Univ (Cambridge 8:e85075. mNat Am 331:492. NSLts Articles Latest PNAS Cucil London). (Churchill, Nature 14:43–59. h vlto fSx nExam- An Sex: of Evolution The 163:688–688. 47:409–416. 69:131–154. Aaei,London). (Academic, 50:193–221. 140:759–761. CmrdeUi Press, Univ (Cambridge 55:325–347. 15:2376–2391. 77:129–144. ee Res Genet 372:20160461. mNat Am mJHmGenet Hum J Am Jns&Bartlett & (Jones Phyllodactylus Drosophila, Drosophila | 113:925– 64:161– f10 of 9 Nature Crepis

EVOLUTION 90. Anderson LK, Reeves A, Webb LM, Ashley T (1999) Distribution of crossing over 124. Ghiselin MT (1974) The Economy of Nature and the Evolution of Sex (University of on mouse synaptonemal complexes using immunofluorescent localization of MLH1 California Press, Berkeley, CA). protein. Genetics 151:1569–1579. 125. Maynard Smith J (1976) A short-term advantage for sex and recombination through 91. Møens PB, et al. (2002) The time course and chromosomal localization of sib-competition. J Theor Biol 63:245–258. recombination-related proteins at meiosis in the mouse are compatible with models 126. Goldstein DB, Bergman A, Feldman MW (1993) The evolution of interference: that can resolve the early DNA-DNA interactions without reciprocal recombination. J Reduction of recombination among three loci. Theor Pop Biol 44:246–259. Cell Sci 115:1611–1622. 127. Gorlov IP, Gorlova OY (2001) Cost–benefit analysis of recombination and its 92. Shifman S, et al. (2006) A high-resolution single nucleotide polymorphism genetic application for understanding of chiasma interference. J Theor Biol 213:1–8. map of the mouse genome. PLoS Biol 4:e395. 128. Hillers KJ (2004) Crossover interference. Curr Biol 14:R1036–R1037. 93. Roach JC, et al. (2010) Analysis of genetic inheritance in a family quartet by whole- 129. Wang S, Zickler D, Kleckner N, Zhang L (2015) Meiotic crossover patterns: Obligatory genome sequencing. Science 328:636–639. crossover, interference and homeostasis in a single process. Cell Cycle 14:305–314. 94. Wang J, Fan HC, Behr B, Quake SR (2012) Genome-wide single-cell analysis 130. Lenormand T, Engelstadter¨ J, Johnston SE, Wijnker E, Haag CR (2016) Evolutionary of recombination activity and de novo mutation rates in human sperm. Cell mysteries in meiosis. Philos Trans R Soc B Biol Sci 371:20160001. 150:402–412. 131. Saito TT, Colaiacovo´ MP (2017) Regulation of crossover frequency and distribution 95. Lu S, et al. (2012) Probing meiotic recombination and aneuploidy of single sperm cells during meiotic recombination. Cold Spring Harb Symp Quant Biol 82:223–234. by whole-genome sequencing. Science 338:1627–1630. 132. van Veen J, Hawley R (2003) Meiosis: When even two is a crowd. Curr Biol 13:R831– 96. Mancera E, Bourgon R, Brozzi A, Huber W, Steinmetz LM (2008) High-resolution R833. mapping of meiotic crossovers and noncrossovers in yeast. Nature 454:479–485. 133. Koszul R, Meselson M, Van Doninck K, Vandenhaute J, Zickler D (2012) The centenary 97. Hou Y, et al. (2013) Genome analyses of single human oocytes. Cell 155:1492–1506. of Janssens’s chiasmatype theory. Genetics 191:309–317. 98. Cole F, et al. (2014) Mouse tetrad analysis provides insights into recombination 134. Egel R (1995) The synaptonemal complex and the distribution of meiotic recombina- mechanisms and hotspot evolutionary dynamics. Nat Genet 46:1072–1080. tion events. Trends Genet 11:206–208. 99. von Wettstein D, Rasmussen SW, Holm PB (1984) The synaptonemal complex in 135. Fernandes JB, Seguela-Arnaud´ M, Larchevequeˆ C, Lloyd AH, Mercier R (2018) genetic segregation. Annu Rev Genet 18:331–411. Unleashing meiotic crossovers in hybrid plants. Proc Natl Acad Sci USA 115:2431–2436. 100. Sherman JD, Stack SM (1995) Two-dimensional spreads of synaptonemal complexes 136. Serra H, et al. (2018) Massive crossover elevation via combination of HEI10 and recq4a from solanaceous plants. VI. High-resolution recombination nodule map for tomato recq4b during Arabidopsis meiosis. Proc Natl Acad Sci USA 115:2437–2442. (Lycopersicon esculentum). Genetics 141:683–708. 137. Crow JF, Kimura M (1965) Evolution in sexual and asexual populations. Am Nat 101. Zickler D, Kleckner N (1999) Meiotic chromosomes: Integrating structure and 99:439–450. function. Annu Rev Genet 33:603–754. 138. Weinreich DM, Chao L (2005) Rapid evolutionary escape by large populations from 102. Anderson LK, et al. (2014) Combined fluorescent and electron microscopic imaging local fitness peaks is likely in nature. Evolution 59:1175–1182. unveils the specific properties of two classes of meiotic crossovers. Proc Natl Acad Sci 139. Neher RA, Shraiman BI (2009) Competition between recombination and epistasis can USA 111:13415–13420. cause a transition from allele to genotype selection. Proc Natl Acad Sci USA 106:6866– 103. Hassold T, Sherman S, Hunt P (2000) Counting cross-overs: Characterizing meiotic 6871. recombination in mammals. Hum Mol Genet 9:2409–2419. 140. Hartl DL (1975) Modifier theory and meiotic drive. Theor Popul Biol 7:168–174. 104. Hunter N (2015) Meiotic recombination: The essence of heredity. Cold Spring Harbor 141. Liberman U (1976) Modifier theory of meiotic drive: Is Mendelian segregation stable? Perspect Biol 7:a016618. Theor Popul Biol 10:127–132. 105. Fransz P, et al. (1998) Cytogenetics for the model system Arabidopsis thaliana. Plant 142. Charlesworth B, Hartl DL (1978) Population dynamics of the segregation distorter J 13:867–876. polymorphism of Drosophila melanogaster. Genetics 89:171–192. 106. Ruiz-Herrera A, et al. (2017) Recombination correlates with synaptonemal com- 143. Rice WR (1986) On the instability of polygenic sex determination: The effect of sex- plex length and chromatin loop size in bovids–insights into mammalian meiotic specific selection. Evolution 40:633–639. chromosomal organization. Chromosoma 126:615–631. 144. van Doorn GS, Kirkpatrick M (2007) Turnover of sex chromosomes induced by sexual 107. Møens PB, Pearlman RE (1988) Chromatin organization at meiosis. Bioessays 9:151– conflict. Nature 449:909–912. 153. 145. van Doorn GS, Kirkpatrick M (2010) Transitions between male and female heteroga- 108. Kleckner N (2006) Chiasma formation: Chromatin/axis interplay and the role (s) of the mety caused by sex-antagonistic selection. Genetics 186:629–645. synaptonemal complex. Chromosoma 115:175–194. 146. McVean GAT, Charlesworth B (2000) The effects of Hill-Robertson interference 109. Li S, Meistrich M, Brock W, Hsu T, Kuo M (1983) Isolation and preliminary characteri- between weakly selected mutations on patterns of molecular evolution and variation. zation of the synaptonemal complex from rat pachytene spermatocytes. Exp Cell Res Genetics 155:929–944. 144:63–72. 147. Beliveau BJ, et al. (2012) Versatile design and synthesis platform for visualizing 110. Kleckner N, Storlazzi A, Zickler D (2003) Coordinate variation in meiotic pachytene genomes with Oligopaint FISH probes. Proc Natl Acad Sci USA 109:21301–21306. SC length and total crossover/chiasma frequency under conditions of constant DNA 148. Beliveau BJ, et al. (2015) Single-molecule super-resolution imaging of chromosomes length. Trends Genet 19:623–628. and in situ haplotype visualization using Oligopaint FISH probes. Nat Comm 6: 111. Novak I, et al. (2008) Cohesin Smc1β determines meiotic chromatin axis loop 7147. organization. J Cell Biol 180:83–90. 149. Wang S, et al. (2016) Spatial organization of chromatin domains and compartments 112. Keyl HG (1975) Lampbrush chromosomes in spermatocytes of Chironomus. Chromo- in single chromosomes. Science 353:598–602. soma 51:75–91. 150. Eddy SR (2012) The C-value paradox, junk DNA and ENCODE. Curr Biol 22:R898–R899. 113. Pukkila PJ, Lu BC (1985) Silver staining of meiotic chromosomes in the fungus, 151. Eddy SR (2013) The ENCODE project: Missteps overshadowing a success. Curr Biol Coprinus cinereus. Chromosoma 91:108–112. 23:R259–R261. 114. Lu BC (1993) Spreading the synaptonemal complex of Neurospora crassa. Chromo- 152. Lek M, et al. (2016) Analysis of protein-coding genetic variation in 60,706 humans. soma 102:464–472. Nature 536:285–291. 115. Zhang L, Liang Z, Hutchinson J, Kleckner N (2014) Crossover patterning by the beam- 153. The 1000 Genomes Project Consortium (2015) A global reference for human genetic film model: Analysis and implications. PLoS Genet 10:e1004042. variation. Nature 526:68–74. 116. Handyside AH, et al. (2010) Karyomapping: A universal method for genome 154. Spencer CC, et al. (2006) The influence of recombination on human genetic diversity. wide analysis of genetic disease based on mapping crossovers between parental PLoS Genet 2:e148. haplotypes. J Med Genet 47:651–658. 155. Cole F, Keeney S, Jasin M (2012) Preaching about the converted: How meiotic gene 117. Coop G, Wen X, Ober C, Pritchard JK, Przeworski M (2008) High-resolution mapping conversion influences genomic diversity. Ann NY Acad Sci 1267:95–102. of crossovers reveals extensive variation in fine-scale recombination patterns among 156. Zelazowski MJ, et al. (2017) Age-dependent alterations in meiotic recombination humans. Science 319:1395–1398. cause chromosome segregation errors in spermatocytes. Cell 171:601–614. 118. Chowdhury R, Bois PRJ, Feingold E, Sherman SL, Cheung VG (2009) Genetic analysis 157. Kleckner N, Zhang L, Weiner B, Zickler D (2012) Meiotic chromosome dynamics. of variation in human meiotic recombination. PLoS Genet 5:e1000648. Genome Organization and Function in the Cell Nucleus, ed Rippe K (Wiley-VCH, 119. Fan HC, Wang J, Potanina A, Quake SR (2011) Whole-genome molecular haplotyping Weinheim, Germany), pp 487–533. of single cells. Nat Biotech 29:51–57. 158. Cole F, et al. (2012) Homeostatic control of recombination is implemented 120. Jeffreys AJ, et al. (2004) Meiotic recombination hot spots and human dna diversity. progressively in mouse meiosis. Nat Cell Biol 14:424–430. Philos Trans R Soc B 359:141–152. 159. Andolfatto P, Nordborg M (1998) The effect of gene conversion on intralocus 121. Kosambi DD (1944) The estimation of map distance from recombination values. Ann associations. Genetics 148:1397–1399. Eugen 12:172–175. 160. Lee PS, et al. (2009) A fine-structure map of spontaneous mitotic crossovers in the 122. Lian J, et al. (2008) Variation in crossover interference levels on individual yeast Saccharomyces cerevisiae. PLoS Genet 5:e1000410. chromosomes from human males. Hum Mol Genet 17:2583–2594. 161. Hum YF, Jinks-Robertson S (2017) Mitotic gene conversion tracts associated with 123. Kong A, et al. (2008) Sequence variants in the RNF212 gene associate with genome- repair of a defined double-strand break in Saccharomyces cerevisiae. Genetics wide recombination rate. Science 319:1398–1401. 207:115–128.

10 of 10 | www.pnas.org/cgi/doi/10.1073/pnas.1817482116 Veller et al. Downloaded by guest on September 24, 2021