<<

Downloaded by guest on September 30, 2021 nrwG Clark G. Andrew Chen Nancy population pedigreed natural a in dynamics frequency www.pnas.org/cgi/doi/10.1073/pnas.1813852116 population date. to population natural in a change in frequencies evolutionary allele short-term of the of descriptions one with complete provides study success, most This reproductive contribution. primarily smaller and a are making survival change flow in changes frequency variation frequency allele to allele due in Observed variance time. of through par- proportion we directional size, the population under in titioned changes SNPs for account few that flow. a models gene actually Using for identified accounting selection appropriately We after to selection flow. short-term attributed gene been to have due shifts may frequency allele otherwise large that some of with recent contributions of substantial, contribution is genetic genetic immigrants the pre- different that show well the results Our generally found for founders. are We accounting changes pedigree. by known frequency dicted the allele the on observed to drift that dropping contributions model genetic gene and individual performed population estimate We of to patterns time. simulations shaping through in variation relative processes genetic the evolutionary different characterize informa- of directly genomic roles Scrub-Jays to Florida and of coerulescens) population pedigree (Aphelocoma natural extensive long-studied a use from through tion population we the Here, in individuals time. all among set relationships the of pedigree, under- population Fully the quantify. requires processes to these difficult standing often movement—are suc- how and reproductive fre- cess, survival, understand allele individual in underlying shape to processes changes—variation flow actual these is the gene However, time. genetics and 2018) through 10, selection, quencies population August natural review for of drift, (received 2018 genetic goal 19, November approved central and Norway, A Oslo, Oslo, of University Stenseth, C. Nils by Edited 38152 TN Memphis, Memphis, of 33960; University FL Venus, Station, Biological Archbold 44195; OH Cleveland, Foundation, Clinic c a eei otiuino nidvda sa niiulsrepro- individual’s an is individual an of expected long-term contribution The (4). genetic descendants of pedigree their dropping of down gene transmission Mendelian via of or simulations (1–3) i.e., simulations, analytically estimated be contri- can individual, genetic Individual given butions generations. a future by in contributed population allele the neutral to a of copies num- of expected ber the i.e., contributions, genetic individual expected segregation. Mendelian of stochasticity genetic the time, differential with over the pedigree combined population of the to terms individuals in of contributions understood mechanisms precisely evolutionary be these can driving in However, flow gene change. and the evolutionary selection, about natural drift, inferences genetic make of effects time—to in genetic variation through of the frequencies available, patterns allele are day samples current temporal the if on observe variation—or, rely rare directly typically population Instead, cannot studies in change. genetic often (except genetic We underlying inheritance processes drive). actual Mendelian meiotic of of cases laws the following A eateto ilg,Uiest fRcetr ohse,N 14627; NY Rochester, Rochester, of University , of Department etrfrPplto ilg,Uiest fClfri,Dvs A95616; CA Davis, California, of University Biology, Population for Center nweg ftepplto eirealw st trace to us allows pedigree population the of Knowledge vligntrlpplto sesnilyavs pedi- vast pedigree a this down essentially transmitted material is genetic population with gree, natural evolving n a,b,c,1 | vnJuric Ivan , e pedigrees n rhmCoop Graham and , | fitness d lsaJ Cosgrove J. Elissa , e | eateto oeua ilg eeis onl nvriy taa Y14850; NY Ithaca, University, Cornell Genetics, & Biology Molecular of Department eeflow gene g onl a fOntooy onl nvriy taa Y180 and 14850; NY Ithaca, University, Cornell Ornithology, of Lab Cornell a,b | eei drift genetic e edBowman Reed , d eateto uniaieHat cecs enrRsac nttt,Cleveland Institute, Research Lerner Sciences, Health Quantitative of Department b eateto vlto clg,Uiest fClfri,Dvs A95616; CA Davis, California, of University Ecology, & of Department aadpsto:Aldt n oeue nti td a efuda Figshare, at found be 1073/pnas.1813852116/-/DCSupplemental. y at online can information supporting contains study article This this in used code 1 and data 10.6084/m9.figshare.7044368 All deposition: Data the under Published Submission.y Direct PNAS a is article This interest.y of conflict no declare and authors data; The analyzed paper. G.C. y the and wrote N.C. G.C. collection; and data N.C., J.W.F.,N.C. performed tools; R.B., S.J.S. reagents/analytic new and E.J.C., J.W.F., contributed N.C., R.B., I.J. E.J.C., and research; N.C. designed research; G.C. performed S.J.S. and and A.G.C., N.C., contributions: Author hne l 1)gnrtdgnm-iesingle-nucleotide genome-wide generated (19) al. Recently, pedigree. et extensive an Chen on individuals of thousands measures fitness for lifetime in Station. resulted has Biological 1969 since Archbold study Intensive at coerulescens) Scrub- Florida (Aphelocoma of Jays population free-living sampled in exhaustively loci many an at change frequency allele predict and describe to gree spread temporal (15–18). the alleles individual study of to loss used and rarely dropping been gene and quan- have and pedigrees inbreeding, approaches in of empirical of date, used To basis consequences (14). more increasingly genetic fitness much variation, the are genetic trait understand and understand titative to (13) to populations diseases used natural human been of long gene models have pedi- of empirical calculations probability Also, 12). the gree (4, populations predict breeding human captive to in in and loss effects 9–11) founder (1–3, assess populations to future contributions to genetic vidual on the passed to the blocks from genomic beneficial (8). genotype, weakly generations of new, individual’s distribution a an of complex of (5–7). loss of survival fitness many probability the individual influences individual of of an of aspects measure value reproductive general the Indeed, a value, ductive owo orsodnesol eadesd mi:[email protected] Email: addressed. be should correspondence whom To f ihnantrlpopulation. occurs natural change a study evolutionary within evolution- This short-term change. of how frequency illustrates roles clearly allele genetic relative governing long-term processes the with ary quantify fitness and individual contributions link gen- future We to transmitted erations. mechanisms is the material population genetic characterize how directly natural underlying to sampled Scrub-Jays tree Florida exhaustively family of extensive an an of for study frequency we (pedigree) population Here carry. the they pop- in alleles among the changes move to and leading die, ulations, reproduce, individuals populations. nature, of composition In genetic the in change is Evolution Significance onW Fitzpatrick W. John , ee ecmiegnmcdt ihakonpplto pedi- population known a with data genomic combine we Here, indi- estimate to used been have pedigrees known of Analyses NSlicense.y PNAS . y g tpa .Schoech J. Stephan , h eateto ilgclSciences, Biological of Department f www.pnas.org/lookup/suppl/doi:10. va clg Program, Ecology Avian NSLts Articles Latest PNAS h , | f7 of 1

EVOLUTION polymorphism (SNP) data for nearly every individual in the Using the detailed population pedigree, we calculated both the population over the past two decades to demonstrate how genealogical and expected genetic contribution of each individ- decreased immigration into the population led to increased levels ual to the study population from 1990 to 2013. Fig. 1 A and B of inbreeding and decreased mean fitness over time. In this study, shows results for two illustrative males, both of whom first bred we link individual lifetime reproductive success with long-term in 1994. Male A lived until 2006 and had 41 offspring, whereas genetic contributions and allele frequency change. We show Male B only lived until 2000 and had 7 offspring. We define how the population pedigree presents a powerful opportunity an individual’s genealogical contribution to a given year as the to directly elucidate the relative roles of drift, gene flow, and proportion of nestlings in the birth cohort who are genealogi- selection in governing allele frequency dynamics over time. cally descended from the focal individual, while an individual’s expected genetic contribution is the expected proportion of alle- Results les at a locus in the nestling cohort that comes from the focal Individual and Genetic Contributions. First, we consider a individual. Beyond a few generations, few genealogical descen- series of inferences that can be made purely with the pedigree, dants are expected to inherit any genetic material, so the number ignoring the SNP genotypes for the moment. We estimated fit- of genealogical descendants should quickly outnumber the num- ness for all 926 individuals who bred in our study population ber of genetic descendants. Fig. 1 A–C nicely demonstrates this in 1990–2013 and were born before 2002 (the age cohorts who pattern in our data, providing empirical illustration for a sub- are all dead by the end of 2014). Lifetime reproductive success stantial body of theory on the relationship between genetic and was highly variable in our study population: The total num- genealogical ancestry (8, 22, 23). An individual’s genealogical ber of nestlings produced over an individual’s lifetime ranged contribution in 2013 is correlated with its expected genetic con- from 0 to 43, with 197 individuals (21%) producing no nestlings tribution in 2013 (Spearman’s ρ = 0.99, p < 2 × 10−16), but its despite having at least one breeding attempt (SI Appendix, Fig. genealogical contribution is significantly larger (paired Wilcoxon S1). Only 43% produced any grand-offspring (range 0 to 189), test, p < 2 × 10−16; Fig. 1C). and 33% produced great-grand-offspring (range 0 to 210). As Individual fitness is a central concept in evolutionary biol- might be expected in these monogamous birds in which the sexes ogy but is notoriously difficult to measure (24). Here, we tested experience equal annual mortality (20, 21), we found no signifi- for a relationship between various proxies for fitness and the cant differences in individual fitness between males and females expected genetic contribution to the population. All three mea- (Wilcoxon rank sum test, p > 0.65 for all three measures of sures of fitness (number of offspring, grand-offspring, and great- fitness). grand-offspring) are significantly correlated with both the total

ABC

17 3 0.3 2 1 5 64 50 264 0.3

7 8 11 12 4 26 49 464 114 6 3 64 54 55 56 3 21 111 73 74 75 76 78 80 82 97 28 62 106 109 110 112 36 113 64 77 146 21 257 258 3 276 13 265 44 266 267 268 269 270 271 272 273 274 275 277 125 316 10 27 317 318

18 4 19 13 20 5 84 17

51 52 53 102 107 108 58 59 60 63 65 67 68 69 70 472 473 474 481 482 483 484 485 489 490 491 492 115 116 117 118 48 119 9 120 121 122 123 126 127 128 234 235 239 240 243 244 245 16 17 18 19 22 23 617 24 30 31 32 33 97 81 162 163 164 124 182 183 184 142 143 144 145 155 156 165 166 167 3 168 129 130 131 25 132 136 79 137 138 141 147 157 148 149 247 150 169 170 173 174 175 176 185 66 186 187 188 207 208 209 210 248 262 88 200 201 219 220 221 84 85 86 89 90 91 92 326 327 40 328 263 278 279 280 281 282 283 284 72 285 35 286 287 288 289 290 291 292 293 294 295 297 301 302 303 260 304 315 329 330 331 332 333 296 466 467 47 468 319 320 46 321 83 322 323 324 325 334 61 335 57 336 45 337 338 339 344 345 29 346 20 133 347 41 599 15 104 601 38 616 602 259 0.02

15 11 21 10 14 22 23 24 74 84 75 9 76 26 27 28 42 43 44 48 49 68 72 73 85 86 87 88 89 13 90 91 12 92 93 94 95 96 97 1 121 118 115 116 117 0.2 135 241 242 236 237 238 626 627 98 99 100 101 103 171 172 190 191 304 192 193 211 212 213 214 232 233 226 227 228 202 229 230 231 158 150 159 160 161 151 152 153 197 198 199 222 223 224 225 252 253 194 195 196 216 217 218 250 43 251 564 498 499 500 501 359 603 261 604 154 607 565 364 609 249 608 373 298 299 300 37 307 309 310 311 312 305 306 486 487 488 493 494 495 496 497 181 360 504 505 509 42 510 512 517 518 519 520 534 535 536 537 511 105 348 349 34 400 401 402 422 423 424 445 340 327 342 71 343 560 356 373 134 109 341 346 139 351 465 352 464 353 87 385 386 387 388 403 404 405 96 406 425 105 370 167 365 366 367 368 369 14 371 372 390 391 411 412 413 414 373 374 375 415 416 641 440 441 442 457 458 459 600 618 619 620 621 622 623 624 625 630 631 632 633 634 605 606 610 611 612 613 628 629 215 661 635 662 0.2

8 25 7 6 65 66 67 69 35 36 37 71 16 34 122 46 47 53 54 2 55 70 119 120 82 83 109 110 111 98 99 100 101 102 103 104 105 106 107 108 112 113 114 122 123 124 125 126 127 128 129 131 77 132 133 45 134 135 136 137 146 147 148

437 438 439 449 450 451 452 177 178 516 179 180 203 204 205 206 254 255 256 574 576 577 578 579 636 614 218 615 185 637 638 639 663 189 664 665 648 649 650 651 658 659 660 642 643 313 314 532 95 533 538 521 522 523 524 539 540 541 542 546 547 544 545 554 555 556 433 434 435 436 355 357 358 379 380 381 389 354 566 246 567 568 140 569 571 572 573 595 596 597 426 308 427 428 429 463 479 480 361 362 363 376 377 350 575 378 631 382 383 384 396 397 398 502 503 506 39 507 94 508 525 526 527 528 531 443 444 453 454 455 456 93 392 393 394 570 395 417 418 419 97 420 421 475 476 477 478 399 407 408 409 410 652 653 654 670 671 672 673 674 675 676

551 552 553 644 645 646 647 657 655 656 666 667 668 669 557 558 559 561 562 563 587 588 589 590 591 592 593 594 460 461 462 446 447 448 580 581 582 598 640 529 530 543 548 549 550 430 431 432 469 470 471 583 584 585 586 513 514 515

29 30 31 32 33 38 39 40 41 50 51 52 56 57 58 78 79 80 81 61 62 63 64 130 59 60 138 139 140 141 142 143 144 145 0.01 0.1 0.1 Genealogical & genetic contribution Genealogical & genetic contribution 0.0 0.0 0.00 Expected genetic contribution in 2013 Expected genetic contribution 1990 1995 2000 2005 2010 1990 1995 2000 2005 2010 0.0 0.1 0.2 0.3 Year Year D Genealogical contribution in 2013 0.4 0.06 0.06

0.2 0.04 0.04

0.0

0.02 0.02 0.2 Expected genetic contribution Expected genetic contribution

0.00 0.00 Predicted allele frequency change 0.4

1990 1995 2000 2005 2010 1990 1995 2000 2005 2010 0.4 0.2 0.0 0.2 0.4 Year Year Observed allele frequency change

Fig. 1. Genealogical (Top) and expected genetic contributions (Bottom) to the study population over time for two males who first bred in 1994 with total lifetime reproductive success of (A) 41 and (B) 7. Blue lines indicate the proportion of nestlings each year who are genealogical descendants. Black lines indicate mean expected genetic contribution for each year, and gray shading is the 95% confidence interval for their contribution at a neutral locus. The pedigree of all descendants of each individual in the study population is shown, with an arrow indicating the focal individual, and solid symbols denoting individuals still alive in 2013. (C) Genealogical contributions and expected genetic contributions to the population in 2013 for all breeders born before 2002 who first bred in 1990 or later (926 individuals). The dotted line indicates a one-to-one relationship. (D) Predicted versus observed change in allele frequencies from 1999 to 2013.

2 of 7 | www.pnas.org/cgi/doi/10.1073/pnas.1813852116 Chen et al. Downloaded by guest on September 30, 2021 Downloaded by guest on September 30, 2021 grgt,epce ocnrbt 5 fteallspeetin present alleles the of 75% contribute to expected (Spear- significantly aggregate, cohort are that and in individuals 0.083 man’s of to number immigrant 0.003 the from with individual correlated range of 2013 2A). contributions in (Fig. cohorts onward genetic 1991 expected from population Total estimated study We immigrants our (19). new in year of appearing given contribution a genetic in expected cumulative adults the breeding all of 32 comprising 55% immigrants to with population, study our into gration Flow. Gene of Effect to 1999 from frequencies allele for (β in 2013 change frequency net allele overall the the (β predict year 1999– predict given any in perfectly in SNP SNP nearly each autosomal can considered each We are for 2013. immigrants predictions generated frequency We that flow. allele founder’s gene Note incorporates the approach SNP. this by so founders, weighted that year at that each of genotype population contributions the year genetic to given individual a founder the able in SNP be summing particular should by a we of simply theory, frequency In allele 2013. the predict to to 1999 in from dynamics cohort autoso- frequency birth allele 10,731 the investigate at we Here, total) 1989–1991, (19). individuals SNPs in mal born (3,404 nestling 1999–2013 every and nearly 1995, and adults all of >80% Predictions. Frequency Allele in considered generations fitness. of of number measure the the with increases contribution 2013 genetic in expected and fitness individual between tion 10 bak leefeunisoe iefra N ihsgicnl increasing significantly simulated with and SNP an (blue) frequencies. for allele Observed immigrant time (B) over intervals. frequencies immi- allele confidence (black) 95% shading of with the the immigrants, contribution show shows of line cohorts to genetic colored successive of Each expected contribution 1990. added after total mean population the the population). in our shows appearing in grants observed line first black were they The year the on (based immigrants 2. Fig. nestling 2013 the to (Spearman’s contribution genetic cohort expected (Spearman’s the and 2013 ison) to 1990 from ρ contribution genetic expected hne al. et Chen B A .2 .5 .8 respectively; 0.78, 0.85, 0.92, = Allele frequency Expected genetic contrib. −16 0.45 0.55 0.65 0.75 0.00 0.25 0.50 0.75 1.00 ρ o ahcomparison; each for h xetdgntccnrbto fdfeetchrso recent of cohorts different of contribution genetic expected The (A) 0 = .7 i.1D). Fig. 0.87; = 9019 0020 2010 2005 2000 1995 1990 .52, 0020 2010 2005 2000 p 0 = rvoswr hwdhg eeso immi- of levels high showed work Previous .Imgat riigsne19 r,in are, 1990 since arriving Immigrants .01). ρ .7 .3 .7 respectively; 0.87, 0.83, 0.57, = .Tecorrela- The S2). Fig. Appendix, SI npeiu ok egenotyped we work, previous In .9.Mr motnl,w can we importantly, More 0.99). = p ear a Ye r a Ye < 2 × 10 −16 o ahcompar- each for p < 2 × Fig. Appendix , (SI data observed to indepen- fit an S5 good sim- survived as a dropping SNPs gene year no provide The ); S4 each ulations years. Fig. across treat correction and we testing S2 multiple Table if Appendix , hits (SI test some dent find Table we Appendix, (SI years, 2013 dis- changes and false significant 1999 a S1 showed between SNPs At frequency 18 allele 3A). 0.25, in generated (Fig. of (FDR) shifts simulations rate frequency dropping covery to this gene allele shifts during the of years frequency from distribution adjacent allele all expected observed between compared the as We drift, from well period. changes and as time frequency flow 2013 allele gene net to both significant 1999 for of tested effects then the we for account accurately Selection. Short-Term large gene these of result that the likely shows actually flow. are approach frequencies allele the gene-dropping in and changes our pedigree selec- the but resemble on could tion, trajectories data sim- such of immigrants, the of absence in allele genotypes the reflected in In is results. change immigration ulation incoming any gene to genotypes, As due well. caus- founder frequency as likely with increase 2013, to begins frequency in dropping allele appearing population immigrants the in ing 0.71 the to SNP, founders increased this test, significantly (Mann–Kendall For immigrants time expecta- simulations. over incoming within in dropping frequency well gene the allele lies our yet 2013, from trajectory allele and tions frequency This 1999 2B). between allele Fig. 0.26 observed by in frequency shown in is increased example (one shifts frequency predictions frequency allele above. the generated to gene equal these is of simulations frequency allele dropping mean founder The observed SNP. the each for with previous genotypes con- simulations our began genetic Unlike we individual here 2013. tributions, generate to to 1999 simulations from pedigree-based cohorts SNP each nestling at the distributions frequency in allele is expected the 2013 model to and 1999 and difference between SNPs The large frequencies period: few time allele a 15-y in observe this over do shifts we frequency changes, of allele frequency majority the small While govern- show time. in SNPs over role trajectories important frequency an allele play ing could flow gene alleles that neutral predicted of 95% for y immigration. by 32 replaced average, We be on ). S3 to Fig. takes, Appendix, it (SI contri- that future the predict the project to into model immigrants a of fitted butions We cohort. nestling 2013 the erdcin epriindtepooto fall frequency allele of proportion and 1999– the immigration, partitioned in individuals, We three change reproduction. of just frequency to survival due differential allele change frequencies processes: in allele that variance constructed assume shap- We the we 2013. in for genome-wide, processes model variation genetic evolutionary a of different patterns of ing roles Time. relative Through neutral Frequencies the a Allele with in consistent Variance largely popu- is our pro- overall, in 2013 lower However, change to as model. frequency (27). 1999 well allele from 2001 as that lation in period) suggest 2001–2002. breeding results time in our later this selection compared and for (33% stronger ductivity 26% mortality to mean intense adult territories elevated led with unusually unburned slightly have this in observed may We and the competition at and (25), elevated 2001, (26) drought in February severe resulted in fire a occurred study of 50-y height the during area ned efudta eeflwaoecncuelreallele large cause can alone flow gene that found we Indeed, we immigrants, of contribution genetic expected high the With ,ecp ehp n20–02 h ags r norstudy our in fire largest The 2001–2002. in perhaps except ), n i.3.Frall rqec hfsbtenadjacent between shifts frequency allele For 3). Fig. and . o 1SP.W sdgn rpigsimulations dropping gene used We SNPs. 11 for >0.2 ie htorgn rpigsimulations dropping gene our that Given p 0 = ,fo .1i h 1990 the in 0.51 from .002), NSLts Articles Latest PNAS ial,t quantify to Finally, .5fr129 for >0.15 | f7 of 3

EVOLUTION AB 0.15 0.95

Observed Expected change change 0.90 0.10

Proportion 0.85 0.05 Allele frequency

0.80 0.00

−0.15 −0.10 −0.05 0.00 0.05 2000 2005 2010 Allele freq change Year C

4

2 −log10(p−value)

0 1 2 6 7 8 3 4 5 9 11 14 17 20 27 10 12 13 18 22 28 15 25 19 21 23 24 26 1A 1B 4A Un LGE22 Chromosome

Fig. 3. (A) Distribution of expected allele frequency shifts between 1999 and 2013 for the SNP shown in B (gray histogram). The blue line indicates the observed allele frequency change. (B) Observed (blue) and simulated (black) allele frequency trajectories for one of the significant SNPs in 1999–2013. Gray bars indicate 95% confidence intervals for the gene dropping simulations. (C) Manhattan plot for allele frequency shifts in 1999–2013. Significant SNPs (FDR < 0.25) are highlighted in orange.

change from year to year due to survival/reproduction and gene contributions to both individual fitness measures and allele fre- flow using a model that accounts for variation in population sizes quency change over time. In our population of Florida Scrub- over time and overlapping generations (Fig. 4). The change in Jays, we observed huge variation in individual fitness: 75% of allele frequency due to births is a result of both variation in fam- the 445 individuals who first bred in our population before 1997 ily size and Mendelian segregation of alleles in heterozygotes. have no living descendants by 2013, but 6 of these individuals We further divided the variance in allele frequency change due are each genealogical ancestors to >25% of the birth cohort in to births into these two components and found that the noise due 2013. However, many of these genealogical descendants receive to Mendelian segregation comprises 24 to 48% of the variance little genetic material from a particular ancestor, thanks to the due to births, and 12 to 23% of the overall variance. Our model vagaries of Mendelian segregation and recombination during results reflect patterns we observed in the field. For instance, meiosis (8, 22, 23). Here, we empirically show how genealogical the number of nestlings born in 2012 was unusually low (SI contributions outstrip expected genetic contributions after just a Appendix, Fig. S6), leading the survivors to have a disproportion- few generations. ate impact on allele frequency variation in 2011–2012. Overall, we found that 90% of the variance in allele frequencies is driven by variation in survival and reproductive success (fitness) among individuals. If variation in fitness is heritable, then the effects of drift can be compounded over the generations, even at unlinked loci (28–30). Simulations of allele frequency change on pedigrees in which we randomized family sizes over breeding individu- als showed that heritable variation in reproductive success has no detectable effect on the variance in allele frequency change in 1999–2013: The mean difference between randomized and observed pedigrees was −0.8%, with a 95% confidence interval of (−11.3%, 8.7%). These results suggest that drift is the pre- dominant force driving allele frequency change over time, which is consistent with our small . Discussion We capitalized on a long-term demographic study of Florida Fig. 4. Schematic and results for our model of the variance in autosomal allele frequency change from year to year due to survival (Surv)/reproduc- Scrub-Jays with extensive pedigree and genomic data to demon- tion (red/orange/yellow) or gene flow (Imm; blue). The variance in allele strate how short-term evolutionary processes operate in a natural frequency change due to births is further partitioned into the variance due population. We estimated genealogical and expected genetic to variation in family size (Fam size) and additional noise due to Mendelian contributions for hundreds of individuals, and linked genetic segregation of heterozygotes (Mend noise).

4 of 7 | www.pnas.org/cgi/doi/10.1073/pnas.1813852116 Chen et al. Downloaded by guest on September 30, 2021 Downloaded by guest on September 30, 2021 hne al. et Chen perfor- the that explored null are further We alleles correct alleles. neutral selected the for to unlinked variation drift provide are sim- fitness genetic pedigree heritable dropping compound for population gene model to the Therefore, act on (28–30). can ulations generations loci the unlinked variation over heritable at while (32), fitness drift num- genetic in offspring of part in information natural variation a gene encodes is However, ber of itself fitness. pedigree effects in selec- the variation for the test about as to as conservative, dropping well gene is using tion as that argue cohort, relat- could and One birth flow. time the over and sizes within population pedigree edness in variation population appropriately for founder simulations the accounted dropping gene observed on our genotypes, and conditioning founder contributions change By frequency genetic genotypes. allele predict individual can we from short which with this accuracy on the evidence alleles little single found Although on we flow. overall, selection frequen- timescale. years, gene directional allele adjacent strong and changing few of drift selection a of as in signatures cies such detect processes did fre- we to could allele selection to due large by attributed knowing be be as may of driven ordinarily population, that importance strongly shifts of quency the demography be emphasize underlying showed results sometimes the and Our can genome flow. the trajectories gene across and SNP SNPs drift individual that for both time of over contributions the change. frequency including allele to as selection should proportions viewed these Thus, be phenotypes. selection suc- individual reproductive and from between cess associations drift for Disentangling testing require selection. due would contri- due natural deviation the the see includes of sur- for heterozygotes, we bution of except in frequencies segregation births, Mendelian allele variation and to survival in into in change folded variation The to is genetic births. fre- their years and while allele vival appear, future in they to year included the contributions are occurs in immigrants only the discrepancy change model, than This quency our smaller 2A). com- in much (Fig. immigrants because, is generations of 4) over contribution (Fig. pounded genetic year expected to repro- cumulative frequency year and allele from to survival immigrants changes new individual are of in contribution changes The variation duction. the frequency by in allele driven an variation primarily Genome-wide, genetic play of (19). immigrants levels that population maintaining in showing role results important previous with sistent number the and offspring of (31). grand-offspring number the of the estimating in by of explore variation correlation could include we genetic they idea an because proxies (24), part, quality fitness in offspring accurate, Longer-term more prox- generations. be fitness more can for include higher is that and 2013 ies emigration descendants in of that contribution surrounding number genetic know the the expected of between we surveys correlation because annual The from areas. issue, low believe minor extremely We are a counted. rates is not are latter emigrants offspring, the of surviving have descendants still any a sample our and because great- in underestimate, individuals or the an of grand-offspring few be of success. may number reproductive however, total lifetime grand-offspring, the total of of estimates con- measures Our us giving our territory, in a on fidence breeder established once an move rarely become Scrub-Jays to they Florida contribution future. is the genetic in success population expected reproductive the individual’s lifetime an that found with we correlated success. using 24, reproductive ref. measured to lifetime typically Similar as is such but proxies generations single-generation future to tribution n fteraoswyw eets e eetdlc is loci selected few so detect we why reasons the of One changes frequency allele predict to dropping gene used We con- is immigrants of contribution genetic expected high The con- genetic individual’s an as defined is fitness Individual h oa niiul h xetdgntccnrbto fa niiult a as to calculated individual be an can of from year contribution comes genetic given that expected cohort The nestling individual. the focal in the alleles of proportion expected the as 10.6084/m9.figshare. Contributions. Figshare Genetic Expected at individuals. found be frequency 3,404 can in allele study SNPs minor this autosomal 7044368. in with 10,731 used SNPs of data removed consists All dataset we found final Here, be Our can Informa- 19. verification (19). pedigree ref. 1999–2013 and and in genotyping, genome- discovery, 1995, SNP 1989–1991, 15,416 on in at tion territories breeders and individuals core nestlings these all 3,984 1990. of in sampling genotyped in near-complete in starting we resulting monitored SNPs, study, wide consistently previous been a has In that area geographic a en fsrn s1--l etig teaea hc hyaefirst are they which at 2014 age of (the end we nestlings the Here, at 11-d-old grand-offspring, great-grand-offspring. banded). alive as additional additional still offspring still produce produce were define may were may who and who therefore grand-offspring 2014 offspring and had of had repro- 13% end 5% lifetime and of the individuals, the survey at before these unbiased died and alive Of all accurate indi- who an success. ensure breeders (926 ductive to of 2002 2014 cohorts We or of age before lifetime. grand-offspring, end to its adults born offspring, sample over all were individual our of given restricted for and number a by fitness later total produced the or individual great-grand-offspring counting 1990 estimate individu- by in 6,936 to viduals), bred of pedigree truncated first consists the and who pedigree used 1990 final in We analyses Our als. our accordingly. tract all study pedigree and began by detailed the we caused a 1990, artifacts have any before we avoid expansion To 36), pedigree. (19–21, population population accurate this in distances dispersal Commission Conservation Wildlife and Fish Wildlife Florida (LSSC-10-00205). and the permits and Fish (banding Archbold 23098), US Survey Geological 07732, and the US Univer- the by (0667), TE-117769), Cornell permitted Memphis (TE824723-8, Service and at of (AUP-006-R) Committees approved University Station Use was the Biological and fieldwork 2010-0015), Care All (IACUC population. Animal sity the Institutional in the reproductive are individuals by and groups all survival family all for of of success documentation nests all providing and monitored, months, pop- closely few entire every The banded, year. censused uniquely each is are individuals ulation population in immigrant entire of half the and identification in southern 33), allowing individuals the (19, All monitoring 35). 1969 Bowman, (34, began in 1989 Fitzpatrick, colleagues half and Woolfenden, northern Schoech, the decades. Biolog- Mumme, monitoring for Archbold began population at colleagues FL) A monitored and (Venus, (33). intensively Florida Station been in has ical scrub Scrub-Jays oak Florida to of restricted bird breeding atively Dataset. and System Study Methods and Materials increase will evolution short-term of dramatically. the consequences observe for and directly resources to causes ability genomic our As expand, time. populations pedigree pedigreed over governing population dynamics forces frequency a evolutionary allele the on to fitness. analyses insights to substantial SNP provide linked single haplotypes across even specific immigrants However, pinpoint quan- recent and of material, genome, contribution genetic the genetic surviving reproductive pedi- actual of between the the distribution tify relationship down the the blocks and explore genomic value could of we inheritance gree, the contributions. tracing genetic expected By just contributions of instead genetic individual actual each would realized, for information of quantification haplotype the and allow linkage of incorporation The in in variance variation decrease heritable frequencies. significantly that allele not found does and success year reproductive permuting by each fitness to within linked families alleles for dropping gene of mance o u eoi nlss efcsdo oestof set core a on focused we analyses, genomic our For natal limited and paternity extra-pair of rate low very the of Because pedigree. the down alleles single only traced have we Here G h lrd cu-a sannirtr,cooper- nonmigratory, a is Scrub-Jay Florida The = n 1 eqatf niiulgntccontribution genetic individual quantify We X m X p  2 1  g NSLts Articles Latest PNAS , 8trioisin territories ∼68 | <0.05. f7 of 5 [1]

EVOLUTION where n is the total number of nestlings born that year, m is the num- we generated a list of individuals present in our population each year in ber of nestlings related to the focal individual, p is the number of paths 1990–2013 and categorized them as survivors, immigrants, or nestlings (new in the pedigree linking the focal individual and the nestling, and g is the births; SI Appendix, Fig. S6A). We only included an individual in a given number of generations separating the focal individual from the nestling in year if it was observed in at least two months during March through June. that path (1–3). We used pedigree-based simulations to estimate expected We conservatively considered individuals who left our study population but individual genetic contributions instead. Our simulation results match the- later returned as survivors during the intervening time period to minimize oretical expectations but also provide estimates of the variance around the inflating the variance in allele frequencies. expected values. Let Nt be the total number of individuals in the population in year t, Ns We used gene dropping simulations to obtain expected genetic contri- be the number of individuals who survived from year t − 1 to t, Ni be the butions of individual breeders and of different immigrant cohorts to our number of new immigrants into the population in year t, and Nb be the population over time. A founder is by definition any individual in the pedi- number of individuals born in year t. Thus, the population size in year t is gree whose parents are unknown. Thus, all immigrants are founders. We Nt = Ns + Ni + Nb. If we denote the allele frequencies in each category as assigned genotypes to all founders as follows: For individual simulations, pj, then we can write the change in allele frequencies between years t − 1 we assigned the genotype “22” to the focal individual and “11” to all and t for a given SNP as other founders. To assess the expected genetic contributions of different immigrant cohorts, we assigned immigrants appearing in different years dif- Ns Ni Nb ∆p = (ps − pt−1) + (pi − pt−1) + (pb − pt−1). [4] ferent alleles. We then simulated Mendelian transmission of alleles down Nt Nt Nt the pedigree 1,000,000 times using custom C code. The distribution of allele counts in the nestling cohort each year gives the distribution of expected The variance in allele frequency change over time is then genetic contributions over time.     Ns 2 Ni 2 Var(∆p) = Var(ps − pt−1) + Var(pi − pt−1) N N Immigrant Contribution Projection. The proportion of resident alleles in the t t  N  N N birth cohort over time [r(t)] can be written as + b 2Var( − ) + 2 s b Cov( − , − ) pb pt−1 2 ps pt−1 pb pt−1 Nt Nt t r(t) = (1 − i) , [2] N N +2 i b Cov( − , − ). 2 pi pt−1 pb pt−1 [5] Nt where i is the per-year replacement rate by immigrant alleles, and t is the number of years following 1992. We began in 1992 because no parents in Note that we assume that survivors and immigrants in a given year are 1990–1992 were recent immigrants. We fitted this model using nonlinear unrelated and accordingly set Cov(ps − pt−1, pi − pt−1) = 0. least squares to estimate i, then used Eq. 2 to calculate the expected time We further partitioned the change in allele frequency due to the birth until neutral alleles were 95% replaced by immigrant alleles. cohort Var(pb − pt−1) into the change due to variation in family size and the deviation due to Mendelian segregation of alleles from heterozygotes

Allele Frequency Predictions. In the absence of selection, the allele frequency (∆pb,mend) (29). If pm and pf are the allele frequencies of the parents of an autosomal SNP in any given year can be written as a function of the weighted by the number of offspring they produced in year t, then individual genetic contributions of each founder and the founder allele fre-   quencies. Let F be the number of founders, Gi,y be the expected genetic 1 pb − pt−1 = (pm + pf ) − pt−1 + ∆pb,mend, [6] contribution of founder i to the population in year y, and pi be the allele 2 frequency of founder i. We can predict the expected allele frequency in year y as follows: where the first term denotes the expected change in allele frequencies due F to the variation in family size, and the second term denotes the additional X pˆy = Gi,y pi. [3] independent noise due to Mendelian transmission. We can then estimate i=1 the variance due to Mendelian noise as Here we iteratively trimmed the population pedigree until all founders were  1  genotyped, and estimated individual genetic contributions using simula- Var(∆pb,mend) = Var pb − (pm + pf ) , [7] tions on the trimmed pedigree. We evaluated prediction accuracy by fitting 2 linear regressions. with the alternate term for the variance due to family size variation following from Eq. 6. To generate expected allele frequency distributions Neutral Allele Dynamics. We estimated each of the terms on the left and right sides of Eq. 5 aver- over time, we used gene dropping simulations on a trimmed pedigree. For aged across all autosomal SNPs. We then divided each of the terms on the each SNP, we iteratively trimmed the pedigree until all founders in the final right by the total to quantify the proportion of allele frequency change due trimmed pedigree had a known genotype. Briefly, we removed all ungeno- to which individuals survive to the focal year, appear as new immigrants, typed founders and set all offspring of these individuals as founders, then or are born, as well as the contribution of survivors and immigrants to the repeated these two steps until all remaining founders had observed geno- birth cohort and Mendelian segregation of heterozygotes. We verified our types. Note that the trimmed pedigree can differ across SNPs because of model using simulations. variable missing data across individuals; however, missing data rates are low Although we have genomic data from nearly every individual present < ( 5%), so these differences are slight. Using the observed genotype for each in the population from 1999 to 2013, we still have a small number of founder, we simulated Mendelian transmission of alleles down the pedigree ungenotyped individuals in each year (SI Appendix, Fig. S6B). To account a million times and estimated allele frequencies each year in genotyped for missing genotypes, we corrected each term in Eq. 5 for sampling. Nor- nestlings from a core set of 54 to 76 territories. mally, the error in allele frequency estimation due to sampling can be We used Mann–Kendall tests from the R package Kendall (37) to test statistically modeled, but relatedness among individuals and nonrandom for trends in the allele frequencies of incoming immigrant cohorts through sampling make error estimation more complicated in this case. Therefore, time. We tested for net between 1999 and 2013, as we empirically calculated the error in allele frequency estimation using well as between all adjacent years during that time period, by comparing simulations. observed allele frequency shifts to the distribution of expected allele fre- To assess the effect of heritable variation in reproductive success on quency shifts generated from the gene dropping simulations. For each test, allele frequency change, we performed simulations that permuted parental we calculated p values by counting the number of simulations in which the assignments within each birth cohort, effectively breaking any inheritance simulated value is more different from the median value of all of the sim- of reproductive success while keeping variation in family sizes consistent. We ulations compared with the observed value. We used an FDR threshold of then simulated genotypes for 10,000 loci on the randomized pedigree and 0.25 for significance. the observed pedigree and compared the variance in net allele frequency change in 1999–2013. See SI Appendix for the full derivation of the model Variance in Allele Frequencies Model. To quantify the proportion of variance and more details on our simulations. All statistical analyses were done in R in the change in allele frequencies due to gene flow and variation in individ- (38). All code is available from Figshare 10.6084/m9.figshare.7044368. ual survival and reproductive success, we modeled the population as follows: Adults who survive or immigrate into the population then produce off- ACKNOWLEDGMENTS. We thank the many students and staff who collected spring. From our detailed census and other population monitoring records, the field data at Archbold Biological Station over the past half century.

6 of 7 | www.pnas.org/cgi/doi/10.1073/pnas.1813852116 Chen et al. Downloaded by guest on September 30, 2021 Downloaded by guest on September 30, 2021 8 ueoaA,e l 21)Rdfxgnm sebyietfisgnmcregions genomic identifies assembly genome fox Red human (2018) a al. against et selection natural AV, neutralizes Kukekova curse 18. Mother’s (2017) the al. of et estimate E, An Milot (2015) 17. M Przeworski C, Ober a M, in Stephens cryptic D, are Waggoner pattern Z, coat Gao of 16. and Selection (2012) al. et forward. way J, The Gratten pedigrees: Wild 15. (2008) J Pemberton 14. of (1986) effect equivalents EA The Thompson Founder pedigrees: (2005) 13. in representation YS founder Aulchenko of Analysis CM, (1989) Duijn RC van Lacy Quebec B, 12. of Oostra I, contribution MacKay genetic LM, the Pardo of 11. Variability (1995) M Tremblay E, Heyer 10. 0 un S olednG,FtptikJ,WieB 19)MlilcsDAfin- DNA Multi-locus (1999) BN White JW, con- Fitzpatrick Genomic GE, (2016) Woolfenden AG JS, Clark Quinn JW, 20. Fitzpatrick R, Bowman EJ, Cosgrove N, Chen 19. a upre yNtoa cec onain(S)Gat DEB0855879 Grants work (NSF) This Foundation comments. helpful Science for Arvid for National reviewer Coopons, Buffalo by anonymous the Vince supported an to and was and go help Jensen, Thanks Jeff statistical randomization. Agren, for pedigree Edge on Doc discussions to go Thanks hne al. et Chen .OBinE ebrR od ,Rgr 19)Fudrefc:Assmn fvariation of Assessment and effect: Founder value (1994) A reproductive Rogers L, between Jorde R, relation Kerber E, The O’Brien (2011) 9. AM Etheridge value. NH, reproductive Fisher’s of Barton theory A 8. (2006) A Grafen 7. (2010) D Charlesworth B, Charlesworth computer 6. by (1930) R analysis Fisher Pedigree (1986) 5. population. OA Ryder evolving B, an Read in JL, VandeBerg change JW, genetic MacCluer of 4. Measures (1980) JC reduction. Bear size DF, population Roberts of 3. effects Genetic (1968) DF Roberts 2. R LB, Jorde E, O’Brien 1. ˚ soitdwt aeadagesv behaviours. aggressive and tame with associated centuries. three over disease genetic humans. by carried 1254. lethal recessive of number average sheep. of population wild Baltimore). equivalents. genome founder and population. isolated genetically young a in drift genetic . deleterious some to 56:970–978. associated founders population founders. among contributions genetic in contribution. genetic 42. Vol CO), Village, Greenwood Company, simulation. Biol Hum 1088. Finland. Sottunga, in disease genetic epitn uprsgntcmngm nFoiascrub-jays. Florida in 45:1–10. monogamy genetic scrub-jay. supports gerprinting Florida endangered the in decline 26:2974–2979. population of sequences 52:773–786. o Biol Zoo h eeia hoyo aua Selection Natural of Theory Genetical The 5:147–160. eireAayi nHmnGenetics Human in Analysis Pedigree Genetics onnl ¨ o Ecol Mol fB ela O rksnA 18)Fudrefc and effect Founder (1988) AW Eriksson JO, Fellman B, of ¨ 188:953–973. o Biol Zoo 21:2977–2990. a clEvol Ecol Nat mJPy Anthropol Phys J Am lmnso vltoayGenetics Evolutionary of Elements u Biol Hum 8:111–123. a clEvol Ecol Nat 1:1400–1406. 66:185–204. rcRScB Soc R Proc n u Genet Hum Ann ahBiol Math J Ofr nvPes Oxford). Press, Univ (Oxford JhsHpisUi Press, Univ Hopkins (Johns 77:335–346. 2:1479–1491. ea clSociobiol Ecol Behav mJHmGenet Hum J Am Genetics Nature 275:613–621. 53:15–60. 69:288–295. 199:1243– 220:1084– urBiol Curr (Roberts 8 oeTa (2013) Team Core R 38. genomic The isolation-by-distance: Deconstructing (2011) A (2017) McLeod al. 37. et SM, success? and Aguillon reproductive increase endocrinology 36. helpers Reproductive Do (1992) (1991) RL Mumme MC 35. Moore RL, Mumme SJ, Schoech (1984) 34. JW evolution. to Fitzpatrick relation GE, in structure Woolfenden breeding and 33. population of Size correla- (1938) genetic S Perfect Wright (2014) 32. KJH Verweij H, Walum R, the take Kuja-Halkola Genes BP, fitness: of Zietsch transmission Cultural 31. (2005) F Austerlitz A, Sibert E, selection. Heyer under populations 30. of size Effective (1995) A Caballero E, Santiago programmes. selection 29. artificial Environment, in (2009) Inbreeding TE (1961) Wilcoxen A RK, Robertson Boughton 28. ES, Bridge MA, Rensel SJ, Schoech 27. (2003) C Valligny 2000-2001 26. the of Effects palmettos: Persistent (2002) CR Abrahamson W, Abrahamson 25. Pieti L, Gustafsson biparental JE, in Brommer ancestors ghost 24. of abundance and existence The (2015) M Genetic genetic Steel imply S, ancestry Gravel (2011) genealogical does 23. IJ extent what To Lovette (2008) SN M, Evans FA, Matsen Dent 22. JW, Fitzpatrick R, Bowman AK, Townsend 21. ..akoldeadtoa upr rmNtoa ntttso Health of 1353380. Institutes and 1262327 National Grants from NSF and and support R01-GM108779 N.C. N.C. Grant 1523665. additional Fund. Biology Athena in acknowledge Ornithology Fellowship G.C. of Postdoctoral NSF Lab by Cornell supported the was by and DEB1257628 and RFudto o ttsia optn,Ven) vial twww.R-project.org/. at Available 2015. Vienna). 15, September Computing, Accessed Statistical for Foundation (R https://CRAN.R-project.org/package=Kendall. at Available 2015. 1, 2.2. December Accessed Version Package R dispersal. limited of consequences jays scrub Florida 31:319–328. breeding cooperatively in coerulescens). inhibition c. (Aphelocoma breeding of mechanisms Bird Cooperative-Breeding Science human industrialized an in grandoffspring and offspring population. of number between tion lane. fast 139:1013–1030. 194. reproduction. of timing the and glucocorticoids, Use Habitat coerulescens) on drought contribution. genetic long-term for proxies as fitness individual populations. ancestry? Florida breeding cooperatively scrub-jays. in landscapes demographic variable across monogamy 87:430–431. rnsGenet Trends ho ou Biol Popul Theor ea Ecol Behav eeo eesadSbletonia. Sabal and repens Serenoa rcNt cdSiUSA Sci Acad Natl Proc ho ou Biol Popul Theor fet fa nes itrFr nFoiaSrbJy(Aphelocoma Scrub-Jay Florida on Fire Winter Intense an of Effects edl:KnalRn orlto n anKnalTedTest Trend Mann-Kendall and Correlation Rank Kendall Kendall: 22:464–470. :ALnug n niomn o ttsia Computing Statistical for Environment and Language A R: 21:234–239. VlaoaUi,Vlaoa PA). Villanova, Univ, (Villanova 74:182–190. PictnUi rs,Princeton). Press, Univ (Princeton 101:47–53. Condor ie ,Meril H, ainen ¨ LSGenet PLoS 111:1032–1036. 93:354–364. h lrd cu a–eorpyo a of Jay–Demography Scrub Florida The 20)Snl-eeainetmtsof estimates Single-generation (2004) J a ¨ l Sci Fla 13:e1006911. e opEndocrinol Comp Gen NSLts Articles Latest PNAS 65:281–292. mNat Am ea clSociobiol Ecol Behav ee Res Genet 163:201–207. 163:505–517. | Genetics 2:189– f7 of 7 .

EVOLUTION