A Population-Level Invasion by Transposable Elements Triggers

bioRxiv preprint doi: https://doi.org/10.1101/2020.02.11.944652; this version posted April 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 A population-level invasion by transposable 2 elements triggers genome expansion in a fungal 3 pathogen 4 5 6 Ursula Oggenfuss1, Thomas Badet1, Thomas Wicker2, Fanny E. Hartmann3,4, Nikhil K. Singh1, Leen 7 N. Abraham1, Petteri Karisto4,6, Tiziana Vonlanthen4, Christopher C. Mundt5, Bruce A. McDonald4, 8 Daniel Croll1,* 9 10 11 1 Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, 2000 Neuchâtel, 12 Switzerland 13 2 Institute for Plant and Microbial Biology, University of Zurich, Zurich, Switzerland 14 3 Ecologie Systématique Evolution, Bâtiment 360, Univ. Paris-Sud, AgroParisTech, CNRS, 15 Université Paris-Saclay, 91400 Orsay, France 16 4 Plant Pathology, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland 17 5 Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331-2902, 18 USA 19 6 Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, 20 Switzerland 21 22 * Author for correspondence: [email protected] 23 24 25 26 27 28 29 30 Running title: Transposable element invasion triggers genome expansion 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.02.11.944652; this version posted April 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 31 ABSTRACT 32 Genome evolution is driven by the activity of transposable elements (TEs). The spread of TEs can 33 have deleterious effects including the destabilization of genome integrity and expansions. However, 34 the precise triggers of genome expansions remain poorly understood because genome size evolution 35 is typically investigated only among deeply divergent lineages. Here, we use a large population 36 genomics dataset of 284 individuals from populations across the globe of Zymoseptoria tritici, a major 37 fungal wheat pathogen. We built a robust map of genome-wide TE insertions and deletions to track a 38 total of 2’456 polymorphic loci within the species. We show that purifying selection substantially 39 depressed TE frequencies in most populations but some rare TEs have recently risen in frequency and 40 likely confer benefits. We found that specific TE families have undergone a substantial genome-wide 41 expansion from the pathogen's center of origin to more recently founded populations. The most 42 dramatic increase in TE insertions occurred between a pair of North American populations collected 43 in the same field at an interval of 25 years. We find that both genome-wide counts of TE insertions 44 and genome size have increased with colonization bottlenecks. Hence, the demographic history likely 45 played a major role in shaping genome evolution within the species. We show that both the activation 46 of specific TEs and relaxed purifying selection underpin this incipient expansion of the genome. Our 47 study establishes a model to recapitulate TE-driven genome evolution over deeper evolutionary 48 timescales. 49 2 bioRxiv preprint doi: https://doi.org/10.1101/2020.02.11.944652; this version posted April 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 50 INTRODUCTION 51 Transposable elements (TEs) are mobile repetitive DNA sequences with the ability to independently 52 insert into new regions of the genome. TEs are major drivers of genome instability and epigenetic 53 change (Eichler & Sankoff, 2003). Insertion of TEs can disrupt coding sequences, trigger 54 chromosomal rearrangements, or alter expression profiles of adjacent genes (Lim, 1988; Petrov et al., 55 2003; Slotkin & Martienssen, 2007; Hollister & Gaut, 2009; Oliver et al., 2013). Hence, TE activity 56 can have phenotypic consequences and impact host fitness. While TE insertion dynamics are driven 57 by the selfish interest for proliferation, the impact on the host can range from beneficial to highly 58 deleterious. The most dramatic examples of TE insertions underpinned rapid adaptation of populations 59 or species (Feschotte, 2008; Chuong et al., 2017), particularly following environmental change or 60 colonization events. Beneficial TE insertions are expected to experience strong positive selection and 61 rapid fixation in populations. However, most TE insertions have neutral or deleterious effects upon 62 insertions. Purifying selection is expected to rapidly eliminate deleterious insertions from populations 63 unless constrained by genetic drift (Walser et al., 2006; Baucom et al., 2008; Cridland et al., 2013; 64 Stuart et al., 2016; Lai et al., 2017; Stritt et al., 2017). Additionally, genomic defense mechanisms can 65 disable transposition activity. Across eukaryotes, epigenetic silencing is a shared defense mechanism 66 against TEs (Slotkin & Martienssen, 2007). Fungi evolved an additional and highly specific defense 67 system introducing repeat-induced point (RIP) mutations into any nearly identical set of sequences. 68 The relative importance of demography, selection and genomic defenses determining the fate of TEs 69 in populations remain poorly understood. 70 71 A crucial property predicting the invasion success of TEs in a genome is the transposition rate. TEs 72 tend to expand through family-specific bursts of transposition followed by prolonged phases of 73 transposition inactivity. Bursts of insertions of different retrotransposon families were observed across 74 eukaryotic lineages including Homo sapiens, Zea mays, Oryza sativa and Blumeria graminis (Shen et 75 al., 1991; SanMiguel et al., 1998; Eichler & Sankoff, 2003; Lu et al., 2017; Frantzeskakis et al., 2018). 76 Prolonged bursts without effective counter-selection are thought to underpin genome expansions. In 3 bioRxiv preprint doi: https://doi.org/10.1101/2020.02.11.944652; this version posted April 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 77 the symbiotic fungus Cenococcum geophilum, the burst of TEs resulted in a dramatically expanded 78 genome compared to closely related species (Peter et al., 2016). Similarly, a burst of a TE family in 79 brown hydras led to an approximately three-fold increase of the genome size compared to related 80 hydras (Wong et al., 2019). Across the tree of life, genome sizes vary by orders of magnitude and 81 enlarged genomes invariably show hallmarks of historic TE invasions (Kidwell, 2002). Population 82 size variation is among the few correlates of genome size across major groups, suggesting that the 83 efficacy of selection plays an important role in controlling TE activity (Lynch, 2007). Reduced 84 selection efficacy against deleterious TE insertions is expected to lead to a ratchet-like increase in 85 genome size. In fungi, TE-rich genomes often show an isochore structure alternating gene-rich and 86 TE-rich compartments (Rouxel et al., 2011). TE-rich compartments often harbor rapidly evolving 87 genes such as effector genes in pathogens or resistance genes in plants (Raffaele & Kamoun, 2012; 88 Jiao & Schneeberger, 2019). Taken together, incipient genome expansions are likely driven by 89 population-level TE insertion dynamics. 90 91 The fungal wheat pathogen Zymoseptoria tritici is one of the most important pathogens on crops 92 causing high yield losses (Torriani et al., 2015). The genome is completely assembled and shows size 93 variation between individuals sampled across the global distribution range (Feurtey et al., 2020; Badet 94 et al., 2020) (Goodwin et al., 2011). The TE content of the genome shows a striking variation of 17- 95 24% variation among individuals (Badet et al., 2020). Z. tritici recently gained major TE-mediated 96 adaptations to colonize host plants and tolerate environmental stress (Omrane et al., 2015, 2017; 97 Krishnan et al., 2018; Meile et al., 2018). Clusters of TEs are often associated with genes encoding 98 important pathogenicity functions (i.e. effectors), recent gene gains or losses (Hartmann & Croll, 99 2017), and major chromosomal rearrangements (Croll et al., 2013; Plissonneau et al., 2016). 100 Transposition activity of TEs also had a genome-wide impact on gene expression profiles during 101 infection (Fouché et al., 2019). The well-characterized demographic history of the pathogen and 102 evidence for recent TE-mediated adaptations make Z. tritici an ideal model to recapitulate the process 103 of TE insertion dynamics, adaptive evolution and changes in genome size at the population level. 104 4 bioRxiv preprint doi: https://doi.org/10.1101/2020.02.11.944652; this version posted April 8, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license.

A Population-Level Invasion by Transposable Elements Triggers

Genome Analysis of the Smallest Free-Living Eukaryote Ostreococcus

Genome Evolution: Mutation Is the Main Driver of Genome Size in Prokaryotes Gabriel A.B

Evolution of Genome Size

Best Practices for Whole Genome Sequencing Using the Sequel System

Decoding Non-Coding DNA: Trash Or Treasure?

How Much Non-Coding DNA?

Primer on Molecular Genetics

Genome Size Covaries More Positively with Propagule Size Than Adult Size: New Insights Into an Old Problem

The SARS-Cov-2 Genome: Variation, Implication and Application

A Chimeric Nuclease Substitutes a Phage CRISPR-Cas System

Genome Size and Chromosome Number Relationship Contradicts the Principle of Darwinian Evolution from Common Ancestor

Trends Between Gene Content and Genome Size in Prokaryotic Species with Larger Genomes Konstantinos T