The Genome of the Seagrass Zostera Marina Reveals Angiosperm Adaptation to the Sea Jeanine L
Total Page:16
File Type:pdf, Size:1020Kb
OPEN LETTER doi:10.1038/nature16548 The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea Jeanine L. Olsen1*, Pierre Rouzé2, Bram Verhelst2, Yao-Cheng Lin2, Till Bayer3, Jonas Collen4, Emanuela Dattolo5, Emanuele De Paoli6, Simon Dittami4, Florian Maumus7, Gurvan Michel4, Anna Kersting8,9, Chiara Lauritano5, Rolf Lohaus2, Mats Töpel10, Thierry Tonon4, Kevin Vanneste2, Mojgan Amirebrahimi11, Janina Brakel3, Christoffer Boström12, Mansi Chovatia11, Jane Grimwood11,13, Jerry W. Jenkins11,13, Alexander Jueterbock14, Amy Mraz15, Wytze T. Stam1, Hope Tice11, Erich Bornberg-Bauer8, Pamela J. Green16, Gareth A. Pearson17, Gabriele Procaccini5*, Carlos M. Duarte18, Jeremy Schmutz11,13, Thorsten B. H. Reusch3,19* & Yves Van de Peer2,20,21* Seagrasses colonized the sea1 on at least three independent occasions the Pacific and Atlantic10. A clone of Z. marina was sequenced from to form the basis of one of the most productive and widespread the Archipelago Sea, southwest Finland, using a combination of fos- coastal ecosystems on the planet2. Here we report the genome of mid-ends and whole-genome shotgun (WGS) approaches (Methods, Zostera marina (L.), the first, to our knowledge, marine angiosperm Supplementary Note 2). The 202.3 Mb Z. marina genome encodes 20,450 to be fully sequenced. This reveals unique insights into the protein-coding genes, 86.6% of which (17,511 genes, Supplementary genomic losses and gains involved in achieving the structural and Note 3.1) are supported by transcriptome data from leaves, roots and physiological adaptations required for its marine lifestyle, arguably flowers (Extended Data Fig. 1, Supplementary Notes 3.2–3.3 and the most severe habitat shift ever accomplished by flowering Supplementary Data 1–3). Genes are located in numerous gene-dense plants. Key angiosperm innovations that were lost include the islands separated by stretches of repeat elements accounting for 63% of entire repertoire of stomatal genes3, genes involved in the synthesis the non-gapped assembly (Extended Data Fig. 2, Supplementary Note of terpenoids and ethylene signalling, and genes for ultraviolet 3.1) as compared to only 13% in the only other sequenced alismatid, protection and phytochromes for far-red sensing. Seagrasses have the freshwater duckweek, Spirodela polyrhiza (Alismatales, Araceae)11. also regained functions enabling them to adjust to full salinity. Their Gypsy-type (32%) and Copia-type (20%) transposable elements contrib- cell walls contain all of the polysaccharides typical of land plants, ute to most of the repetitive DNA. Sequence divergence analysis suggests but also contain polyanionic, low-methylated pectins and sulfated that the genome retains copies from two distinct periods of invasion by galactans, a feature shared with the cell walls of all macroalgae4 Copia elements, but only one period for Gypsy elements (Extended Data and that is important for ion homoeostasis, nutrient uptake and Fig. 3a–c). Genes gained by Z. marina (‘accessory’) are located closer to O2/CO2 exchange through leaf epidermal cells. The Z. marina transposable elements than to conserved (‘single copy’) genes (Fisher’s genome resource will markedly advance a wide range of functional exact test, P < 0.0001) indicating that transposable elements may have ecological studies from adaptation of marine ecosystems played a role in genic adaptation. under climate warming5,6, to unravelling the mechanisms of We identified 36 conserved microRNAs with high confidence and osmoregulation under high salinities that may further inform our their predicted targets (Supplementary Note 3.4, Supplementary Data 4 understanding of the evolution of salt tolerance in crop plants7. and 5). A novel variant of miR528 (not present in Spirodela) was found Seagrasses are a polyphyletic assemblage of basal monocots belong- to be the only member of this miRNA family, and demonstrates that ing to four families in the Alismatales1,2 (Supplementary Note 1.1 this conserved miRNA is the only one ancestral to the entire monocot and Supplementary Fig. 1.1). As a functional group, they provide the lineage. Most likely, Z. marina did not take part in the subsequent birth foundation of highly productive ecosystems present along the coasts of miRNAs that are common to several other monocots12; nor did it of all continents except Antarctica, where they rival tropical rain experience or retain traces of prominent miRNA duplications. 8,9 forests and coral reefs in ecosystem services . In colonizing sedimen- Analysis of synonymous substitutions per synonymous site (KS) age tary shorelines of the world’s ocean, seagrasses found a vast new habitat distributions indicates that Z. marina carries the remnants of an inde- free of terrestrial competitors and insect pests but had to adapt to cope pendent, ancient whole-genome duplication (WGD) event (Fig. 2a, with new structural and physiological challenges related to full marine Supplementary Note 4.1)13. Duplicated segments account for ~9% conditions. of the Z. marina genome, probably an underestimate due to the Zostera marina (Zosteraceae), or eelgrass (Fig. 1), is the most wide- fragmented nature of the assembly. Zostera and Spirodela diverged spread species throughout the temperate northern hemisphere of somewhere between 135 and 107 million years ago (Mya)14 and 1Groningen Institute of Evolutionary Life Sciences (GELIFES), University of Groningen, PO Box 11103, 9700 CC Groningen, The Netherlands. 2Department of Plant Systems Biology, VIB and Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 927, B-9052 Ghent, Belgium. 3GEOMAR Helmholtz Centre for Ocean Research-Kiel, Evolutionary Ecology, Düsternbrooker Weg 20, D-24105 Kiel, Germany. 4Sorbonne Université, UPMC Univ Paris 06, CNRS, UMR 8227, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff cedex, France. 5Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy. 6Dipartimento di Scienze Agrarie e Ambientali, University of Udine, Via delle Scienze 206, 33100 Udine, Italy. 7INRA, UR1164 URGI—Research Unit in Genomics-Info, INRA de Versailles-Grignon, Route de Saint-Cyr, Versailles 78026, France. 8Institute for Evolution and Biodiversity, Westfälische Wilhelms-University of Münster, Hüfferstrasse 1, D-48149 Münster, Germany. 9Institute for Computer Science, Heinrich Heine University, D-40255 Duesseldorf, Germany. 10Department of Biological and Environmental Sciences, Bioinformatics Infrastructure for Life Sciences (BILS), University of Gothenburg, Medicinaregatan 18A, 40530 Gothenburg, Sweden. 11Department of Energy Joint Genome Institute, 2800 Mitchell Dr., #100, Walnut Creek, California 94598, USA. 12Environmental and Marine Biology, Faculty of Science and Engineering, Åbo Akademi University, Artillerigatan 6, FI-20520 Turku/Åbo, Finland. 13HudsonAlpha Institute for Biotechnology, 601 Genome Way NW, Huntsville, Alabama 35806, USA. 14Marine Ecology Group, Nord University, Postbox 1490, 8049 Bodø, Norway. 15Amplicon Express, 2345 NE Hopkins Ct., Pullman, Washington 99163, USA. 16School of Marine Science and Policy, Department of Plant and Soil Sciences, Delaware Biotechnology Institute, University of Delaware, 15-Innovation Way, Newark, Delaware 19711, USA. 17Marine Ecology and Evolution, Centre for Marine Sciences (CCMAR), University of Algarve, 8005-139 Faro, Portugal. 18King Abdullah University of Science and Technology (KAUST), Red Sea Research Center (RSRC), Thuwal 23955-6900, Saudi Arabia. 19University of Kiel, Faculty of Mathematics and Natural Sciences, Christian-Albrechts-Platz 4, 24118 Kiel, Germany. 20Genomics Research Institute, University of Pretoria, Hatfield Campus, Pretoria 0028, South Africa. 21Bioinformatics Institute Ghent, Ghent University, Ghent B-9000, Belgium. *These authors contributed equally to this work. 18 FEBRUARY 2016 | VOL 530 | NATURE | 331 © 2016 Macmillan Publishers Limited. All rights reserved RESEARCH LETTER +616/–2,570 a +/– Gene families gain/loss Z. marina 11,749 3,844 20,537 b +104/–2,445 7 13,703 +536/–2,107 S. polyrhiza 12,132 3,660 19,623 +849/–1,420 +531/–2,104 6 16,044 B. distachyon 17,417 3,449 26,552 +1,234/–702 9 18,990 +1,674/–1,489 O. sativa 19,175 10,925 39,049 8 18,458 +4,730/–2,316 +1,808/–1,706 +3,024/–126 Z. mays 18,560 8,592 39,656 5 16,615 +870/–3,555 V. vinifera 13,916 7,059 26,346 +1,501/–1,953 10 16,601 P. trichocarpa 15,961 8,169 41,335 +1,252/–1,266 +4,019/–723 4 13,717 11 16,413 +463/–918 +633/–821 T. parvula 17,269 3,469 27,132 12 17,724 +3,854/–2,543 +527/–605 A. thaliana 17,646 3,509 27,416 +1,452/–218 3 10,421 +1,328/–2,039 A. trichopoda 13,006 7,799 26,846 +3,505 2 9,187 +1,895/–2,145 S. moellendorffii 10,171 4,364 22,273 +1,599/–1,078 +5,682 P. patens 9,708 12,618 32,273 1 5,682 +217/–1,751 +405 O. lucimarinus 4,553 2,603 7,742 13 6,087 +1,426/–1,043 C. reinhardtii 6,470 6,675 17,737 s Gene Genes families Orphan Figure 1 | Zostera marina and phylogenetic tree showing gene family number of predicted genes is indicated next to each species. Background expansion/contraction analysis compared with 13 representatives of the colours (top to bottom) are Alismatales, other monocots, dicots, mosses/ Viridiplantae. a, Gains and losses are indicated along branches and nodes. algae b, Typical Zostera marina meadow, Archipelago Sea, southwest The number of gene families, orphans (single-copy gene families) and Finland (photo by C.B.). phylogenomic dating13 of the Z. marina WGD suggests that it occurred Supplementary Data 6). While many genes are shared between Zostera 72–64 Mya (Fig. 2b), thus independently from the two WGDs reported and Spriodela, clearly some losses and gains are unique to Zostera in for S. polyrhiza11. This timeframe coincides with the initial diversi- relation to its marine environment, the alismatid lineage having set the fication of a freshwater clade that includes three of the four families stage for the subsequent freshwater–marine transition. Those unique of seagrasses (Supplementary Table 1.1) and with the Cretaceous– to Z.