1 Accurate and Sensitive Detection of Microbial Eukaryotes from Whole 1

Total Page:16

File Type:pdf, Size:1020Kb

1 Accurate and Sensitive Detection of Microbial Eukaryotes from Whole 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.22.216580; this version posted January 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Accurate and sensitive detection of microbial eukaryotes from whole 2 metagenome shotgun sequencing 3 Abigail L. Lind1 and Katherine S. Pollard1,2,3,* 4 1. Gladstone Institute of Data Science and Biotechnology, San Francisco, CA. 5 2. Institute for Human Genetics, Department of Epidemiology and Biostatistics, and 6 Institute for Computational Health Sciences, University of California, San Francisco, 7 CA. 8 3. Chan Zuckerberg Biohub, San Francisco, CA. 9 * Corresponding author: [email protected] 10 11 Abstract 12 Background 13 Microbial eukaryotes are found alongside bacteria and archaea in natural microbial 14 systems, including host-associated microbiomes. While microbial eukaryotes are critical 15 to these communities, they are challenging to study with shotgun sequencing 16 techniques and are therefore often excluded. 17 18 Results 19 Here we present EukDetect, a bioinformatics method to identify eukaryotes in shotgun 20 metagenomic sequencing data. Our approach uses a database of 521,824 universal 21 marker genes from 241 conserved gene families, which we curated from 3,713 fungal, 22 protist, non-vertebrate metazoan, and non-streptophyte archaeplastid genomes and 23 transcriptomes. EukDetect has a broad taxonomic coverage of microbial eukaryotes, 24 performs well on low-abundance and closely related species, and is resilient against 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.22.216580; this version posted January 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 25 bacterial contamination in eukaryotic genomes. Using EukDetect, we describe the 26 spatial distribution of eukaryotes along the human gastrointestinal tract, showing that 27 fungi and protists are present in the lumen and mucosa throughout the large intestine. 28 We discover that there is a succession of eukaryotes that colonize the human gut during 29 the first years of life, mirroring patterns of developmental succession observed in gut 30 bacteria. By comparing DNA and RNA sequencing of paired samples from human stool, 31 we find that many eukaryotes continue active transcription after passage through the 32 gut, though some do not, suggesting they are dormant or nonviable. We analyze 33 metagenomic data from the Baltic Sea and find that eukaryotes differ across locations 34 and salinity gradients. Finally, we observe eukaryotes in Arabidopsis leaf samples, 35 many of which are not identifiable from public protein databases. 36 37 Conclusions 38 EukDetect provides an automated and reliable way to characterize eukaryotes in 39 shotgun sequencing datasets from diverse microbiomes. We demonstrate that it 40 enables discoveries that would be missed or clouded by false positives with standard 41 shotgun sequence analysis. EukDetect will greatly advance our understanding of how 42 microbial eukaryotes contribute to microbiomes. 43 44 Keywords: fungi, protists, algae, nematode, arthropod, choanoflagellate, microbial 45 eukaryotes, helminth, whole metagenome sequencing 46 47 Background 2 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.22.216580; this version posted January 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 48 Eukaryotic microbes are ubiquitous in microbial systems, where they function as 49 decomposers, predators, parasites, and producers [1]. Eukaryotic microbes inhabit 50 virtually every environment on earth, from extremophiles found in geothermal vents, to 51 endophytic fungi growing within the leaves of plants, to parasites inside the animal gut. 52 Microbial eukaryotes have complex interactions with their hosts in both plant and animal 53 associated microbiomes. For example, in plants, microbial eukaryotes assist with 54 nutrient uptake and can protect against herbivory [2]. In animals, microbial eukaryotes in 55 the gastrointestinal tract metabolize plant compounds [3]. Microbial eukaryotes can also 56 cause disease in both plants and animals, as in the case of oomycete pathogens in 57 plants or cryptosporidiosis in humans [4,5]. In humans, microbial eukaryotes interact in 58 complex ways with the host immune system, and their depletion and low diversity in 59 microbiomes from industrialized societies mirrors the industrialization-driven “extinction” 60 seen for bacterial residents in the microbiome [6–8]. Outside of host associated 61 environments, microbial eukaryotes are integral to the ecology of aquatic and soil 62 ecosystems, where they are primary producers, partners in symbioses, decomposers, 63 and predators [9,10]. 64 Despite their importance, eukaryotic species are often not captured in studies of 65 microbiomes [11]. By far the most frequently used metabarcoding strategy for studying 66 host-associated and environmental microbiomes targets the 16S ribosomal RNA gene, 67 which is found exclusively in bacteria and archaea, and therefore cannot be used to 68 detect eukaryotes. Alternatively, amplicon sequencing of eukaryotic-specific (18S) or 69 fungal-specific (ITS) ribosomal genes are effectively used in studies specifically 70 targeting eukaryotes. Our understanding of the eukaryotic diversity in host-associated 3 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.22.216580; this version posted January 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 71 and environmental microbiomes is largely due to advances in eukaryotic-specific 72 amplicon sequencing [12–15]. However, like all amplicon strategies they can suffer from 73 limited taxonomic resolution and difficulty in distinguishing between closely related 74 species [16,17]. 75 Unlike amplicon strategies, whole metagenome sequencing captures fragments 76 of DNA from the entire pool of species present in a microbiome, including eukaryotes. 77 Whole metagenome sequencing data is increasingly common in microbiome studies, as 78 these data can be used to assemble new genomes, disambiguate strains, and reveal 79 presence and absence of genes and pathways [18]. As well as these methods work for 80 identifying bacteria and archaea, multiple challenges remain to robustly and routinely 81 identify eukaryotes in whole metagenome sequencing datasets. First, eukaryotic 82 species are estimated to be present in many environments at lower abundances than 83 bacteria, and comprise a much smaller fraction of the reads of a shotgun sequencing 84 library than do bacteria [1,19]. Despite this limitation, interest in detecting rare taxa and 85 rare variants in microbiomes has increased alongside falling sequencing costs, resulting 86 in more and more microbiome deep sequencing datasets where eukaryotes could be 87 detected and analyzed in a meaningful way. 88 Apart from sequencing depth, a second major barrier to detecting eukaryotes 89 from whole metagenome sequencing is the availability of methodological tools. 90 Research questions about eukaryotes in microbiomes using whole metagenome 91 sequencing have primarily been addressed by using eukaryotic reference genomes, 92 genes, or proteins for either k-mer matching or read mapping approaches [20,21]. 93 However, databases of eukaryotic genomes and proteins have widespread 4 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.22.216580; this version posted January 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 94 contamination from bacterial sequences, and these methods therefore frequently 95 misattribute bacterial reads to eukaryotic species [22,23]. Gene-based taxonomic 96 profilers, such as Metaphlan3, have been developed to detect eukaryotic species, but 97 these target a small subset of microbial eukaryotes (122 eukaryotic species as of the 98 mpa_v30 release) [24,25]. 99 To enable routine detection of eukaryotic species from any environment 100 sequenced with whole metagenome sequencing, we have developed EukDetect, a 101 bioinformatics approach that aligns metagenomic sequencing reads to a database of 102 universally conserved eukaryotic marker genes. As microbial eukaryotes are not a 103 monophyletic group, we have taken an inclusive approach and incorporated marker 104 genes from 3,713 eukaryotes, including 596 protists, 2010 fungi, 146 non-streptophyte 105 archaeplastids, and 961 non-vertebrate metazoans (many of which are microscopic or 106 near-microscopic). This gene-based strategy avoids the pitfall of spurious bacterial 107 contamination in eukaryotic genomes that has confounded other approaches. The 108 EukDetect pipeline will enable different
Recommended publications
  • Seasonal Variation in Abundance and Species Composition of the Parmales Community in the Oyashio Region, Western North Pacific
    Vol. 75: 207–223, 2015 AQUATIC MICROBIAL ECOLOGY Published online July 6 doi: 10.3354/ame01756 Aquat Microb Ecol Seasonal variation in abundance and species composition of the Parmales community in the Oyashio region, western North Pacific Mutsuo Ichinomiya1,*, Akira Kuwata2 1Prefectural University of Kumamoto, 3-1-100 Tsukide, Kumamoto 862-8502, Japan 2Tohoku National Fisheries Research Institute, Shinhamacho 3−27−5, Shiogama, Miyagi 985−0001, Japan ABSTRACT: Seasonal variation in abundance and species composition of the Parmales commu- nity (siliceous pico-eukaryotic marine phytoplankton) was investigated off the south coast of Hokkaido, Japan, in the western North Pacific. Growth rates under various temperatures (0 to 20°C) were also measured using 3 Parmales culture strains, Triparma laevis f. inornata, Triparma laevis f. longispina and Triparma strigata. Distribution of Parmales abundance was coupled with the occurrence of Oyashio water, which originates from the cold Oyashio Current. In March and May, the water temperature was usually low (<10°C) and the water column was vertically mixed. Parmales was often abundant (>1 × 102 cells ml−1) and evenly distributed from 0 down to 100 m. In contrast, when water stratification was well developed in July and October, Parmales was almost absent above the pycnocline at >15°C, but had an abundance of >1 × 102 cells ml−1 in the sub - surface layer of 30 to 50 m at <10°C. The seasonal variations in the vertical distributions of the 3 dominant species (Triparma laevis, Triparma strigata and Tetraparma pelagica) were similar to each other. Growth experiments revealed that Triparma laevis f. inornata and Triparma strigata, and Triparma laevis f.
    [Show full text]
  • Silicification in the Microalgae
    Silicification in the Microalgae Zoe V. Finkel 1 Silicifi cation in the Microalgae the cell wall, or Si may be bound to organic ligands associ- ated with the glycocalyx, or that Si may accumulate in peri- Silicon (Si) is the second most common element in the plasmic spaces associated with the cell wall (Baines et al. Earth’s crust (Williams 1981 ) and has been incorporated in 2012 ). In the case of fi eld populations of marine species from most of the biological kingdoms (Knoll 2003 ). Synechococcus , silicon to phosphorus ratios can approach In this review I focus on what is known about: Si accumula- values found in diatoms, and signifi cant cellular concentra- tion and the formation of siliceous structures in microalgae tions of Si have been confi rmed in some laboratory strains and some related non-photosynthetic groups, molecular and (Baines et al. 2012 ). The hypothesis that Si accumulates genetic mechanisms controlling silicifi cation, and the poten- within the periplasmic space of the outer cell wall is sup- tial costs and benefi ts associated with silicifi cation in the ported by the observation that a silicon layer forms within microalgae. This chapter uses the terminology recommended invaginations of the cell membrane in Bacillus cereus spores by Simpson and Volcani ( 1981 ): Si refers to the element and (Hirota et al. 2010 ). when the form of siliceous compound is unknown, silicic Signifi cant quantities of Si, likely opal, have been detected acid, Si(OH)4 , refers to the dominant unionized form of Si in in freshwater and marine green micro- and macro-algae (Fu aqueous solution at pH 7–8, and amorphous hydrated polym- et al.
    [Show full text]
  • Diversity and Oceanic Distribution of Parmales (Bolidophyceae)
    1 Supplementary materials 2 Supplementary Tables 3 Table S1: Primers and PCR conditions used in this study. 4 Table S2: Environmental sequences from GenBank and CAMERA databases matching 5 Bolidophyceae. Sequences highlighted in grey represent chimeras and were not considered in 6 our analysis. 7 Table S3: Characteristics of the Tara Oceans V9 18S rRNA data set. 8 Table S4: List of all Tara Oceans V9 OTUs classified as Bolidophyceae. 9 Table S5: The six major Tara Oceans V9 OTUs belonging to the Bolidophyceae. Number 10 of sequences for each OTU, % of total Bolidophyceae/Parmales sequences in the 4 fractions 11 considered, % stations where the OTU is present in the 0.8-5 µm fraction in surface and at 12 DCM and % of Bolidophyceae/Parmales sequences in 0.8-5 µm fraction. 13 14 Supplementary Figures 15 Fig. S1: Map of the Tara Oceans expedition with location of the stations sampled. 16 Fig. S2: Nucleotide signature of the six major OTUs belonging to Bolidophyceae in the V9 17 region of the 18S rRNA gene. The sequence of the diatom Phaeodactylum tricornutum is 18 used as an outgroup. 19 Fig. S3: Proportion of environmental nuclear 18S rRNA gene in each clade defined in the 18S 20 rRNA phylogenetic analysis (Fig. 2). 21 Fig. S4: Maximum-likelihood tree inferred from rRNA ITS (ITS1+5.8S RNA+ITS2) gene 22 nucleotide sequences. Legend as in Fig. 2. 23 Fig. S5: Maximum-likelihood tree inferred from rbcL gene nucleotide sequences. Legend as 24 in Fig. 2. 25 Fig. S6: Maximum-likelihood tree inferred from NADH dehydrogenase subunit1 (nad1) gene 26 nucleotide sequences.
    [Show full text]
  • Diel Shifts in Microbial Eukaryotic Activity in the North Pacific Subtropical Gyre
    ORIGINAL RESEARCH published: 10 October 2018 doi: 10.3389/fmars.2018.00351 A Hard Day’s Night: Diel Shifts in Microbial Eukaryotic Activity in the North Pacific Subtropical Gyre Sarah K. Hu*, Paige E. Connell, Lisa Y. Mesrop and David A. Caron Biological Sciences, University of Southern California, Los Angeles, CA, United States Molecular analysis revealed diel rhythmicity in the metabolic activity of single-celled microbial eukaryotes (protists) within an eddy in the North Pacific Subtropical Gyre (ca. 100 km NE of station ALOHA). Diel trends among different protistan taxonomic groups reflected distinct nutritional capabilities and temporal niche partitioning. Changes in relative metabolic activities among phototrophs corresponded to the light cycle, generally peaking in mid- to late-afternoon. Metabolic activities of protistan taxa with phagotrophic ability were higher at night, relative to daytime, potentially in response to increased availability of picocyanobacterial prey. Tightly correlated Operational Taxonomic Units throughout the diel cycle implicated the existence of parasitic and mutualistic relationships within the microbial eukaryotic community, underscoring the need to define and include these symbiotic interactions in marine food web Edited by: descriptions. This study provided a new high-resolution view into the ecologically Susana Agusti, important interactions among primary producers and consumers that mediate the King Abdullah University of Science and Technology, Saudi Arabia transfer of carbon to higher trophic levels. Characterizations of the temporal dynamics Reviewed by: of protistan activities contribute knowledge for predicting how these microorganisms Xin Lin, respond to environmental forcing factors. Xiamen University, China Roberta L. Hansman, Keywords: microbial eukaryotes, protists, diel periodicity, daily patterns, metabolic activity, microbial ecology, IAEA International Atomic Energy protistan ecology Agency, Monaco *Correspondence: Sarah K.
    [Show full text]
  • Growth Characteristics and Vertical Distribution of Triparma Laevis (Parmales) During Summer in the Oyashio Region, Western North Pacific
    Vol. 68: 107–116, 2013 AQUATIC MICROBIAL ECOLOGY Published online January 10 doi: 10.3354/ame01606 Aquat Microb Ecol Growth characteristics and vertical distribution of Triparma laevis (Parmales) during summer in the Oyashio region, western North Pacific Mutsuo Ichinomiya1,*, Miwa Nakamachi2, Yugo Shimizu3, Akira Kuwata2 1Prefectural University of Kumamoto, Tsukide 3−1−100, Higashi, Kumamoto 862−8502, Japan 2Tohoku National Fisheries Research Institute, Shinhamacho 3−27−5, Shiogama, Miyagi 985−0001, Japan 3National Research Institute of Fisheries Science, Fukuura 2−12−4, Kanazawa, Yokohama 236−8648, Japan ABSTRACT: The vertical and regional distribution of Triparma laevis (Parmales), a siliceous pico- sized eukaryotic marine phytoplankton species, was investigated during summer off the south coast of Hokkaido, Japan, in the western North Pacific. Growth characteristics were also studied in the laboratory using a recently isolated culture strain. T. laevis was abundant in the subsurface layer (30 to 50 m), where water temperature was <10°C, but it was absent above the pycnocline when temperatures were >15°C. Growth experiments revealed that T. laevis was able to grow at 0 to 10°C but not higher than 15°C, indicating that its depth distribution mainly depended on tem- perature. High irradiances resulted in increased growth rates of T. laevis, with the highest rates of 0.50 d−1 at 150 µmol m−2 s−1. Using measured daily incident photosynthetically available radiation and in situ light attenuation, the growth rates of T. laevis at 30 and 50 m were calculated as 0.02 to 0.34 and −0.01 to 0.08 d−1, respectively.
    [Show full text]
  • 9781107555655 Index.Pdf
    Cambridge University Press 978-1-107-55565-5 — Phycology Robert Edward Lee Index More Information INDEX Acanthopeltis , 95 Amphiprora , 501 aragonite , 87 , 88 , 89 , 115 , 178 , 283 , Acaryochloris marina , 41 , 85 Amphiroa , 118 416 , 424 , 477 , 511 Acetabularia , 22 , 170 , 171 , 172 Amphiscolops langerhansi , 284 Archaeplastida , 27 Achnanthes exigua , 379 , 382 Amphora , 365 , 374 , 390 Arctic , 58 Achnanthes longipes , 364 , 365 Amphora coffaeformis. , 369 Arthrobacter , 95 Achnanthidium minutissimum , 364 amylopectin , 20 , 21 , 39 , 86 , 135 , Arthrospira , 63 , 67 acritrachs , 267 311 , 510 , 516 Arthrospira fusiformis , 64 Acrochaetiales , 103 , 110 amyloplasts , 10 , 133 , 172 , 173 , 177 , Ascophyllum , 7 , 92 , 93 , 418 , 456 , Acrochaetium , 92 , 98 , 102 , 110 179 , 180 , 207 458 , 459 Acrochaetium asparagopsis , 102 amylose , 21 , 135 , 311 Astasia , 240 , 244 , 245 , 246 Acroseira , 436 Anabaena , 32 , 34 , 58 , 61 , 493 astaxanthin , 133 , 191 Actiniscus pentasterias , 268 Anabaena azollae , 53 Asterionella , 381 Actinocyclus subtilis , 367 Anabaena circinalis , 68 Asterionella formosa , 380 , 384 adelphoparasites , 93 , 128 Anabaena crassa , 39 Asterocytis , 103 , 107 agar , 10 , 94 , 95 , 96 , 97 , 119 , 122 , Anabaena cylindrica , 59 Attheya , 335 132 , 184 , 218 , 365 , 373 , 510 , 512 Anabaena fl os-aquae , 39 , 40 Audouinella , 110 agarophyte , 122 , 510 Anabaenopsis , 64 Aulosira fertilissima , 60 agars , 85 anatoxin , 61 , 492 , 493 Aulosira implexa , 68 agglutination , 138 , 187 , 230 , 510 androsporangia , 217
    [Show full text]
  • Parmales, a New Order of Marine Chrysophytes, with Descriptions of Three New Genera and Seven New Species',2
    J. Phyol. 23, 245-260 (1987) PARMALES, A NEW ORDER OF MARINE CHRYSOPHYTES, WITH DESCRIPTIONS OF THREE NEW GENERA AND SEVEN NEW SPECIES',2 Beatrice C. Booth3 School of Oceanography, University of Washington, Seattle, Washington 98 195 and HarzIej J. )\/larchant Antarctic Division, Department of Science, Channel Highway, Kingston, Tasmania 7 150, Australia ABSTRACT by Nishida (1986) and Takahashi et al. (1986). Here A neu order, Parrnales, in the Chryophyeae has cells we describe the entire group as a new order within wth szlzceous ualls rnacle up of round, trzradzate and the Chrysophyceae, an algal class with a number of sometunes oblong plates alljitting edge to edge. In the nez orders containing organisms with siliceous cell walls. farnzlj, Octolarninaceae, cell ualls haile ezght plates. Cell Unlike other siliceous organisms such as diatoms, ulalls in the neul gems Tetraparma haw four round there seems to be a wide range of variation, even plates and four trziadzate plates. Cell ualls in the new within a distinct form, in the size, length and density genus Triparma halie three round plates of equal size, of the ornamentation. For these reasons it is not one larger round plate, one trzradiate plate and three possible at this time to know if all the 21 distinct oblong plates In the neul fa,nilj, Pentalaminaceae, cell forms which have been observed to date are discrete walls hazv three round and two triradiate plates. A total species. We have therefore used the basic construc- of seim neu species andfour subspecies are described from tion and symmetry of the cell wall as conservative subarctic Pa@ and ,4ntarctic waters.
    [Show full text]
  • Parmales, Heterokontophyta)
    Effects of Silicon-Limitation on Growth and Morphology of Triparma laevis NIES-2565 (Parmales, Heterokontophyta) Kazumasa Yamada1, Shinya Yoshikawa1*, Mutsuo Ichinomiya2, Akira Kuwata3, Mitsunobu Kamiya1, Kaori Ohki1 1 Faculty of Marine Bioscience, Fukui Prefectural University, Obama, Japan, 2 Faculty of Environmental & Symbiotic Sciences, Prefectural University of Kumamoto, Tsukide, Higashi, Kumamoto, Japan, 3 Tohoku National Fisheries Research Institute, Shinhama-cho, Shiogama, Japan Abstract The order Parmales (Heterokontophyta) is a group of small-sized unicellular marine phytoplankton, which is distributed widely from tropical to polar waters. The cells of Parmales are surrounded by a distinctive cell wall, which consists of several siliceous plates fitting edge to edge. Phylogenetic and morphological analyses suggest that Parmales is one of the key organisms for elucidating the evolutionary origin of Bacillariophyceae (diatoms), the most successful heterokontophyta. The effects of silicon-limitation on growth and morphogenesis of plates were studied using a strain of Triparma laevis NIES-2565, which was cultured for the first time in artificial sea water. The cells of T. laevis were surrounded by eight plates when grown with sufficient silicon. However, plate formation became incomplete when cells were cultured in a medium containing low silicate (ca. ,10 mM). Cells finally lost almost all plates in a medium containing silicate concentrations lower than ca. 1 mM. However, silicon-limitation did not affect growth rate; cells continued to divide without changing their growth rate, even after all plates were lost. Loss of plates was reversible; when cells without plates were transferred to a medium containing sufficient silicate, regeneration of shield and ventral plates was followed by the formation of girdle and triradiate plates.
    [Show full text]
  • Sequence Analysis Confirms a New Algal Class Linda K
    Sequence analysis confirms a new algal class Linda K. Medlin, Yves Desdevises To cite this version: Linda K. Medlin, Yves Desdevises. Sequence analysis confirms a new algal class. Vie et Milieu /Life & Environment, Observatoire Océanologique - Laboratoire Arago, 2018. hal-02144544 HAL Id: hal-02144544 https://hal.archives-ouvertes.fr/hal-02144544 Submitted on 26 Jul 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. VIE ET MILIEU - LIFE AND ENVIRONMENT, 2018, 68 (2-3): 151-155 SEQUENCE analysis CONFIRMS A NEW ALGAL CLASS L. K. MEDLIN 1, Y. DESDEVISES 2 1 Marine Biological Association of the UK, The Citadel, Plymouth PL1 2PB, UK 2 Sorbonne Université, CNRS, UMR 7232, Biologie Intégrative des Organismes Marins, BIOM, Observatoire Océanologique, 66650 Banyuls/Mer, France MICROALGAL CLASS ABSTRACT. – As ���������������������������������������������������������������������������udged by its position in an SSU rRNA tree, a new algal class in the heter- CHLOROMORUM PHYLOGENETICS okonts has been recovered from a Bayesian phylogenetic analysis of 12 heterokont algal classes. The sequences were deposited in Genbank as raphidophytes, possibly because cells of these species resemble raphidophytes. A constrained tree placing them in the raphidiophytes yielded a significantly worse tree as determined by the Shimodaira-Hasegawa test (P = 0.0001).
    [Show full text]
  • Supplementary Information
    Supplementary information Methods 1. Creation of the plastid dataset We retrieved the protein annotations for 75 selected plastid genomes of Rhodophyta, Cryptophyta, Haptophyceae and Ochrophyta from the NCBI RefSeq database (https://www.ncbi.nlm.nih.gov/) (Supplementary Table 8). We used OrthoFinder (Emms and ​ ​ Kelly 2015) with a BLASTP E-value threshold of 1e-5 and an MCL inflation parameter of 1.5 to produce orthogroups (OGs). We filtered the 504 resulting OGs to retain those (108) containing ≥20 species (of which ≥1 Rhodophyta, ≥1 Stramenopiles, and either ≥1 Cryptophyta or ≥1 Haptophyceae). We first aligned the selected OGs with MAFFT (L-INS-i algorithm, 5000 iterations) (Katoh and Standley 2013), then enriched them by adding more species from genomic data (such as the five new species sequenced in this study) with Forty-Two (https://metacpan.org/release/Bio-MUST-Apps-FortyTwo). We checked for possible paralogy using methods that are described in the section about the construction of the nuclear dataset (see below) and found only one dubious OG, from which we manually removed four paralogous sequences. We further discarded 9 additional OGs with <30 species. Finally, to select unambiguously aligned positions, we applied a loose BMGE (Criscuolo et al. 2010) filter (entropy cutoff of 0.6 and gap cutoff of 0.4) on each aligned OG. 2. Creation of the mitochondrial dataset As for the plastid, we retrieved all the protein annotations available for stramenopiles mitochondrial genomes from the NCBI website (Supplementary Table 9). To this first set, we added the annotations of the five new species generated in this study, as well as some identified from genomic scaffolds of Labyrinthulomycetes and Xanthophyceae species presenting a high similarity to mitochondrial genomes, using MFannot server (Beck and Lang 2010; MFannot, organelle genome annotation websever; ​ http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl).
    [Show full text]
  • Sequencing and Phylogenetic Analysis of Chloroplast Genes in Freshwater Raphidophytes
    G C A T T A C G G C A T genes Article Sequencing and Phylogenetic Analysis of Chloroplast Genes in Freshwater Raphidophytes Ingrid Sassenhagen 1,* and Karin Rengefors 2 1 Laboratoire d’Océanologie et des Geosciences, UMR CNRS 8187, Université du Littoral Côte d’Opale, 62930 Wimereux, France 2 Aquatic Ecology, Department of Biology, Lund University, 22362 Lund, Sweden; [email protected] * Correspondence: [email protected] Received: 8 February 2019; Accepted: 20 March 2019; Published: 22 March 2019 Abstract: The complex evolution of chloroplasts in microalgae has resulted in highly diverse pigment profiles. Freshwater raphidophytes, for example, display a very different pigment composition to marine raphidophytes. To investigate potential differences in the evolutionary origin of chloroplasts in these two groups of raphidophytes, the plastid genomes of the freshwater species Gonyostomum semen and Vacuolaria virescens were sequenced. To exclusively sequence the organelle genomes, chloroplasts were manually isolated and amplified using single-cell whole-genome-amplification. Assembled and annotated chloroplast genes of the two species were phylogenetically compared to the marine raphidophyte Heterosigma akashiwo and other evolutionarily more diverse microalgae. These phylogenetic comparisons confirmed the high relatedness of all investigated raphidophyte species despite their large differences in pigment composition. Notable differences regarding the presence of light-independent protochlorophyllide oxidoreductase (LIPOR) genes among raphidophyte algae were also revealed in this study. The whole-genome amplification approach proved to be useful for isolation of chloroplast DNA from nuclear DNA. Although only approximately 50% of the genomes were covered, this was sufficient for a multiple gene phylogeny representing large parts of the chloroplast genes.
    [Show full text]
  • Evolution and Systematics of Plastids of Rhodophytic Branch V.A
    Evolution and Systematics of Plastids of Rhodophytic Branch V.A. Lyubetsky, R.A. Gershgorin, L.I. Rubanov, A.V. Seliverstov, and O.A. Zverkov Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Bolshoy Karetny per. 19, build.1, Moscow 127051 Russia, [email protected] The genomes of plastids, semiautonomous organelles originating from cyanobacteria, have been studied in algae of the phylum Rhodophyta as well as in the species with plastids of secondary or tertiary origin from those of Rhodophyta. These include species of the superphyla Alveolata and Heterokonta (classes Bacillariophyceae, Bolidophyceae, Chrysophyceae, Dictyochophyceae, Eustigmatophyceae, Phaeophyceae, Xanthophyceae, and Raphidophyceae) as well as phyla Cryptophyta and Haptophyta. Their growth temperature ranges from -1.8°C in Phaeocystis antarctica (Smith et al. 1999) to 56°C in algae of the class Cyanidiophyceae living in hot springs. The unicellular alga Triparma laevis related to diatoms lives at 0°–10°C and cannot be found at temperatures above 15°C (Ichinomiya, Kuwata 2015). According to Claquin et al. (2008), the optimal growth temperatures for Lepidodinium chlorophorum is about 22°C; Emiliania huxleyi, 23°C; Thalassiosira pseudonana, 25°C; and Pseudo-nitzschia fraudulenta, 21°C. Global warming can substantially change the range of all algae. Similar changes have been observed in the past (Li et al. 2016). Intergenic lengths of three types were calculated in algal plastids: between convergent genes, between divergent genes, and between tandem genes. If neighboring genes overlap, the distance between them is set equal to zero. It is not unusual for convergent and tandem genes to overlap.
    [Show full text]