1,003 Reference Genomes of Bacterial and Archaeal Isolates Expand

Total Page:16

File Type:pdf, Size:1020Kb

Load more

RESOU r CE OPEN 1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life Supratim Mukherjee1,10, Rekha Seshadri1,10, Neha J Varghese1, Emiley A Eloe-Fadrosh1, Jan P Meier-Kolthoff2 , Markus Göker2 , R Cameron Coates1,9, Michalis Hadjithomas1, Georgios A Pavlopoulos1 , David Paez-Espino1 , Yasuo Yoshikuni1, Axel Visel1 , William B Whitman3, George M Garrity4,5, Jonathan A Eisen6, Philip Hugenholtz7 , Amrita Pati1,9, Natalia N Ivanova1, Tanja Woyke1, Hans-Peter Klenk8 & Nikos C Kyrpides1 We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster with potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery. Systematic surveys of the diversity of cultivated microorganisms have subsequent experiments. Typically, a type strain has well-character- lagged behind improvements in sequencing technologies. Traditionally, ized taxonomic and phenotypic data, isolation source metadata, and most isolate sequencing projects are chosen based on the clinical or other criteria, as defined by the International Code of Nomenclature biotechnological relevance of the target organisms or their physiology1. of Prokaryotes (ICNP)10. As of December 5, 2015, there were 12,981 In 2015, 43% of sequenced bacterial genomes comprised just ten human bacterial and archaeal species with valid, published names, with 650 pathogenic species. While sequencing different strains of the same spe- new type strains added (on average) every year11,12. However, despite cies aided our understanding of pathogenesis, the focus on specific bac- their importance, the genomes of only 826 type strains were publicly terial species results in a biased phylogenetic representation of sequence available at the start of this study. space. This skewed phylogeny narrowed our view of the functional and The Genomic Encyclopedia of Bacteria and Archaea (GEBA) pilot evolutionary diversity of microbial life. There is a direct correlation project presented the analysis of 56 type-strain genomes and validated between phylogenetic distance and novel function discovery2,3, which the usefulness of a phylogeny-driven ‘encyclopedia’ of bacteria and suggests that filling the gaps in the phylogenetic tree might result in a archaea3. We now present a substantially expanded data set (GEBA-I) Nature America, Inc., part of Springer Nature. All rights reserved. 4 7 substantial increase in new genes, protein families and pathways . comprising 1,003 reference genomes from 974 bacterial and 29 Reference genomes can fill phylogenetic gaps, but also serve as archaeal type strains. Our objectives were to provide an expanded 201 © anchors for the identification of sequence fragments from metage- reference genome catalog of broad phylogenetic and physiological nomic studies. Previous efforts to expand the bacterial and archaeal diversity, to determine how this catalog facilitates the discovery of reference genomes by targeted sequencing of phylogenetically under- protein families and expands the diversity of known functions, and to represented lineages have enabled vast improvements in taxonomic ascertain whether these type-strain genomes improve the recruitment assignment in metagenomic data sets5. Furthermore, access to and phylogenetic assignment of existing metagenomic sequences. completed genomes enables more accurate whole-genome-based taxonomic assignments6,7 and improved phylogenies8,9. RESULTS Bacterial and archaeal type strains are the representative unit of Increased phylogenetic diversity of microbial genomes a microbial species, and are chosen when the species name is estab- 974 bacterial and 29 archaeal genomes (from 579 genera in 21 phyla and lished. Type strains are maintained in at least two different culture 43 classes) were sequenced as part of the GEBA Initiative (GEBA-I), collections and provide easy access to source strain material for using a phylogeny-based scoring system for strain selection6,13. 1Department of Energy, Joint Genome Institute, Walnut Creek, California, USA. 2Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany. 3Department of Microbiology, University of Georgia, Athens, Georgia, USA. 4Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, Michigan, USA. 5NamesforLife, LLC, East Lansing, Michigan, USA. 6University of California Davis Genome Center, Davis, California, USA. 7Australian Centre for Ecogenomics, The University of Queensland, Brisbane, Queensland, Australia. 8School of Biology, Newcastle University, Newcastle upon, Tyne, UK. 9Present addresses: Zymergen Inc., Emeryville, California, USA (R.C.C.) and Roche Molecular Systems Inc., Pleasanton, California, USA (A.P.). 10These authors contributed equally to this work. Correspondence should be addressed to N.C.K. ([email protected]). Received 8 November 2016; accepted 21 April 2017; published online 12 June 2017; doi:10.1038/nbt.3886 NATURE BIOTECHNOLOGY ADVANCE ONLINE PUBLICATION 1 RESOU r CE 70 2 Of the 1,003 genomes presented, 396 GEBA-I genomes were the a 3 3 2 ActinobacteriaA first sequenced representative of a genus (Fig. 1a). The Caldithrixae, Chloroflexi 4 Deinococcus-Thermus Deferribacteres, Synergistetes and Thermodesulfobacteria (Fig. 1a) Saccharibacteria Crenarchaeot Lentisphaerae phyla have the most new genera. The most populous phyla, in terms Planctomycetes Thaumarchaeota Verrucomicrobia Chlamydiae 1 of numbers of genomes sequenced, were the Proteobacteria (with 330 a 12 Euryarchaeota genomes), Firmicutes (178), Bacteroidetes (163) and Actinobacteria GemmatimonadetesFibrobacteres Caldithrixae (157). The remaining 175 genomes belonged to 17 additional phyla, 72 including the only sequenced representative of the Caldithrixae phy- Ignavibacteriae Bacteroidetes 4 lum (Supplementary Table 1). The GEBA-I strains originate from Deferribacteres a multitude of habitats including extreme environments, terrestrial Chlorobi biomes, industrial waste and human body sites (Supplementary Fig. 1) Chrysiogenetes 52 Firmicutes/Tenericutes Phylum containing GEBA-I genome and unsurprisingly have diverse physiology, genome size and aver- Phylum without GEBA-I genome 2 Nitrospirae Prior to GEBA-I age G+C content (Supplementary Fig. 2). GEBA-I is a high-quality New genus added by GEBA-I Armatimonates Cyanobacteria reference resource with 99.4% (on average) genome completeness AcidobacteriaAc 14 2 (assessed using CheckM ; Supplementary Table 1). Annotation of Synergistetes Thermodesulfobi ThermodesulfobacteriaTh the 1,003 GEBA-I genomes resulted in 3,472,483 predicted genes from 2 Aquificae 9 Coprothermoba um narugense 3.75 Gbp of assembled sequence data (Supplementary Fig. 3 and icae Supplementary Table 1). All GEBA-I genomes are publicly avail- Dictyoglom 0 Caldiseric cter platensi ProteobacteriaPro i able through the Integrated Microbial Genomes with Microbiomes Thermotogae Fusobacteria 144 a (IMG/M) system15 and GenBank, and the corresponding strains a s through the respective culture collection (Supplementary Table 1). Spirochaetes Elusimicrobi To quantify the increase in phylogenetic diversity contributed by 2 2 GEBA-I genomes compared with all previously available, validly 4 named archaeal and bacterial species (i.e., type strains), we meas- b ured the diversity distance of all sequenced type strains in a compre- 100 hensive 16S rRNA gene tree6. The GEBA-I genomes increased the 90 phylogenetic distance threefold, expanding the overall diversity of 80 the type-strain sequence space by ~24% (Fig. 1b). Further, we applied 70 a whole-genome comparative analysis based on the average nucle- 60 otide identity to verify the relative novelty of the GEBA-I genomes 50 compared to a set of 14,625 control genomes. We found that the vast 40 majority (845/1,003) of the GEBA-I genomes were ‘singletons’ on the 30 basis of the proposed criteria for defining a “species group”7, verifying 20 that no other sequenced representative of that species is available. 10 Cumulative percentage of total bRPD 0 Before GEBA-1 Other type strains Expanding the universe of known proteins 0 A total of 3,402,887 protein-coding sequences were predicted from the 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 10,000 11,000 12,000 1,003 GEBA-I genomes. We compared this data set with 23,470,984 Number of strains non-redundant proteins from all available (14,625) control bacterial and archaeal genomes. Clustering ~26 million total proteins at ~30% Figure 1 GEBA-I strain phylogeny and distribution. (a) Maximum sequence identity over 80% alignment length using KClust resulted likelihood tree based on concatenated alignment
Recommended publications
  • Chemical Structures of Some Examples of Earlier Characterized Antibiotic and Anticancer Specialized

    Chemical Structures of Some Examples of Earlier Characterized Antibiotic and Anticancer Specialized

    Supplementary figure S1: Chemical structures of some examples of earlier characterized antibiotic and anticancer specialized metabolites: (A) salinilactam, (B) lactocillin, (C) streptochlorin, (D) abyssomicin C and (E) salinosporamide K. Figure S2. Heat map representing hierarchical classification of the SMGCs detected in all the metagenomes in the dataset. Table S1: The sampling locations of each of the sites in the dataset. Sample Sample Bio-project Site depth accession accession Samples Latitude Longitude Site description (m) number in SRA number in SRA AT0050m01B1-4C1 SRS598124 PRJNA193416 Atlantis II water column 50, 200, Water column AT0200m01C1-4D1 SRS598125 21°36'19.0" 38°12'09.0 700 and above the brine N "E (ATII 50, ATII 200, 1500 pool water layers AT0700m01C1-3D1 SRS598128 ATII 700, ATII 1500) AT1500m01B1-3C1 SRS598129 ATBRUCL SRS1029632 PRJNA193416 Atlantis II brine 21°36'19.0" 38°12'09.0 1996– Brine pool water ATBRLCL1-3 SRS1029579 (ATII UCL, ATII INF, N "E 2025 layers ATII LCL) ATBRINP SRS481323 PRJNA219363 ATIID-1a SRS1120041 PRJNA299097 ATIID-1b SRS1120130 ATIID-2 SRS1120133 2168 + Sea sediments Atlantis II - sediments 21°36'19.0" 38°12'09.0 ~3.5 core underlying ATII ATIID-3 SRS1120134 (ATII SDM) N "E length brine pool ATIID-4 SRS1120135 ATIID-5 SRS1120142 ATIID-6 SRS1120143 Discovery Deep brine DDBRINP SRS481325 PRJNA219363 21°17'11.0" 38°17'14.0 2026– Brine pool water N "E 2042 layers (DD INF, DD BR) DDBRINE DD-1 SRS1120158 PRJNA299097 DD-2 SRS1120203 DD-3 SRS1120205 Discovery Deep 2180 + Sea sediments sediments 21°17'11.0"
  • Deconstruction of Lignin: from Enzymes to Microorganisms

    Deconstruction of Lignin: from Enzymes to Microorganisms

    molecules Review Deconstruction of Lignin: From Enzymes to Microorganisms Jéssica P. Silva 1, Alonso R. P. Ticona 1 , Pedro R. V. Hamann 1, Betania F. Quirino 2 and Eliane F. Noronha 1,* 1 Enzymology Laboratory, Cell Biology Department, University of Brasilia, 70910-900 Brasília, Brazil; [email protected] (J.P.S.); [email protected] (A.R.P.T.); [email protected] (P.R.V.H.) 2 Genetics and Biotechnology Laboratory, Embrapa-Agroenergy, 70770-901 Brasília, Brazil; [email protected] * Correspondence: [email protected]; Tel.: +55-61-3307-2152 Abstract: Lignocellulosic residues are low-cost abundant feedstocks that can be used for industrial applications. However, their recalcitrance currently makes lignocellulose use limited. In natural environments, microbial communities can completely deconstruct lignocellulose by synergistic action of a set of enzymes and proteins. Microbial degradation of lignin by fungi, important lignin degraders in nature, has been intensively studied. More recently, bacteria have also been described as able to break down lignin, and to have a central role in recycling this plant polymer. Nevertheless, bacterial deconstruction of lignin has not been fully elucidated yet. Direct analysis of environmental samples using metagenomics, metatranscriptomics, and metaproteomics approaches is a powerful strategy to describe/discover enzymes, metabolic pathways, and microorganisms involved in lignin breakdown. Indeed, the use of these complementary techniques leads to a better understanding of the composition, function, and dynamics of microbial communities involved in lignin deconstruction. We focus on omics approaches and their contribution to the discovery of new enzymes and reactions that impact the development of lignin-based bioprocesses.
  • Evolution of the 3-Hydroxypropionate Bicycle and Recent Transfer of Anoxygenic Photosynthesis Into the Chloroflexi

    Evolution of the 3-Hydroxypropionate Bicycle and Recent Transfer of Anoxygenic Photosynthesis Into the Chloroflexi

    Evolution of the 3-hydroxypropionate bicycle and recent transfer of anoxygenic photosynthesis into the Chloroflexi Patrick M. Shiha,b,1, Lewis M. Wardc, and Woodward W. Fischerc,1 aFeedstocks Division, Joint BioEnergy Institute, Emeryville, CA 94608; bEnvironmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720; and cDivision of Geological and Planetary Sciences, California Institute of Technology, Pasadena, CA 91125 Edited by Bob B. Buchanan, University of California, Berkeley, CA, and approved August 21, 2017 (received for review June 14, 2017) Various lines of evidence from both comparative biology and the provide a hard geological constraint on these analyses, the timing geologic record make it clear that the biochemical machinery for of these evolutionary events remains relative, thus highlighting anoxygenic photosynthesis was present on early Earth and provided the uncertainty in our understanding of when and how anoxy- the evolutionary stock from which oxygenic photosynthesis evolved genic photosynthesis may have originated. ca. 2.3 billion years ago. However, the taxonomic identity of these A less recognized alternative is that anoxygenic photosynthesis early anoxygenic phototrophs is uncertain, including whether or not might have been acquired in modern bacterial clades relatively they remain extant. Several phototrophic bacterial clades are thought recently. This possibility is supported by the observation that to have evolved before oxygenic photosynthesis emerged, including anoxygenic photosynthesis often sits within a derived position in the Chloroflexi, a phylum common across a wide range of modern the phyla in which it is found (3). Moreover, it is increasingly environments. Although Chloroflexi have traditionally been thought being recognized that horizontal gene transfer (HGT) has likely to be an ancient phototrophic lineage, genomics has revealed a much played a major role in the distribution of phototrophy (8–10).
  • A Genomic Journey Through a Genus of Large DNA Viruses

    A Genomic Journey Through a Genus of Large DNA Viruses

    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Virology Papers Virology, Nebraska Center for 2013 Towards defining the chloroviruses: a genomic journey through a genus of large DNA viruses Adrien Jeanniard Aix-Marseille Université David D. Dunigan University of Nebraska-Lincoln, [email protected] James Gurnon University of Nebraska-Lincoln, [email protected] Irina V. Agarkova University of Nebraska-Lincoln, [email protected] Ming Kang University of Nebraska-Lincoln, [email protected] See next page for additional authors Follow this and additional works at: https://digitalcommons.unl.edu/virologypub Part of the Biological Phenomena, Cell Phenomena, and Immunity Commons, Cell and Developmental Biology Commons, Genetics and Genomics Commons, Infectious Disease Commons, Medical Immunology Commons, Medical Pathology Commons, and the Virology Commons Jeanniard, Adrien; Dunigan, David D.; Gurnon, James; Agarkova, Irina V.; Kang, Ming; Vitek, Jason; Duncan, Garry; McClung, O William; Larsen, Megan; Claverie, Jean-Michel; Van Etten, James L.; and Blanc, Guillaume, "Towards defining the chloroviruses: a genomic journey through a genus of large DNA viruses" (2013). Virology Papers. 245. https://digitalcommons.unl.edu/virologypub/245 This Article is brought to you for free and open access by the Virology, Nebraska Center for at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Virology Papers by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. Authors Adrien Jeanniard, David D. Dunigan, James Gurnon, Irina V. Agarkova, Ming Kang, Jason Vitek, Garry Duncan, O William McClung, Megan Larsen, Jean-Michel Claverie, James L. Van Etten, and Guillaume Blanc This article is available at DigitalCommons@University of Nebraska - Lincoln: https://digitalcommons.unl.edu/ virologypub/245 Jeanniard, Dunigan, Gurnon, Agarkova, Kang, Vitek, Duncan, McClung, Larsen, Claverie, Van Etten & Blanc in BMC Genomics (2013) 14.
  • Distribution of Long Linear and Branched Polyamines in the Thermophiles Belonging to the Domain Bacteria

    Distribution of Long Linear and Branched Polyamines in the Thermophiles Belonging to the Domain Bacteria

    Journal of Japanese Society for Extremophiles (2008) Vol.7 (1) Journal of Japanese Society for Extremophiles (2008), Vol. 7, 10-20 ORIGINAL PAPER Hamana Ka,b,e, Hosoya Ra, Yokota Ac, Niitsu Md, Hayashi He and Itoh Tb Distribution of long linear and branched polyamines in the thermophiles belonging to the domain Bacteria a Gunma University School of Health Sciences, Maebashi, Gunma 371-8514, Japan. bJapan Collection of Microorganisms, RIKEN, BioResource Center, Wako, Saitama 351-0198, Japan. c Institute of Molecular and Cellular Biosciences, The University of Tokyo, Tokyo 113-0032, Japan. d Faculty of Pharmaceutical Sciences, Josai University, Sakado, Saitama 350-0290, Japan. e Faculty of Engineering, Maebashi Institute of Technology, Maebashi, Gunma 371-0816, Japan. Corresponding author : Koei Hamana, [email protected] Phone : +81-27-220-8916, FAX : +81-27-220-8999 Received: April 3, 2008/ Reviced:May 26, 2008/ Acepted:June 3, 2008 Abstract Cellular polyamines of 44 newly validated have been published in eubacteria 15, 16). However, the eubacterial thermophiles growing at 45-80℃, belonging degree of thermophily is roughly estimated and not to eight orders (six phyla) of the domain Bacteria, were defined exactly. The cellular occurrence of long linear analyzed by HPLC and GC. A quaternary branched and/or branched polyamines in extremely thermophilic penta-amine, N4-bis(aminopropyl)norspermidine, was (or hyperthermophilic) eubacteria suggested that the found in Hydrogenivirga and Sulfurihydrogenibium extreme thermophiles (or hyperthermophiles) may have belonging to the order of Aquificales. Another some novel polyamine synthetic abilities possibly quaternary branched penta-amine, N4-bis(aminopropyl) associated with their thermophily 8-11, 13-15, 18, 23, 24).
  • Thermobaculum Terrenum' Type Strain (YNP1)

    Thermobaculum Terrenum' Type Strain (YNP1)

    Lawrence Berkeley National Laboratory Recent Work Title Complete genome sequence of 'Thermobaculum terrenum' type strain (YNP1). Permalink https://escholarship.org/uc/item/5d44s5qf Journal Standards in genomic sciences, 3(2) ISSN 1944-3277 Authors Kiss, Hajnalka Cleland, David Lapidus, Alla et al. Publication Date 2010-10-27 DOI 10.4056/sigs.1153107 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Standards in Genomic Sciences (2010) 3:153-162 DOI:10.4056/sigs.1153107 Complete genome sequence of ‘Thermobaculum T terrenum’ type strain (YNP1 ) Hajnalka Kiss1, David Cleland2, Alla Lapidus3, Susan Lucas3, Tijana Glavina Del Rio3, Matt Nolan3, Hope Tice3, Cliff Han1, Lynne Goodwin1,3, Sam Pitluck3, Konstantinos Liolios3, Natalia Ivanova3, Konstantinos Mavromatis3, Galina Ovchinnikova3, Amrita Pati3, Amy Chen4, Krishna Palaniappan4, Miriam Land3,5, Loren Hauser3,5, Yun-Juan Chang3,5, Cynthia D. Jeffries3,5, Megan Lu3, Thomas Brettin3, John C. Detter1, Markus Göker6, Brian J. Tindall6, Brian Beck2, Timothy R. McDermott7, Tanja Woyke3, James Bristow3, Jonathan A. Eisen3,8, Victor Markowitz4, Philip Hugenholtz3, Nikos C. Kyrpides3, Hans-Peter Klenk6*, and Jan-Fang Cheng3 1 Los Alamos National Laboratory, Bioscience Division, Los Alamos, New Mexico, USA 2 ATCC- American Type Culture Collection, Manassas, Virginia, USA 3 DOE Joint Genome Institute, Walnut Creek, California, USA 4 Biological Data Management and Technology Center, Lawrence Berkeley National Laboratory, Berkeley, California, USA 5 Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA 6 DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany 7 Thermal Biology Institute, Montana State University, Bozeman, Montana, USA 8 University of California Davis Genome Center, Davis, California, USA *Corresponding author: Hans-Peter Klenk Keywords: extreme thermal soil, thermoacidophile, Gram-positive, nonmotile, non-spore- forming, obligate aerobe, Incertae sedis, Chloroflexi, GEBA ‘Thermobaculum terrenum’ Botero et al.
  • Systema Naturae. the Classification of Living Organisms

    Systema Naturae. the Classification of Living Organisms

    Systema Naturae. The classification of living organisms. c Alexey B. Shipunov v. 5.601 (June 26, 2007) Preface Most of researches agree that kingdom-level classification of living things needs the special rules and principles. Two approaches are possible: (a) tree- based, Hennigian approach will look for main dichotomies inside so-called “Tree of Life”; and (b) space-based, Linnaean approach will look for the key differences inside “Natural System” multidimensional “cloud”. Despite of clear advantages of tree-like approach (easy to develop rules and algorithms; trees are self-explaining), in many cases the space-based approach is still prefer- able, because it let us to summarize any kinds of taxonomically related da- ta and to compare different classifications quite easily. This approach also lead us to four-kingdom classification, but with different groups: Monera, Protista, Vegetabilia and Animalia, which represent different steps of in- creased complexity of living things, from simple prokaryotic cell to compound Nature Precedings : doi:10.1038/npre.2007.241.2 Posted 16 Aug 2007 eukaryotic cell and further to tissue/organ cell systems. The classification Only recent taxa. Viruses are not included. Abbreviations: incertae sedis (i.s.); pro parte (p.p.); sensu lato (s.l.); sedis mutabilis (sed.m.); sedis possi- bilis (sed.poss.); sensu stricto (s.str.); status mutabilis (stat.m.); quotes for “environmental” groups; asterisk for paraphyletic* taxa. 1 Regnum Monera Superphylum Archebacteria Phylum 1. Archebacteria Classis 1(1). Euryarcheota 1 2(2). Nanoarchaeota 3(3). Crenarchaeota 2 Superphylum Bacteria 3 Phylum 2. Firmicutes 4 Classis 1(4). Thermotogae sed.m. 2(5).
  • Widespread Impact of Horizontal Gene Transfer on Plant Colonization of Land

    Widespread Impact of Horizontal Gene Transfer on Plant Colonization of Land

    ARTICLE Received 10 Jun 2012 | Accepted 20 Sep 2012 | Published 23 Oct 2012 DOI: 10.1038/ncomms2148 Widespread impact of horizontal gene transfer on plant colonization of land Jipei Yue1,2, Xiangyang Hu1,3, Hang Sun1, Yongping Yang1,3 & Jinling Huang2 In complex multicellular eukaryotes such as animals and plants, horizontal gene transfer is commonly considered rare with very limited evolutionary significance. Here we show that horizontal gene transfer is a dynamic process occurring frequently in the early evolution of land plants. Our genome analyses of the moss Physcomitrella patens identified 57 families of nuclear genes that were acquired from prokaryotes, fungi or viruses. Many of these gene families were transferred to the ancestors of green or land plants. Available experimental evidence shows that these anciently acquired genes are involved in some essential or plant- specific activities such as xylem formation, plant defence, nitrogen recycling as well as the biosynthesis of starch, polyamines, hormones and glutathione. These findings suggest that horizontal gene transfer had a critical role in the transition of plants from aquatic to terrestrial environments. On the basis of these findings, we propose a model of horizontal gene transfer mechanism in nonvascular and seedless vascular plants. 1 Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China. 2 Department of Biology, East Carolina University, Greenville, North Carolina 27858, USA. 3 Institute of Tibet Plateau Research, Chinese Academy of Sciences, Kunming 650201, China. Correspondence and requests for materials should be addressed to J.H. (email: [email protected]). NATURE COMMUNICATIONS | 3:1152 | DOI: 10.1038/ncomms2148 | www.nature.com/naturecommunications © 2012 Macmillan Publishers Limited.
  • Neo-Formation of Chromosomes in Bacteria

    Neo-Formation of Chromosomes in Bacteria

    bioRxiv preprint doi: https://doi.org/10.1101/264945; this version posted March 2, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Neo-formation of chromosomes in bacteria 1,2* 1,2,3** Olivier Poirion & Bénédicte Lafay 1 Université de Lyon, F-69134 Lyon, France 2 CNRS (French National Center for Scientific Research) UMR5005, Laboratoire Ampère, École Centrale de Lyon, 36 avenue Guy de Collongue, 69134 Écully, France 3 CNRS (French National Center for Scientific Research) UMR5558, Laboratoire de Biométrie et Bioogie Évolutive, Université Claude Bernard – Lyon 1, 43 boulevard du 11 novembre 1918, 69622 Villeurbanne cedex, France * Present address: Cancer Epidemiology Program, University of Hawaii Cancer Center, University of Hawaii at Manoa, 701 Ilalo Street Honolulu, HI 96813, USA ** Author for correspondence: [email protected] bioRxiv preprint doi: https://doi.org/10.1101/264945; this version posted March 2, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Summary The genome of bacteria is classically separated into the essential, stable and slow evolving chromosomes and the accessory, mobile and rapidly evolving plasmids. This distinction was blurred with the characterization of multipartite genomes constituted of a standard chromosome and additional essential replicons, generally thought to be adapted plasmids. Whereas chromosomes are exclusively transmitted from mother to daughter cell during the bacterial cell cycle, in general, the duplication of plasmids is unlinked to the cell cycle and their transfer to daughter cell is random.
  • Nitrolancea Hollandica Gen. Nov., Sp. Nov., a Chemolithoautotrophic Nitrite-Oxidizing Bacterium Isolated from a Bioreactor Belonging to the Phylum Chloroflexi

    Nitrolancea Hollandica Gen. Nov., Sp. Nov., a Chemolithoautotrophic Nitrite-Oxidizing Bacterium Isolated from a Bioreactor Belonging to the Phylum Chloroflexi

    International Journal of Systematic and Evolutionary Microbiology (2014), 64, 1859–1865 DOI 10.1099/ijs.0.062232-0 Nitrolancea hollandica gen. nov., sp. nov., a chemolithoautotrophic nitrite-oxidizing bacterium isolated from a bioreactor belonging to the phylum Chloroflexi Dimitry Y. Sorokin,1,2 Dana Vejmelkova,3 Sebastian Lu¨cker,4 Galina M. Streshinskaya,5 W. Irene C. Rijpstra,6 Jaap S. Sinninghe Damste´,6 Robbert Kleerbezem,2 Mark van Loosdrecht,2 Gerard Muyzer7 and Holger Daims4 Correspondence 1Winogradsky Institute of Microbiology RAS, Moscow, Russia Dimitry Y. Sorokin 2Department of Biotechnology, TU Delft, The Netherlands [email protected] 3Department of Water Technology, ICT Prague, Czech Republic 4Department of Microbial Ecology, Ecology Centre, University of Vienna, Vienna, Austria 5Microbiology Department, Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia 6Department of Marine Organic Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, The Netherlands 7Department of Aquatic Microbiology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands A novel nitrite-oxidizing bacterium (NOB), strain LbT, was isolated from a nitrifying bioreactor with a high loading of ammonium bicarbonate in a mineral medium with nitrite as the energy source. The cells were oval (lancet-shaped) rods with pointed edges, non-motile, Gram-positive (by staining and from the cell wall structure) and non-spore-forming. Strain LbT was an obligately aerobic, chemolitoautotrophic NOB, utilizing nitrite or formate as the energy source and CO2 as the carbon source. Ammonium served as the only source of assimilated nitrogen. Growth with nitrite was optimal at pH 6.8–7.5 and at 40 6C (maximum 46 6C).
  • Plastid-Localized Amino Acid Biosynthetic Pathways of Plantae Are Predominantly Composed of Non-Cyanobacterial Enzymes

    Plastid-Localized Amino Acid Biosynthetic Pathways of Plantae Are Predominantly Composed of Non-Cyanobacterial Enzymes

    Plastid-localized amino acid biosynthetic pathways of Plantae are predominantly SUBJECT AREAS: MOLECULAR EVOLUTION composed of non-cyanobacterial PHYLOGENETICS PLANT EVOLUTION enzymes PHYLOGENY Adrian Reyes-Prieto1* & Ahmed Moustafa2* Received 1 26 September 2012 Canadian Institute for Advanced Research and Department of Biology, University of New Brunswick, Fredericton, Canada, 2Department of Biology and Biotechnology Graduate Program, American University in Cairo, Egypt. Accepted 27 November 2012 Studies of photosynthetic eukaryotes have revealed that the evolution of plastids from cyanobacteria Published involved the recruitment of non-cyanobacterial proteins. Our phylogenetic survey of .100 Arabidopsis 11 December 2012 nuclear-encoded plastid enzymes involved in amino acid biosynthesis identified only 21 unambiguous cyanobacterial-derived proteins. Some of the several non-cyanobacterial plastid enzymes have a shared phylogenetic origin in the three Plantae lineages. We hypothesize that during the evolution of plastids some enzymes encoded in the host nuclear genome were mistargeted into the plastid. Then, the activity of those Correspondence and foreign enzymes was sustained by both the plastid metabolites and interactions with the native requests for materials cyanobacterial enzymes. Some of the novel enzymatic activities were favored by selective compartmentation should be addressed to of additional complementary enzymes. The mosaic phylogenetic composition of the plastid amino acid A.R.-P. ([email protected]) biosynthetic pathways and the reduced number of plastid-encoded proteins of non-cyanobacterial origin suggest that enzyme recruitment underlies the recompartmentation of metabolic routes during the evolution of plastids. * Equal contribution made by these authors. rimary plastids of plants and algae are the evolutionary outcome of an endosymbiotic association between eukaryotes and cyanobacteria1.
  • Highly Abundant and Diverse Planktonic Freshwater Chloroflexi

    Highly Abundant and Diverse Planktonic Freshwater Chloroflexi

    bioRxiv preprint doi: https://doi.org/10.1101/366732; this version posted July 10, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Hidden in plain sight - highly abundant and diverse planktonic freshwater Chloroflexi 2 3 Maliheh Mehrshad1*, Michaela M. Salcher2, Yusuke Okazaki3, Shin-ichi Nakano3, Karel Šimek1, 4 Adrian-Stefan Andrei1, Rohit Ghai1* 5 6 1 Biology Centre of the Czech Academy of Sciences, Institute of Hydrobiology, Department of Aquatic 7 Microbial Ecology, České Budějovice, Czech Republic 8 2 Department of Limnology, Institute of Plant Biology, University of Zurich, Seestrasse 187, CH-8802 9 Kilchberg, Switzerland 10 3 Center for Ecological Research, Kyoto University, 2-509-3 Hirano, Otsu, Shiga, 520-2113, Japan 11 12 *Corresponding authors: 13 Maliheh Mehrshad 14 Rohit Ghai 15 Institute of Hydrobiology, Department of Aquatic Microbial Ecology, Biology Centre ASCR 16 Na Sádkách 7, 370 05, České Budějovice, Czech Republic 17 Tel: 00420 38777 5819 18 Email: [email protected], 19 [email protected] 20 bioRxiv preprint doi: https://doi.org/10.1101/366732; this version posted July 10, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 21 Abstract 22 Background: Representatives of the phylum Chloroflexi, though reportedly highly abundant (up to 23 30% of total prokaryotes) in the extensive deep water habitats of both marine (SAR202) and 24 freshwater (CL500-11), remain uncultivated and uncharacterized.