Reconstruction of Bacterial Strain Genomes from Shotgun Metagenomic Reads

Total Page:16

File Type:pdf, Size:1020Kb

Reconstruction of Bacterial Strain Genomes from Shotgun Metagenomic Reads University of Central Florida STARS Electronic Theses and Dissertations, 2020- 2020 Reconstruction of Bacterial Strain Genomes from Shotgun Metagenomic Reads Xin Li University of Central Florida Part of the Computer Sciences Commons Find similar works at: https://stars.library.ucf.edu/etd2020 University of Central Florida Libraries http://library.ucf.edu This Doctoral Dissertation (Open Access) is brought to you for free and open access by STARS. It has been accepted for inclusion in Electronic Theses and Dissertations, 2020- by an authorized administrator of STARS. For more information, please contact [email protected]. STARS Citation Li, Xin, "Reconstruction of Bacterial Strain Genomes from Shotgun Metagenomic Reads" (2020). Electronic Theses and Dissertations, 2020-. 377. https://stars.library.ucf.edu/etd2020/377 RECONSTRUCTION OF BACTERIAL STRAIN GENOMES FROM SHOTGUN METAGENOMIC READS by XIN LI B.E. Yanshan University, 2010 M.S. Florida International University, 2013 A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computer Science in the Department of Computer Science in the College of Engineering and Computer Science at the University of Central Florida Orlando, Florida Fall Term 2020 Major Professor: Haiyan Hu ABSTRACT It is necessary to study bacterial strains in environmental samples. The environmental samples are mixed DNA samples collected from the ocean, soil, lake, human body sites, etc. In a natural environment, they provide us new insights into the diversity of our earth. As for bacterial strains on or inside human bodies, to select the proper treatment for diseases caused by bacterial strains, it is critical to identify the corresponding strains and reconstruct their genomes. However, it is a challenge to do so with the DNA from a large number of unknown microbial species mixed together in an environmental sample. The majority of available computational methods depend on available sequenced genomes and marker genes, which can not fully discover the strains and reconstruct their genomes from the shotgun metagenomic reads. In this dissertation, we studied bacterial strain reconstruction, including one case study about shotgun metagenomic sequencing and two novel approaches to improve the performance of reconstructing bacterial strains. Firstly, we studied how newly sequenced genomes affect the analysis result from shotgun metagenomic datasets. In this study, we found two more new phyla that were related to colitis development compared with a previous study, and the two new phyla were also more statistically significant. Furthermore, we found that one major conclusion from the previous study was not supported by repeating the analysis with an updated marker gene database and tools in metagenomics. Secondly, to better analyze shotgun metagenomic datasets, BHap, a novel algorithm based on fuzzy flow networks and de Bruijn graph was developed to reconstruct bacterial strains. BHap had high precision, recall and F1 score and low susceptibility to sequence errors. It also outperformed existing tools in terms of better precision, better recall, higher F1 score and more accurate estimation of the number of strains. Last but not least, a second approach, mixtureS, was developed by considering all genome positions. MixtureS is based on the EM ii algorithms and the frequency difference of strains to distinguish different strains of a bacterial species in shotgun metagenomic datasets. Compared with several existing methods including BHap, mixtureS had a better performance in terms of precision, recall, the prediction accuracy of the strain numbers and abundance. Based on the developed BHap and mixtureS methods, we also developed two software tools, which will be valuable for future strain studies in metagenomics. iii ACKNOWLEDGMENTS I would sincerely express my gratitude to my advisor Dr. Haiyan Hu, who continually teaches me the importance of a good attitude for research and work. That is extremely important for my Ph.D. studies. It would be not possible to finish this dissertation without her guidance and help. I would also like to thank my co-advisor Dr.Xiaoman Li, who is a talented scientist. I have learned a lot from him for the past six years. He has given countless thoughtful suggestions to my research, and guide me in the right research direction. I also want to this opportunity to thank my committee members Dr. Saleh A. Naser and Dr. Liqiang Wang for serving in my committee. Their suggestions are very helpful for me to complete my dissertation. Finally, I would like to thank my parents. They are supporting me and encouraging me in many different ways through my whole graduate study. iv TABLE OF CONTENTS LIST OF FIGURES ..................................................................................................................... viii LIST OF TABLES ......................................................................................................................... ix CHAPTER ONE: INTRODUCTION ............................................................................................. 1 1.1 Metagenomics ....................................................................................................................... 1 1.2 16S ribosomal RNA and shotgun sequencing ...................................................................... 4 1.3 Bacterial strains ..................................................................................................................... 5 1.4 Other strain reconstruction method ....................................................................................... 7 1.5 Overview of the Dissertation ................................................................................................ 8 CHAPTER TWO: OLD METAGENOMIC DATA MEET NEWLY SEQUENCED GENOMES ......................................................................................................................................................... 9 2.1 Introduction ........................................................................................................................... 9 2.2 Materials and Methods ........................................................................................................ 12 2.2.1 Data and their processing ............................................................................................. 12 2.2.2 Database preparation and Centrifuge ........................................................................... 14 2.2.3 MetaPhlAn ................................................................................................................... 15 2.2.4 Statistical analysis ........................................................................................................ 15 2.3 Results ................................................................................................................................. 15 2.3.1 At least two new phyla may relate to colitis development in patients ......................... 15 2.3.2 Hundreds of lower taxa may relate to colitis development in patients ........................ 19 v 2.3.3 Many identified taxa may relate to colitis development based on literature ............... 24 2.3.4 Re-analyses with MetaPhlAn support that Actinobacteria is different between CF samples and PtC samples ...................................................................................................... 26 2.4 Discussion ........................................................................................................................... 30 CHAPTER THREE: AN APPROACH FOR BACTERIAL STRAIN RECONSTRUCTION BASED ON DE BRUIJN GRAPH ............................................................................................... 35 3.1 Introduction ......................................................................................................................... 35 3.2 Materials and Methods ........................................................................................................ 37 3.2.1 Simulated datasets ........................................................................................................ 37 3.2.2 Experimental datasets .................................................................................................. 42 3.2.3 BHap, a novel approach for strain reconstruction in bacterial populations ................. 44 3.2.4 Evaluation of BHap and other tools ............................................................................. 50 3.3 Results ................................................................................................................................. 50 3.3.1 BHap has a robust performance with varied parameter values .................................... 50 3.3.2 BHap reconstructs strains better than EVORhA on simulated datasets ...................... 57 3.3.3 BHap reconstructs strains better than EVORhA on experimental dataset ................... 67 3.4 Discussion ........................................................................................................................... 70 CHAPTER FOUR: A NOVEL TOOL FOR BACTERIAL STRAIN RECONSTRUCTION FROM READS ......................................................................................................................................... 72 4.1 Introduction ......................................................................................................................... 72 vi 4.2 Materials and Methods .......................................................................................................
Recommended publications
  • Doctoral Dissertation Template
    UNIVERSITY OF OKLAHOMA GRADUATE COLLEGE METAGENOMIC INSIGHTS INTO MICROBIAL COMMUNITY RESPONSES TO LONG-TERM ELEVATED CO2 A DISSERTATION SUBMITTED TO THE GRADUATE FACULTY in partial fulfillment of the requirements for the Degree of DOCTOR OF PHILOSOPHY By QICHAO TU Norman, Oklahoma 2014 METAGENOMIC INSIGHTS INTO MICROBIAL COMMUNITY RESPONSES TO LONG-TERM ELEVATED CO2 A DISSERTATION APPROVED FOR THE DEPARTMENT OF MICROBIOLOGY AND PLANT BIOLOGY BY ______________________________ Dr. Jizhong Zhou, Chair ______________________________ Dr. Meijun Zhu ______________________________ Dr. Fengxia (Felicia) Qi ______________________________ Dr. Michael McInerney ______________________________ Dr. Bradley Stevenson © Copyright by QICHAO TU 2014 All Rights Reserved. Acknowledgements At this special moment approaching the last stage for this degree, I would like to express my gratitude to all the people who encouraged me and helped me out through the past years. Dr. Jizhong Zhou, my advisor, is no doubt the most influential and helpful person in pursuing my academic goals. In addition to continuous financial support for the past six years, he is the person who led me into the field of environmental microbiology, from a background of bioinformatics and plant molecular biology. I really appreciated the vast training I received from the many interesting projects I got involved in, without which I would hardly develop my broad experienced background from pure culture microbial genomics to complex metagenomics. Dr. Zhili He, who played a role as my second advisor, is also the person I would like to thank most. Without his help, I could be still struggling working on those manuscripts lying in my hard drive. I definitely learned a lot from him in organizing massed results into logical scientific work—skills that will benefit me for life.
    [Show full text]
  • Acetobacteroides Hydrogenigenes Gen. Nov., Sp. Nov., an Anaerobic Hydrogen-Producing Bacterium in the Family Rikenellaceae Isolated from a Reed Swamp
    %paper no. ije063917 charlesworth ref: ije063917& New Taxa - Bacteroidetes International Journal of Systematic and Evolutionary Microbiology (2014), 64, 000–000 DOI 10.1099/ijs.0.063917-0 Acetobacteroides hydrogenigenes gen. nov., sp. nov., an anaerobic hydrogen-producing bacterium in the family Rikenellaceae isolated from a reed swamp Xiao-Li Su,1,2 Qi Tian,1,3 Jie Zhang,1,2 Xian-Zheng Yuan,1 Xiao-Shuang Shi,1 Rong-Bo Guo1 and Yan-Ling Qiu1 Correspondence 1Key Laboratory of Biofuels, Qingdao Institute of Bioenergy and Bioprocess Technology, Yan-Ling Qiu Chinese Academy of Sciences, Qingdao, Shandong Province 266101, PR China [email protected] 2University of Chinese Academy of Sciences, Beijing 100049, PR China 3Ocean University of China, Qingdao, 266101, PR China A strictly anaerobic, mesophilic, carbohydrate-fermenting, hydrogen-producing bacterium, designated strain RL-CT, was isolated from a reed swamp in China. Cells were Gram-stain- negative, catalase-negative, non-spore-forming, non-motile rods measuring 0.7–1.0 mm in width and 3.0–8.0 mm in length. The optimum temperature for growth of strain RL-CT was 37 6C (range 25–40 6C) and pH 7.0–7.5 (range pH 5.7–8.0). The strain could grow fermentatively on yeast extract, tryptone, arabinose, glucose, galactose, mannose, maltose, lactose, glycogen, pectin and starch. The main end products of glucose fermentation were acetate, H2 and CO2. Organic acids, alcohols and amino acids were not utilized for growth. Yeast extract was not required for growth; however, it stimulated growth slightly. Nitrate, sulfate, sulfite, thiosulfate, elemental sulfur and Fe(III) nitrilotriacetate were not reduced as terminal electron acceptors.
    [Show full text]
  • Noncontiguous Finished Genome Sequence and Description Of
    TAXONOGENOMICS:GENOME OF A NEW ORGANISM Noncontiguous finished genome sequence and description of Gabonia massiliensis gen. nov., sp. nov. G. Mourembou1,2, J. Rathored1, A. Ndjoyi-Mbiguino4, J. B. Lekana-Douki3,5, F. Fenollar1, C. Robert1, P.-E. Fournier1, D. Raoult1,6 and J. C. Lagier1 1) Aix Marseille Université, URMITE, UM63, CNRS 7278, IRD 198, INSERM 1095, Marseille, France, 2) Ecole Doctorale Régionale d’Afrique Centrale, 3) Unité de Parasitologie Médicale (UPARAM) CIRMF, Franceville, 4) Département de Microbiologie, Laboratoire National de référence IST/sida, Faculté de Médecine, 5) Département de Parasitologie Mycologie et de Médecine Tropicale, Université des Sciences de la Santé B.P., Libreville, Gabon and 6) Special Infectious Agents Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia Abstract Culturomics coupled with taxonogenomics is currently used to isolate and characterize new bacteria. Here we describe the features and complete genome sequence of Gabonia massiliensis strain GM3, an anaerobic Gram negative, non-spore-forming and catalase-positive bacillus isolated from a stool specimen of a healthy Gabonese male youth. Belonging to a new genus called Gabonia, it exhibits a genome of 4 261 752 bp including 37.9% GC content and 3,288 predicted genes. New Microbes and New Infections © 2015 The Authors. Published by Elsevier Ltd on behalf of European Society of Clinical Microbiology and Infectious Diseases. Keywords: Culturomics, Gabonia massiliensis gen. nov. et sp. nov., genome, taxonogenomics, gut microbiota Original Submission: 28 September 2015; Revised Submission: 3 November 2015; Accepted: 4 November 2015 Article published online: 24 November 2015 including 16S rRNA sequence similarity, GC percentage and Corresponding author: J.C.
    [Show full text]
  • Antibitoic Treatment for Tuberculosis Induces a Profound Dysbiosis of the Gut Microbiome That Persists Long After Therapy Is Completed
    ANTIBITOIC TREATMENT FOR TUBERCULOSIS INDUCES A PROFOUND DYSBIOSIS OF THE GUT MICROBIOME THAT PERSISTS LONG AFTER THERAPY IS COMPLETED A Thesis Presented to the Faculty of the Weill Cornell Graduate School of Medical Sciences in Partial Fulfillment of the Requirements for the Degree of Masters of Science by Matthew F. Wipperman May 2017 © 2017 Matthew F. Wipperman ABSTRACT Mycobacterium tuberculosis, the cause of Tuberculosis (TB), infects one third of the world’s population and causes substantial mortality worldwide. In its shortest format, treatment of drug sensitive TB requires six months of multidrug therapy with a mixture of broad spectrum and mycobacterial specific antibiotics, and treatment of multidrug resistant TB is much longer. The widespread use of this regimen worldwide makes this one the largest exposures of humans to antimicrobials, yet the effects of antimycobacterial agents on intestinal microbiome composition and long term stability are unknown. We compared the microbiome composition, assessed by both 16S rDNA and metagenomic DNA sequencing, of Haitian TB cases during antimycobacterial treatment and following cure by 6 months of TB therapy. TB treatment does not perturb overall diversity, but nonetheless dramatically depletes multiple immunologically significant commensal bacteria. The perturbation by TB therapy lasts at least 1.5 years after completion of treatment, indicating that the effects of TB treatment are long lasting and perhaps permanent. These results demonstrate that TB treatment has dramatic and durable effects on the intestinal microbiome and highlight unexpected extreme consequences of treatment for the world’s most common infection on human ecology. BIOGRAPHICAL SKETCH NAME POSITION TITLE Wipperman, Matthew Frederick Postdoctoral Researcher at eRA COMMONS USER NAME Memorial Sloan Kettering Cancer Center MFWIPPERMAN DEGREE INSTITUTION AND (if MM/YY FIELD OF STUDY LOCATION applicable) Franklin & Marshall College B.A.
    [Show full text]
  • Muribaculaceae Genomes Assembled from Metagenomes Suggest Genetic Drivers of Differential Response to Acarbose Treatment in Mice
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.01.183202; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Muribaculaceae genomes assembled from metagenomes suggest genetic drivers of differential response to acarbose treatment in mice Byron J. Smitha,* Richard A. Millerb Thomas M. Schmidta,c,# aDepartment of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA. bDepartment of Pathology and Geriatrics Center, University of Michigan, Ann Arbor, MI, USA. cDepartment of Internal Medicine, University of Michigan, Ann Arbor, MI, USA. *Present address: J. David Gladstone Institutes, San Francisco, CA, USA. #Correspondence: [email protected] Running title: Genomes of acarbose-responsive Muribaculaceae Word Count: Abstract: 249 Importance: 149 Manuscript: 6,369 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.01.183202; this version posted December 24, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 2 Abstract The drug acarbose (ACA) is used to treat diabetes, and, by inhibiting α-amylase in the small intestine, increases the amount of starch entering the lower digestive tract. This results in changes to the composition of the microbiota and their fermentation products. Acarbose also increases longevity in mice, an effect that has been correlated with increased production of the short-chain fatty acids propionate and butyrate.
    [Show full text]
  • Assessment of Microbial and Growth Response of Broilers Fed a Commercial Prebiotic Sang in Lee University of Arkansas, Fayetteville
    University of Arkansas, Fayetteville ScholarWorks@UARK Theses and Dissertations 5-2016 Assessment of Microbial and Growth Response of Broilers Fed a Commercial Prebiotic Sang In Lee University of Arkansas, Fayetteville Follow this and additional works at: http://scholarworks.uark.edu/etd Part of the Food Microbiology Commons Recommended Citation Lee, Sang In, "Assessment of Microbial and Growth Response of Broilers Fed a Commercial Prebiotic" (2016). Theses and Dissertations. 1516. http://scholarworks.uark.edu/etd/1516 This Thesis is brought to you for free and open access by ScholarWorks@UARK. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of ScholarWorks@UARK. For more information, please contact [email protected]. Assessment of Microbial and Growth Response of Broilers Fed a Commercial Prebiotic A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Cell and Molecular Biology by Sang In Lee University of Arkansas Bachelors of Science in Animal Science, 2013 May 2016 University of Arkansas This thesis is approved for recommendation to the Graduate Council ___________________________________ Dr. Steven C. Ricke Thesis Director ___________________________________ ___________________________________ Dr. Young Min Kwon Dr. Jeffrey A. Lewis Committee Member Committee Member Abstract Prebiotic refers to nondigestible food ingredients that enhance the health of the host by selectively promoting one or more number of beneficial bacteria. Gibson and Roberfroid (1995) initially described prebiotics as compounds that neither hydrolyzed nor absorbed in upper part of the gastrointestinal tract, selectively stimulates growth of beneficial bacteria, enhance health of the host and able to amend microorganism population to healthier states.
    [Show full text]
  • Genomic Characterization of the Uncultured Bacteroidales Family S24-7 Inhabiting the Guts of Homeothermic Animals Kate L
    Ormerod et al. Microbiome (2016) 4:36 DOI 10.1186/s40168-016-0181-2 RESEARCH Open Access Genomic characterization of the uncultured Bacteroidales family S24-7 inhabiting the guts of homeothermic animals Kate L. Ormerod1, David L. A. Wood1, Nancy Lachner1, Shaan L. Gellatly2, Joshua N. Daly1, Jeremy D. Parsons3, Cristiana G. O. Dal’Molin4, Robin W. Palfreyman4, Lars K. Nielsen4, Matthew A. Cooper5, Mark Morrison6, Philip M. Hansbro2 and Philip Hugenholtz1* Abstract Background: Our view of host-associated microbiota remains incomplete due to the presence of as yet uncultured constituents. The Bacteroidales family S24-7 is a prominent example of one of these groups. Marker gene surveys indicate that members of this family are highly localized to the gastrointestinal tracts of homeothermic animals and are increasingly being recognized as a numerically predominant member of the gut microbiota; however, little is known about the nature of their interactions with the host. Results: Here, we provide the first whole genome exploration of this family, for which we propose the name “Candidatus Homeothermaceae,” using 30 population genomes extracted from fecal samples of four different animal hosts: human, mouse, koala, and guinea pig. We infer the core metabolism of “Ca. Homeothermaceae” to be that of fermentative or nanaerobic bacteria, resembling that of related Bacteroidales families. In addition, we describe three trophic guilds within the family, plant glycan (hemicellulose and pectin), host glycan, and α-glucan, each broadly defined by increased abundance of enzymes involved in the degradation of particular carbohydrates. Conclusions: “Ca. Homeothermaceae” representatives constitute a substantial component of the murine gut microbiota, as well as being present within the human gut, and this study provides important first insights into the nature of their residency.
    [Show full text]
  • Appendix 1. Validly Published Names, Conserved and Rejected Names, And
    Appendix 1. Validly published names, conserved and rejected names, and taxonomic opinions cited in the International Journal of Systematic and Evolutionary Microbiology since publication of Volume 2 of the Second Edition of the Systematics* JEAN P. EUZÉBY New phyla Alteromonadales Bowman and McMeekin 2005, 2235VP – Valid publication: Validation List no. 106 – Effective publication: Names above the rank of class are not covered by the Rules of Bowman and McMeekin (2005) the Bacteriological Code (1990 Revision), and the names of phyla are not to be regarded as having been validly published. These Anaerolineales Yamada et al. 2006, 1338VP names are listed for completeness. Bdellovibrionales Garrity et al. 2006, 1VP – Valid publication: Lentisphaerae Cho et al. 2004 – Valid publication: Validation List Validation List no. 107 – Effective publication: Garrity et al. no. 98 – Effective publication: J.C. Cho et al. (2004) (2005xxxvi) Proteobacteria Garrity et al. 2005 – Valid publication: Validation Burkholderiales Garrity et al. 2006, 1VP – Valid publication: Vali- List no. 106 – Effective publication: Garrity et al. (2005i) dation List no. 107 – Effective publication: Garrity et al. (2005xxiii) New classes Caldilineales Yamada et al. 2006, 1339VP VP Alphaproteobacteria Garrity et al. 2006, 1 – Valid publication: Campylobacterales Garrity et al. 2006, 1VP – Valid publication: Validation List no. 107 – Effective publication: Garrity et al. Validation List no. 107 – Effective publication: Garrity et al. (2005xv) (2005xxxixi) VP Anaerolineae Yamada et al. 2006, 1336 Cardiobacteriales Garrity et al. 2005, 2235VP – Valid publica- Betaproteobacteria Garrity et al. 2006, 1VP – Valid publication: tion: Validation List no. 106 – Effective publication: Garrity Validation List no. 107 – Effective publication: Garrity et al.
    [Show full text]
  • Natronoflexus Pectinivorans Gen. Nov. Sp. Nov., an Obligately Anaerobic
    Extremophiles (2011) 15:691–696 DOI 10.1007/s00792-011-0399-7 ORIGINAL PAPER Natronoflexus pectinivorans gen. nov. sp. nov., an obligately anaerobic and alkaliphilic fermentative member of Bacteroidetes from soda lakes D. Y. Sorokin • A. N. Panteleeva • T. P. Tourova • E. N. Kaparullina • G. Muyzer Received: 22 July 2011 / Accepted: 26 August 2011 / Published online: 14 September 2011 Ó The Author(s) 2011. This article is published with open access at Springerlink.com Abstract Anaerobic enrichment with pectin at pH 10 and Phylogenetic analysis based on 16S rRNA sequences placed moderate salinity inoculated with sediments from soda lakes of the isolate into the phylum Bacteroidetes as a separate the Kulunda Steppe (Altai, Russia) resulted in the isolation of a lineage within the family Marinilabilaceae. On the basis of novel member of the Bacteroidetes, strain AP1T. The cells are distinct phenotype and phylogeny, the soda lake isolate long, flexible, Gram-negative rods forming pink carotenoids. AP1T is proposed to be assigned in a new genus and species The isolate is an obligate anaerobe, fermenting various carbo- Natronoflexus pectinivorans (=DSM24179T = UNIQEM hydrates to acetate and succinate. It can hydrolyze and utilize U807T). pectin, xylan, starch, laminarinandpullulanasgrowthsub- strates. Growth is possible in a pH range from 8 to 10.5, with an Keywords Natronoflexus pectinivorans Á Pectin Á optimum at pH 9.5, and at a salinity range from 0.1 to 2 M Na?. Haloalkaliphilic Á Soda lakes Introduction Communicated by A. Oren. Pectin, a natural polymer of partially methylated galact- Nucleotide sequence accession number: the GenBank/EMBL uronic acid, is an important component of plant biomass T accession number of the 16S rRNA gene sequences of strain AP1 is acting as a glue for the cellulose fibrils.
    [Show full text]
  • Complete Thesis
    University of Groningen The interplay between genetics, the microbiome, DNA‐methylation & gene‐expression Bonder, Marc Jan IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below. Document Version Publisher's PDF, also known as Version of record Publication date: 2017 Link to publication in University of Groningen/UMCG research database Citation for published version (APA): Bonder, M. J. (2017). The interplay between genetics, the microbiome, DNA‐methylation & gene‐ expression. University of Groningen. Copyright Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons). The publication may also be distributed here under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license. More information can be found on the University of Groningen website: https://www.rug.nl/library/open-access/self-archiving-pure/taverne- amendment. Take-down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum. Download date: 10-10-2021 The interplay between genetics, the microbiome, DNA-methylation & gene-expression Marc Jan Bonder Marc Jan Bonder The interplay between genetics, the microbiome, DNA-methylation & gene-expression Second edition.
    [Show full text]
  • Novel Bacterial Taxa in the Human Microbiome
    Novel Bacterial Taxa in the Human Microbiome Kristine M. Wylie1*., Rebecca M. Truty2*., Thomas J. Sharpton2, Kathie A. Mihindukulasuriya1, Yanjiao Zhou1, Hongyu Gao1, Erica Sodergren1, George M. Weinstock1, Katherine S. Pollard2,3 1 The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America, 2 Gladstone Institutes, University of California San Francisco, San Francisco, California, United States of America, 3 Division of Biostatistics, Institute for Human Genetics, University of California San Francisco, San Francisco, California, United States of America Abstract The human gut harbors thousands of bacterial taxa. A profusion of metagenomic sequence data has been generated from human stool samples in the last few years, raising the question of whether more taxa remain to be identified. We assessed metagenomic data generated by the Human Microbiome Project Consortium to determine if novel taxa remain to be discovered in stool samples from healthy individuals. To do this, we established a rigorous bioinformatics pipeline that uses sequence data from multiple platforms (Illumina GAIIX and Roche 454 FLX Titanium) and approaches (whole-genome shotgun and 16S rDNA amplicons) to validate novel taxa. We applied this approach to stool samples from 11 healthy subjects collected as part of the Human Microbiome Project. We discovered several low-abundance, novel bacterial taxa, which span three major phyla in the bacterial tree of life. We determined that these taxa are present in a larger set of Human Microbiome Project subjects and are found in two sampling sites (Houston and St. Louis). We show that the number of false- positive novel sequences (primarily chimeric sequences) would have been two orders of magnitude higher than the true number of novel taxa without validation using multiple datasets, highlighting the importance of establishing rigorous standards for the identification of novel taxa in metagenomic data.
    [Show full text]
  • Molecular Profiling of Bacterial Species in the Caecum of Geese
    Original Paper Czech J. Anim. Sci., 56, 2011 (4): 192–203 Molecular profiling of bacterial species in the caecum of geese B.Y. Liu, Z.Y. Wang, H.R. Wang, P. Hu, D. Xu, Q. Wang College of Animal Science and Technology, Yangzhou University, Yangzhou, Jiangsu Province, P.R. China ABSTRACT: The purpose of this study was to analyse the microbial diversity in the caecum of geese using a 16S ribosomal RNA gene (rRNA) clone library approach. A total of 160 clones and 124 clones were sequenced and phylogenetically analysed from the contents and mucosa of the caecum of Yang Zhou geese, respectively. The result indicated that there was a rich variety of bacteria in the caecum contents. Forty-six operational taxonomic units (OTUs) based on a 98% similarity criterion were classified in the contents of goose caecum, as compared to 29 OTUs based on a 97% similarity criterion in the mucosa of goose caecum. The sequences were assigned to 7 and 5 groups in the contents and mucosa of goose caecum, respectively. Contents of goose caecum were dominantly occupied by Clostridia-related species (58.7%) with other abundant sequences being related to Bacteroidetes (26.9%) and Erysipelotrichi (11.2%). Gammaproteobacteria (59.6%) and Clostridia (20.1%) were predominant in the mucosa of goose caecum. Keywords: geese; caecum; microbiota; phylogenetic analysis Unlike other avian species, the goose is a kind is considered to be beneficial not only to their nu- of waterfowl with the relatively developed paired trition, but also to the health of animals. Reports caecum, which can take advantage of fibrous on the microbial ecology in the caecum of geese plant materials partly.
    [Show full text]