ESCMID Online Lecture Library © by Author

Total Page:16

File Type:pdf, Size:1020Kb

ESCMID Online Lecture Library © by Author Overview of Software & Databases for Rapid High-throughput Sequence Analysis in Clinical & Public Health Microbiology © by author ESCMID OnlineDag HarmsenLecture Library University of Münster, Germany Short CV – Dag Harmsen • 1983–1991, Medical Schools University Antwerpen, Belgium and Univ. Würzburg, Germany • 1991, MD and doctoral thesis (‘Nucleo-capsid gene of Coronavirus HCV-229E’, V. ter Meulen) • 1992 – 2000, Institute of Hygiene & Microbiology (J. Heesemannn, Candida spp. & Aspergillus spp. & M. Frosch, Mycobacterium spp.) / Internal Medicine (K. Kochsiek) / Institute of Virology (V. ter Meulen), Univ. Würzburg, Germany • 1998, Inclusion in the German Medical Microbiology and Infectious Epidemiology specialist register • 2000-2001, Head of R&D CREATOGEN diagnostics, CREATOGEN AG, Augsburg, Germany • 2002-2004, Institute of Hygiene (H. Karch, MRSA), University of Münster, Germany • 2003, co-founder and shareholder of a bioinformatics company (Ridom GmbH, Münster, Germany) • 2004– , Full Professorship Department of Periodontology, Univ. Münster, Germany • July 2005 – July 2008, Temporary Head of the Department of Periodontology, Univ. Münster • August 2008 – , Head of Research Department of Periodontology • July 2005 – , Member of the Executive© Board by of the author International Committee on Systematics of Prokaryotes (ICSP) of the ‘International Union of Microbiological Societies’ (IUMS) • June 2007 – July 2012, Member of the ASM Professional Development Committee • OctoberESCMID 2012 - , ASM Ambassador Online in Germany Lecture Library • ECDC & EFSA technical advisor for genotyping and NGS/WGS • Scientific interests: molecular diagnostic, epidemiology, and phylogeny of microorganisms; applied bioinformatics in microbiology Commercial Disclosure Dag Harmsen is co-founder and partial owner of a bioinformatics company (Ridom GmbH, Münster, Germany) that develops software for DNA sequence analysis. Recently Ridom and Ion Torrent/Thermo Fisher (Waltham, MA) partnered and released SeqSphere+ software to speed and simplify whole genome ©based by bacterial author typing. ESCMID Online Lecture Library Next Generation Sequencing (NGS) AS = Amplicon Sequencing DNA WGS = Whole Genome Sequencing MPSS = Massive Parallel Signature Sequencing AS WGS Single molecule PCR Cloning 3. Generation Immobilized Arrayed Nanopore in vitro polymerase in vivo fragments readers 2. Generation (PacBio; (Helicos) (Oxford, Genia) VisiGen) Braslavsky et al., Levene et al., Kasianowicz et al., © by author2003 [PubMed] 2003 [PubMed] 1996 [PubMed] Cloning Sequencing Hybridization Sanger method in vivo by synthesis and ligation ESCMIDSanger et al., Online Lecture Library 1977 [PubMed] Polony / 454 Roche / Illumina MPSS Ion Torrent (Solexa) ABI SOLiD Modified from: Hall (2007). J. Marguiles et al., Bennett et al., Shendure et al., Brenner et al., Exp. Biol. 210: 1518 [PubMed] 2005 [PubMed] 2005 [PubMed] 2005 [PubMed] 2000 [PubMed] It’s the Consensus Genome-wide Gene by Gene de novo Consensus Accuracy Venn diagram of de novo consensus Details accuracy for PGM, MiSeq and GSJ . Consensus errors were analyzed for 4,632 coding NCBI Sakai reference genes retrieved from MIRA de novo assemblies using SeqSphere+ for all 3 platforms . Number of variants confirmed by bidirectional Sanger sequencing indicated in parentheses . Validation of the 8 substitution and 15 indel variants identified using all 3 NGS platforms, suggested that either the Sakai © by authorstrain experienced micro-evolutionary changes or the genome sequence deposited in 2001 contains sequencing PGM, Ion Torrent Personal Genome Machine 300bp; MiSeq, Illumina MiSeq 2x 250bp PE; errors ESCMIDGSJ, 454 GS Junior with GSJ Titanium Online chemistry; Lecture Library bp, base pairs Jünemann et al. (2013). Nature Biotechnology 31: 294 [PubMed]. Real Cost of Sequencing Cost per Mb of DNA Sequence Moore's law $10,000.00 $1,000.00 $100.00 $10.00 © by author $1.00 ESCMID Online Lecture Library $0.10 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 Sboner et al. (2011). Genome Biol. 12: 125 [PubMed]. Real Cost of Sequencing Sample collection and Data reduction Sequencing Downstream experimental design Data management analyses 100% 80 60 40 Data management Data © by author 20 ESCMID Online Lecture Library 0% Pre-NGS Now Future (Approximately 2000) (Approximately 2010) (Approximately 2020) Sboner et al. (2011). Genome Biol. 12: 125 [PubMed]. Microbial Genomics Data, Workflow & Bioinformatics Tasks 16S rDNA community profiling Other ‚omics‘ applications ... pathogenicity, pathogenicity amplicon resistance profiling … … community profiling & meta-genome functional analysis microbial NGS reads pathogen diseased discovery human tissue Proteomics Shendure et al. (2012). Nature Biotechnol. 30: 1084 [PubMed]. strain attribution shot-gun © by authorsurveillance, plain language phylogeny & report resistom pure culture / de novo automated assembled annotated ESCMID Onlinesingle cell WGS Lecture Library reference assisted semi-automated assembly annotation PCR signatures synteny WGS; whole genome sequencing. General Hardware Requirements for NGS Software © by author RAM ≥ 32GB ESCMID Online Lecture Library HD ≥ 1TB (Raid) Metagenomic Analysis / Strain Attribution Composition or pattern matching approach • uses predetermined patterns in the data (e.g., taxonomic clade markers [MetaPhlAn], k- mer frequency) often together with sophisticated classification algorithms • require intensive preprocessing of the genomic database before application and classification results can change drastically depending on size and composition of genomic database Taxonomic mapping approach • typically rely on a ‘lowest common ancestor’ • usually only highly accurate for higher taxonomic levels • e.g., MEGAN , MG-RAST , CARMA3 Whole-genome assembly approach • often lead to most accurate strain identification Meyer et al. (2008). BMC Bioinformatics 19: 386 [PubMed]. • computational very intensive (e.g., PathSeq) Pathoscope © by author • capitalizes on a Bayesian statistical framework • takes NGS reads as input; considers sequencing errors and sequence & mappingESCMID quality; provides Online posterior matches Lecture to a known Library database • accurately discriminates for presence of multiple species or closely related strains of the same species • considers cases when the sample species/strain is not in reference database • fast because no database preprocessing or sequence assembly FrancisFrancis et al. et (2013).al. (2013). Genome Genome Res. Res. 23: 23: Epub 1721 ahead [PubMed of print]. [PubMed]. De novo vs. Reference Assisted Assembly De novo assembly slower (produces e.g. ACE/AMOS files); ‘hypothesis free’ assembly (resistance) frequently used for panmictic/non-clonal bacteria (e.g., N. meningitidis) e.g., MIRA, Newbler, or Velvet © by author Reference-assistedESCMID assembly Online Lecture Library fast (produces SAM/BAM files); detects only what is present in reference frequently used for monomorphic/ clonal bacteria (e.g., M. tuberculosis) or lineage-only for non- clonal bacteria e.g., bwa GABenchToB: A Genome Assembly Benchmark Tuned on Bacteria and Benchtop Sequencers Objecves • Focus on practical questions • benchtop platforms (MiSeq, PGM, GS Junior) • single library (single or paired end reads) • real data (no synthetic data) • assembler specific default procedures and parameters • parallelization on single dedicated host • including commercial assemblers • benchmark run time and memory usage • Bacterial genomes only Supermicro A+ Server • Escherichia coli O157:H7 SAKAI • Staphylococcus aureus© COL by author 4 CPUs / 48 Cores • Mycobacterium tuberculosis H37Rv 128GB RAM • all with complete reference 500GB local storage • whole GC-range: 66% (H37), 51% (SAKAI), 33% (COL) ~6.000 Euro (mid/ 2011) ESCMID• coverage down-sampled Online Lecture Library • 75-fold: MiSeq 2x150bp, MiSeq 2x250bp, PGM400bp • 40-fold: PGM 200bp, GSJ Jünemann et al. (2014). unpublished. Effect of Coverage on N50 Contig Size for Benchtop NGS Platforms Less coverage means higher through-put (cheaper NGS) and faster de novo assembly (OLC). For PGM © by authorEffect of coverage400bp optimalon N50 contig size coverageand memory lies requirements in aroundan E. coli ESCMID Online Lecturede Librarynovo assembly50- 60x(simulated! MiSeq 2x 250bp PE (blue), PGM 300bp (red) and GSJ (green) MIRAIllumina de 75 bp paired-end data sets novo assemblies at different average coverage obtained by randomwith sub- a 200 bp insert size assembled sampling of original E. coli datasets. N50, a statistic for describingusing the Velvet 0.7.31). distribution of contig lengths in an assembly; kb, kilo bases. Illumina Technical Note (2010). De Novo Jünemann et al. (2013).Assembly Nature Using Biotechnology Illumina Reads. 31: 294 [PubMed]. De novo Assembler Assembler selecon criteria • processing single and paired-end reads • not relying on mate-pair libraries • no requirement on multiple libraries • only de Bruijn and OLC approaches Assemblers included in other studies but not considered here • ARACHNE: long Sanger reads and mate-pair libraries • MaSuRCA: relies on paired-end data • SGA: relies on paired-end© bydata author • ALLPATH-LG: needs at least one paired-end and one mate-pair library • Phusion: needs mate-pair data ESCMID Online Lecture Library 14 De novo Assembler Benchmarks Main metrics • Quast1 used to assess contiguity and accuracy (also used by GAGE-B) • NGA50 • # misassemblies: misassemblies (inversion, relocation, translocation) & local misassemblies (misassembly
Recommended publications
  • Proquest Dissertations
    Automated learning of protein involvement in pathogenesis using integrated queries Eithon Cadag A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2009 Program Authorized to Offer Degree: Department of Medical Education and Biomedical Informatics UMI Number: 3394276 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. UMI Dissertation Publishing UMI 3394276 Copyright 2010 by ProQuest LLC. All rights reserved. This edition of the work is protected against unauthorized copying under Title 17, United States Code. uest ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106-1346 University of Washington Graduate School This is to certify that I have examined this copy of a doctoral dissertation by Eithon Cadag and have found that it is complete and satisfactory in all respects, and that any and all revisions required by the final examining committee have been made. Chair of the Supervisory Committee: Reading Committee: (SjLt KJ. £U*t~ Peter Tgffczy-Hornoch In presenting this dissertation in partial fulfillment of the requirements for the doctoral degree at the University of Washington, I agree that the Library shall make its copies freely available for inspection. I further agree that extensive copying of this dissertation is allowable only for scholarly purposes, consistent with "fair use" as prescribed in the U.S.
    [Show full text]
  • QUANTIFYING TRENDS in BACTERIAL VIRULENCE and PATHOGEN-ASSOCIATED GENES THROUGH LARGE SCALE BIOINFORMATICS ANALYSIS By
    QUANTIFYING TRENDS IN BACTERIAL VIRULENCE AND PATHOGEN-ASSOCIATED GENES THROUGH LARGE SCALE BIOINFORMATICS ANALYSIS by Anastasia Amber Fedynak B.Comp., University of Guelph, 2003 B.Sc., University of Alberta, 2002 THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY In the Department of Molecular Biology and Biochemistry © Anastasia Fedynak 2007 SIMON FRASER UNIVERSITY 2007 All rights reserved. This work may not be reproduced in whole or in part, by photocopy or other means, without permission of the author. APPROVAL Name: Anastasia Amber Fedynak Degree: Doctor of Philosophy Title of Thesis: Quantifying trends in bacterial virulence and pathogen-associated genes through large scale bioinformatics analysis Examining Committee: Chair: Dr. Bruce P. Brandhorst Professor, Department of Molecular Biology and Biochemistry Dr. Fiona S.L. Brinkman Senior Supervisor Associate Professor, Department of Molecular Biology and Biochemistry Dr. Lisa Craig Supervisor Assistant Professor, Department of Molecular Biology and Biochemistry Dr. Kay C. Wiese Supervisor Associate Professor, School of Computing Science Dr. Jack Chen Internal Examiner Associate Professor, Department of Molecular Biology and Biochemistry Dr. Gee W. Lau External Examiner Assistant Professor, College of Veterinary Medicine, University of Illinois at Urbana-Champaign Date Defended: Monday December 10, 2007 ii SIMON FRASER UNIVERSITY LIBRARY Declaration of Partial Copyright Licence The author, whose copyright is declared on the title page of this work, has granted to Simon Fraser University the right to lend this thesis, project or extended essay to users of the Simon Fraser University Library, and to make partial or single copies only for such users or in response to a request from the library of any other university, or other educational institution, on its own behalf or for one of its users.
    [Show full text]
  • Comparative Genome Analyses of Vibrio Anguillarum Strains Reveal a Link with Pathogenicity Traits
    Comparative genome analyses of Vibrio anguillarum strains reveal a link with pathogenicity traits Castillo Bermúdez, Daniel Elías; Alvise, Paul D.; Xu, Ruiqi; Zhang, Faxing; Middelboe, Mathias; Gram, Lone Published in: mSystems DOI: 10.1128/mSystems.00001-17 Publication date: 2017 Document version Publisher's PDF, also known as Version of record Document license: CC BY Citation for published version (APA): Castillo Bermúdez, D. E., Alvise, P. D., Xu, R., Zhang, F., Middelboe, M., & Gram, L. (2017). Comparative genome analyses of Vibrio anguillarum strains reveal a link with pathogenicity traits. mSystems, 2(1), [e00001- 17]. https://doi.org/10.1128/mSystems.00001-17 Download date: 01. Oct. 2021 RESEARCH ARTICLE Ecological and Evolutionary Science crossm Comparative Genome Analyses of Vibrio anguillarum Strains Reveal a Link with Pathogenicity Traits Daniel Castillo,a Paul D. Alvise,b Ruiqi Xu,c Faxing Zhang,d Mathias Middelboe,a Lone Gramb Downloaded from Marine Biological Section, University of Copenhagen, Helsingør, Denmarka; Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs Lyngby, Denmarkb; Copenhagen Bio Science Park, Copenhagen, Denmarkc; BGI Park, Yantian District, Shenzhen, Chinad ABSTRACT Vibrio anguillarum is a marine bacterium that can cause vibriosis in many fish and shellfish species, leading to high mortalities and economic losses in aqua- Received 5 January 2017 Accepted 30 culture. Although putative virulence factors have been identified, the mechanism of January 2017 Published 28 February 2017 Citation Castillo D, Alvise PD, Xu R, Zhang F, http://msystems.asm.org/ pathogenesis of V. anguillarum is not fully understood. Here, we analyzed whole- Middelboe M, Gram L. 2017. Comparative genome sequences of a collection of V.
    [Show full text]
  • Whole Genome Sequencing Revealed Host Adaptation-Focused Genomic
    www.nature.com/scientificreports OPEN Whole genome sequencing revealed host adaptation-focused genomic plasticity of pathogenic Received: 08 October 2015 Accepted: 09 December 2015 Leptospira Published: 02 February 2016 Yinghua Xu1,*, Yongzhang Zhu2,*, Yuezhu Wang4,*, Yung-Fu Chang5, Ying Zhang1, Xiugao Jiang7, Xuran Zhuang2, Yongqiang Zhu4, Jinlong Zhang1, Lingbing Zeng2, Minjun Yang4, Shijun Li8, Shengyue Wang4, Qiang Ye1, Xiaofang Xin1, Guoping Zhao4,6, Huajun Zheng3,4, Xiaokui Guo2 & Junzhi Wang1 Leptospirosis, caused by pathogenic Leptospira spp., has recently been recognized as an emerging infectious disease worldwide. Despite its severity and global importance, knowledge about the molecular pathogenesis and virulence evolution of Leptospira spp. remains limited. Here we sequenced and analyzed 102 isolates representing global sources. A high genomic variability were observed among differentLeptospira species, which was attributed to massive gene gain and loss events allowing for adaptation to specific niche conditions and changing host environments. Horizontal gene transfer and gene duplication allowed the stepwise acquisition of virulence factors in pathogenic Leptospira evolved from a recent common ancestor. More importantly, the abundant expansion of specific virulence-related protein families, such as metalloproteases-associated paralogs, were exclusively identified in pathogenic species, reflecting the importance of these protein families in the pathogenesis of leptospirosis. Our observations also indicated that positive selection played a crucial role on this bacteria adaptation to hosts. These novel findings may lead to greater understanding of the global diversity and virulence evolution of Leptospira spp. Leptospirosis, caused by pathogenic spirochetes of the genus Leptospira, is one of the most widespread and signifi- cant zoonotic diseases in the world1. A wide variety of mammalian hosts can serve as infection reservoirs.
    [Show full text]
  • MOCAT2: a Metagenomic Assembly, Annotation and Profiling Framework Jens Roat Kultima1, Luis Pedro Coelho1, Kristoffer Forslund1, Jaime Huerta-Cepas1, Simone S
    Bioinformatics, 32(16), 2016, 2520–2523 doi: 10.1093/bioinformatics/btw183 Advance Access Publication Date: 8 April 2016 Applications Note Genome analysis MOCAT2: a metagenomic assembly, annotation and profiling framework Jens Roat Kultima1, Luis Pedro Coelho1, Kristoffer Forslund1, Jaime Huerta-Cepas1, Simone S. Li1,2, Marja Driessen1, Anita Yvonne Voigt1,3, Georg Zeller1, Shinichi Sunagawa1 and Peer Bork1,3,4,5,* 1Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany, 2School of Biotechnology and Biomolecular Sciences, University of New South Wales, 2052 Sydney, Australia, 3Molecular Medicine Partnership Unit, University of Heidelberg and European Molecular Biology Laboratory, 69120 Heidelberg, Germany, 4Department of Bioinformatics, Biocenter, University of Wu¨rzburg, 97074 Wu¨rzburg, Germany and 5Max Delbru¨ck Centre for Molecular Medicine, 13125 Berlin, Germany *To whom correspondence should be addressed. Associate Editor: John Hancock Received on January 8, 2016; revised on March 15, 2016; accepted on April 1, 2016 Abstract Summary: MOCAT2 is a software pipeline for metagenomic sequence assembly and gene prediction with novel features for taxonomic and functional abundance profiling. The automated generation and efficient annotation of non-redundant reference catalogs by propagating pre-computed assign- ments from 18 databases covering various functional categories allows for fast and comprehensive functional characterization of metagenomes. Availability and Implementation: MOCAT2 is implemented in Perl 5 and Python 2.7, designed for 64-bit UNIX systems and offers support for high-performance computer usage via LSF, PBS or SGE queuing systems; source code is freely available under the GPL3 license at http://mocat.embl.de. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
    [Show full text]
  • Detailed Analysis of Metagenome Datasets Obtained from Biogas
    Eikmeyer et al. Biotechnology for Biofuels 2013, 6:49 http://www.biotechnologyforbiofuels.com/content/6/1/49 RESEARCH Open Access Detailed analysis of metagenome datasets obtained from biogas-producing microbial communities residing in biogas reactors does not indicate the presence of putative pathogenic microorganisms Felix G Eikmeyer1, Antje Rademacher2, Angelika Hanreich2, Magdalena Hennig1, Sebastian Jaenicke3, Irena Maus1, Daniel Wibberg1, Martha Zakrzewski3, Alfred Pühler1, Michael Klocke2 and Andreas Schlüter1* Abstract Background: In recent years biogas plants in Germany have been supposed to be involved in amplification and dissemination of pathogenic bacteria causing severe infections in humans and animals. In particular, biogas plants are discussed to contribute to the spreading of Escherichia coli infections in humans or chronic botulism in cattle caused by Clostridium botulinum. Metagenome datasets of microbial communities from an agricultural biogas plant as well as from anaerobic lab-scale digesters operating at different temperatures and conditions were analyzed for the presence of putative pathogenic bacteria and virulence determinants by various bioinformatic approaches. Results: All datasets featured a low abundance of reads that were taxonomically assigned to the genus Escherichia or further selected genera comprising pathogenic species. Higher numbers of reads were taxonomically assigned to the genus Clostridium. However, only very few sequences were predicted to originate from pathogenic clostridial species. Moreover, mapping of metagenome reads to complete genome sequences of selected pathogenic bacteria revealed that not the pathogenic species itself, but only species that are more or less related to pathogenic ones are present in the fermentation samples analyzed. Likewise, known virulence determinants could hardly be detected. Only a marginal number of reads showed similarity to sequences described in the Microbial Virulence Database MvirDB such as those encoding protein toxins, virulence proteins or antibiotic resistance determinants.
    [Show full text]
  • Whole Genome Sequencing for Foodborne Disease Surveillance Landscape Paper
    Whole genome sequencing for foodborne disease surveillance Landscape paper Whole genome sequencing for foodborne disease surveillance Landscape paper Whole genome sequencing for foodborne disease surveillance: landscape paper ISBN 978-92-4-151386-9 © World Health Organization 2018 Some rights reserved. This work is available under the Creative Commons Attribution-NonCommercial-ShareAlike3.0 IGO licence (CC BY-NC-SA 3.0 IGO; https://creativecommons.org/licenses/by-nc-sa/3.0/igo). Under the terms of this licence, you may copy, redistribute and adapt the work for non-commercial purposes, provided the work is appropriately cited, as indicated below. In any use of this work, there should be no suggestion that WHO endorses any specific organization, products or services. The use of the WHO logo is not permitted. If you adapt the work, then you must license your work under the same or equivalent Creative Commons licence. If you create a translation of this work, you should add the following disclaimer along with the suggested citation: “This translation was not created by the World Health Organization (WHO). WHO is not responsible for the content or accuracy of this translation. The original English edition shall be the binding and authentic edition”. Any mediation relating to disputes arising under the licence shall be conducted in accordance with the mediation rules of the World Intellectual Property Organization. Suggested citation. Whole genome sequencing for foodborne disease surveillance: landscape paper. Geneva: World Health Organization; 2018. Licence: CC BY-NC-SA 3.0 IGO. Cataloguing-in-Publication (CIP) data. CIP data are available at http://apps.who.int/iris.
    [Show full text]
  • Leveraging Public Information About Pathogens for Disease Outbreak
    Leveraging Public Tonia Korves Information about Matthew Peterson Pathogens for Disease Wenling Chang Outbreak Investigations The views, opinions and/or findings contained in this report are those of The MITRE Corporation and should March 2013 not be construed as an official government position, policy, or decision, unless designated by other documentation. Approved for Public Release; Distribution Unlimited. 13-1292 Department No.: G62, E52X Project No.: 51MSR601-BA ©2013 The MITRE Corporation. All rights reserved. Bedford, MA Abstract With recent technological advances in DNA sequencing and access to information via internet databases, the amount of information about pathogen strains in the public domain is growing rapidly. This information can be leveraged to help identify the origins of pathogens that cause disease outbreaks. This report describes potential uses for public data in outbreak investigations, key data types, the formats and locations of pathogen data in public sources, and tools MITRE is designing for assembling and integrating information during disease outbreak investigations. iii Acknowledgements The authors thank Lynette Hirschman for advice and helpful suggestions on this work and document. iv Introduction Pathogens are capable of causing disease outbreaks with devastating consequences, including loss of life, societal disruption, and large economic costs. High consequence disease outbreaks can be naturally-occurring such as the devastating 1918 global influenza epidemic, accidentally-triggered such as the 2010-2012 Haitian cholera epidemic, and deliberately-caused such as the anthrax attacks in the fall of 2001. From national security, law enforcement, and public health perspectives, it is important to determine where an outbreak pathogen came from, and whether the outbreak was natural, accidental, or intentional.
    [Show full text]
  • Using Genomics to Track Global Antimicrobial Resistance
    Downloaded from orbit.dtu.dk on: Oct 02, 2021 Using Genomics to Track Global Antimicrobial Resistance Hendriksen, Rene S.; Bortolaia, Valeria; Tate, Heather; Tyson, Gregory H; Aarestrup, Frank Møller; McDermott, Patrick F. Published in: Frontiers in Public Health Link to article, DOI: 10.3389/fpubh.2019.00242 Publication date: 2019 Document Version Publisher's PDF, also known as Version of record Link back to DTU Orbit Citation (APA): Hendriksen, R. S., Bortolaia, V., Tate, H., Tyson, G. H., Aarestrup, F. M., & McDermott, P. F. (2019). Using Genomics to Track Global Antimicrobial Resistance. Frontiers in Public Health, 7, [242]. https://doi.org/10.3389/fpubh.2019.00242 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profit-making activity or commercial gain You may freely distribute the URL identifying the publication in the public portal If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. REVIEW published: 04 September 2019 doi: 10.3389/fpubh.2019.00242 Using Genomics to Track Global Antimicrobial Resistance Rene S. Hendriksen 1*, Valeria Bortolaia 1, Heather Tate 2, Gregory H.
    [Show full text]
  • Metagenomic Insights Into Chlorination Effects on Microbial Antibiotic Resistance in Drinking Water
    water research 47 (2013) 111e120 Available online at www.sciencedirect.com journal homepage: www.elsevier.com/locate/watres Metagenomic insights into chlorination effects on microbial antibiotic resistance in drinking water Peng Shi a, Shuyu Jia a, Xu-Xiang Zhang a,b,*, Tong Zhang b, Shupei Cheng a, Aimin Li a,* a State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210046, China b Environmental Biotechnology Laboratory, Department of Civil Engineering, The University of Hong Kong, Hong Kong SAR, China article info abstract Article history: This study aimed to investigate the chlorination effects on microbial antibiotic resistance Received 26 June 2012 in a drinking water treatment plant. Biochemical identification, 16S rRNA gene cloning and Received in revised form metagenomic analysis consistently indicated that Proteobacteria were the main antibiotic 3 September 2012 resistant bacteria (ARB) dominating in the drinking water and chlorine disinfection greatly Accepted 22 September 2012 affected microbial community structure. After chlorination, higher proportion of the Available online 4 October 2012 surviving bacteria was resistant to chloramphenicol, trimethoprim and cephalothin. Quantitative real-time PCRs revealed that sulI had the highest abundance among the Keywords: antibiotic resistance genes (ARGs) detected in the drinking water, followed by tetA and tetG. Antibiotic resistance genes Chlorination caused enrichment of ampC, aphA2, blaTEM-1, tetA, tetG, ermA and ermB, but sulI Chlorine disinfection was considerably removed ( p < 0.05). Metagenomic analysis confirmed that drinking water Mobile genetic elements chlorination could concentrate various ARGs, as well as of plasmids, insertion sequences Drinking water and integrons involved in horizontal transfer of the ARGs.
    [Show full text]
  • Learning Virulent Proteins from Integrated Query Networks Eithon Cadag1*, Peter Tarczy-Hornoch2 and Peter J Myler2,3
    Cadag et al. BMC Bioinformatics 2012, 13:321 http://www.biomedcentral.com/1471-2105/13/321 METHODOLOGY ARTICLE Open Access Learning virulent proteins from integrated query networks Eithon Cadag1*, Peter Tarczy-Hornoch2 and Peter J Myler2,3 Abstract Background: Methods of weakening and attenuating pathogens’ abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories. Results: A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction. Conclusions: This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories. Background database of pathogen genomes, lists 801 different species Though recent decades have seen a decrease in mortality and strains bacteria and eukarya infectious to mankind related to infectious disease, new dangers have appeared [4]. The availability and dissemination of this data has in the form of emerging and re-emerging pathogens as allowed many new discoveries in virulence research to well as the continuing threat of weaponized infectious stem at least partly from computational methods.
    [Show full text]
  • Excess Labile Carbon Promotes the Expression of Virulence Factors in Coral Reef Bacterioplankton
    OPEN The ISME Journal (2018) 12, 59–76 www.nature.com/ismej ORIGINAL ARTICLE Excess labile carbon promotes the expression of virulence factors in coral reef bacterioplankton Anny Cárdenas1,2,3, Matthew J Neave3, Mohamed Fauzi Haroon3,4, Claudia Pogoreutz1,3,5, Nils Rädecker3,5, Christian Wild5, Astrid Gärdes1 and Christian R Voolstra3 1Leibniz Center for Tropical Marine Ecology (ZMT), Bremen, Germany; 2Max Plank Institute for Marine Microbiology, Bremen, Germany; 3Red Sea Research Center, Biological and Environmental Sciences and Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia; 4Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA and 5Marine Ecology Group, Faculty of Biology and Chemistry, University of Bremen, Germany Coastal pollution and algal cover are increasing on many coral reefs, resulting in higher dissolved organic carbon (DOC) concentrations. High DOC concentrations strongly affect microbial activity in reef waters and select for copiotrophic, often potentially virulent microbial populations. High DOC concentrations on coral reefs are also hypothesized to be a determinant for switching microbial lifestyles from commensal to pathogenic, thereby contributing to coral reef degradation, but evidence is missing. In this study, we conducted ex situ incubations to assess gene expression of planktonic microbial populations under elevated concentrations of naturally abundant monosaccharides (glucose, galactose, mannose, and xylose) in algal exudates and sewage inflows. We assembled 27 near-complete (470%) microbial genomes through metagenomic sequencing and determined associated expression patterns through metatranscriptomic sequencing. Differential gene expres- sion analysis revealed a shift in the central carbohydrate metabolism and the induction of metalloproteases, siderophores, and toxins in Alteromonas, Erythrobacter, Oceanicola, and Alcanivorax populations.
    [Show full text]