Pathogenomics

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Pathogenomics From patient to bioinformatician Dieter Bulach Torsten Seemann M.Sc(Bioinf) - University of Melbourne - Tue 1 May 2012 The "rules" ● Conversation, not lecture ● Ask questions at any time ● Activities and quizzes are interspersed. These have yellow background like this slide. ● Please turn your phones to silent. ● Let's start! Overview ● Medical issue ○ sample collection from patient ○ sample preparation ● Genome sequencing ○ experimental design ● Bioinformatics ○ identify SNPs - read mapping, to reference ■ phylogenomic tree ○ identify novel DNA - de novo assembly, no reference ■ annotation ● Biological interpretation What is a pathogen? ● infectious agent or "germ" ● microbe that causes disease in its host ○ organism ■ virus, bacterium, fungus, protozoa, parasite ■ most are harmless or even beneficial ○ host ■ human, animal, plant ○ transmission ■ any "hole" in you - inhalation, ingestion, wound ○ virulence ■ how bad, how quick, mortality What type of pathogen are these? HIV Malaria Golden Staph Powdery mildew Black death (Plague) Glandular Fever How do they work? ● Adhesion ○ bind to host cell surface - interferes normal process ● Colonization ○ take over parts of the body - upsets processes ● Invasion ○ produce proteins to disrupt host cells, allow entry ● Immunosuppression ○ for example, produce proteins to bind to antibodies ● Toxins ○ proteins/metabolites that are poison to the host Patient scenario ● Hospital patient with indwelling catheter ○ risk of pathogens entering the bloodstream ○ this is not normal, and is called septicemia ○ sepsis is the whole body inflammatory response to it ● Need to defeat the pathogen ○ most likely bacterial in this case ○ need to identify the bacterium and characteristics of the bacterium ■ antibiotic resistance profile eg. MRSA, VRE ■ might even want to know where it came from Sample collection ● Take patient blood ○ send to pathology ● Centrifuge ○ slow spin to remove human cells ○ fast spin to pellet bacterial cells ● Streak onto agar media ○ first emulsify the pellet to make it spreadable ○ grow for 24 hours, likely to be monoculture Traditional Microbiology ● Phenotype based: ○ look at cells under microscope ○ Gram staining - cell walls ○ biochemical tests ==> identification of the bacterium ■ genus and species ● PCR based testing: ○ 16s ribosomal RNA ■ common to all bacteria, differs slightly per strain ■ identify genus >90%, species >65%, unknown 10% ○ Multi-locus sequence typing (MLST) ■ sequence ~8 conserved genes ■ each strain/genus has its own MLST pattern ==> faster but limited - need prior knowledge WGS for diagnostics ● Whole Genome Sequencing ○ fast and no prerequisite knowledge about the pathogen ○ Microbiologist won't be superseded!! ■ just different tools ■ sequence data set: will still do all the 'tests' to identify and profile Purify DNA ● DNA extraction kit ○ lyse cells and digest (proteinaseK) ○ centrifuge to remove cell debris ○ pass lysate through column ■ DNA sticks to a DNA binding matrix ○ wash bound DNA ○ lower salt concentration - release bound DNA ○ precipitate: dubiously familiar stringy white pellet ■ salt and ethanol ● Extract DNA from strawberries at home! ○ detergent - breaks cells (octoploid genome) ○ strainer/pantyhose - remove particulate matter ○ salt - aids DNA precipitation ○ alcohol - precipitates DNA, keeps rest in solution Library preparation ● Enough DNA? ○ each technology requires different amounts ● Library type ○ shotgun, short paired, or long paired reads? ○ different construction methods eg. circularization ● Size selection ○ nebulize, sonication, enzymatic methods ○ run on gel + scalpel, or fancier methods ● Amplification ○ lots DNA : use PCR methods eg. emulsion PCR ○ little DNA : multiple displacement amplification ■ random hexamers, high fidelity polymerase ■ whole genome amplification for single-cell seq High throughput DNA-Seq ● Lots of technologies at market ○ 454, Illumina, SOLiD, Ion Torrent, PacBio ● Each has its ups and downs ○ speed, yield, length, price, quality, labour, reliability ● Technology trend ○ Illumina is currently the best choice ○ Most mature technology ○ Produces direct "base space" ie. A,G,T,C ○ Easiest data to work with Current technology Length Length Paired Mate Method Quality Yield "Space" (now) (soon) end? pairs? Yes Yes Illumina 150 250 base (→800bp) (→3kb) +++++ ++++ Yes 454 500 900 No flow (→8kb) +++ ++ Yes SOLiD 75 75 No colour (~4kb) +++ +++++ Ion Testing 100 200 Testing flow Torrent (~4kb) ++ +++ PacBio 2000 6000+ No No + + base? Read types ● Single end, "shotgun" ○ ===>--------- ○ sequence from one end of a fragment ● Paired end ○ ==>--------<== ○ sequence from both ends of the same fragment ○ space between mates is the "insert size" (< 800bp) ○ ● Mate pair ○ ==>--~~~~--<== ○ sequence both ends of a pseudo-fragment ○ this allows us to use longer insert sizes (> 800bp) Read "spaces" ● Example read ○ ACTGGGTCC ● Base space ○ get native bases: A,C,T,G,G,G,T,C,C ● Flow space ○ get base flows: A*1, C*1, T*1, G*3, T*1, C*2 ○ mis-count when n > 3 (homopolymers) ● Colour space ○ get di-base encoding: T:X,X,X,X,X,X,X ○ theoretically useful, but messy overall Read filtering ● Sequencing is a multi-step process ○ ost steps are biological - so there will be errors! ● Bacterial genome sequencing ○ usually excess sequence, can afford to discard ● Why filter? ○ reduce size of data set to deal with ■ need less RAM and CPU ○ improve average reads quality ■ better results What to filter on ● Phred base qualities ○ Q<20 still means >1% error! ● ambiguous bases ie. "N" ○ these should have low Q scores anyway ● reads that are too short ○ too ambiguous to map, too short to assemble ● widowed reads ○ reads, that after filtering, no longer have a mate Sequenced it, now what? ● How is it different? ○ compare to known closely related "reference" strain ● Types of differences ○ deleted DNA - in reference, not in ours ○ duplicated DNA - extra copies in ours ○ novel DNA - in ours, not in reference ○ SNPs - single nucleotide polymorphisms (1bp subst) ○ indels - short insertions or deletions (usually 1bp) ■ sometimes indels fall under "SNPs" banner ○ structural variation - rearrangements, inversions ■ small scale, large scale Read mapping - large scale x1 coverage Conserved Deleted x3 x2 Conserved Read mapping - medium scale Read mapping - small scale Reference sequence Depth Errors Are we seeing everything? ● Hmm, some of our reads didn't map ○ sequencing artifacts (some) ○ contamination (maybe - RA sneezed into sample?) ○ DNA in our sample but not in reference (yes) ■ need to de novo assemble ● Other comparisons more difficult ○ structural change, rearrangements ■ read length & insert size are limiting factors ○ read mapping is not the answer to every question ■ particularly with non-model organisms De novo genome assembly ● De novo ○ Latin - "from the beginning", "afresh", "anew" ○ Without reference to any other genomes ● "Genome assembly is impossible." ○ A/Prof. Mihai Pop - leading assembly researcher! Assembling bacteria ● Genomes ○ DNA, single organism, ~1 sequence ● Transcriptomes ○ RNA (cDNA), single organism, ~4000 sequences ● Meta-genomes ○ DNA, mix of organisms, >10 sequences ○ eg. human gut microbiome, oral cavity ● Meta-transcriptomes ○ RNA (cDNA), mix of organisms, >40000 sequences! Types of assemblers ● Greedy ○ find two best matching reads, join them, iterate ● Overlap-Layout-Consensus ○ collate all overlaps into a graph and finds a path ● de Bruijn graph (pronounced "brown") ○ break reads into k-mers, overlap is 100%id k-1 ● String graph ○ represents all that is inferable from the reads ○ encompasses OLC and DBGs Assembly algorithm ● Find all overlaps between all reads ○ naively this is O(N2) for N reads, but good heuristics ○ parameters are: min. overlap, min %identity ● Build a graph from these overlaps ○ nodes/arcs <=> reads/overlaps <=> vertices/edges ● Simplify the graph ○ because real-world reads have errors ● Trace a single path through the graph ○ Read off the consensus of bases as you go The tyranny of repeats ● Assembler would output 7 contig sequences ○ path is broken at ambiguous decision points ○ read/pair length limits ability to resolve repeats How many contigs will be produced? More complex graph Contigs Connections Reality bites Shared vertices are repeats. Scaffolding ● Use paired reads to join contigs ○ reads with their mates in different contigs in a consistent manner suggests adjacency ● A difficult constraint problem ○ distance between mates ("insert size") variable ○ repeats cause ambiguous mate placement ○ some assemblers do it, separate scaffolders exist Contig ordering ● Optical maps ○ wet lab method, real experimental evidence ○ chromosome sized restriction site map ● Align to reference genome ○ fit contigs best as possible against known reference ○ some contigs will fit if split (DNA rearrangement) ○ expect orphan contigs (novel DNA) Genome closure ● Finished genome ○ one contig per replicon in original sample ○ bacterial chromosomes/plasmids usually circular ● Labour intensive ○ design primers around gaps, PCR, Sanger ○ Fosmid/BAC libraries for larger inconsistencies ● Why bother? ○ no close reference exists ○ ensures you didn't miss anything ○ understand whole genome architecture ○ simplifies all downstream analysis Annotation ● Annotation is the process of identifying important features in a genome ○ gene - protein product, promoter, signal sequences ■ ~1000 per Mbp in bacteria, coding dense ○ tRNA - transfer RNA ■ ~30 per bacteria cover all codons (wobble base)
Recommended publications
  • The Pathogenomics and Evolution of Anthrax-Like Bacillus Cereus Isolates and Plasmids

    The Pathogenomics and Evolution of Anthrax-Like Bacillus Cereus Isolates and Plasmids

    The Pathogenomics and Evolution of Anthrax-like Bacillus cereus Isolates and Plasmids A white paper proposal submitted by: Geraldine A. Van der Auwera, Ph.D. Harvard Medical School Michael Feldgarden, Ph.D. Genomic Sequencing Center for Infectious Diseases The Broad Institute of MIT and Harvard 1 Executive summary A key member of the Bacillus cereus group, Bacillus anthracis is defined by phenotypic and molecular characteristics that are conferred by two large plasmids, pXO1 and pXO2. However the very concept of B. anthracis as a distinct species has been called into question by recent discoveries of “intermediate” isolates identified as B. cereus and B. thuringiensis but possessing features similar to those of B. anthracis, including large plasmids that share a common backbone with pXO1 and/or pXO2. Many of these “intermediate” isolates possess potential or demonstrated lethal pathogenic properties and are sometimes called “anthrax-like”, even though they do not meet the strict definition of anthrax-causing B. anthracis. We recently showed that pXO1- and pXO2- like plasmids are widely prevalent in environmental isolates of the B. cereus group. Because B. anthracis-like isolates do not possess all the molecular hallmarks of typical B. anthracis, there is a significant risk that they would escape being flagged as dangerous. Consequently, accidental infection by naturally occurring pathotypes which are not immediately recognized as life-threatening could present a serious health concern. Such cases have already been reported, some with a fatal outcome. The second risk posed by these B. anthracis-like isolates could be the intentional use as “stealth anthrax” bioweapon, either in natural form or with genetic modifications that would require only minimal skills and facilities to produce.
  • 14S802 - Strain Specific Pathogenicity of Staphylococcus Aureus Final Report

    14S802 - Strain Specific Pathogenicity of Staphylococcus Aureus Final Report

    14S802 - Strain Specific Pathogenicity of Staphylococcus aureus Final Report This project was funded under the Department of Agriculture, Food and the Marine Competitive Funding Programme. SUMMARY Mastitis is a costly endemic disease for the dairy industry. It is primarily caused by bacterial infection and is the most common reason for antibiotic use in dairy cows in Ireland. Staphylococcus aureus is the most common mastitis pathogen in Ireland and the S. aureus strains that cause mastitis belong to specific bovine-adapted lineages. Current selection for mastitis-resistance is based on the host immune response, as determined by somatic cell count (SCC). However, the ability of S. aureus to evade and subvert the host immune response is well known, including the ability to internalise and survive within host cells. This project tested the hypothesis that bovine intramammary infection with different S. aureus strains results in differential activation of the host immune response. Supporting this hypothesis, significant differences between S. aureus lineages in their ability to internalise within bovine mammary epithelial cells were found with some strains internalising at higher levels than others. It was also found that some strains induced higher expression of cytokines and chemokines responsible for attracting immune cells and these strains induced mammary epithelial cells to produce factors that attracted somatic cells, while other strains did not. Differences in disease presentation in vivo in cows infected with different strains were also observed, indicating strain-specific virulence. Significantly higher somatic cell count and anti- Staphylococcus IgG and significantly lower milk yield were observed in response to infection with a more virulent strain.
  • The Role of Genomics in the Identification, Prediction, and Prevention of Biological Threats

    The Role of Genomics in the Identification, Prediction, and Prevention of Biological Threats

    Perspective The Role of Genomics in the Identification, Prediction, and Prevention of Biological Threats W. Florian Fricke, David A. Rasko, Jacques Ravel* Institute for Genome Sciences (IGS), University of Maryland School of Medicine, Baltimore, Maryland, United States of America Since the publication in 1995 of the first Biodefense Funding for of programs, the genome sequences of over complete genome sequence of a free-living Genomic Research 90,000 influenza viruses were rapidly organism, the bacterium Haemophilus influ- generated and are now deposited in enzae [1], more than 1,000 genomes of Since the anthrax letter attacks of 2001, GenBank (http://www.ncbi.nlm.nih.gov/ species from all three domains of life— when letters containing anthrax spores genomes/FLU/aboutdatabase.html). Be- Bacteria, Archaea, and Eukarya—have were mailed to several news media offices cause of the availability of large sequencing been completed and a staggering 4,300 and two Democratic senators in the capacity and the large amount of informa- are in progress (not including an even United States, killing five people and tion, the response to the 2009 H1N1 larger number of viral genome projects) infecting 17 others, funding agencies in influenza pandemic was rapid and efficient (GOLD, Genomes Online Database v. the US and other countries have priori- (Box 2): Genomics information was gener- 2.0; http://www.genomesonline.org/gold. tized research projects on organisms that ated within days and validated diagnostic cgi, as of August 2009). Whole-genome might potentially challenge our security tools were approved within weeks [5,6]. A shotgun sequencing remains the standard and economy should they be used as global response was made possible through in biomedical, biotechnological, environ- biological weapons.
  • Microbes and Metagenomics in Human Health an Overview of Recent Publications Featuring Illumina® Technology TABLE of CONTENTS

    Microbes and Metagenomics in Human Health an Overview of Recent Publications Featuring Illumina® Technology TABLE of CONTENTS

    Microbes and Metagenomics in Human Health An overview of recent publications featuring Illumina® technology TABLE OF CONTENTS 4 Introduction 5 Human Microbiome Gut Microbiome Gut Microbiome and Disease Inflammatory Bowel Disease (IBD) Metabolic Diseases: Diabetes and Obesity Obesity Oral Microbiome Other Human Biomes 25 Viromes and Human Health Viral Populations Viral Zoonotic Reservoirs DNA Viruses RNA Viruses Human Viral Pathogens Phages Virus Vaccine Development 44 Microbial Pathogenesis Important Microorganisms in Human Health Antimicrobial Resistance Bacterial Vaccines 54 Microbial Populations Amplicon Sequencing 16S: Ribosomal RNA Metagenome Sequencing: Whole-Genome Shotgun Metagenomics Eukaryotes Single-Cell Sequencing (SCS) Plasmidome Transcriptome Sequencing 63 Glossary of Terms 64 Bibliography This document highlights recent publications that demonstrate the use of Illumina technologies in immunology research. To learn more about the platforms and assays cited, visit www.illumina.com. An overview of recent publications featuring Illumina technology 3 INTRODUCTION The study of microbes in human health traditionally focused on identifying and 1. Roca I., Akova M., Baquero F., Carlet J., treating pathogens in patients, usually with antibiotics. The rise of antibiotic Cavaleri M., et al. (2015) The global threat of resistance and an increasingly dense—and mobile—global population is forcing a antimicrobial resistance: science for interven- tion. New Microbes New Infect 6: 22-29 1, 2, 3 change in that paradigm. Improvements in high-throughput sequencing, also 2. Shallcross L. J., Howard S. J., Fowler T. and called next-generation sequencing (NGS), allow a holistic approach to managing Davies S. C. (2015) Tackling the threat of anti- microbial resistance: from policy to sustainable microbes in human health.
  • Molecular Complexity of Successive Bacterial Epidemics Deconvoluted by Comparative Pathogenomics

    Molecular Complexity of Successive Bacterial Epidemics Deconvoluted by Comparative Pathogenomics

    Molecular complexity of successive bacterial epidemics deconvoluted by comparative pathogenomics Stephen B. Beresa,b, Ronan K. Carrolla,b, Patrick R. Sheaa,b, Izabela Sitkiewicza,b, Juan Carlos Martinez-Gutierreza,b, Donald E. Lowc, Allison McGeerd, Barbara M. Willeyd, Karen Greend, Gregory J. Tyrrelld, Thomas D. Goldmanf, Michael Feldgardeng, Bruce W. Birreng, Yuriy Fofanovh, John Boosi, William D. Wheatoni, Christiane Honischf, and James M. Mussera,b,1 aCenter for Molecular and Translational Human Infectious Diseases Research, The Methodist Hospital Research Institute, and bDepartment of Pathology, The Methodist Hospital, Houston, TX 77030; cOntario Agency for Health Protection and Promotion, and University of Toronto, Toronto, ON M5G 1X5, Canada; dDepartment of Microbiology, Mount Sinai Hospital, Toronto, ON M5G 1X5, Canada; eDepartment of Laboratory medicine and Pathology, University of Alberta, Edmonton, AB T6G 2J2, Canada; fSequenom, Inc., San Diego, CA 92121; gBroad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA 02142; hDepartment of Computer Science and Department of Biology and Biochemistry, University of Houston, Houston, TX 77204; and iRTI International, Research Triangle Park, NC 27709 Edited* by Charles R. Cantor, Sequenom Inc., San Diego, CA, and approved December 14, 2009 (received for review September 30, 2009) Understanding the fine-structure molecular architecture of bacterial century that GAS has the capacity to cause epidemics charac- epidemics has been a long-sought goal of infectious disease research. terized by rapid increase in disease frequency and severity. For We used short-read-length DNA sequencing coupled with mass example, Weech (6) described an epidemic of septic scarlet fever spectroscopy analysis of SNPs to study the molecular pathogenomics in Yunnanfu, China, that killed 50,000 people, fully 25% of the of three successive epidemics of invasive infections involving 344 population of the province.
  • Microbial Risk Assessment Guideline

    Microbial Risk Assessment Guideline

    EPA/100/J-12/001 USDA/FSIS/2012-001 MICROBIAL RISK ASSESSMENT GUIDELINE PATHOGENIC MICROORGANISMS WITH FOCUS ON FOOD AND WATER Prepared by the Interagency Microbiological Risk Assessment Guideline Workgroup July 2012 Microbial Risk Assessment Guideline Page ii DISCLAIMER This guideline document represents the current thinking of the workgroup on the topics addressed. It is not a regulation and does not confer any rights for or on any person and does not operate to bind USDA, EPA, any other federal agency, or the public. Further, this guideline is not intended to replace existing guidelines that are in use by agencies. The decision to apply methods and approaches in this guideline, either totally or in part, is left to the discretion of the individual department or agency. Mention of trade names or commercial products does not constitute endorsement or recommendation for use. Environmental Protection Agency (EPA) (2012). Microbial Risk Assessment Guideline: Pathogenic Microorganisms with Focus on Food and Water. EPA/100/J-12/001 Microbial Risk Assessment Guideline Page iii TABLE OF CONTENTS Disclaimer .......................................................................................................................... ii Interagency Workgroup Members ................................................................................ vii Preface ............................................................................................................................. viii Abbreviations ..................................................................................................................
  • Comparative Pathogenomics of Aeromonas Veronii from Pigs in South Africa: Dominance of the Novel ST657 Clone

    Comparative Pathogenomics of Aeromonas Veronii from Pigs in South Africa: Dominance of the Novel ST657 Clone

    microorganisms Article Comparative Pathogenomics of Aeromonas veronii from Pigs in South Africa: Dominance of the Novel ST657 Clone Yogandree Ramsamy 1,2,3,* , Koleka P. Mlisana 2, Daniel G. Amoako 3 , Akebe Luther King Abia 3 , Mushal Allam 4 , Arshad Ismail 4 , Ravesh Singh 1,2 and Sabiha Y. Essack 3 1 Medical Microbiology, College of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa; [email protected] 2 National Health Laboratory Service, Durban 4001, South Africa; [email protected] 3 Antimicrobial Research Unit, College of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa; [email protected] (D.G.A.); [email protected] (A.L.K.A.); [email protected] (S.Y.E.) 4 Sequencing Core Facility, National Institute for Communicable Diseases, National Health Laboratory Service, Johannesburg 2131, South Africa; [email protected] (M.A.); [email protected] (A.I.) * Correspondence: [email protected] Received: 9 November 2020; Accepted: 15 December 2020; Published: 16 December 2020 Abstract: The pathogenomics of carbapenem-resistant Aeromonas veronii (A. veronii) isolates recovered from pigs in KwaZulu-Natal, South Africa, was explored by whole genome sequencing on the Illumina MiSeq platform. Genomic functional annotation revealed a vast array of similar central networks (metabolic, cellular, and biochemical). The pan-genome analysis showed that the isolates formed a total of 4349 orthologous gene clusters, 4296 of which were shared; no unique clusters were observed. All the isolates had similar resistance phenotypes, which corroborated their chromosomally mediated resistome (blaCPHA3 and blaOXA-12) and belonged to a novel sequence type, ST657 (a satellite clone).
  • Genomic Diversity of Escherichia Coli Isolates from Non-Human Primates in the Gambia

    Genomic Diversity of Escherichia Coli Isolates from Non-Human Primates in the Gambia

    bioRxiv preprint doi: https://doi.org/10.1101/2020.02.29.971309; this version posted March 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Genomic diversity of Escherichia coli isolates from non-human primates in the Gambia 2 Ebenezer Foster-Nyarko1,2, Nabil-Fareed Alikhan1, Anuradha Ravi1, Gaëtan Thilliez1, 3 Nicholas Thomson1, David Baker1, Gemma Kay1, Jennifer D. Cramer3, Justin O’Grady1, 4 Martin Antonio2,4, Mark J. Pallen1,5† 5 6 1 Quadram Institute Bioscience, Norwich Research Park, Norwich, Norfolk, United Kingdom 7 2 Medical Research Council Unit the Gambia at the London School of Hygiene and Tropical 8 Medicine, Atlantic Boulevard Road, Fajara, The Gambia 9 3 American Public University System, Charles Town, WV, USA 10 4 Microbiology and Infection Unit, Warwick Medical School, University of Warwick, 11 Coventry, United Kingdom 12 5 School of Veterinary Medicine, University of Surrey, Guildford, Surrey, United Kingdom. 13 14 †Correspondence: Professor Mark Pallen, Quadram Institute Bioscience, Norwich Research 15 Park, Norwich, Norfolk, United Kingdom 16 Email: [email protected] 17 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.02.29.971309; this version posted March 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
  • Pangenome Analysis Reveals Genetic Isolation in Campylobacter Hyointestinalis Subspecies Adapted to Different Mammalian Hosts

    Pangenome Analysis Reveals Genetic Isolation in Campylobacter Hyointestinalis Subspecies Adapted to Different Mammalian Hosts

    www.nature.com/scientificreports OPEN Pangenome analysis reveals genetic isolation in Campylobacter hyointestinalis subspecies adapted to diferent mammalian hosts Daniela Costa 1,2, Simon Lévesque3, Nitin Kumar4, Pablo Fresia1,5, Ignacio Ferrés 1, Trevor D. Lawley4 & Gregorio Iraola 1,4,6* Campylobacter hyointestinalis is an emerging pathogen currently divided in two subspecies: C. hyointestinalis subsp. lawsonii which is predominantly recovered from pigs, and C. hyointestinalis subsp. hyointestinalis which can be found in a much wider range of mammalian hosts. Despite C. hyointestinalis being reported as an emerging pathogen, its evolutionary and host-associated diversifcation patterns are still vastly unexplored. For this reason, we generated whole-genome sequences of 13 C. hyointestinalis subsp. hyointestinalis strains and performed a comprehensive comparative analysis including publicly available C. hyointestinalis subsp. hyointestinalis and C. hyointestinalis subsp. lawsonii genomes, to gain insight into the genomic variation of these diferentially-adapted subspecies. Both subspecies are distinct phylogenetic lineages which present an apparent barrier to homologous recombination, suggesting genetic isolation. This is further supported by accessory gene patterns that recapitulate the core genome phylogeny. Additionally, C. hyointestinalis subsp. hyointestinalis presents a bigger and more diverse accessory genome, which probably refects its capacity to colonize diferent mammalian hosts unlike C. hyointestinalis subsp. lawsonii that is presumably host-restricted. This greater plasticity in the accessory genome of C. hyointestinalis subsp. hyointestinalis correlates to a higher incidence of genome-wide recombination events, that may be the underlying mechanism driving its diversifcation. Concordantly, both subspecies present distinct patterns of gene families involved in genome plasticity and DNA repair like CRISPR-associated proteins and restriction-modifcation systems.
  • Snapshot Study of Whole Genome Sequences of Escherichia Coli from Healthy Companion Animals, Livestock, Wildlife, Humans and Food in Italy

    Snapshot Study of Whole Genome Sequences of Escherichia Coli from Healthy Companion Animals, Livestock, Wildlife, Humans and Food in Italy

    antibiotics Article Snapshot Study of Whole Genome Sequences of Escherichia coli from Healthy Companion Animals, Livestock, Wildlife, Humans and Food in Italy Elisa Massella 1, Cameron J. Reid 2, Max L. Cummins 2, Kay Anantanawat 2, Tiziana Zingali 2, Andrea Serraino 1 , Silvia Piva 1, Federica Giacometti 1 and Steven P. Djordjevic 2,* 1 Department of Veterinary Medical Sciences, University of Bologna, 40126 Bologna, Italy; [email protected] (E.M.); [email protected] (A.S.); [email protected] (S.P.); [email protected] (F.G.) 2 The ithree Institute, University of Technology Sydney, City Campus, Ultimo, NSW 2007, Australia; [email protected] (C.J.R.); [email protected] (M.L.C.); [email protected] (K.A.); [email protected] (T.Z.) * Correspondence: [email protected] Received: 15 September 2020; Accepted: 5 November 2020; Published: 6 November 2020 Abstract: Animals, humans and food are all interconnected sources of antimicrobial resistance (AMR), allowing extensive and rapid exchange of AMR bacteria and genes. Whole genome sequencing (WGS) was used to characterize 279 Escherichia coli isolates obtained from animals (livestock, companion animals, wildlife), food and humans in Italy. E. coli predominantly belonged to commensal phylogroups B1 (46.6%) and A (29%) using the original Clermont criteria. One hundred and thirty-six sequence types (STs) were observed, including different pandemic (ST69, ST95, ST131) and emerging (ST10, ST23, ST58, ST117, ST405, ST648) extraintestinal pathogenic Escherichia coli (ExPEC) lineages. Eight antimicrobial resistance genes (ARGs) and five chromosomal mutations conferring resistance to highest priority critically important antimicrobials (HP-CIAs) were identified (qnrS1, qnrB19, mcr-1, blaCTX-M1,15,55, blaCMY-2, gyrA/parC/parE, ampC and pmrB).
  • Pathogenomic Characterisation of a Novel, Layer-Associated Avian

    Pathogenomic Characterisation of a Novel, Layer-Associated Avian

    Pathogenomic characterisation of a novel, layer‐associated Avian Pathogenic Escherichia coli Thesis submitted in accordance with the requirements of the University of Liverpool for the degree of Doctor in Philosophy by Charlotte Rose Collingwood July 2016 Author’s Declaration I declare that the work in this dissertation was carried out in accordance with the University’s Regulations and Code of Practice for Research. Except where indicated, the work is my own. Work done in collaboration, or with the assistance of others, is indicated as such. Any views expressed in the dissertation are those of the author. Signed: Date: i Acknowledgements Firstly I would like to thank my supervisor, Paul Wigley, for his help, support, guidance and most importantly encouragement during my studies. Your expertise, good humour, and kindness have been unparalleled. I’ve also learnt more about certain (obscure?) musical genres than I could have imagined. I’d also like to thank Craig Winstanley, in particular in enabling me to get the 3770 genome sequenced. Without your timely input, this particular chapter would be considerably briefer. I would also like to thank many people in the labs at Leahurst and in the Ronald Ross building for their help and friendship during my studies. Whether experiments went wrong or right; there was always someone there to commiserate or congratulate. There are too many to name individually, but special thanks go to Georgina Crayford, Elli Wright and Camilla Brena, you have helped more than you could know. On a practical note, I would like to thank Roy Chaudhuri for his expertise in genomics and phylogenetics, Mark Stevens for the kind gift of the χ7122 isolate, and Tim Wallis at Ridgeway Biologicals for sharing the SPS‐associated isolates used in this project.
  • The Perfect Condition for the Rising of Superbugs: Person-To-Person Contact and Antibiotic Use Are the Key Factors Responsible F

    The Perfect Condition for the Rising of Superbugs: Person-To-Person Contact and Antibiotic Use Are the Key Factors Responsible F

    antibiotics Article The Perfect Condition for the Rising of Superbugs: Person-to-Person Contact and Antibiotic Use Are the Key Factors Responsible for the Positive Correlation between Antibiotic Resistance Gene Diversity and Virulence Gene Diversity in Human Metagenomes Célia P. F. Domingues 1,2,† , João S. Rebelo 1,† , Joël Pothier 3 , Francisca Monteiro 1 , Teresa Nogueira 1,2,*,‡ and Francisco Dionisio 1,* 1 cE3c—Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal; [email protected] (C.P.F.D.); [email protected] (J.S.R.); [email protected] (F.M.) 2 INIAV—National Institute for Agrarian and Veterinary Research, Bacteriology and Mycology Laboratory, 2780-157 Oeiras, Portugal 3 Atelier de Bioinformatique, ISYEB, UMR 7205 CNRS MNHN UPMC EPHE, Muséum National d’Histoire Naturelle, CP 50, 45 rue Buffon, F-75005 Paris, France; [email protected] * Correspondence: [email protected] (T.N.); [email protected] (F.D.) Citation: Domingues, C.P.F.; Rebelo, † These authors have contributed equally to this work. J.S.; Pothier, J.; Monteiro, F.; Nogueira, ‡ The Institution is Located in Oeiras, and the Author’s Office is in 4485-655 Vairão, Portugal. T.; Dionisio, F. The Perfect Condition for the Rising of Superbugs: Abstract: Human metagenomes with a high diversity of virulence genes tend to have a high diversity Person-to-Person Contact and of antibiotic-resistance genes and vice-versa. To understand this positive correlation, we simulated Antibiotic Use Are the Key Factors the transfer of these genes and bacterial pathogens in a community of interacting people that take Responsible for the Positive antibiotics when infected by pathogens.