Identification of Macrolide Resistance Alleles in Environmental Metagenomes Robert Schmieder1,2, Anca Segall3, Molly Schmid4 and Robert Edwards2 1 Computational Science Research Center, SDSU; 2 Department of Computer Science, SDSU; 3 Department of Biology, SDSU; 4 Keck Graduate Institute of Applied Life Sciences E D W A R D S L A B A T S A N D I E G O S T A T E U N I V E R S I T Y Why do we care about antibiotic resistance ? How do we identify resistance alleles ? What did we find ? (cont.)
An bio cs are substances An bio c misuse 1 Iden fy 23S rRNA‐like sequences in metagenomes Base frequencies that kill or inhibit growth contributes to emergence of 296 publicly available metagenomes (over 42M sequence reads) DB of 23S rRNA sequences Base frequencies for the 13 of bacteria from fully sequenced bacterial posi ons of interest listed by resistant bacteria genomes in NCBI their posi on in the E. coli K‐12 MG1655 23S rRNA gene. The Bacteria can bars are in order of (S)ILVA, (D)B, and (M)etagenomes. develop resistance Growing problem of BLAST 480 Species 23S DB SILVA counts are based on an bio c resistance 755 Strains 6,685 23S rRNA sequences in health care and with length between 2,200 bp and 3,200 bp (as of version Modified image from: www.answersingenesis.org 2211 Sequences commercial farming 100). DB counts are based on Aim: Examine 23S rRNA drug resistance determinants in metagenomes from 2,211 unique 23S rRNA 2 Iden fy base subs tu ons in candidate sequences sequences of all completely different environments. The results may contribute to characterizing the sequenced bacterial genomes reservoir of an bio c resistance determinants in an environment. Map muta on sites in reference organism E. coli K‐12 MG1655 to other species in the NCBI database and with using global mul ple alignments length between 2,200 bp and 3,200 bp (as of 11/23/09). The number of metagenomic What is macrolide resistance ? A2058C/G/T A2059C/T Muta on sites sequences are shown above G2057A A2062C the bars. Macrolides are an bio cs that inhibit bacterial protein synthesis G2032A/C/T Sequence logo of region composed of 2 amino or neutral sugars a ached to lactone ring of variable size in domain V showing high conserva on over Several key posi ons were invariant in all tested metagenomes. Posi on 754 shows the 480 different species highest frequency of base subs tu ons, followed by posi on 2057, 2611, and 2032. Overall Macrolide groups infrequent occurrence of resistance alleles. 14‐membered: Clarithromycin, Dirithromycin, 2% G2032A 7% G2057A Erythromycin and Roxithromycin G2032A: Helicobacter pylori J99 ‐ source pa ent was exposed to macrolides Muta on frequencies # Metagenomes 15‐membered: Azithromycin G2057A: Chlamydia trachoma s D/UW‐3/CX ‐ clinical isolates obtained a er ineffec ve therapy Frequency of reference and altered # Sequences of urogenital chlamydiasis with fluoroquinolones and macrolides 16‐membered: Josamycin, Midecamycin, Miocamycin, sequences in the metagenomes. The Rokitamyin, Spiramycin and Tylosin majority of sequences similar to the Figure taken from Poehlsgaard and Douthwaite (2005) Analysis workflow resistance regions were found in the Macrolide resistance mechanisms microbial metagenomes. The human Preprocess metagenomic dataset Summary of the computa onal gut samples contained the highest • Ribosomal modifica on (methyla on or muta on) process that was integrated into a number of altered and/or resistance sequences. • Efflux of the macrolide out of the cell Point muta ons in 23S rRNA Run BLASTn against 23S rRNA database pipeline for automa c analysis An addi onal web‐service allows the • Macrolide inac va on Parse BLAST results data processing through a user‐ friendly interface. Modifica on by point muta on(s) in 23S rRNA Realign candidate sequences The bases at the alleles known to Domain II Domain V (pep dyl‐transferase region) determine resistance to macrolides Iden fy and count base subs tu ons were used to calculate the frequency of resistance for each Calculate muta on frequencies single metagenome.
Modified image from rna.ucsc.edu/rnacenter/ribosome_images.html Muta on site What did we find ? What comes next ? Single base subs tu on Streptococcus pneumoniae at posi on 2059 will cause a >100‐fold increase in resistance against the macrolide Erythromycin Healthy Japanese individuals The study will be extended to analyze addi onal sites in the 23S rRNA gene that allow No an bio cs for at least 4 weeks bacteria to gain resistance against other an bio cs such as Oxazolidinones. Furthermore, a Why use metagenomes ? web‐service is under development to iden fy resistance alleles in metagenomic data sets. Metagenomes are community genome sequences with minimal sampling bias. Global ocean survey Further information In combina on with next‐genera on sequencing techniques, analysis of metagenomic sequences can efficiently assess the reservoir of resistance Please contact [email protected]. This poster is available online at determinants in an environment. h p://edwards.sdsu.edu/posters/macrolide_resistance2.pdf