Classification of Genomic Sequences by Latent Semantic Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Classification of Genomic Sequences by Latent Semantic Analysis University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Theses, Dissertations, and Student Research Electrical & Computer Engineering, Department from Electrical & Computer Engineering of Summer 8-2012 Classification of Genomic Sequences yb Latent Semantic Analysis Samuel F. Way University of Nebraska-Lincoln, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/elecengtheses Part of the Bioinformatics Commons, Computational Biology Commons, and the Other Electrical and Computer Engineering Commons Way, Samuel F., "Classification of Genomic Sequences yb Latent Semantic Analysis" (2012). Theses, Dissertations, and Student Research from Electrical & Computer Engineering. 42. https://digitalcommons.unl.edu/elecengtheses/42 This Article is brought to you for free and open access by the Electrical & Computer Engineering, Department of at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Theses, Dissertations, and Student Research from Electrical & Computer Engineering by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. CLASSIFICATION OF GENOMIC SEQUENCES BY LATENT SEMANTIC ANALYSIS by Samuel F. Way A THESIS Presented to the Faculty of The Graduate College at the University of Nebraska In Partial Fulfilment of Requirements For the Degree of Master of Science Major: Electrical Engineering Under the Supervision of Professor Khalid Sayood and Professor Ozkan Ufuk Nalbantoglu Lincoln, Nebraska August, 2012 CLASSIFICATION OF GENOMIC SEQUENCES BY LATENT SEMANTIC ANALYSIS Samuel F. Way, M. S. University of Nebraska, 2012 Adviser: Khalid Sayood and Ozkan Ufuk Nalbantoglu Evolutionary distance measures provide a means of identifying and organizing related organisms by comparing their genomic sequences. As such, techniques that quantify the level of similarity between DNA sequences are essential in our efforts to decipher the genetic code in which they are written. Traditional methods for estimating the evolutionary distance separating two ge- nomic sequences often require that the sequences first be aligned before they are compared. Unfortunately, this preliminary step imposes great computational burden, making this class of techniques impractical for applications involving a large number of sequences. Instead, we desire new methods for differentiating genomic sequences that eliminate the need for sequence alignment. Here, we present a novel approach for identifying evolutionarily similar DNA se- quences using a theory and collection of techniques called latent semantic analysis (LSA). In the field of information retrieval, LSA techniques are used to identify text documents with related underlying concepts, or \latent semantics". These techniques capture the inherent structure within a collection of documents, and in much the same way, we extend these approaches to infer the biological structure through which a collection of organisms are related. In doing so, we develop a computationally ef- ficient means of identifying evolutionarily similar DNA sequences that is especially well-suited for partitioning large sets of biological data. iii ACKNOWLEDGMENTS First and foremost, I would like to thank my committee and academic advisors for their help throughout my graduate studies. In particular, no words could ever express my appreciation for Dr. Khalid Sayood and the impact that he has had on my life, both academically and personally. I feel very fortunate to have found an advisor who is as patient and caring as he is, and I can't thank him enough for his support and guidance. I would also like to thank Dr. Michael Hoffman for his invaluable insight and help in preparing this research. He and Dr. Sayood have shown me how absolutely fun research can be, and I thank them both for making my time at UNL incredibly enjoyable. Next, I would like to thank Dr. Ufuk Nalbantoglu for being a great friend and constant source of ideas throughout my graduate career. Beyond my committee, I want thank Dr. David Russell, who has been an amazing role model, teacher, and friend over the past few years. His high expectations have motivated me to always strive to do quality work, and with him, I also thank my other co-workers at Red Cocoa, especially Mark Nispel, David McCreight, Chris Hruby, and Andy Lenhart. I also need to thank Dr. Mark Bauer for his incredible support and guidance, as well as our collaborators at Names for Life and the University of Nebraska Medical Center. I owe a special thanks and many boxes of cookies to Cheryl Wemhoff and Terri Eastin for helping me get everything lined up to graduate this semester. They helped me to overcome a number of obstacles this summer, and I thank them for all of their patience. I also need to thank Jay Carlson for his help with Adobe Illustrator and for letting me vent my frustrations. Finally, my thanks also go to my family and friends for their amazing support throughout my academic journey. Since day one, my parents have provided me with iv the means and inspiration to accomplish great things, and they've always been there to help me in any way they can. Words can't express how grateful I am for their many sacrifices and undying support. Lastly, I would like to thank my beautiful girlfriend, Laura Norris, whose love and encouragement has carried me through the last nearly eight years of my life. She inspires me to become a better person every day, and this work would not have been possible without her support. v Contents Contents v 1 Introduction 1 2 Biology Background 3 2.1 Introduction to DNA Structure . 3 2.2 Function of DNA . 6 3 Comparing DNA Sequences 8 3.1 Alignment-based Distance Measures . 8 3.1.1 Jukes-Cantor . 9 3.1.2 Kimura . 13 3.2 Alignment-free Distance Measures . 15 3.2.1 Relative Complexity Metric . 16 3.2.2 K-mer Based Genomic Signatures . 19 3.2.2.1 The Average Mutual Information Profile . 20 3.2.2.2 The Relative Abundance Index . 22 4 Latent Semantic Analysis 25 4.1 Introduction to LSA . 27 vi 4.2 Dimensionality Reduction Techniques . 31 4.2.1 Singular Value Decomposition . 32 4.2.1.1 Mathematical Derivation . 33 4.2.1.2 Dimensionality Reduction via SVD . 35 4.2.2 Non-negative Matrix Factorization . 36 4.2.2.1 Mathematical Derivation . 38 4.2.2.2 Dimensionality Reduction via NMF . 40 4.3 Adaptation to Sequence Data . 41 5 Visualizing High-Dimensional Datasets 43 5.1 Method . 45 5.2 Implementation . 47 5.3 Results and Discussion . 48 5.3.1 Validation of Taxonomy Data . 49 5.3.2 Map Creation From Proximity Data . 50 5.3.3 Discussion . 51 6 Clustering of 16s Ribosomal Genes 54 6.1 Background Information . 54 6.2 Method . 55 6.3 Results and Discussion . 56 7 Identification and Removal of Host DNA Fragments from Metage- nomics Datasets 67 7.1 Construction of a synthetic dataset . 68 7.2 Method . 70 7.3 Results . 73 vii 7.4 Discussion . 77 8 Using LSA-NMF to Design Microarray Probes 79 8.1 Background Information . 79 8.2 Method . 81 8.3 Results and Discussion . 82 9 Conclusion and Recommendations for Future Work 89 9.1 Recommendations for Future Work . 90 Bibliography 94 1 Chapter 1 Introduction Since the days of the Human Genome Project, the world has witnessed an ongoing revolution in the field of biology. In fact, the changes in this area are so profound that many believe we are currently experiencing the advent of a \New Biology", one that relies on concepts and approaches from engineering, computational sciences, mathematics, and many other fields[1]. The integration of these disciplines signi- fies the emergence of biology as an informational science and is necessitated by the introduction of many new technologies for producing biological data. Some of the most important examples of these new technologies are platforms for massively parallel, next-generation DNA sequencing. Next-generation sequencing (NGS) replaces the traditional Sanger sequencing method with automated, high- throughput technologies that produce unprecedented amounts of sequence data at ever-increasing speeds. What's more, the cost to purchase and operate these platforms has declined rapidly over the years, and as a result, the number of research centers using these technologies has skyrocketed. Given the ability to quickly sequence genetic material, scientists are constantly discovering new applications for next-generation platforms. However, with so many 2 scientists generating such massive amounts of sequence data, it is becoming increas- ingly difficult to make sense of all this new information. As such, it is imperative that we devise efficient techniques for classifying and organizing genomic sequences such that they may be quickly identified and retrieved. Traditional methods for sequence comparison are often computationally inefficient and therefore should not be used in large-scale implementations. Instead, in this re- search, we present a new approach for differentiating genomic sequences based on latent semantic analysis, a theory and collection of techniques born out of the fields of natural language processing and information retrieval. Using this new means com- parison, we explore several experiments that will demonstrate the approach as an attractive alternative to traditional distance measures that is especially well-suited for large-scale applications. 3 Chapter 2 Biology Background Before we go about introducing methods for comparing DNA sequences, we should first provide some biological background to establish what these sequences are and why they are important. It is commonly understood that DNA is a sort of genetic code or blueprint that is used to construct living things. For the purposes of our discussion, we will require only a slightly more detailed explanation, which we present here. In addition, this section will be used to introduce most of the terminology and conventions that will be used throughout subsequent chapters. 2.1 Introduction to DNA Structure DNA stands for deoxyribonucleic acid and is a double-stranded macromolecule com- prised of four basic structural units called nucleotides.
Recommended publications
  • Iron Transport Strategies of the Genus Burkholderia
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2015 Iron transport strategies of the genus Burkholderia Mathew, Anugraha Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-113412 Dissertation Published Version Originally published at: Mathew, Anugraha. Iron transport strategies of the genus Burkholderia. 2015, University of Zurich, Faculty of Science. Iron transport strategies of the genus Burkholderia Dissertation zur Erlangung der naturwissenschaftlichen Doktorwürde (Dr. sc. nat.) vorgelegt der Mathematisch-naturwissenschaftlichen Fakultät der Universität Zürich von Anugraha Mathew aus Indien Promotionskomitee Prof. Dr. Leo Eberl (Vorsitz) Prof. Dr. Jakob Pernthaler Dr. Aurelien carlier Zürich, 2015 2 Table of Contents Summary .............................................................................................................. 7 Zusammenfassung ................................................................................................ 9 Abbreviations ..................................................................................................... 11 Chapter 1: Introduction ....................................................................................... 14 1.1.Role and properties of iron in bacteria ...................................................................... 14 1.2.Iron transport mechanisms in bacteria .....................................................................
    [Show full text]
  • Broad-Spectrum Antimicrobial Activity by Burkholderia Cenocepacia Tatl-371, a Strain Isolated from the Tomato Rhizosphere
    RESEARCH ARTICLE Rojas-Rojas et al., Microbiology DOI 10.1099/mic.0.000675 Broad-spectrum antimicrobial activity by Burkholderia cenocepacia TAtl-371, a strain isolated from the tomato rhizosphere Fernando Uriel Rojas-Rojas,1 Anuar Salazar-Gómez,1 María Elena Vargas-Díaz,1 María Soledad Vasquez-Murrieta, 1 Ann M. Hirsch,2,3 Rene De Mot,4 Maarten G. K. Ghequire,4 J. Antonio Ibarra1,* and Paulina Estrada-de los Santos1,* Abstract The Burkholderia cepacia complex (Bcc) comprises a group of 24 species, many of which are opportunistic pathogens of immunocompromised patients and also are widely distributed in agricultural soils. Several Bcc strains synthesize strain- specific antagonistic compounds. In this study, the broad killing activity of B. cenocepacia TAtl-371, a Bcc strain isolated from the tomato rhizosphere, was characterized. This strain exhibits a remarkable antagonism against bacteria, yeast and fungi including other Bcc strains, multidrug-resistant human pathogens and plant pathogens. Genome analysis of strain TAtl-371 revealed several genes involved in the production of antagonistic compounds: siderophores, bacteriocins and hydrolytic enzymes. In pursuit of these activities, we observed growth inhibition of Candida glabrata and Paraburkholderia phenazinium that was dependent on the iron concentration in the medium, suggesting the involvement of siderophores. This strain also produces a previously described lectin-like bacteriocin (LlpA88) and here this was shown to inhibit only Bcc strains but no other bacteria. Moreover, a compound with an m/z 391.2845 with antagonistic activity against Tatumella terrea SHS 2008T was isolated from the TAtl-371 culture supernatant. This strain also contains a phage-tail-like bacteriocin (tailocin) and two chitinases, but the activity of these compounds was not detected.
    [Show full text]
  • Horizontal Gene Transfer to a Defensive Symbiont with a Reduced Genome
    bioRxiv preprint doi: https://doi.org/10.1101/780619; this version posted September 24, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 1 Horizontal gene transfer to a defensive symbiont with a reduced genome 2 amongst a multipartite beetle microbiome 3 Samantha C. Waterwortha, Laura V. Flórezb, Evan R. Reesa, Christian Hertweckc,d, 4 Martin Kaltenpothb and Jason C. Kwana# 5 6 Division of Pharmaceutical Sciences, School of Pharmacy, University of Wisconsin- 7 Madison, Madison, Wisconsin, USAa 8 Department of Evolutionary Ecology, Institute of Organismic and Molecular Evolution, 9 Johannes Gutenburg University, Mainz, Germanyb 10 Department of Biomolecular Chemistry, Leibniz Institute for Natural Products Research 11 and Infection Biology, Jena, Germanyc 12 Department of Natural Product Chemistry, Friedrich Schiller University, Jena, Germanyd 13 14 #Address correspondence to Jason C. Kwan, [email protected] 15 16 17 18 1 bioRxiv preprint doi: https://doi.org/10.1101/780619; this version posted September 24, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. 19 ABSTRACT 20 The loss of functions required for independent life when living within a host gives rise to 21 reduced genomes in obligate bacterial symbionts. Although this phenomenon can be 22 explained by existing evolutionary models, its initiation is not well understood.
    [Show full text]
  • Molecular Study of the Burkholderia Cenocepacia Division Cell Wall Operon and Ftsz Interactome As Targets for New Drugs
    Università degli Studi di Pavia Dipartimento di Biologia e Biotecnologie “L. Spallanzani” Molecular study of the Burkholderia cenocepacia division cell wall operon and FtsZ interactome as targets for new drugs Gabriele Trespidi Dottorato di Ricerca in Genetica, Biologia Molecolare e Cellulare XXXIII Ciclo – A.A. 2017-2020 Università degli Studi di Pavia Dipartimento di Biologia e Biotecnologie “L. Spallanzani” Molecular study of the Burkholderia cenocepacia division cell wall operon and FtsZ interactome as targets for new drugs Gabriele Trespidi Supervised by Prof. Edda De Rossi Dottorato di Ricerca in Genetica, Biologia Molecolare e Cellulare XXXIII Ciclo – A.A. 2017-2020 Cover image: representation of the Escherichia coli division machinery. Created by Ornella Trespidi A tutta la mia famiglia e in particolare a Diletta Table of contents Abstract ..................................................................................................... 1 Abbreviations ............................................................................................ 3 1. Introduction ........................................................................................... 5 1.1 Cystic fibrosis 5 1.1.1 CFTR defects in CF 5 1.1.2 Pathophysiology of CF 7 1.1.3 CFTR modulator therapies 11 1.2 CF-associated lung infections 12 1.2.1 Pseudomonas aeruginosa 13 1.2.2 Staphylococcus aureus 14 1.2.3 Stenotrophomonas maltophilia 15 1.2.4 Achromobacter xylosoxidans 16 1.2.5 Non-tuberculous mycobacteria 17 1.3 The genus Burkholderia 18 1.3.1 Burkholderia cepacia complex 20 1.3.2 Bcc species as CF pathogens 22 1.4 Burkholderia cenocepacia 23 1.4.1 B. cenocepacia epidemiology 23 1.4.2 B. cenocepacia genome 25 1.4.3 B. cenocepacia pathogenicity and virulence factors 26 1.4.3.1 Quorum sensing 29 1.4.3.2 Biofilm 32 1.4.4 B.
    [Show full text]
  • The Organization of the Quorum Sensing Luxi/R Family Genes in Burkholderia
    Int. J. Mol. Sci. 2013, 14, 13727-13747; doi:10.3390/ijms140713727 OPEN ACCESS International Journal of Molecular Sciences ISSN 1422-0067 www.mdpi.com/journal/ijms Article The Organization of the Quorum Sensing luxI/R Family Genes in Burkholderia Kumari Sonal Choudhary 1, Sanjarbek Hudaiberdiev 1, Zsolt Gelencsér 2, Bruna Gonçalves Coutinho 1,3, Vittorio Venturi 1,* and Sándor Pongor 1,2,* 1 International Centre for Genetic Engineering and Biotechnology (ICGEB), Padriciano 99, Trieste 32149, Italy; E-Mails: [email protected] (K.S.C.); [email protected] (S.H.); [email protected] (B.G.C.) 2 Faculty of Information Technology, PázmányPéter Catholic University, Práter u. 50/a, Budapest 1083, Hungary; E-Mail: [email protected] 3 The Capes Foundation, Ministry of Education of Brazil, Cx postal 250, Brasilia, DF 70.040-020, Brazil * Authors to whom correspondence should be addressed; E-Mails: [email protected] (V.V.); [email protected] (S.P.); Tel.: +39-40-375-7300 (S.P.); Fax: +39-40-226-555 (S.P.). Received: 30 May 2013; in revised form: 20 June 2013 / Accepted: 24 June 2013 / Published: 2 July 2013 Abstract: Members of the Burkholderia genus of Proteobacteria are capable of living freely in the environment and can also colonize human, animal and plant hosts. Certain members are considered to be clinically important from both medical and veterinary perspectives and furthermore may be important modulators of the rhizosphere. Quorum sensing via N-acyl homoserine lactone signals (AHL QS) is present in almost all Burkholderia species and is thought to play important roles in lifestyle changes such as colonization and niche invasion.
    [Show full text]
  • Identification of the Essential Genome of Burkholderia Cenocepacia K56-2 to Uncover Novel
    Identification of the Essential Genome of Burkholderia cenocepacia K56-2 to Uncover Novel Antibacterial Targets by April S. Gislason A Thesis submitted to the Faculty of Graduate Studies of The University of Manitoba in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Microbiology University of Manitoba Winnipeg, Manitoba, Canada Copyright © 2017 April S. Gislason ii Abstract Burkholderia cenocepacia K56-2 is a clinical isolate of the Burkholderia cepacia complex, a group of intrinsically antibiotic resistant opportunistic pathogens, for which few treatment options are available. The identification of essential genes is an important step for exploring new antibiotic targets. Previously, a conditional growth mutant library was created using transposon mutagenesis to insert a rhamnose-inducible promoter into the genome of B. cenocepacia. It was then demonstrated that conditional growth mutants under-expressing an antibiotic target were sensitized to sub-lethal concentrations of that antibiotic. This thesis describes the use of next generation sequencing to measure the enhanced sensitivity of conditional growth mutants after being grown together competitively. Evidence is provided to show that the sensitivity of a GyrB conditional growth mutant to its cognate antibiotic, novobiocin, was enhanced due to competitive growth. Screening eight antibiotics with known mechanisms of action against a pilot conditional growth mutant library revealed a previously uncharacterized two-component system involved antibiotic resistance in B. cenocepacia. With the goal of exploring new Burkholderia antibacterial targets, the essential genome of B. cenocepacia K56-2 was identified by sequencing a high density transposon mutant library. Comparison of the B. cenocepacia K56-2 essential gene set with the essential genomes predicted for B.
    [Show full text]
  • Diazotrophic Burkholderia Species Isolated from the Amazon Region Exhibit
    Systematic and Applied Microbiology 35 (2012) 253–262 Contents lists available at SciVerse ScienceDirect Systematic and Applied Microbiology jo urnal homepage: www.elsevier.de/syapm Diazotrophic Burkholderia species isolated from the Amazon region exhibit phenotypical, functional and genetic diversity a,1,2 b,3 b,3 c,4 Krisle da Silva , Alice de Souza Cassetari , Adriana Silva Lima , Evie De Brandt , d c,4 b,∗ Eleanor Pinnock , Peter Vandamme , Fatima Maria de Souza Moreira a Microbiologia Agrícola Graduate Programme, Departamento de Biologia, Universidade Federal de Lavras, Campus UFLA, 37200-000 Lavras, Minas Gerais, Brazil b Laboratório de Biologia, Microbiologia e Processos Biológicos do Solo, Departamento de Ciência do Solo, Universidade Federal de Lavras, Campus UFLA, 37200-000 Lavras, Minas Gerais, Brazil c Laboratorium voor Microbiologie, Faculteit Wetenschappen, Universiteit Gent, K. L. Ledeganckstraat 35, 9000 Gent, Belgium d Department of Biological Sciences, Warwick University, Coventry CV4 7AL, United Kingdom a r t i c l e i n f o a b s t r a c t Article history: Forty-eight Burkholderia isolates from different land use systems in the Amazon region were compared Received 25 January 2012 to type strains of Burkholderia species for phenotypic and functional characteristics that can be used Received in revised form 21 April 2012 to promote plant growth. Most of these isolates (n = 46) were obtained by using siratro (Macroptilium Accepted 24 April 2012 atropurpureum – 44) and common bean (Phaseolus vulgaris –2) as the trap plant species; two isolates were obtained from nodules collected in the field from Indigofera suffruticosa and Pithecellobium sp. The Keywords: evaluated characteristics were the following: colony characterisation on “79” medium, assimilation of Macroptilium atropurpureum different carbon sources, enzymatic activities, solubilisation of phosphates, nitrogenase activity and anti- Phaseolus vulgaris fungal activity against Fusarium oxysporium f.
    [Show full text]
  • The Evolutionary Diversity and Biological Function of Phenazine Metabolite Biosynthesis in Burkholderia SPP
    The University of Southern Mississippi The Aquila Digital Community Master's Theses Summer 2019 The Evolutionary Diversity and Biological Function of Phenazine Metabolite Biosynthesis in Burkholderia SPP Samuel Hendry University of Southern Mississippi Follow this and additional works at: https://aquila.usm.edu/masters_theses Part of the Bacteriology Commons Recommended Citation Hendry, Samuel, "The Evolutionary Diversity and Biological Function of Phenazine Metabolite Biosynthesis in Burkholderia SPP" (2019). Master's Theses. 653. https://aquila.usm.edu/masters_theses/653 This Masters Thesis is brought to you for free and open access by The Aquila Digital Community. It has been accepted for inclusion in Master's Theses by an authorized administrator of The Aquila Digital Community. For more information, please contact [email protected]. THE EVOLUTIONARY DIVERSITY AND BIOLOGICAL FUNCTION OF PHENAZINE METABOLITE BIOSYNTHESIS IN BURKHOLDERIA SPP by Samuel Hendry A Thesis Submitted to the Graduate School, the College of Arts and Sciences and the School of Biological, Environmental, and Earth Sciences at The University of Southern Mississippi in Partial Fulfillment of the Requirements for the Degree of Master of Science Approved by: Dmitri Mavrodi, Committee Chair Janet Donaldson Mohamed Elasri ____________________ ____________________ ____________________ Dr. Dmitri Mavrodi Dr. Jake Schaefer Dr. Karen S. Coats Committee Chair Director of School Dean of the Graduate School August 2019 COPYRIGHT BY Samuel Hendry 2019 Published by the Graduate School ABSTRACT Burkholderia encompass a group of ubiquitous Gram-negative bacteria that include numerous saprophytes, as well as several species that cause infections in animals, immunocompromised patients, and plants. Some species of Burkholderia produce colored redox-active secondary metabolites called phenazines (Phz).
    [Show full text]
  • Bacterial Comparative Genomics
    CENTRO DE INVESTIGACION Y DE ESTUDIOS AVANZADOS DEL INSTITUTO POLITECNICO NACIONAL UNIDAD IRAPUATO “Explorando los mecanismos moleculares que mantienen la simbiosis Rhizopus microsporus – Burkholderia rhizoxinica” Presenta LCG José Roberto Bermúdez Barrientos Para obtener el grado de Maestro en Ciencias en Biología Integrativa Directores de tesis: Dra. Laila Pamela Partida Martínez Dr. Cei Leander Gastón Abreu Goodger Irapuato, Guanajuato Febrero de 2016 1 CENTRO DE INVESTIGACION Y DE ESTUDIOS AVANZADOS DEL INSTITUTO POLITECNICO NACIONAL UNIDAD IRAPUATO “Exploring the molecular mechanisms maintaining the Rhizopus microsporus - Burkholderia rhizoxinica symbiosis” Presents LCG José Roberto Bermúdez Barrientos To obtain the grade of Master in Sciences in Integrative Biology Thesis directors: Dr. Laila Pamela Partida Martínez Dr. Cei Leander Gastón Abreu Goodger Irapuato, Guanajuato February de 2016 2 Agracecimientos Este trabajo se realizó en el Centro de Investigación y de Estudios Avanzados del IPN - Unidad Irapuato. Bajo la dirección de Dra. Laila Pamela Partida Martínez y de Dr. Cei Abreu Goodger En el Laboratorio de Interacciones Microbianas perteneciente al Departamento de Ingeniería Genética y en el Laboratorio de Genómica computacional del RNA perteneciente a la Unidad de Genómica Avanzada. En colaboración con la Dra Teresa Pawlowska, Plant Pathology & Plant-Microbe Biology Lab, Plant Science Department, Cornell University, Ithaca, NY. Gracias al apoyo del Consejo Nacional de Ciencia y Tecnología (CONACyT) a través de los Proyectos CB-2009-01-133388, CB-2010-01-151007, INFR-2012-01- 187799 y la beca 358014 Este trabajo se realizó del 1 de Septiembre de 2013 al 1 de Octubre de 2015 Miembros del comité de asesores: Dr. Alfredo Herrera Estrella, Unidad de Genómica Avanzada, CINVESTAV Irapuato Dr.
    [Show full text]
  • Genomic Analysis of the Potential for Aromatic Compounds
    bs_bs_banner Environmental Microbiology (2012) 14(5), 1091–1117 doi:10.1111/j.1462-2920.2011.02613.x Minireview Genomic analysis of the potential for aromatic compounds biodegradation in Burkholderialesemi_2613 1091..1117 Danilo Pérez-Pantoja,1 Raúl Donoso,1,2 in the catabolic clusters of these pathways indicating Loreine Agulló,3 Macarena Córdova,3 recent events in its evolutionary history. In addition, a Michael Seeger,3 Dietmar H. Pieper4 and significant bias towards secondary chromosomes, Bernardo González1,2* now termed chromids, is observed in the distribution 1Center for Advanced Studies in Ecology and of catabolic genes across multipartite genomes, Biodiversity. Millennium Nucleus in Plant Functional which is consistent with a genus-specific character. Genomics. Facultad de Ciencias Biológicas, P. Strains isolated from environmental sources such as Universidad Católica de Chile. Santiago, Chile. soil, rhizosphere, sediment or sludge show a higher 2Facultad de Ingeniería y Ciencias, Universidad Adolfo content of catabolic genes in their genomes com- Ibáñez. Santiago, Chile. pared with strains isolated from human, animal or 3Laboratorio de Microbiología Molecular y Biotecnología plant hosts, but no significant difference is found Ambiental, Departamento de Química, Center for among Alcaligenaceae, Burkholderiaceae and Coma- Nanotechnology and Systems Biology, Universidad monadaceae families, indicating that habitat is more Técnica Federico Santa María, Valparaíso, Chile. of a determinant than phylogenetic origin in shaping 4Microbial Interactions and Processes Research Group, aromatic catabolic versatility. Department of Medical Microbiology, HZI – Helmholtz Centre for Infection Research. Braunschweig, Germany. Introduction Aromatic compounds are widespread in nature, being Summary found as lignin and petroleum components, xenobiotic The relevance of the b-proteobacterial Burkholderi- chemicals, aromatic amino acids and constituents of plant ales order in the degradation of a vast array of exudates, among other sources.
    [Show full text]
  • Research Article Burkholderia Contaminans Biofilm Regulating Operon and Its Distribution in Bacterial Genomes
    Hindawi Publishing Corporation BioMed Research International Volume 2016, Article ID 6560534, 13 pages http://dx.doi.org/10.1155/2016/6560534 Research Article Burkholderia contaminans Biofilm Regulating Operon and Its Distribution in Bacterial Genomes Olga L. Voronina,1 Marina S. Kunda,1 Natalia N. Ryzhova,1 Ekaterina I. Aksenova,1 Andrey N. Semenov,1 Yulia M. Romanova,1,2 and Alexandr L. Gintsburg1,2 1 N.F. Gamaleya Federal Research Center for Epidemiology and Microbiology, Ministry of Health of Russia, Gamaleya Street 18, Moscow 123098, Russia 2I.M. Sechenov First Moscow State Medical University, Moscow 119991, Russia Correspondence should be addressed to Olga L. Voronina; [email protected] Received 24 February 2016; Accepted 8 November 2016 Academic Editor: Vassily Lyubetsky Copyright © 2016 Olga L. Voronina et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Biofilm formation by Burkholderia spp. is a principal cause of lung chronic infections in cystic fibrosis patients. A “lacking biofilm production” (LBP) strain B. contaminans GIMC4587:Bct370-19 has been obtained by insertion modification of clinical strain with plasposon mutagenesis. It has an interrupted transcriptional response regulator (RR) gene. The focus of our investigation was a two-component signal transduction system determination, including this RR. B. contaminans clinical and LBP strains were analyzed by whole genome sequencing and bioinformatics resources. A four-component operon (BiofilmReg) has a key role in biofilm formation. The relative location (i.e., by being separated by another gene) of RR and histidine kinase genes is uniquein BiofilmReg.
    [Show full text]
  • Burkholderia Bacteria Produce Multiple Potentially Novel Molecules That Inhibit Carbapenem-Resistant Gram-Negative Bacterial Pathogens
    antibiotics Article Burkholderia Bacteria Produce Multiple Potentially Novel Molecules that Inhibit Carbapenem-Resistant Gram-Negative Bacterial Pathogens Eliza Depoorter 1 , Evelien De Canck 1, Tom Coenye 2 and Peter Vandamme 1,* 1 Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, 9000 Ghent, Belgium; [email protected] (E.D.); [email protected] (E.D.C.) 2 Laboratory of Pharmaceutical Microbiology, Department of Pharmaceutical Analysis, Ghent University, 9000 Ghent, Belgium; [email protected] * Correspondence: [email protected]; Tel.: +32-9264-5113 Abstract: Antimicrobial resistance in Gram-negative pathogens represents a global threat to human health. This study determines the antimicrobial potential of a taxonomically and geographically diverse collection of 263 Burkholderia (sensu lato) isolates and applies natural product dereplication strategies to identify potentially novel molecules. Antimicrobial activity is almost exclusively present in Burkholderia sensu stricto bacteria and rarely observed in the novel genera Paraburkholderia, Caballero- nia, Robbsia, Trinickia, and Mycetohabitans. Fourteen isolates show a unique spectrum of antimicrobial activity and inhibited carbapenem-resistant Gram-negative bacterial pathogens. Dereplication of the molecules present in crude spent agar extracts identifies 42 specialized metabolites, 19 of which repre- sented potentially novel molecules. The known identified Burkholderia metabolites include toxoflavin, reumycin, pyrrolnitrin,
    [Show full text]