Identification and Classification of Known and Putative Antimicrobial Compounds Produced by a Wide Variety of Bacillales Species Zhao, Xin; Kuipers, Oscar P
Total Page:16
File Type:pdf, Size:1020Kb
University of Groningen Identification and classification of known and putative antimicrobial compounds produced by a wide variety of Bacillales species Zhao, Xin; Kuipers, Oscar P Published in: BMC Genomics DOI: 10.1186/s12864-016-3224-y IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below. Document Version Publisher's PDF, also known as Version of record Publication date: 2016 Link to publication in University of Groningen/UMCG research database Citation for published version (APA): Zhao, X., & Kuipers, O. P. (2016). Identification and classification of known and putative antimicrobial compounds produced by a wide variety of Bacillales species. BMC Genomics, 17(1), [882]. https://doi.org/10.1186/s12864-016-3224-y Copyright Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons). Take-down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum. Download date: 28-09-2021 Zhao and Kuipers BMC Genomics (2016) 17:882 DOI 10.1186/s12864-016-3224-y RESEARCH ARTICLE Open Access Identification and classification of known and putative antimicrobial compounds produced by a wide variety of Bacillales species Xin Zhao1,2 and Oscar P. Kuipers1* Abstract Background: Gram-positive bacteria of the Bacillales are important producers of antimicrobial compounds that might be utilized for medical, food or agricultural applications. Thanks to the wide availability of whole genome sequence data and the development of specific genome mining tools, novel antimicrobial compounds, either ribosomally- or non-ribosomally produced, of various Bacillales species can be predicted and classified. Here, we provide a classification scheme of known and putative antimicrobial compounds in the specific context of Bacillales species. Results: We identify and describe known and putative bacteriocins, non-ribosomally synthesized peptides (NRPs), polyketides (PKs) and other antimicrobials from 328 whole-genome sequenced strains of 57 species of Bacillales by using web based genome-mining prediction tools. We provide a classification scheme for these bacteriocins, update the findings of NRPs and PKs and investigate their characteristics and suitability for biocontrol by describing per class their genetic organization and structure. Moreover, we highlight the potential of several known and novel antimicrobials from various species of Bacillales. Conclusions: Our extended classification of antimicrobial compounds demonstrates that Bacillales provide a rich source of novel antimicrobials that can now readily be tapped experimentally, since many new gene clusters are identified. Keywords: Antimicrobials, Bacillales, Bacillus, Genome-mining, Lanthipeptides, Sactipeptides, Thiopeptides, NRPs, PKs Background (bacteriocins) [4], as well as non-ribosomally synthesized Most of the species of the genus Bacillus and related peptides (NRPs) and polyketides (PKs) [5]. Firmicutes are Gram-positive, aerobic endospore-forming The discovery of biosynthetic gene clusters of antimicro- and rod-shaped bacteria, which are found in diverse bial compounds by genome mining is a rewarding task, environments such as soil and clay, rocks, dust, aquatic because this methodology can lead to the identification and environments, on vegetation, in food and in the gastro- subsequent isolation of novel molecules of pharmacological intestinal tracts of various insects and animals [1]. and biotechnological interest [6]. Various powerful tools Antimicrobial compounds have been used for a variety of with broad databases have been created for the automated purposes, such as delaying spoilage by plant pathogens in screening of bacteriocin gene clusters. BAGEL3 (http:// agriculture and extending product shelf life in the food bagel.molgenrug.nl/) is a versatile fast genome-mining tool industry [2, 3]. In particular, Bacillus strains are known to valid not only for modified- and non-modified bacteriocins, produce a wide variety of biocontrol metabolites, inclu- but also for non-bactericidal ribosomally produced and ding the ribosomally synthesized antimicrobial peptides posttranslationally modified peptides (RiPPs) [7]. The corre- sponding database [7] contains all the records belonging to one of the three classes of proteins being core to BAGEL3: * Correspondence: [email protected] Class I contains RiPPs of less than 10 kDa, which currently 1Department of Molecular Genetics, University of Groningen, Nijenborgh 7, Groningen 9747AG, The Netherlands is divided into more than 12 supported subclasses; Class II Full list of author information is available at the end of the article © The Author(s). 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Zhao and Kuipers BMC Genomics (2016) 17:882 Page 2 of 18 contains unmodified peptides not fitting the criteria of the Fig. 1). We also highlight examples of each class by describ- first database; Class III contains antimicrobial proteins ing the genetics and structure, with a keen eye on biocontrol larger than 10 kDa. BAGEL3 uses DNA nucleotide se- properties and applications. Within the genus Bacillus, B. quences in FASTA format as input; multiple sequence subtilis, B. amyloliquefaciens, B. licheniformis, B. cereus and entries per file are allowed. The input DNA sequences are B. thuringiensis are the best studied species for antimicro- analyzed in parallel via two different approaches; one is the bials production [12]. Genome mining and subsequent context of bacteriocin- or RiPP gene-based mining, the analyses and classification ofantimicrobialsofotherless other is precursor (structural gene)-based mining directly explored Bacillales, including Paenibacillus, Brevibacillus, by Glimmer, which increases the success rate and lowers Alicyclolacillus, Anoxybacillus, Lysinibacillus and Geobacil- the need for manual evaluation of results [7]. The output is lus will be also included in this analysis, revealing interesting visualized in an html page, by a table of putative bac- new features and distributions. teriocins or modified peptides classified into the detailed bacteriocin class found in the mining sequence; graphics of Results gene clusters; annotation of each ORF in the context; as Classification of antimicrobial peptides encountered in well as detailed information of putative bacteriocins, such as Bacillales BLAST hits in the bacteriocin database, or the pI (Isoelec- The main classification scheme for ribosomally synthe- tric point) value. A detailed prediction of the gene clusters sized antimicrobial peptides currently available is that of of NRPs, PKs and other antimicrobials is provided by anti- the lactic acid bacteria (LAB) bacteriocins [13], which was SMASH (http://antismash.secondarymetabolites.org), a web recently reviewed and revised by Alvarez-Sieiro et al. [14]. server and stand-alone tool for the automatic genomic iden- The main classification scheme for RiPPs (Class I) was tification and analysis of biosynthetic gene clusters [8–10]. provided by the paper of Arnison et al. [15]. Although A database of classes specific for many types [10] of biosyn- some bacteriocins produced by Bacillus are similar to thesis signature genes is constructed by Hidden Markov those of LAB’s, the Bacillus antimicrobial compound Models (pHMMs) covering a wide range of known or classification system now is lagging behind that of LAB putative secondary metabolite compounds. The anti- classifications. Conveniently, BAGEL3 can be used for SMASH web server allows uploading of sequence files of mining bacteriocin gene clusters, some of which were not not only a variety of types (FASTA, GBK, or EMBL files), identified before. Moreover, some cryptic gene clusters of but also GenBank/RefSeq accession numbers. Gene clusters bacteriocins were identified that have not been isolated are first predicted and identified by Glimmer and pHMMs, yet from wild type microorganisms. In this study, we respectively. Subsequently, several downstream analyses can identified 583 putative bacteriocin gene clusters from 328 be performed by different modules: NRPS/PKS domain strains of 57 species of Bacillales (Table 1), and these gene analysis and annotation; prediction of the core chemical clusters were further classified into three classes harboring structure of PKSs and NRPSs; ClusterBlast gene cluster 46 types of bacteriocins covering 50 species of Bacillales comparative analysis; active enzyme site analysis; and (Additional file 2: Table S2) according