The Classification of Orphans Is Improved by Combining Searches in Both Proteomes and Genomes

Total Page:16

File Type:pdf, Size:1020Kb

The Classification of Orphans Is Improved by Combining Searches in Both Proteomes and Genomes bioRxiv preprint doi: https://doi.org/10.1101/185983; this version posted May 19, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. The classification of orphans is improved by combining searches in both proteomes and genomes. Walter Basile1,2, Marco Salvatore1,2, Arne Elofsson1,2,3,*, 1 Science for Life Laboratory, Stockholm University SE-171 21 Solna, Sweden 2 Department of Biochemistry and Biophysics, Stockholm University, SE-106 91 Stockholm, Sweden 3 Swedish e-Science Research Center (SeRC) * Corresponding author: [email protected] Abstract The identification of de novo created genes is important as it provides a glimpse on the evolutionary processes of gene creation. Potential de novo created genes are identified by selecting genes that have no homologs outside a particular species, but for an accurate detection this identification needs to be correct. Genes without any homologs are often referred to as orphans; in addition to de novo created ones, fast evolving genes or genes lost in all related genomes might also be classified as orphans. The identification of orphans is dependent on: (i) a method to detect homologs and (ii) a database including genes from related genomes. Here, we set out to investigate how the detection of orphans is influenced by these two factors. Using Saccharomyces cerevisiae we identify that best strategy is to use a combination of searching annotated proteins and a six-frame translation of all ORFs from closely related genomes. Using this strategy we obtain a set of 54 orphans in Drosophila melanogaster and 38 in Drosophila pseudoobscura, significantly less than what is reported in some earlier studies. Introduction 1 Even after twenty years of genome sequencing some protein-coding genes appear to have 2 no homologs. These genes are often referred to as orphans, a definition that dates back 3 to the sequencing of Saccharomyces cerevisiae [1{3]. It was believed that most of them 4 would disappear when more genomes would be sequenced. However that is not the case, 5 even after thousands of genomes have been sequenced [4{7]. After the initial studies in 6 yeast, orphans have also been identified in Drosophila [8{10], mammals [11], 7 primates [12{16], plants [17,18], Bacteria [19{21] and viruses [22,23]. 8 The term \orphan" might refer to genes without any detectable homologs or it might 9 refer to only de novo created genes. The first definition would include genes that have 10 homologs that all are too distant to be detected, possibly due to extensive gene loss in 11 closely related species, lack of closely related species in the database searched, or 12 because of fast evolution of the gene. Given better methods to detect homology and the 13 inclusion of more closely related species in the database, the number of non-de novo 14 created orphans should decrease. 15 If a gene truly does not have any homologs it must have been created by some 16 mutations turning a non-coding DNA region into a protein coding gene. It is likely that 17 PLOS 1/21 bioRxiv preprint doi: https://doi.org/10.1101/185983; this version posted May 19, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. this is a widespread phenomenon, at least in yeast [24]. However, in evolutionary terms 18 only a few of these genes will get fixed in the population. It is plausible that every 19 protein superfamily was created by such mechanisms. However, it is also possible that 20 at least some superfamilies are created by rearranging shorter stretches of coding 21 regions [25]. A correct identification of orphans is therefore important for understanding 22 the selective pressures affecting novel gene creation throughout evolutionary history. In 23 particular, an imprecise detection, that includes fast evolving genes in the orphan set, 24 might cause the incorrect picture that de novo created genes have the same properties 25 as fast evolving genes. 26 In some studies, different levels of orphanicity have been used [4,26{28]. Here, a gene 27 is assigned to have a certain age - given that homologs are only found within genomes 28 belonging to a certain clade. For instance, a gene can be specific to primates, to 29 mammals or eukaryotes. The correct classification of the age of a gene is difficult, as it 30 would require accurate detection of very distant homologs. If this is not done correctly, 31 fast-evolving genes will appear to be younger than they are. Taken these difficulties into 32 account we only study the most recent orphans and have not used gene age in this study. 33 In short, the identification of orphans starts by searching for homologous genes in a 34 set of related genomes and then labeling as orphans the genes lacking hits. The results 35 will be dependent on the database used and the homology detection method. In general, 36 better homology detection [29{31] would potentially reduce the number of genes 37 assigned as orphans [4]. 38 Another complication when it comes to identification of de novo created genes is 39 that it is almost impossible to know if an ORF is a functional gene from its genomic 40 sequence alone. For genes with homologs conservation across species is the best method 41 to identify if a gene is functionally important or not, but by definition this is not 42 possible for orphans. An ORF might look like a gene but is never expressed under any 43 circumstances, i.e. not being functional. Alternatively, an ORF can be missed as it 44 appears not to be coding. This difficulty applies both to the ORFs in the query species 45 and to the genomes searched. Therefore, just having a hit to a genomic location does 46 not guarantee that the corresponding region is an actual functional gene [5,28, 32]. 47 However, the reverse is also possible, i.e. that a hit to a non-functional ORF represents 48 homology with a common functional ancestral gene, i.e. the function was lost in the 49 other species but present in the ancestral one. Other factors, such as the removal of 50 CDS overlapping annotated genes, the study of the syntenic regions, or experimental 51 studies, improve the efficacy of detection of de novo created genes. 52 Several different estimates of the orphans of S. cerevisiae have been 53 proposed [4, 5, 27, 28,33]. Surprisingly, the obtained set of orphans is quite different 54 between these studies. One of the reasons of these differences is the use of different 55 datasets [28]. For example all S. cerevisiae ORFs are considered in Carvunis et al. [5], 56 but only the annotated ones are present in Ekman et al. [4]. 57 The goal of this study was to develop a robust strategy for orphan detection in 58 sequenced genomes. We used a set of homology search methods (blastn, blastp and 59 tblastn ) and compared the classified orphans with earlier sets. To avoid some of the 60 problems described above we decided to: (i) use only annotated genes, (ii) assume that 61 every significant hit in a related genome shows that a gene is not de novo created, 62 although we know that this would not identify very recently de novo created genes, (iii) 63 use a similar set of genomes as in earlier studies. We then explore how the inclusion of 64 different genomes and the use of different homology detection tools influence the 65 detection of orphans first in S. cerevisiae and thereafter in D. melanogaster and D. 66 pseudoobscura. At the end we propose a strategy that combines the use of tblastn and 67 blastp. Using this strategy a conservative estimate of the number of in Drosophila 68 melanogaster and Drosophila pseudoobscura is obtained. 69 PLOS 2/21 bioRxiv preprint doi: https://doi.org/10.1101/185983; this version posted May 19, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Materials and Methods 70 S. cerevisiae data 71 5,917 Saccharomyces cerevisiae protein-coding genes (reference strain s288c, NCBI 72 taxon id 559292) were downloaded from the Saccharomyces Genome Database 73 (SGD) [34]. Only coding and annotated ORFs were considered, and we also removed 74 genes labeled as \dubious", mitochondrial or 2-micron plasmid encoded genes, resulting 75 in a set of 5,894 genes. 76 For homology detection we used fourteen genomes from the Saccharomycetales order, 77 see Table 1. Five of these genomes belong to the Saccharomyces genus. All genomes are 78 fully sequenced with an assembly status equal to \contigs", \scaffold", \chromosomes" 79 or \complete genome". Full genome sequences were obtained from NCBI Genome 80 Project, while proteomes were downloaded from Uniprot [35]. It should be noted that 81 many better annotated fungal genomes are available today. However, we choose to use 82 this set, as it is representative of what was used in earlier studies. 83 Estimation of evolutionary distances between genomes 84 To estimate the phylogenetic distance between the genomes we used the amino acid 85 sequence of the well-conserved gene mre11 (nuclease subunit of the MRX complex, 86 involved in DNA repair) as a proxy for evolutionary distance.
Recommended publications
  • (Vles) in the Yeast Debaryomyces Hansenii
    toxins Article New Cytoplasmic Virus-Like Elements (VLEs) in the Yeast Debaryomyces hansenii Xymena Połomska 1,* ,Cécile Neuvéglise 2, Joanna Zyzak 3, Barbara Zarowska˙ 1, Serge Casaregola 4 and Zbigniew Lazar 1 1 Department of Biotechnology and Food Microbiology, Faculty of Biotechnology and Food Science, Wrocław University of Environmental and Life Sciences (WUELS), 50-375 Wroclaw, Poland; [email protected] (B.Z.);˙ [email protected] (Z.L.) 2 SPO, INRAE, Montpellier SupAgro, Université de Montpellier, 34060 Montpellier, France; [email protected] 3 Department of Microbiology, Laboratory of Microbiome Immunobiology, Ludwik Hirszfeld Institute of Immunology and Experimental Therapy, Polish Academy of Sciences, 53-114 Wroclaw, Poland; [email protected] 4 INRAE, AgroParisTech, Micalis Institute, CIRM-Levures, Université Paris-Saclay, 78350 Jouy-en-Josas, France; [email protected] * Correspondence: [email protected]; Tel.: +48-71-3207-791 Abstract: Yeasts can have additional genetic information in the form of cytoplasmic linear dsDNA molecules called virus-like elements (VLEs). Some of them encode killer toxins. The aim of this work was to investigate the prevalence of such elements in D. hansenii killer yeast deposited in culture collections as well as in strains freshly isolated from blue cheeses. Possible benefits to the host from harboring such VLEs were analyzed. VLEs occurred frequently among fresh D. hansenii isolates (15/60 strains), as opposed to strains obtained from culture collections (0/75 strains). Eight new different systems were identified: four composed of two elements and four of three elements. Full sequences of three new VLE systems obtained by NGS revealed extremely high conservation Citation: Połomska, X.; Neuvéglise, among the largest molecules in these systems except for one ORF, probably encoding a protein C.; Zyzak, J.; Zarowska,˙ B.; resembling immunity determinant to killer toxins of VLE origin in other yeast species.
    [Show full text]
  • Comparative Genomics of Protoploid Saccharomycetaceae
    Downloaded from genome.cshlp.org on October 5, 2021 - Published by Cold Spring Harbor Laboratory Press Evolution of protoploid yeast genomes ___________________________________________________________________________ Comparative genomics of protoploid Saccharomycetaceae. The Génolevures Consortium (1) Running title: Evolution of protoploid yeast genomes Key words: protein families, synteny, tandems, annotation, SONS, ancestor genome Corresponding author: Jean Luc Souciet Université de Strasbourg, CNRS, UMR 7156 Institut de Botanique, 28 rue Goethe, F-67000 Strasbourg, France Tel: 33 3 90 24 18 17 FAX: 33 3 90 24 20 28 e-mail: [email protected] (1) List of participants and affiliations appear at the end of the paper 1 Downloaded from genome.cshlp.org on October 5, 2021 - Published by Cold Spring Harbor Laboratory Press Evolution of protoploid yeast genomes ___________________________________________________________________________ Abstract Our knowledge on yeast genomes remains largely dominated by the extensive studies on Saccharomyces cerevisiae and the consequences of its ancestral duplication, leaving the evolution of the entire class of hemiascomycetes only partly explored. We concentrate here on five species of Saccharomycetaceae, a large subdivision of hemiascomycetes, that we call “protoploid” because they diverged from the S. cerevisiae lineage prior to its genome duplication. We determined the complete genome sequences of three of these species, Kluyveromyces (Lachancea) thermotolerans and Saccharomyces (Lachancea) kluyveri (two members of the newly described Lachancea clade) and Zygosaccharomyces rouxii. We included in our comparisons the previously available sequences of Klyveromyces lactis and Ashbya (Eremothecium) gossypii. Despite their broad evolutionary range and significant individual variations in each lineage, the five protoploid Saccharomycetaceae share a core repertoire of ca.
    [Show full text]
  • Global Expression Analysis of the Yeast Lachancea
    Washington University School of Medicine Digital Commons@Becker Open Access Publications 11-2013 Global expression analysis of the yeast Lachancea (saccharomyces) kluyveri reveals new URC genes involved in pyrimidine catabolism Anna Andersson Rasmussen Lund University Dineshkumar Kandasamy Lund University Halfdan Beck Lund University Seth D. Crosby Washington University School of Medicine in St. Louis Olof Bjornberg Lund University See next page for additional authors Follow this and additional works at: https://digitalcommons.wustl.edu/open_access_pubs Recommended Citation Rasmussen, Anna Andersson; Kandasamy, Dineshkumar; Beck, Halfdan; Crosby, Seth D.; Bjornberg, Olof; Schnackerz, Klaus D.; and Piskur, Jure, ,"Global expression analysis of the yeast Lachancea (saccharomyces) kluyveri reveals new URC genes involved in pyrimidine catabolism." Eukaryotic Cell.13,1. 31-42. (2013). https://digitalcommons.wustl.edu/open_access_pubs/2116 This Open Access Publication is brought to you for free and open access by Digital Commons@Becker. It has been accepted for inclusion in Open Access Publications by an authorized administrator of Digital Commons@Becker. For more information, please contact [email protected]. Authors Anna Andersson Rasmussen, Dineshkumar Kandasamy, Halfdan Beck, Seth D. Crosby, Olof Bjornberg, Klaus D. Schnackerz, and Jure Piskur This open access publication is available at Digital Commons@Becker: https://digitalcommons.wustl.edu/open_access_pubs/2116 Global Expression Analysis of the Yeast Lachancea (Saccharomyces) kluyveri Reveals New URC Genes Involved in Pyrimidine Catabolism Anna Andersson Rasmussen, Dineshkumar Kandasamy, Downloaded from Halfdan Beck, Seth D. Crosby, Olof Björnberg, Klaus D. Schnackerz and Jure Piskur Eukaryotic Cell 2014, 13(1):31. DOI: 10.1128/EC.00202-13. Published Ahead of Print 1 November 2013. http://ec.asm.org/ Updated information and services can be found at: http://ec.asm.org/content/13/1/31 These include: SUPPLEMENTAL MATERIAL Supplemental material on January 28, 2014 by Washington University in St.
    [Show full text]
  • Conservation of Mrna Quality Control Factor Ski7 and Its Diversification Through Changes in Alternative Splicing and Gene Duplication
    Conservation of mRNA quality control factor Ski7 and its diversification through changes in alternative splicing and gene duplication Alexandra N. Marshalla, Jaeil Hana, Minseon Kima, and Ambro van Hoofa,1 aDepartment of Microbiology and Molecular Genetics, University of Texas Health Science Center at Houston and University of Texas Graduate School of Biomedical Sciences, Houston, TX 77030 Edited by Lynne E. Maquat, University of Rochester School of Medicine and Dentistry, Rochester, NY, and approved May 7, 2018 (received for review February 2, 2018) Eukaryotes maintain fidelity of gene expression by preferential Ski7 function was initially thought to have arisen after the du- degradation of aberrant mRNAs that arise by errors in RNA plication, we have previously shown that the single Lachancea processing reactions. In Saccharomyces cerevisiae, Ski7 plays an kluyveri ortholog can carry out both Ski7 and Hbs1 functions and important role in this mRNA quality control by mediating mRNA thus, that Ski7 function was already present before whole ge- degradation by the RNA exosome. Ski7 was initially thought to be nome duplication (8, 10, 11). In addition, we have shown that the restricted to Saccharomyces cerevisiae and close relatives because L. kluyveri gene encodes two different proteins through alter- the SKI7 gene and its paralog HBS1 arose by whole genome du- native splicing with a shorter splice isoform performing the plication (WGD) in a recent ancestor. We have recently shown that function of HBS1 and a longer splice isoform performing the the preduplication gene was alternatively spliced and that Ski7 function of SKI7 (10). Furthermore, based on sequence analysis of function predates WGD.
    [Show full text]
  • Searching for Telomerase Rnas in Saccharomycetes
    bioRxiv preprint doi: https://doi.org/10.1101/323675; this version posted May 16, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Article TERribly Difficult: Searching for Telomerase RNAs in Saccharomycetes Maria Waldl 1,†, Bernhard C. Thiel 1,†, Roman Ochsenreiter 1, Alexander Holzenleiter 2,3, João Victor de Araujo Oliveira 4, Maria Emília M. T. Walter 4, Michael T. Wolfinger 1,5* ID , Peter F. Stadler 6,7,1,8* ID 1 Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090 Wien, Austria; {maria,thiel,romanoch}@tbi.univie.ac.at, michael.wolfi[email protected] 2 BioInformatics Group, Fakultät CB Hochschule Mittweida, Technikumplatz 17, D-09648 Mittweida, Germany; [email protected] 3 Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany 4 Departamento de Ciência da Computação, Instituto de Ciências Exatas, Universidade de Brasília; [email protected], [email protected] 5 Center for Anatomy and Cell Biology, Medical University of Vienna, Währingerstraße 13, 1090 Vienna, Austria 6 German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, University Leipzig, Germany 7 Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, D-04103 Leipzig, Germany 8 Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM 87501 * Correspondence: MTW michael.wolfi[email protected]; PFS [email protected] † These authors contributed equally to this work.
    [Show full text]
  • Evaluating Accessibility, Usability and Interoperability of Genome-Scale Metabolic Models for Diverse Yeasts Species
    Evaluating accessibility, usability and interoperability of genome-scale metabolic models for diverse yeasts species Downloaded from: https://research.chalmers.se, 2021-09-30 12:00 UTC Citation for the original published paper (version of record): Domenzain Del Castillo Cerecer, I., Li, F., Kerkhoven, E. et al (2021) Evaluating accessibility, usability and interoperability of genome-scale metabolic models for diverse yeasts species FEMS Yeast Research, 21(1) http://dx.doi.org/10.1093/femsyr/foab002 N.B. When citing this work, cite the original published paper. research.chalmers.se offers the possibility of retrieving research publications produced at Chalmers University of Technology. It covers all kind of research output: articles, dissertations, conference papers, reports etc. since 2004. research.chalmers.se is administrated and maintained by Chalmers Library (article starts on next page) FEMS Yeast Research, 21, 2021, foab002 doi: 10.1093/femsyr/foab002 Advance Access Publication Date: 11 January 2021 Minireview MINIREVIEW Evaluating accessibility, usability and interoperability of genome-scale metabolic models for diverse yeasts species Ivan´ Domenzain1,2,†,‡,FeiranLi1,2,†,§, Eduard J. Kerkhoven1,2 and Verena Siewers1,2,* 1Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivagen¨ 10, SE-412 96, Gothenburg, Sweden and 2Novo Nordisk Foundation Center for Biosustainability, Chalmers University of Technology, Kemivagen¨ 10, SE-412 96, Gothenburg, Sweden ∗Corresponding author: Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivagen¨ 10, SE-412 96 Gothenburg, Sweden. E-mail: [email protected] One sentence summary: Review on computational models of metabolism for diverse yeast species, their development history, applications and critical assessment of their accessibility, usability and interoperability.
    [Show full text]
  • The Genetics of Non-Conventional Wine Yeasts: Current Knowledge and Future Challenges
    MINI REVIEW published: 11 January 2016 doi: 10.3389/fmicb.2015.01563 The Genetics of Non-conventional Wine Yeasts: Current Knowledge and Future Challenges Isabelle Masneuf-Pomarede 1, 2, Marina Bely 1, Philippe Marullo 1, 3 and Warren Albertin 1, 4* 1 ISVV, Unité de Recherche Œnologie EA 4577, USC 1366 Institut National de la Recherche Agronomique, Bordeaux INP, University Bordeaux, Villenave d’Ornon, France, 2 Bordeaux Sciences Agro, Gradignan, France, 3 Biolaffort, Bordeaux, France, 4 ENSCBP, Bordeaux INP, Pessac, France Saccharomyces cerevisiae is by far the most widely used yeast in oenology. However, during the last decade, several other yeasts species has been purposed for winemaking as they could positively impact wine quality. Some of these non-conventional yeasts (Torulaspora delbrueckii, Metschnikowia pulcherrima, Pichia kluyveri, Lachancea thermotolerans, etc.) are now proposed as starters culture for winemakers in mixed fermentation with S. cerevisiae, and several others are the subject of various studies (Hanseniaspora uvarum, Starmerella bacillaris, etc.). Along with their biotechnological use, the knowledge of these non-conventional yeasts greatly increased these last 10 years. The aim of this review is to describe the last updates and the current state-of-art of Edited by: the genetics of non-conventional yeasts (including S. uvarum, T. delbrueckii, S. bacillaris, Gemma Beltran, Universitat Rovira i Virgili, Spain etc.). We describe how genomics and genetics tools provide new data into the population Reviewed by: structure and biodiversity of non-conventional yeasts in winemaking environments. Aspasia Nisiotou, Future challenges will lie on the development of selection programs and/or genetic Technological Educational Institute of improvement of these non-conventional species.
    [Show full text]
  • Whole-Genome Sequencing and Intraspecific Analysis of the Yeast Species Lachancea Quebecensis
    Whole-genome sequencing and intraspecific analysis of the yeast species Lachancea quebecensis. Kelle C Freel, Anne Friedrich, Véronique Sarilar, Hugo Devillers, Cécile Neuvéglise, Joseph Schacherer To cite this version: Kelle C Freel, Anne Friedrich, Véronique Sarilar, Hugo Devillers, Cécile Neuvéglise, et al.. Whole- genome sequencing and intraspecific analysis of the yeast species Lachancea quebecensis.. Genome Biology and Evolution, Society for Molecular Biology and Evolution, 2016, 8 (3), pp.733-741. 10.1093/gbe/evv262. hal-02635461 HAL Id: hal-02635461 https://hal.inrae.fr/hal-02635461 Submitted on 27 May 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Genome Biology and Evolution Advance Access published January 5, 2016 doi:10.1093/gbe/evv262 Whole-genome sequencing and intraspecific analysis of the yeast species Lachancea quebecensis Downloaded from http://gbe.oxfordjournals.org/ Kelle C. Freel1†, Anne Friedrich1†, Véronique Sarilar2,3, Hugo Devillers2,3, Cécile Neuvéglise2,3 and Joseph Schacherer1* at INRA-VERS-V.G.M.bioinf on February 15, 2016 1. Department of Genetics, Genomics and Microbiology, University of Strasbourg - CNRS, UMR7156, Strasbourg, France 2. INRA, UMR1319 Micalis, Jouy-en-Josas, France 3. AgroParisTech, UMR1319 Micalis, Jouy-en-Josas, France †These authors contributed equally to this work *Author for Correspondence: Joseph Schacherer, Department of Genetics, Genomics and Microbiology, University of Strasbourg, Strasbourg, France, +33 03 68 85 19 61, [email protected] Short title: Population genomics of L.
    [Show full text]
  • D-Fructose Assimilation and Fermentation by Yeasts
    microorganisms Article D-Fructose Assimilation and Fermentation by Yeasts Belonging to Saccharomycetes: Rediscovery of Universal Phenotypes and Elucidation of Fructophilic Behaviors in Ambrosiozyma platypodis and Cyberlindnera americana Rikiya Endoh *, Maiko Horiyama and Moriya Ohkuma Microbe Division/Japan Collection of Microorganisms, RIKEN BioResource Research Center (RIKEN BRC-JCM), 3-1-1 Koyadai, Tsukuba, Ibaraki 305-0074, Japan; [email protected] (M.H.); [email protected] (M.O.) * Correspondence: [email protected] Abstract: The purpose of this study was to investigate the ability of ascomycetous yeasts to as- similate/ferment D-fructose. This ability of the vast majority of yeasts has long been neglected since the standardization of the methodology around 1950, wherein fructose was excluded from the standard set of physiological properties for characterizing yeast species, despite the ubiquitous presence of fructose in the natural environment. In this study, we examined 388 strains of yeast, mainly belonging to the Saccharomycetes (Saccharomycotina, Ascomycota), to determine whether they can assimilate/ferment D-fructose. Conventional methods, using liquid medium containing Citation: Endoh, R.; Horiyama, M.; yeast nitrogen base +0.5% (w/v) of D-fructose solution for assimilation and yeast extract-peptone Ohkuma, M. D-Fructose Assimilation +2% (w/v) fructose solution with an inverted Durham tube for fermentation, were used. All strains and Fermentation by Yeasts examined (n = 388, 100%) assimilated D-fructose, whereas 302 (77.8%) of them fermented D-fructose. Belonging to Saccharomycetes: D D Rediscovery of Universal Phenotypes In addition, almost all strains capable of fermenting -glucose could also ferment -fructose. These and Elucidation of Fructophilic results strongly suggest that the ability to assimilate/ferment D-fructose is a universal phenotype Behaviors in Ambrosiozyma platypodis among yeasts in the Saccharomycetes.
    [Show full text]
  • Unlocking a Signal of Introgression from Codons in Lachancea Kluyveri Using a Mutation-Selection Model Cedric Landerer1,2,3* , Brian C
    Landerer et al. BMC Evolutionary Biology (2020) 20:109 https://doi.org/10.1186/s12862-020-01649-w RESEARCH ARTICLE Open Access Unlocking a signal of introgression from codons in Lachancea kluyveri using a mutation-selection model Cedric Landerer1,2,3* , Brian C. O’Meara1,2, Russell Zaretzki2,4 and Michael A. Gilchrist1,2 Abstract Background: For decades, codon usage has been used as a measure of adaptation for translational efficiency and translation accuracy of a gene’s coding sequence. These patterns of codon usage reflect both the selective and mutational environment in which the coding sequences evolved. Over this same period, gene transfer between lineages has become widely recognized as an important biological phenomenon. Nevertheless, most studies of codon usage implicitly assume that all genes within a genome evolved under the same selective and mutational environment, an assumption violated when introgression occurs. In order to better understand the effects of introgression on codon usage patterns and vice versa, we examine the patterns of codon usage in Lachancea kluyveri, a yeast which has experienced a large introgression. We quantify the effects of mutation bias and selection for translation efficiency on the codon usage pattern of the endogenous and introgressed exogenous genes using a Bayesian mixture model, ROC SEMPPR, which is built on mechanistic assumptions about protein synthesis and grounded in population genetics. Results: We find substantial differences in codon usage between the endogenous and exogenous genes, and show that these differences can be largely attributed to differences in mutation bias favoring A/T ending codons in the endogenous genes while favoring C/G ending codons in the exogenous genes.
    [Show full text]
  • Extraordinary Diversity of Telomeres, Telomerase Rnas and Their
    www.nature.com/scientificreports OPEN Extraordinary diversity of telomeres, telomerase RNAs and their template regions in Saccharomycetaceae Vratislav Peska1*, Petr Fajkus1,3, Michal Bubeník1,3, Václav Brázda1, Natália Bohálová1,2, Vojtěch Dvořáček1, Jiří Fajkus1,3 & Sònia Garcia4* Telomerase RNA (TR) carries the template for synthesis of telomere DNA and provides a scafold for telomerase assembly. Fungal TRs are long and have been compared to higher eukaryotes, where they show considerable diversity within phylogenetically close groups. TRs of several Saccharomycetaceae were recently identifed, however, many of these remained uncharacterised in the template region. Here we show that this is mainly due to high variability in telomere sequence. We predicted the telomere sequences using Tandem Repeats Finder and then we identifed corresponding putative template regions in TR candidates. Remarkably long telomere units and the corresponding putative TRs were found in Tetrapisispora species. Notably, variable lengths of the annealing sequence of the template region (1–10 nt) were found. Consequently, species with the same telomere sequence may not harbour identical TR templates. Thus, TR sequence alone can be used to predict a template region and telomere sequence, but not to determine these exactly. A conserved feature of telomere sequences, tracts of adjacent Gs, led us to test the propensity of individual telomere sequences to form G4. The results show highly diverse values of G4-propensity, indicating the lack of ubiquitous conservation of this feature across Saccharomycetaceae. Chromosomes are capped by nucleoprotein structures called telomeres, which are involved in several important functions such as the regulation of gene expression, recombination, mitosis and meiosis1–4.
    [Show full text]
  • Mating-Type Switching by Chromosomal Inversion In
    Mating-type switching by chromosomal inversion PNAS PLUS in methylotrophic yeasts suggests an origin for the three-locus Saccharomyces cerevisiae system Sara J. Hanson, Kevin P. Byrne, and Kenneth H. Wolfe1 UCD Conway Institute, School of Medicine and Medical Science, University College Dublin, Dublin 4, Ireland Edited by Jasper Rine, University of California, Berkeley, CA, and approved October 7, 2014 (received for review August 19, 2014) Saccharomyces cerevisiae has a complex system for switching the S. cerevisiae (subphylum Saccharomycotina) and Schizosaccharomyces mating type of haploid cells, requiring the genome to have three pombe (Taphrinomycotina). A large clade of nonswitching fil- mating-type (MAT)–like loci and a mechanism for silencing two of amentous ascomycetes, the Pezizomycotina, is related more closely them. How this system originated is unknown, because the three- to S. cerevisiae than to S. pombe (Fig. 1). Despite their apparently locus system is present throughout the family Saccharomyceta- independent origins, the switching systems of S. cerevisiae (3) and ceae, whereas species in the sister Candida clade have only one S. pombe (4, 5) have many parallels, including the presence of three locus and do not switch. Here we show that yeasts in a third clade, MAT-like loci (two of which are silenced) with triplicated guide the methylotrophs, have a simpler two-locus switching system sequences, a mechanism for specifically inducing a DNA break at based on reversible inversion of a section of chromosome with MAT, donor bias (27), and cell-lineage tracking. However, the MATa genes at one end and MATalpha genes at the other end. details of switching and silencing are quite different in the two In Hansenula polymorpha the 19-kb invertible region lies beside species (3, 4).
    [Show full text]