A Study on the Intron Landscape of the Cytochrome B Gene, and Mitochondrial-Encoded N-Acetyltransferase and Ribosomal Protein S3 Genes
Total Page:16
File Type:pdf, Size:1020Kb
A study on the intron landscape of the cytochrome b gene, and mitochondrial-encoded N-acetyltransferase and ribosomal protein S3 genes of fungi in the subphylum Pezizomycotina by Alvan Wai A Thesis to the Faculty of Graduate Studies of The University of Manitoba in partial fulfilment of the requirements of the degree of MASTER OF SCIENCE Department of Microbiology University of Manitoba Winnipeg, Manitoba, Canada Copyright © 2018 by Alvan Wai Abstract The Master’s program was based on mainly in silico work, employing several bioinformatics techniques/tools. The main focus of the research involved comparative sequence analysis of fungal mitochondrial genomes, which has previously been noted to be a rich source of mobile elements. The work is divided into two projects. The first was the study of intron distribution within the cytochrome b gene of filamentous fungi of the phylum Ascomycota. Cytochrome b genes (129) were sampled from the public database, GenBank, and compared via multiple sequence alignments. A total of 21 unique intron insertion sites were observed. Two introns displaying unique intron arrangements were explored in greater depths. One appears to be capable of alternative splicing, the other encodes an open reading frame interrupted by an intron. The results of the first project demonstrate the diversity of intron insertions within a single gene and its impact on genome evolution, and the potential to uncover novel introns. The second project was based on prior research that noted the presence/absence of N-acetyltransferase and ribosomal protein S3 genes within fungal mitochondrial genomes of the Ascomycota. In order to determine the uniqueness of this presence/absence, the protein sequences of the two genes were sampled from two different databases, GenBank and MycoCosm (Joint Genome Institute). Sequences were retrievable from two different compartments, nucleus and mitochondria. The two data sets were separately aligned, and each were subjected to phylogenetic analyses. The results suggest separate origins for the two genes. The N-acetyltransferase gene appears to have a nuclear origin, but some fungi have obtained a mitochondrial version(s). The ribosomal protein S3 gene appears to be of mitochondrial origin, but some fungi appear to encode two copies of separate origin (mitochondrial or nuclear) in the nuclear genome. The underlying mechanism of transfer of genetic elements between nucleus and mitochondria has yet been elucidated. ii Acknowledgements The Master’s project was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Graduate Enhancement of Tri-Council Stipends (GETS) program of the Faculty of Graduate Studies (University of Manitoba; Winnipeg, Manitoba, Canada). Much appreciation goes to Dr. Steven Zimmerly (University of Calgary; Calgary, Alberta, Canada) for all the advice he gave with regards to modeling of group II introns. Much appreciation also goes to my committee members, Drs. Deborah Court and Sean McKenna for all the advice they have given me throughout the Master’s program. We would like to acknowledge Hay M, La D, Carta A, Chen T, and Corkery T, for all their contributions to the cytochrome b intron landscape project. Valuable comments by two anonymous reviewers of the paper, that Chapter 2 was based on, were greatly appreciated. I am very grateful for everyone, both past and present, that has accompanied me through my Master’s project. Everyone has taught me a lot with regards to not only to materials related to the laboratory but also to various aspects of life. A special acknowledgement goes my parents and siblings for their support and patience during the Master’s program. My deepest gratitude and appreciation goes to Dr. Georg Hausner. I would like to especially thank him for his patience, understanding, and vast insight of not only research but life as well. iii Table of contents Abstract ii Acknowledgements iii List of tables viii List of figures ix List of abbreviations x Chapter 1. Literature review 1 1.1. The Fungi 1 1.2. Fungal mitochondrial genomes 2 1.2.1. Size variation of mitochondrial genomes 5 1.2.2. Self-splicing introns 5 1.2.2.1. Group I introns 6 1.2.2.2. Group II introns 8 1.2.2.3. Group III introns 11 1.2.3. Mobile elements: Mobile introns and intron-encoded proteins 12 1.2.3.1. Homing endonucleases 13 1.2.3.1.1. LAGLIDADG homing endonucleases 13 1.2.3.1.2. GIY-YIG homing endonucleases 17 1.2.4. Introns and fungal phenotypes 18 1.2.4.1. Introns and virulence 18 iv 1.2.4.2. Introns and antibiotic resistance 19 1.2.4.3. Intron and growth and senescence 20 1.2.5. Applications of introns and intron-encoded proteins 23 1.3. Research objectives and rationale 24 Chapter 2: The intron landscape of the mtDNA cytb gene among the Ascomycota: introns and intron-encoded open reading frames 26 2.1. Abstract 28 2.2. Introduction 28 2.3. Materials and methods 30 2.3.1. Nucleic acid extraction, cytb gene amplification, and DNA sequencing 30 2.3.2. Data mining for cytb sequences 32 2.3.3. Sequence analysis 32 2.3.4. Intron folding 33 2.4. Results 34 2.4.1. The cytb gene architecture among the Ascomycota: the cytb intron landscape 34 2.4.2. Group II introns and novel intron arrangements 53 2.4.3. Group I introns and intron-encoded ORFs 54 2.5. Discussion 59 2.5.1. cytb intron landscape 59 v 2.5.2. Homing endonuclease genes: From intron invasion, to intron/HEG mutualism, to drift, and decay 61 2.5.3. Evolutionary strategies for optimizing or regulating expression of intron ORFs 66 2.6. Conclusions 66 Chapter 3: Intron-encoded ribosomal proteins and N-acetyltransferases within the mitochondrial genomes of fungi: Here today, gone tomorrow? 68 3.1. Abstract 69 3.2. Introduction 69 3.3. Material and Methods 72 3.3.1. Sequence analysis 72 3.3.2. Intron folding 73 3.3.3. Phylogenetic analysis of NAT and RPS3 amino acid sequences 73 3.4. Results 74 3.4.1. Among the Pezizomycotina, the rps3 gene can be intron-encoded, freestanding, or missing from the mitochondrial genome 74 3.4.2. Encoding pattern of the rps3 gene within the Pezizomycetes 74 3.4.3. Encoding pattern of the rps3 gene within the Dothideomycetes 86 3.4.3.1. The rps3 gene is absent from mitochondrial genomes of the Capnodiales (Dothideomycetes) 87 3.4.4. Encoding pattern of the rps3 gene within the Leotiomycetes 90 3.4.5. Encoding pattern of the rps3 gene within the Lecanoromycetes 90 vi 3.4.6. Encoding pattern of the rps3 gene within the Eurotiomycetes, Sordariomycetes, and Xylonomycetes 90 3.4.7. “Hitchhiking” rps3 genes 93 3.4.8. In fungal mitochondrial genomes, NAT open reading frames can be intron-encoded or freestanding 93 3.4.9. Phylogenetic analysis of mitochondrial- and nuclear-encoded NATs 94 3.4.10. More additions to mitochondrial genomes: members of the aminotransferase superfamily 99 3.5. Discussion 102 3.5.1. The position of the rps3 gene among the Pezizomycotina 102 3.5.2. The case of the lost mtDNA rps3 gene in some members of the Capnodiales 103 3.5.3. Speculations on the formation of freestanding rps3 genes 103 3.5.4. The NAT gene is a recent arrival from the nuclear genome 105 3.6. Conclusions 109 Chapter 4: Conclusions and future directions 110 4.1. The intron landscape of the cytochrome b gene 111 4.2. Mitochondrial-encoded N-acetyltransferase and ribosomal protein S3 113 4.3. Mitochondrial genomes of fungi are diverse: contributions by introns and genes 115 4.4. Potential future work to consider 115 References 118 vii List of tables Table 1.1. Summary of “typical” gene content in mitochondrial genomes of fungi in the phylum Ascomycota. 4 Table 2.1. Summary of introns inserted in the cytb gene sampled from 129 Pezizomycotina fungi from the National Center for Biotechnology Information database. 35 Table 2.2. Summary of the number of introns and the intron/open reading frame configurations observed in the cytb gene, sampled from 129 Pezizomycotina fungi from the National Center for Biotechnology Information database. 47 Table 3.1. The encoding pattern of the rps3 gene among Pezizomycotina fungi surveyed from the Joint Genome Institute MycoCosm web portal and the National Center for Biotechnology Information. 77 Table 3.2. The encoding pattern of the NAT gene among Pezizomycotina fungi surveyed from the National Center for Biotechnology Information. 95 viii List of figures Figure 1.1. Schematic of conserved secondary structures adopted by group I and group II introns. 10 Figure 1.2. The structure of the LAGLIDADG homing endonuclease, I-CreI. 16 Figure 2.1. Phylogenetic tree of cytb genes from fungi of the phylum Ascomycota. 50 Figure 2.2. Intron landscape of the cytochrome b gene. 52 Figure 2.3. The cytb-289 intron of Chrysoporthe austroafricana (KT380883.1). 56 Figure 2.4. The cytb-506 intron of Chaetomium thermophilum (JN007486.1). 58 Figure 2.5. Proposed intron and homing endonuclease life cycle. 63 Figure 3.1. Schematic phylogenetic tree based on current phylogeny of the Ascomycota. 76 Figure 3.2. The proposed secondary structure of the nad5-710 intron of Phyllosticta citriasiana (JGI). 89 Figure 3.3. Phylogenetic analysis of mitochondrial- and nuclear-encoded RPS3 amino acid sequences from fungi of the order Capnodiales. 92 Figure 3.4. Phylogenetic analysis of mitochondrial-encoded NAT amino acid sequences from fungi of the subphylum Pezizomycotina. 101 Figure 3.5. Encoding patterns noted in the cytb-429 intron of fungi of the Pezizomycotina. 108 ix List of abbreviations aa amino acid(s) AAT aminotransferase AT adenine, thymine atp/ATP adenosine triphosphate