Snopatrol: How Many Snorna Genes Are There? Paul P Gardner*1, Alex Bateman1 and Anthony M Poole2,3

Snopatrol: How Many Snorna Genes Are There? Paul P Gardner*1, Alex Bateman1 and Anthony M Poole2,3

Gardner et al. Journal of Biology 2010, 9:4 http://jbiol.com/content/9/1/4 MINIREVIEW SnoPatrol: how many snoRNA genes are there? Paul P Gardner*1, Alex Bateman1 and Anthony M Poole2,3 See research articles http://www.biomedcentral.com/1471-2164/10/515 and http://www.biomedcentral.com/1471-2164/11/61 Abstract rRNA, and in the Cajal body of the eukaryote cell (small Cajal body scaRNPs), where they modify small nuclear Small nucleolar RNAs (snoRNAs) are among the most RNAs (snRNAs), the RNA constituents of the spliceo- evolutionarily ancient classes of small RNA. Two some [2]. Recently, HBII-52, a human C/D snoRNA, has experimental screens published in BMC Genomics been shown to regulate splicing of serotonin receptor 2C expand the eukaryotic snoRNA catalog, but many more mRNA, indicating a wider role in gene regulation [3], and snoRNAs remain to be found. another C/D snoRNA has been shown to be expressed from the Epstein-Barr virus genome [4]. As our know- ledge of snoRNAs expands beyond RNA modification What are snoRNAs? and hints at wider regulatory roles, there is a need to e biosynthesis of eukaryote ribosomes is complex, identify the full repertoire of snoRNAs in a genome and involving numerous processing events to generate mature establish when and on what RNAs they act. Against this ribosomal RNAs (rRNAs) and the subsequent assembly backdrop, experimental screens that trawl organism-by- of processed rRNAs with dozens of ribosomal proteins. organism for snoRNAs are vital, as bioinformatic screens Small nucleolar RNAs (snoRNAs) are central to ribosome have so far failed to provide a robust computational maturation, being required in key cleavage steps to alternative to labour-intensive experimental methods of generate individual rRNAs, and in their capacity as guides RNA identification. Two recent papers in BMC Genomics for site-specific modification of rRNA. In the rRNA of by Zhang et al. [5] and Liu et al. [6] report the identi fic- the budding yeast Saccharomyces cerevisiae, on the order ation of novel snoRNAs from the rhesus monkey Macaca of 100 snoRNA-guided modifications are made during mulatta and the filamentous fungus Neurospora crassa, the biosynthesis of a single ribosome; this number is respectively. Both sets of authors experimentally investi- approximately double in humans. Around half of these gated snoRNA pools by sequencing cDNAs derived from modifications are methylations of the 2’ position on RNA extracted from their species of interest. Subsequent ribose, and are carried out by C/D-box small nucleolar bioinformatics analysis was used by each group to classify ribo nucleoproteins (snoRNPs), which consist of a guide sequences as either of the two snoRNA classes or other- snoRNA acting in concert with several proteins, includ- wise. ese approaches netted 48 H/ACA and 32 C/D ing Nop1p, the RNA methylase component of the box snoRNAs in the monkey and 20 H/ACA and 45 C/D snoRNP. e remaining modifications produce pseudo- box snoRNAs in the fungus. Studies like these are vital to uridine, an isomer of uridine, and are guided by H/ACA- the extension of our knowledge of how complements of box snoRNPs, with the Cbf5p subunit performing the snoRNAs vary through evolution. Given the intense pseudouridylation reaction [1]. Figure 1 illustrates the effort required for such analyses, it is worth taking stock inter action between the two types of snoRNA and their and asking, where are the current gaps in our knowledge respective RNA targets. of snoRNAs? Over the past decade, the snoRNA universe has expanded rapidly. H/ACA- and C/D-family RNAs have The taxonomic distribution of known snoRNAs been discovered in Archaea (where they are dubbed To investigate the taxonomic distribution of the known sRNAs, as Archaea lack nucleoli), and likewise modify snoRNAs and highlight where potential new discoveries can be made, we have gathered data from the Pfam (protein families), Rfam (RNA families), Genomes Online *Correspondence: [email protected] 1Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, (GOLD) and EMBL databases (Figure 2). e Rfam CB10 1SA, UK database uses experimentally validated ncRNA sequences Full list of author information is available at the end of the article that have been deposited in EMBL to search for homologous sequences across all nucleotide sequences © 2010 BioMed Central Ltd © 2010 BioMed Central Ltd (see the red and pink bars in Figure 2). e results show Gardner et al. Journal of Biology 2010, 9:4 Page 2 of 4 http://jbiol.com/content/9/1/4 5’ x C’ 3’ bo bo D’ x NΨ NΨ Me Me x 3’ 5’ bo D x 3’ bo C 5’ H box ACA 3’ 5’ Eukaryotes: Archaea: Cbf5 Cbf5 Gar1 Gar1 5’ 3’ Nop10 Nop10 Nhp2 L7Ae Eukaryotes: Archaea: Fibrillarin (Nop1) Fibrillarin Nop56 Nop5 Nop58 L7Ae 15.5kDa/Snu13 Figure 1. snoRNA structure. The structure of a H/ACA snoRNA (left) and a C/D box snoRNA (right). The targets for RNA modification are shown in blue. The most important snoRNA-associated proteins are listed below. that for many major taxonomic clades there are few or no sequence was published nearly a decade ago and eight known snoRNAs annotated. further projects are in progress, yet despite this apparent In the Archaea, annotated snoRNAs are notably absent wealth of information no snoRNAs have been identifed. from the taxon Halobacterium, for which a genome But like diplomonads, micro­sporidia clearly have sequence has been available for nearly 10 years and which components of the snoRNA machinery and almost has been proposed to contain snoRNAs on the basis of certainly utilize snoRNAs. The absence, therefore, is due the presence of the snoRNP-associated proteins fibril­ to the fact that snoRNAs have not been experimentally larin and Nop56/58 [7]. In fact, only 33% of the crenar­ determined, and current bioinformatics methods are not chaeal and 60% of the euryarchaeal groups carry known sensitive enough to reliably identify snoRNAs in these or predicted snoRNAs, and numbers of snoRNAs are taxa from sequence analyses alone, so none have been very low in the Euryarchaeota. Still within the Archaea, inferred by homology. snoRNAs have been annotated in some methanococcal As expected, the Metazoa are comparatively well genomes, predicted on the basis of homology to experi­ studied; there is a host of supporting experimental and mentally validated snoRNAs from members of the bioinformatics evidence for snoRNAs across the meta­ Thermoprotei [8]. zoa, with the exception of the Cnidaria and the Platy­ Some eukaryotic taxa fare little better. For example, in helminthes, which currently only have bioinfor­matically the unicellular diplomonads (Diplomonadida; Figure 2), predicted snoRNAs based upon sequence similarity to such as Giardia lamblia, there are no snoRNA families other metazoan snoRNAs. listed in Rfam, although putative snoRNA-like RNAs The genome sequence for the parasitic protozoan have been reported from G. lamblia [9,10]. Databases Trichomonas vaginalis (a parabasalid; Figure 2) bears one such as Rfam inevitably lag behind the current literature; lonely C/D-box snoRNA annotation for a homolog of the we expect that these missing snoRNAs will be included fungal snoRNA snR52/Z13. Furthermore, this is a rather in future releases. low-scoring hit (26.12 bits, E-value = 1.04e+02) to an The case of the microsporidia (unicellular organisms otherwise exclusively fungal family and the Trichomonas allied to the fungi) is interesting in that one genome sequence has some differences from the canonical C- and Gardner et al. Journal of Biology 2010, 9:4 Page 3 of 4 http://jbiol.com/content/9/1/4 Chlorophyta (green algae) (Figure 2), both have good Crenarchaeota Thermoprotei snoRNA coverage, which is based on both bioinformatics Halobacteria Archaea and intensive experimental study of green plant Methanobacteria snoRNAs. Methanococci Finally, the Stramenopiles (Figure 2) have fve Euryarchaeota completed and one draft genome project according to the Methanomicrobia GOLD database. Both the two main lineages of Thermococci stramenopiles, Bacillariophyta and Oomycetes, have Thermoplasmata reasonable numbers of predicted snoRNAs based on Alveolata Apicomplexa homology to other lineages (9 and 75, respectively), though none has been experimentally validated. Whereas Amoebozoa Archamoebae counts of Pfam domains and rRNAs indicate that the Diplomonadida Hexamitidae snoRNP machinery is present in all known taxa of Euglenozoa Kinetoplastida Archaea and Eukaryota, surprisingly it seems to be Dikarya absent from Oomycetes. However, this lack is likely to be Fungi due to the protein sequences not yet being included in Microsporidia the public sequence databases rather than bona fide loss Eukaryota Arthropoda of the snoRNP machinery. Chordata Metazoa Future directions for snoRNA research Cnidaria Up to now, bioinformatics approaches for de novo predic­ Nematoda tion of snoRNAs have not been a great success. As shown Platyhelminthes by Figure 2, a homology search using experimentally Parabasalidea Trichomonada verified snoRNAs, as performed by the Rfam database, Chlorophyta has some success in identifying snoRNAs in taxonomic Viridiplantae lineages where no experiments have yet been performed. Streptophyta But many of these predictions need further validation Bacillariophyta Stramenopiles before they can be entirely trusted. Using additional Oomycetes information such as genomic context and target infor­ma­ tion could prove quite useful in this regard [11,12]. The Color Length snoRNP protein domains from Pfam <10 growing host of orphan snoRNAs - that is, snoRNAs SSU rRNA regions from Rfam <100 lacking a target-modification site - are especially interest­ GOLD genome projects <1000 All snoRNA regions from Rfam <10,000 ing in that several lines of evidence hint at a possible All published snoRNA sequences in Rfam <100,000 regulatory role, as with human HBII-52 [3]. The snoRNA universe is thus likely to expand in function, phylogenetic Figure 2. The taxonomic distribution of existing snoRNA annotations.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    4 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us