A New Domain in Nuclear Proteins Peter Stoilov, Ilona Rafalska and Stefan Stamm
Total Page:16
File Type:pdf, Size:1020Kb
Research Update TRENDS in Biochemical Sciences Vol.27 No.10 October 2002 495 Summary 2 Haering, C.H. et al. (2002) Molecular architecture differential regulation along arms versus the The recent structural studies have of SMC proteins and the yeast cohesin complex. centric region. Cell 98, 249–259 narrowed down the number of possible Mol. Cell 9, 773–788 13 Laloraya, S. et al. (2000) Chromosomal addresses 3 Losada, A. et al. (2000) Identification and of the cohesin component Mcd1p. J. Cell Biol. 151, models for sister chromatid cohesion and characterization of SA/Scc3p subunits in the 1047–1056 provide a clearer view of how the Xenopus and human cohesin complexes. J. Cell 14 Ciosk, R. et al. (2000) Cohesin’s binding to components of the cohesin complex Biol. 150, 405–416 chromosomes depends on a separate complex interact. Nonetheless, several key 4 Sumara, I. et al. (2000) Characterization of consisting of Scc2 and Scc4 proteins. Mol. Cell 5, questions regarding the mechanism of vertebrate cohesin complexes and their 243–254 regulation in prophase. J. Cell Biol. 151, 15 Hartman, T. et al. (2000) Pds5p is an essential sister chromatid cohesion still need to be 749–762 chromosomal protein required for both sister addressed: can cohesins form higher-order 5 Pasierbek, P. et al. (2001) A Caenorhabditis chromatid cohesion and condensation in structures, how is the cohesin complex elegans cohesion protein with functions in meiotic Saccharomyces cerevisiae. J. Cell Biol. 151, recruited to specific regions of the chromosome pairing and disjunction. Genes Dev. 613–626 15, 1349–1360 16 Panizza, S. et al. (2000) Pds5 cooperates with chromosome, how does the passage of the 6 Hirano, T. (2002) The ABCs of SMC proteins: cohesin in maintaining sister chromatid cohesion. replication fork lead to the linkage of sister two-armed ATPases for chromosome Curr. Biol. 10, 1557–1564 chromatids and what role does ATP condensation, cohesion, and repair. Genes Dev. 16, 17 Skibbens, R.V. et al. (1999) Ctf7p is essential for hydrolysis play in sister chromatid 399–414 sister chromatid cohesion and links mitotic cohesion. Undoubtedly, the establishment 7 Hirano, M. et al. (2001) Bimodal activation of chromosome structure to the DNA replication SMC ATPase by intra- and inter-molecular machinery. Genes Dev. 13, 307–319 and maintenance of sister chromatid interactions. EMBO J. 20, 3238–3250 18 Ross, K.E. et al. (2002) Separase: a conserved cohesin is a dynamic process and many of 8 Cohen-Fix, O. (2001) The making and protease separating more than just sisters. its moving parts still need to be elucidated. breaking of sister chromatid cohesion. Cell 106, Trends Cell Biol. 12, 1–3 137–140 19 Dej, K.J. et al. (2000) Separation anxiety at the Acknowledgements 9 Toth, A. et al. (1999) Yeast cohesin complex centromere. Trends Cell Biol. 10, 392–399 requires a conserved protein, Eco1p(Ctf7), 20 Cohen-Fix, O. (2000) Sister chromatid separation: We thank Ann Corsi, April Robbins, to establish cohesion between sister falling apart at the seams. Curr. Biol. 10, Ritu Agarwal, Paul Megee, chromatids during DNA replication. Genes Dev. R816–819 Doug Koshland and Tatsuya Hirano for 13, 320–333 stimulating and helpful discussions. 10 Tanaka, T. et al. (1999) Identification of cohesin association sites at centromeres and along Joseph L. Campbell chromosome arms. Cell 98, 847–858 Orna Cohen-Fix* References 11 Megee, P.C. et al. (1999) The centromeric sister The Laboratory of Molecular and Cellular 1 Anderson, D.E. et al. (2002) Condensin and chromatid cohesion site directs Mcd1p binding to cohesin display different arm conformations with adjacent sequences. Mol. Cell 4, 445–450 Biology, NIDDK, NIH 8 Center Drive, characteristic hinge angles. J. Cell Biol. 156, 12 Blat, Y. et al. (1999) Cohesins bind to preferential Bethesda, MD 20892, USA. 419–424 sites along yeast chromosome III, with *e-mail: [email protected] Protein Sequence Motif YTH: a new domain in nuclear proteins Peter Stoilov, Ilona Rafalska and Stefan Stamm A novel 100–150-residue domain has been intron sequences by the spliceosome is In addition, the alternative splicing identified in the human splicing factor crucial for gene expression. However, the patterns of the splicing factor SRp20 YT521-B and its Drosophila and yeast sequences of the two splice sites and the and hTra2-β pre-mRNAs are altered homologues. Homology searches show branch point that are recognized by by YT521-B [6]. YT521 does not belong that the domain is typical for the spliceosome components are clearly to any of the known splicing factor eukaryotes and is particularly abundant in insufficient to identify the exons in the families. The only sequence feature plants. It is predicted to adopt a mixed pre-mRNA sequence. Correct recognition that it shares with some of the splicing α-helix–β-sheet fold and to bind to RNA. of exons is achieved by the cooperative factors is an RE/D repeat. Here we We propose the name YTH (for YT521-B action of multiple splicing factors report the identification of a new homology) for the domain. auxiliary to the spliceosome [2]. These domain in splicing factor YT521-B factors bind in a sequence-specific and a number of proteins of unknown Published online: 5 September 2002 manner to the pre-mRNA [3,4] and function that could be involved in recruit the spliceosome components to RNA binding. One of the most prominent features of the splice sites. The nuclear protein eukaryotic genes is their discontinuity. YT521 has been identified in Domain characterization Analysis of the working draft of the two-hybrid screens with splicing During BLAST searches to identify human genome has shown that, on factors [5,6]. It interacts with several YT521-B homologues, a conserved part of average, introns account for 95% of the splicing factors, both in two-hybrid and the protein was identified between pre-mRNA [1]. The precise removal of co-immunoprecipitation assays [5,6]. residues 356 and 499 of the rat YT521-B http://tibs.trends.com 0968-0004/02/$ – see front matter © 2002 Elsevier Science Ltd. All rights reserved. PII: S0968-0004(02)02189-8 496 Research Update TRENDS in Biochemical Sciences Vol.27 No.10 October 2002 Fig. 1. Multiple alignment of sequences containing YT homology (YTH) domains. The sequences are denoted with the closely related vertebrate homologues GeneBank identifier of the corresponding nucleotide sequence followed by the species name. The GeneBank identifier of YT521. for the Saccharomyces cerevisiae refers to protein entry in the database. Translations of partial EST sequences are indicated with an asterisk. The sequences were identified using searches of the translated EST and the nonredundant We used SMART [9,10] to search the nucleotide databases at NCBI using rat YT521-B (shown in italic). Wherever available, the conceptual translation SMART and PFAM databases for provided in the database was used for the alignment. If no conceptual translation was provided (i.e. for the EST additional known domains in the sequences), the reading frame identified by the BLAST search was used if it did not contain stop codons or unknown proteins that contain YTH. The search residues. Redundant sequences originating from the same species were omitted. The similar residues that are present in all sequences are shaded (green) using the PAM250 matrix. The identical residues are shaded in yellow. The failed to identify any known domains secondary structure, as predicted by the PHD program, is shown on top. E denotes extended (β-strands) structure and except for three C–C–C–H-type zinc H denotes the predicted α-helices. The multiple sequence alignment (accession number ALIGN_000432) has been finger domains in GI:18397519, an deposited at the European Bioinformatics Institute (ftp://ftp.ebi.ac.uk/pub/databases/embl/align/). A. thaliana predicted protein of unknown function. Therefore, we protein. We therefore performed a The sequences from these searches conclude that the YTH domain defines a PSI-BLAST [7] search with this portion of that aligned to the full length of the query new protein family. the YT521-B protein, using an exclusion and had an E value <10−6 were aligned The putative secondary structure was threshold of 0.005. After four iterations, using CLUSTALW [8] after the determined using the PHD program [11]. multiple proteins from Arabidopsis redundancies had been removed (Fig. 1). The domain is predicted to have a mixed thaliana, Oryza sativa, Homo sapiens, Additional BLAST searches against the αβ-fold, with four α-helices and six Mus musculus, Drosophila melanogaster, genome databases confirmed that the β-strands. The conservation pattern Plasmodium falciparum and conserved region is present exclusively follows the predicted secondary Saccharomyces cerevisiae where found to in eukaryotic genomes. structure, with three blocks of conserved have similar regions, with E values in the The conserved region appears to sequence separated by loops of variable range 10−11–10−60. Many of these protein define a new domain in these proteins, size. Notable features of the domain are sequences were derived from automated which we termed the YT homology (YTH) the highly conserved aromatic residues gene prediction and were not confirmed domain. The YTH domain is usually located in the β-sheet. by mRNA or EST sequences. To identify located in the middle of the protein Most of the proteins identified in the sequences of existing proteins, we sequence. The domain shows remarkable BLAST searches are of plant origin, with performed BLAST [7] searches with conservation across a wide species 13 distinct sequences coming from a the conserved region against the range, with 14 invariant and 19 single species (A. thaliana). It is unclear translated nonredundant nucleotide highly conserved residues.