Letter to the Editor

Total Page:16

File Type:pdf, Size:1020Kb

Letter to the Editor Letter to the Editor Evolution of Ribonuclease Inhibitor by Exon Duplication Marcia C. Haigis,* Eric S. Haag,* and Ronald T. Raines*² *Department of Biochemistry and ²Department of Chemistry, University of Wisconsin±Madison The leucine-rich repeat (LRR) is a prevalent struc- Shapiro, and Acharya 1997; Kajava 1998) (®g. 1A and tural motif that mediates speci®c protein-protein inter- B). By using the National Center for Biotechnology In- actions (Kobe and Deisenhofer 1995b). The LRR is de- formation (NCBI) nr database (http://www.ncbi.nlm. ®ned by a region of 20±29 residues that contains the nih.gov), we identi®ed three potential transcripts ob- consensus sequence: XLXXLXLXXN, where L, N, and tained from human placental, HeLa, and testis cDNA X are leucine, asparagine, and any residue, respectively. libraries (GI 5 186260, 35843, and 7328055, respec- Typically, LRRs are displayed in 1±30 tandem copies tively). These clones are 1,682, 1,921, and 2,982 bp, within a protein. respectively. Although alternative splice forms of the 59- The ®rst crystalline structure of an LRR-containing untranslated region have been identi®ed in placenta, protein revealed a new protein fold (Kobe and Deisen- Northern blot analysis revealed that the RI gene is ex- hofer 1993) (®g. 1A and B). In porcine ribonuclease in- pressed as a single transcript in placental and HeLa cells hibitor (RI [Hofsteenge 1997]), the LRRs are arranged (Lee et al. 1988; Schneider et al. 1988; Crawford, Hag- tandemly in a horseshoe shape with 16 a-helices encas- erty, and Beutler 1989). Using these transcripts, we ing a parallel b-sheet of 17 strands. Hydrophobic forces searched the NCBI database for a corresponding geno- between the consensus leucine residues appear to sta- mic sequence and identi®ed a region of DNA on human bilize the b-sheet. RI is able to bind tightly to members chromosome 11 that overlaps cosmids cSRL125c1 and of the ribonuclease (RNase) A superfamily, which not cSRL135f4 and contains the RI gene (GI 5 2281060). only catalyze RNA degradation but also mediate angio- The genomic locus of human RI spans more than genesis, cytotoxicity, and the host-defense response 12 kb (®g. 1C). All three transcripts that we identi®ed (D'Alessio and Riordan 1997; Raines 1998; Leland and possess a colinear open reading frame encoded by nine Raines 2001). 39-terminal exons, numbered 2±10, as determined by the The RI fold is conserved in other proteins contain- presence of consensus splice donor and acceptor sites. ing LRRs. Human U2A9 is a small nuclear protein that The transcripts differ in their 59-untranslated region and contains ®ve LRRs and binds to U2B0 to form an active contain zero, one, or two alternatively spliced exons. As spliceosomal complex. Another protein, Skp2, contains all transcripts encode an identical protein sequence, the both 10 LRRs that function as a protein-binding domain functional signi®cance of alternative splice forms is un- and an F-box motif that binds to the catalytic core of known. Remarkably, all but the ®rst and last coding ex- ubiquitin-protein ligases. The LRRs in U2A9 and Skp2 ons are exactly 171 bp in length. In addition, all introns adopt an RI-like fold (Price, Evans, and Nagai 1998; are of phase 2, which denotes codon interruption after Schulman et al. 2000). The crystalline structures of the second nucleotide. Hence, any of the internal exons U2A9 bound to U2B0 and those of RI bound to RNase could be moved or removed without disrupting the A and angiogenin reveal that the parallel b-sheet formed alignment of LRRs encoded by neighboring exons. by the LRRs furnishes the surface for interacting with A striking relationship exists between the intron- other proteins (Kobe and Deisenhofer 1995a; Papageor- exon structure and the tertiary structure of RI (®g. 1). giou, Shapiro, and Acharya 1997; Price, Evans, and Na- Each of the internal exons encodes 57 residues. Except gai 1998). for the junction after the N-terminus, which occurs with- The modularity of the LRR is consistent with the in a longer b-strand, every exon translation product cor- evolution of RI by gene duplication. To test this hy- relates with a pair of b-strands and a-helices without pothesis, we investigated the genomic structure of hu- any disruption (®g. 1A and B). Thus, the LRR motif man RI. Our ®ndings provide a new insight into the within RI is de®ned completely by its intron-exon evolution of RI as well as the many proteins containing structure. LRRs or other modules. The phasing and position of the RI introns are con- Human RI consists of alternating 28- or 29-residue sistent with exon ampli®cation during evolution (de LRR units (Kobe and Deisenhofer 1993; Papageorgiou, Souza et al. 1997; Gilbert, de Souza, and Long 1997; Fedorov et al. 1998). To test this hypothesis, we inves- tigated the similarity of the coding exons. RI exons were Abbreviations: LRR, leucine-rich repeat; RI, ribonuclease aligned using the PILEUP program from the Wisconsin inhibitor. Package of Genetics Computer Group (Madison, Wis.). Key words: angiogenin, exon duplication, leucine-rich repeat, ri- Although the terminal exons (2 and 10) have low se- bonuclease A, ribonuclease inhibitor, RNA. quence identity, the internal exons (3±9) can be aligned Address for correspondence and reprints: Ronald T. Raines, De- without gaps and have considerable sequence identity partment of Biochemistry, University of Wisconsin±Madison, 433 Babcock Drive, Madison, Wisconsin 53706-1544. (®g. 2A). Pairwise comparisons revealed that exons 3± E-mail: [email protected]. 9 are 50%±60% identical. In addition, translation prod- Mol. Biol. Evol. 19(6):959±963. 2002 ucts corresponding to exons 3±9 were aligned and de- q 2002 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038 termined to be 30%±50% identical (®g. 2C). 959 960 Haigis et al. ing genomes diverged. To distinguish between these dis- tinct mechanisms, we performed phylogenetic analyses on the internal coding exons of human, mouse, and pig RI. The mouse RI exons were identi®ed by a twofold approach: analyzing existing sequences and generating new sequence data. By using the NCBI mouse genome database, we identi®ed genomic sequences that contain several exons of mouse RI. In addition, we generated new sequence data to verify the remaining intron-exon junctions. To obtain the mouse RI locus, we screened a mouse 129 library (Genome Systems, St. Louis, Mo.) by the PCR using the primers: 59 GGTGCAAAGACA- TCAGCTCAGCAGTCCAAGC 39 and 59 CCTGGA- GCACCAGACCCACACC 39. Like the human gene, mouse RI consists of seven internal exons that are 171 bp in length and are separated by phase 2 introns. The pig exons were inferred from the cDNA for pig RI (GI 5 M58700) by using the genomic structures of human and mouse RI. The internal exons from human, pig, and mouse were aligned as described previously (®g. 2A). Sequence alignment reveals strong identity between analogous exons. Neighbor-Joining phylogenetic trees built by using a Kimura two-parameter correction have strongly supported clades for analogous exons between species with bootstrap values of 99%±100% (®g. 2B). A maximum parsimony analysis of the data set also strongly supports the existence of clades for each exon with bootstrap values of 98%±100%. Apparently, the tandem repeats within RI are extremely stable and ho- mogenization has not taken place, as that would have resulted in a tree in which the internal exons from each species form a strongly supported clade. A consistent internal branching pattern did not emerge with strong support from either analysis. Furthermore, the branches leading to each exon clade are extremely long, which is consistent with a nearly simultaneous divergence of each clade. Thus, we conclude from this analysis that exons 3±9 were most likely duplicated in a radiation that occurred rapidly early in the history of RI. Members of the RNase A superfamily apparently exist in all tetra- pods (D'Alessio and Riordan 1997). Thus, it is likely that RI was assembled after the origin of tetrapods in the late Devonian, approximately 360 MYA, but before the divergence of eutherian mammals. FIG. 1.ÐCorrelation of the exons of human RI with its LRR units. A, Our data indicate several unusual features about the Ribbon diagram of RI was created using the atomic coordinates de- evolution of RI. The internal exons of RI are homolo- rived by X-ray diffraction analysis (Papageorgiou, Shapiro, and Achar- gous and stable. In addition, the branching pattern sug- ya 1997) and the program MOLSCRIPT (Kraulis 1991). Each exon- gests that the internal exon duplications occurred rapidly encoded sequence is in a different color. The N- and C-termini are labeled. B, Same as A, except rotated by ;908 around the horizontal and then diverged over a long period of time. To assess axis. C, Physical map of the human RI gene. Alternate forms isolated whether positive selection has been a factor in the di- from placenta, HeLa cells, and testes are depicted as RIa,RIb, and vergence of the RI exons, we measured the rate of non- RIg, respectively. Exons and introns are drawn to scale. Start and stop synonymous versus synonymous changes in human ex- codons are indicated. Exon colors match those of exon-encoded se- ons 3±9 (Nei and Gojobori 1986). The ratios of non- quences in A and B. synonymous-to-synonymous changes range from 0.31 to 1.00. Although the nonsynonymous-to-synonymous ra- The high degree of similarity and their identical tio is not always a de®nitive indicator of positive selec- length strongly suggests that the internal exons of RI are tion (Wyckoff, Want, and Wu 2000), the low ratios for homologues related by exon duplication.
Recommended publications
  • The TIM Barrel Fold Nagarajan D
    The TIM barrel fold Nagarajan D. and Nanajkar N. Comments and corrections: Line 10: fix “αhelices” in “α-helices”. Lines 11-12: C-terminal loops are important for catalytic activity, while N-terminal loops are important for the stability of the TIM-barrels. This should be mentioned. Line 14: The reference #7 is not related to the statement. Line 14: There is a new EC classe (EC.7, translocases). Change “5 of 6” in “5 of 7”. Lines 26-27: It is not correct to state that the shear number of 8 for the TIM-barrels is due to “their staggered nature”. Most of the β-barrels have a staggered nature, but their shear number is not 8. Line 27: The reference #2 is imprecise. Wierenga did not defined himself the shear number of TIM-barrel proteins. Please check the 2 papers of Murzin AG, 1994, “Principle determining the structure of β-sheet barrels in proteins,” I and II, and the paper of Liu W, 1998, “Shear numbers of protein β-barrels: definition refinements and statistics”. Line 29: Again, it is not correct to state that the 4-fold geometric symmetry depends on the stagger. Since the number of strands (n) is equal to the Shear number (S), side-chains point alternatively towards the pore and the core, giving a 4-fold symmetry. Line 37: “historically” is a bit exaggerated for a reference dated 2015, especially if it comes from the author itself. Find a true historic reference, or just mention that you defined the regions “core” and “pore”. Line 43: “Consequently” is misleading.
    [Show full text]
  • Smurflite: Combining Simplified Markov Random Fields With
    SMURFLite: combining simplified Markov random fields with simulated evolution improves remote homology detection for beta-structural proteins into the twilight zone Noah M. Daniels 1, Raghavendra Hosur 2, Bonnie Berger 2∗, and Lenore J. Cowen 1∗ 1Department of Computer Science, Tufts University, Medford, MA 02155 2Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139 ABSTRACT are limited in their power to recognize remote homologs because of Motivation: One of the most successful methods to date for their inability to model statistical dependencies between amino-acid recognizing protein sequences that are evolutionarily related has residues that are close in space but far apart in sequence (Lifson and been profile Hidden Markov Models (HMMs). However, these models Sander (1980); Zhu and Braun (1999); Olmea et al. (1999); Cowen do not capture pairwise statistical preferences of residues that are et al. (2002); Steward and Thorton (2002)). hydrogen bonded in beta sheets. These dependencies have been For this reason, many have suggested (White et al. (1994); partially captured in the HMM setting by simulated evolution in the Lathrop and Smith (1996); Thomas et al. (2008); Liu et al. (2009); training phase and can be fully captured by Markov Random Fields Menke et al. (2010); Peng and Xu (2011)) that more powerful (MRFs). However, the MRFs can be computationally prohibitive when Markov Random Fields (MRFs) be used. MRFs employ an auxiliary beta strands are interleaved in complex topologies. dependency graph which allows them to model more complex We introduce SMURFLite, a method that combines both simplified statistical dependencies, including statistical dependencies that Markov Random Fields and simulated evolution to substantially occur between amino-acid residues that are hydrogen bonded in beta improve remote homology detection for beta structures.
    [Show full text]
  • Modeling and Predicting Super-Secondary Structures of Transmembrane Beta-Barrel Proteins Thuong Van Du Tran
    Modeling and predicting super-secondary structures of transmembrane beta-barrel proteins Thuong van Du Tran To cite this version: Thuong van Du Tran. Modeling and predicting super-secondary structures of transmembrane beta-barrel proteins. Bioinformatics [q-bio.QM]. Ecole Polytechnique X, 2011. English. NNT : 2011EPXX0104. pastel-00711285 HAL Id: pastel-00711285 https://pastel.archives-ouvertes.fr/pastel-00711285 Submitted on 23 Jun 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THESE` pr´esent´ee pour obtenir le grade de DOCTEUR DE L’ECOLE´ POLYTECHNIQUE Sp´ecialit´e: INFORMATIQUE par Thuong Van Du TRAN Titre de la th`ese: Modeling and Predicting Super-secondary Structures of Transmembrane β-barrel Proteins Soutenue le 7 d´ecembre 2011 devant le jury compos´ede: MM. Laurent MOUCHARD Rapporteurs Mikhail A. ROYTBERG MM. Gregory KUCHEROV Examinateurs Mireille REGNIER M. Jean-Marc STEYAERT Directeur Laboratoire d’Informatique UMR X-CNRS 7161 Ecole´ Polytechnique, 91128 Plaiseau CEDEX, FRANCE Composed with LATEX !c Thuong Van Du Tran. All rights reserved. Contents Introduction 1 1Fundamentalreviewofproteins 5 1.1 Introduction................................... 5 1.2 Proteins..................................... 5 1.2.1 Aminoacids............................... 5 1.2.2 Properties of amino acids .
    [Show full text]
  • Development and Characterization of Novel Bioluminescent Reporters of Cellular Activity by Derrick C. Cumberbatch Dissertation
    Development and Characterization of Novel Bioluminescent Reporters of Cellular Activity By Derrick C. Cumberbatch Dissertation Submitted to the Faculty of the Graduate School of Vanderbilt University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY in Biological Sciences May 10, 2019 Nashville, Tennessee Approved: C. David Weaver, Ph.D. Douglas McMahon, Ph.D. Qi Zhang, Ph.D. Carl Johnson, Ph.D. To my beloved and supportive wife Alicia, and to my parents Cameron and Marcia Cumberbatch. ii ACKNOWLEDGEMENTS This work was made possible by financial support from the NIMH grants MH107713 and MH116150 awarded to Carl Johnson, Ph.D. as well as funds provided by the Vanderbilt University Dissertation Enhancement Grant, graciously awarded to me by the Graduate School. I appreciate Dr. Carl Johnson for taking me into his lab and providing me with ample tools that aided in the successful completion of my Ph.D. I would like to express gratitude to my committee members Drs. David Weaver, Douglas McMahon, Qi Zhang, and the late Dr. Donna Webb for guiding me through the process of becoming a competent researcher. I would also like to make special mention of Jie Yang, Ph.D. whose persistent efforts and sage advice were an ever-present help during my graduate studies. His one-on-one training provided me with many skills that will serve me well as a molecular biologist. Meaningful contributions from the other current and past members of the Johnson lab group, Yao Xu, Ph.D., Tetsuya Mori, Ph.D., Shuqun Shi, Ph.D., Chi Zhao, Ph.D., Peijun Ma, Ph.D., He Huang, Ph.D., Kathryn Campbell, Briana Wyzinski, Kevin Kelly, Maria Luisa Jabbur, Carla O’Neale and Ian Dew deserve to be highlighted here as well.
    [Show full text]
  • The Structure of Small Beta Barrels
    bioRxiv preprint doi: https://doi.org/10.1101/140376; this version posted May 24, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. The Structure of Small Beta Barrels Philippe Youkharibache*, Stella Veretnik1, Qingliang Li, Philip E. Bourne*1 National Center for Biotechnology Information, The National Library of Medicine, The National Institutes of Health, Bethesda Maryland 20894 USA. *To whom correspondence should be addressed at [email protected] and [email protected] 1 Current address: Department of Biomedical Engineering, The University of Virginia, Charlottesville VA 22908 USA. 1 bioRxiv preprint doi: https://doi.org/10.1101/140376; this version posted May 24, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Abstract The small beta barrel is a protein structural domain, highly conserved throughout evolution and hence exhibits a broad diversity of functions. Here we undertake a comprehensive review of the structural features of this domain. We begin with what characterizes the structure and the variable nomenclature that has been used to describe it. We then go on to explore the anatomy of the structure and how functional diversity is achieved, including through establishing a variety of multimeric states, which, if misformed, contribute to disease states.
    [Show full text]
  • Folding-TIM Barrel
    Protein Folding Practical September 2011 Folding up the TIM barrel Preliminary Examine the parallel beta barrel that you constructed, noting the stagger of the strands that was needed to connect the ends of the 8-stranded parallel beta sheet into the 8-stranded beta barrel. Notice that the stagger dictates which side of the sheet is on the inside and which is on the outside. This will be key information in folding the complete TIM linear peptide into the TIM barrel. Assembling the full linear peptide 1. Make sure the white beta strands are extended correctly, and the 8 yellow helices (with the green loops at each end) are correctly folded into an alpha helix (right handed with H-bonds to the 4th ahead in the chain). 2. starting with a beta strand connect an alpha helix and green loop to make the blue-red connecting peptide bond. Making sure that you connect the carbonyl (red) end of the beta strand to the amino (blue) end of the loop-helix-loop. Secure the just connected peptide bond bond with a twist-tie as shown. 3. complete step 2 for all beta strand/loop-helix-loop pairs, working in parallel with your partners 4. As pairs are completed attach the carboxy end of the strand- loop-helix-loop to the amino end of the next strand-loop-helix-loop module and secure the new peptide bond with a twist-tie as before. Repeat until the full linear TIM polypeptide chain is assembled. Make sure all strands and helices are still in the correct conformations.
    [Show full text]
  • New Mesh Headings for 2018 Single Column After Cutover
    New MeSH Headings for 2018 Listed in alphabetical order with Heading, Scope Note, Annotation (AN), and Tree Locations 2-Hydroxypropyl-beta-cyclodextrin Derivative of beta-cyclodextrin that is used as an excipient for steroid drugs and as a lipid chelator. Tree locations: beta-Cyclodextrins D04.345.103.333.500 D09.301.915.400.375.333.500 D09.698.365.855.400.375.333.500 AAA Domain An approximately 250 amino acid domain common to AAA ATPases and AAA Proteins. It consists of a highly conserved N-terminal P-Loop ATPase subdomain with an alpha-beta-alpha conformation, and a less-conserved C- terminal subdomain with an all alpha conformation. The N-terminal subdomain includes Walker A and Walker B motifs which function in ATP binding and hydrolysis. Tree locations: Amino Acid Motifs G02.111.570.820.709.275.500.913 AAA Proteins A large, highly conserved and functionally diverse superfamily of NTPases and nucleotide-binding proteins that are characterized by a conserved 200 to 250 amino acid nucleotide-binding and catalytic domain, the AAA+ module. They assemble into hexameric ring complexes that function in the energy-dependent remodeling of macromolecules. Members include ATPASES ASSOCIATED WITH DIVERSE CELLULAR ACTIVITIES. Tree locations: Acid Anhydride Hydrolases D08.811.277.040.013 Carrier Proteins D12.776.157.025 Abuse-Deterrent Formulations Drug formulations or delivery systems intended to discourage the abuse of CONTROLLED SUBSTANCES. These may include physical barriers to prevent chewing or crushing the drug; chemical barriers that prevent extraction of psychoactive ingredients; agonist-antagonist combinations to reduce euphoria associated with abuse; aversion, where controlled substances are combined with others that will produce an unpleasant effect if the patient manipulates the dosage form or exceeds the recommended dose; delivery systems that are resistant to abuse such as implants; or combinations of these methods.
    [Show full text]
  • Tertiary Structure
    Comments Structural motif v sequence motif polyproline (“PXXP”) motif for SH3 binding “RGD” motif for integrin binding “GXXXG” motif within the TM domain of membrane protein Most common type I’ beta turn sequences: X – (N/D/G)G – X Most common type II’ beta turn sequences: X – G(S/T) – X 1 Putting it together Alpha helices and beta sheets are not proteins—only marginally stable by themselves … Extremely small “proteins” can’t do much 2 Tertiary structure • Concerns with how the secondary structure units within a single polypeptide chain associate with each other to give a three- dimensional structure • Secondary structure, super secondary structure, and loops come together to form “domains”, the smallest tertiary structural unit • Structural domains (“domains”) usually contain 100 – 200 amino acids and fold stably. • Domains may be considered to be connected units which are to varying extents independent in terms of their structure, function and folding behavior. Each domain can be described by its fold, i.e. how the secondary structural elements are arranged. • Tertiary structure also includes the way domains fit together 3 Domains are modular •Because they are self-stabilizing, domains can be swapped both in nature and in the laboratory PI3 kinase beta-barrel GFP Branden & Tooze 4 fluorescence localization experiment Chimeras Recombinant proteins are often expressed and purified as fusion proteins (“chimeras”) with – glutathione S-transferase – maltose binding protein – or peptide tags, e.g. hexa-histidine, FLAG epitope helps with solubility, stability, and purification 5 Structural Classification All classifications are done at the domain level In many cases, structural similarity implies a common evolutionary origin – structural similarity without evolutionary relationship is possible – but no structural similarity means no evolutionary relationship Each domain has its corresponding “fold”, i.e.
    [Show full text]
  • Molecular Modeling 2021 Lecture 3 -- Tues Feb 2
    Molecular Modeling 2021 lecture 3 -- Tues Feb 2 Protein classification SCOP TOPS Contact maps domains Domains To a cell biologist a domain is a sequential unit within a gene, usually with a specific function. To a structural biologist a domain is a compact globular unit within a protein, classified by its 3D structure. 2 A domain is... • ... an autonomously-folding substructure of a protein. • ... > 30 residues, but typically < 200. May be bigger. • ...usually has a single hydrophobic core • ... usually composed of one chain (occasionally composed of multiple chains) • ...is usually composed on one contiguous segment (occasionally made of discontiguous segments of the same chain) 3 SARS-CoV-2 spike protein — a multi domain protein 4 SCOPe -- classification of domains !http://scop.berkeley.edu similar secondary structure (1) class content (2) fold vague structural homology (3) superfamily Clear structural homology (4) family increasing structural similarity structural increasing (5) protein Clear sequence homology (6) species nearly identical sequences individual structures SCOPe -- class 1. all α (289) classes of domains 2. all β (178) 3. α/β (148) 4. α+β (388) 5. multidomain (71) 6. membrane (60) 7. small (98) Not true classes of globular 8. coiled coil (7) protein domains 9. low-resolution (25) 10. peptides (148) 11. designed proteins (44) 12. artifacts (1) Proteins of the same class conserve secondary structure content SCOPe -- fold level within α/β proteins -- Mainly parallel beta sheets (beta-alpha-beta units) TIM-barrel (22) swivelling beta/beta/alpha domain (5) Many folds have historical spoIIaa-like (2) names. “TIM” barrel was flavodoxin-like (10) first seen in TIM.
    [Show full text]
  • Beta-Barrel Scaffold of Fluorescent Proteins: Folding, Stability and Role in Chromophore Formation
    Author's personal copy CHAPTER FOUR Beta-Barrel Scaffold of Fluorescent Proteins: Folding, Stability and Role in Chromophore Formation Olesya V. Stepanenko*, Olga V. Stepanenko*, Irina M. Kuznetsova*, Vladislav V. Verkhusha**,1, Konstantin K. Turoverov*,1 *Institute of Cytology of Russian Academy of Sciences, St. Petersburg, Russia **Department of Anatomy and Structural Biology and Gruss-Lipper Biophotonics Center, Albert Einstein College of Medicine, Bronx, NY, USA 1Corresponding authors: E-mails: [email protected]; [email protected] Contents 1. Introduction 222 2. Chromophore Formation and Transformations in Fluorescent Proteins 224 2.1. Chromophore Structures Found in Fluorescent Proteins 225 2.2. Autocatalytic and Light-Induced Chromophore Formation and Transformations 228 2.3. Interaction of Chromophore with Protein Matrix of β-Barrel 234 3. Structure of Fluorescent Proteins and Their Unique Properties 236 3.1. Aequorea victoria GFP and its Genetically Engineered Variants 236 3.2. Fluorescent Proteins from Other Organisms 240 4. Pioneering Studies of Fluorescent Protein Stability 243 4.1. Fundamental Principles of Globular Protein Folding 244 4.2. Comparative Studies of Green and Red Fluorescent Proteins 248 5. Unfolding–Refolding of Fluorescent Proteins 250 5.1. Intermediate States on Pathway of Fluorescent Protein Unfolding 251 5.2. Hysteresis in Unfolding and Refolding of Fluorescent Proteins 258 5.3. Circular Permutation and Reassembly of Split-GFP 261 5.4. Co-translational Folding of Fluorescent Proteins 263 6. Concluding Remarks 267 Acknowledgments 268 References 268 Abstract This review focuses on the current view of the interaction between the β-barrel scaf- fold of fluorescent proteins and their unique chromophore located in the internal helix.
    [Show full text]
  • 101 Appendix 1. List of Candidate Genes in the Mapped QTL Intervals
    Appendix 1. List of candidate genes in the mapped QTL intervals (for Chapter 3) Chromosome Gene Gene PFAM Gene Name Name Start (bp) End (bp) ID PFAM Description Glyma01g21480 Gm01 26542533 26543925 Glyma01g21500 Gm01 26596310 26599861 PF02617 ATP -dependent Clp protease adaptor protein ClpS Glyma01g21510 Gm01 26672736 26675940 PF01490 Transmembrane amino acid transporter protein Glyma01g21530 Gm01 26738538 26739700 Glyma01g21540 Gm01 26740712 26741148 Glyma01g21570 Gm01 26771298 26774049 Glyma01g21580 Gm01 26814404 26816960 Glyma01g21590 Gm01 26883579 26885938 Glyma01g21620 Gm01 26918445 26921076 Glyma01g21660 Gm01 27034015 27042113 PF08154 NLE (NUC135) domain Glyma01g21680 Gm01 27083934 27086891 Glyma01g21710 Gm01 27124069 27134363 PF00076 RNA recognition motif. (a.k.a.
    [Show full text]
  • Evidence for the Preferential Reuse of Sub‐Domain Motifs in Primordial Protein Folds
    Received: 4 January 2021 Revised: 15 April 2021 Accepted: 28 April 2021 DOI: 10.1002/prot.26089 RESEARCH ARTICLE Evidence for the preferential reuse of sub-domain motifs in primordial protein folds Leonhard Heizinger | Rainer Merkl Institute of Biophysics and Physical Biochemistry, University of Regensburg, Abstract Regensburg, Germany A comparison of protein backbones makes clear that not more than approximately Correspondence 1400 different folds exist, each specifying the three-dimensional topology of a pro- Rainer Merkl, Institute of Biophysics and tein domain. Large proteins are composed of specific domain combinations and many Physical Biochemistry, University of Regensburg, 93040 Regensburg, Germany. domains can accommodate different functions. These findings confirm that the reuse Email: [email protected] of domains is key for the evolution of multi-domain proteins. If reuse was also the Funding information driving force for domain evolution, ancestral fragments of sub-domain size exist that Deutsche Forschungsgemeinschaft, Grant/ are shared between domains possessing significantly different topologies. For the Award Numbers: ME 2259/4-1, SFB 960 fully automated detection of putatively ancestral motifs, we developed the algorithm Fragstatt that compares proteins pairwise to identify fragments, that is, instantiations of the same motif. To reach maximal sensitivity, Fragstatt compares sequences by means of cascaded alignments of profile Hidden Markov Models. If the fragment sequences are sufficiently similar, the program determines and scores the structural concordance of the fragments. By analyzing a comprehensive set of proteins from the CATH database, Fragstatt identified 12 532 partially overlapping and structurally similar motifs that clustered to 134 unique motifs. The dissemination of these motifs is limited: We found only two domain topologies that contain two different motifs and generally, these motifs occur in not more than 18% of the CATH topologies.
    [Show full text]