Structural Approaches to Protein Sequence Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Structural Approaches to Protein Sequence Analysis Structural approaches to protein sequence analysis by David Tudor Jones Department of Biochemistry and Molecular Biology University College Gower Street London WCIE 6BT This thesis is submitted in fulfilment of the requirements for the degree of Doctor of Philosophy from the University of London. April 17, 1993 ProQuest Number: 10045889 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest. ProQuest 10045889 Published by ProQuest LLC(2016). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States Code. Microform Edition © ProQuest LLC. ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106-1346 Abstract Various protein sequence analysis techniques are described, aimed at improving the prediction of protein structure by means of pattern matching. To investigate the possibility that improvements in amino acid comparison matrices could result in improvements in the sensitivity and accuracy of protein sequence alignments, a method for rapidly calculating amino acid mutation data matrices from large sequence data sets is presented. The method is then applied to the membrane-spanning segments of integral membrane proteins in order to investigate the nature of amino acid mutability in a lipid environment. Whilst purely sequence analytic techniques work well for cases where some residual sequence similarity remains between a newly characterized protein and a protein of known 3-D structure, in the harder cases, there is little or no sequence similarity with which to recognize proteins with similar folding patterns. In the light of these limitations, a new approach to protein fold recognition is described, which uses a statistically derived pairwise potential to evaluate the compatibility between a test sequence and a library of structural templates, derived from solved crystal structures. The method, which is called optimal sequence threading, proves to be highly successful, and is able to detect the common TIM barrel fold between a number of enzyme sequences, which has not been achieved by any previous sequence analysis technique. Finally, a new method for the prediction of the secondary structure and topology of membrane proteins is described. The method employs a set of statistical tables compiled from well-characterized membrane protein data, and a novel dynamic programming algorithm to recognize membrane topology models by expectation maximization. The statistical tables show definite biases towards certain amino acid species on the inside, middle and outside of a cellular membrane. Acknowledgements In many ways this section was the hardest of all to write. It seems that 1 have received help and encouragement in one way or another from just about everyone 1 have met during my PhD course. Unfortunately, due to the limits of both space and memory, 1 can only hope to mention a fraction of those who have played an important part in the completion of this project. My sincerest apologies to anyone 1 have omitted. It goes without saying that 1 wouldn’t be writing this were it not for the help, encourage­ ment, trust, friendship and of course education 1 have received from my two supervisors. Professor Janet Thornton and Dr Willie Taylor. In many senses, it is hard for a PhD student to actually choose a supervisor (let alone TWO supervisors) - they are pretty much chosen for you by the constraints of geography, project availability, timing, luck and of course whether or not the supervisor in question actually wants you to work on their project. All 1 can say is that if 1 could have chosen from every supervisor in the field, 1 could not have made a better choice. 1 greatly look forward to our continued friendship and collaboration in the future. Thanks to the following: Dr Christine Orengo for all the general help, encouragement, technical discussions on algorithms over the years. Above all, thanks for never physically ejecting me from your office when I’ve been looking for someone to talk to and you’ve had a desk-full of work to do! Dr Tom Flores for all the help and advice, and the many discussions we’ve had on topics too numerous to mention - most of which left me out of my depth after about 2 milliseconds. Acknowledgements Dr Mark Swindells for the coffee, ice-cream, The Simpsons™, extremely stimulating technical arguments, and humour beyond the call of duty. Don’t stay in Japan for ever! Dr Simon Hubbard for even more coffee, ice-cream, discussions, the brilliant wit, and keeping a grip on reality even when the rest of the world appeared to be auditioning for a part in a sequel to One Flew Over the Cuckoo's Nest. Dr Nigel Brown for teaching me about Un*x, and the world of real computers. Dr Richard Mott for much helpful advice on statistics and many useful suggestions. Dr Tom Kirkwood for making me welcome in the Laboratory of Mathematical Biology at the National Institute for Medical Research (NIMR), where some of this work was carried out. And the following people for help, discussion, computer code, jokes but above all just for making my workplace fun to be in: Dr Andras Azodi, Steve Gardner, Dr Michael Green, Dr Gail Hutchinson, Dr Axel Kowald, Dr Roman Laskowski, Malcolm Mac Arthur, Dr Laurence Pearl, Frances Richardson, Dr Jus Singh, Helen Stirk, Dr Dek Woolfson, and Dr Marketa Zvelebil. This work was supported by a SERC CASE studentship with the MRC National Institute for Medical Research, Mill Hill, London. To my parents - Rose and Tudor Jones. Contents Abstract........................................................................................................................ 2 Acknowledgements..................................................................................................... 3 Contents ..................................................................................................................... 6 List of Figures.............................................................................................................. 10 List of Tables................................................................................................................ 13 Commonly used abbreviations .................................................................................. 15 Standard amino acid codes......................................................................................... 15 Chapter 1 - Pattern Matching Methods in ProteinStructure Prediction 16 1.1 Protein structure prediction from first principles ............................................ 17 1.2 Pattern matching methods in the prediction of protein structure and fu n c tio n......................................................................................................... 18 1.3 Consensus methods .......................................................................................... 23 1.4 Regular ex pressions .......................................................................................... 31 1.5 Specific structural patterns............................................................................... 32 1.6 Pattern assessment............................................................................................ 34 1.7 Prediction of progress at la s t .......................................................................... 36 Chapter 2 - The Calculation of New Amino Acid Comparison M atrices 40 2.1 Introduction......................................................................................................... 41 2.2 Types of scoring matrix .................................................................................. 41 6 Contents 2.3 Construction of the raw PAM matrix ........................................................... 44 2.4 Calculation of relative mutabilities............................................................... 45 2.5 Calculation of the mutation probability m atrix ........................................... 46 2.6 Calculating the log-odds m atrix .................................................................... 48 2.7 Automating the procedure ............................................................................... 48 2.8 Program implementation .................................................................................. 54 2.9 Results ................................................................................................................. 54 2.10 Summary............................................................................................................ 61 2.11 Alignment parameter optimization .............................................................. 64 2.12 A mutation data matrix for transmembrane proteins ............................... 70 2.13 Discussion......................................................................................................... 77 Chapter 3 - Protein Tertiary Structure Prediction by Fold Recognition .......... 84 3.1 Introduction......................................................................................................... 85 3.2 A limited number of folds ................................................................................ 88 3.3 The fold library ................................................................................................
Recommended publications
  • MATHEMATICAL TECHNIQUES in STRUCTURAL BIOLOGY Contents 0. Introduction 4 1. Molecular Genetics: DNA 6 1.1. Genetic Code 6 1.2. T
    MATHEMATICAL TECHNIQUES IN STRUCTURAL BIOLOGY J. R. QUINE Contents 0. Introduction 4 1. Molecular Genetics: DNA 6 1.1. Genetic code 6 1.2. The geometry of DNA 6 1.3. The double helix 6 1.4. Larger organization of DNA 7 1.5. DNA and proteins 7 1.6. Problems 7 2. Molecular Genetics: Proteins 10 2.1. Amino Acids 10 2.2. The genetic code 10 2.3. Amino acid template 11 2.4. Tetrahedral geometry 11 2.5. Amino acid structure 13 2.6. The peptide bond 13 2.7. Protein structure 14 2.8. Secondary structure 14 3. Frames and moving frames 19 3.1. Basic definitions 19 3.2. Frames and gram matrices 19 3.3. Frames and rotations 20 3.4. Frames fixed at a point 20 3.5. The Frenet Frame 20 3.6. The coiled-coil 22 3.7. The Frenet formula 22 3.8. Problems 24 4. Orthogonal transformations and Rotations 25 4.1. The rotation group 25 4.2. Complex form of a rotation 28 4.3. Eigenvalues of a rotation 28 4.4. Properties of rotations 29 4.5. Problems 30 5. Torsion angles and pdb files 33 5.1. Torsion Angles 33 5.2. The arg function 34 5.3. The torsion angle formula 34 5.4. Protein torsion angles. 35 5.5. Protein Data Bank files. 35 1 2 J. R. QUINE 5.6. Ramachandran diagram 36 5.7. Torsion angles on the diamond packing 37 5.8. Appendix, properties of cross product 38 5.9. Problems 38 6.
    [Show full text]
  • Folding-TIM Barrel
    Protein Folding Practical September 2011 Folding up the TIM barrel Preliminary Examine the parallel beta barrel that you constructed, noting the stagger of the strands that was needed to connect the ends of the 8-stranded parallel beta sheet into the 8-stranded beta barrel. Notice that the stagger dictates which side of the sheet is on the inside and which is on the outside. This will be key information in folding the complete TIM linear peptide into the TIM barrel. Assembling the full linear peptide 1. Make sure the white beta strands are extended correctly, and the 8 yellow helices (with the green loops at each end) are correctly folded into an alpha helix (right handed with H-bonds to the 4th ahead in the chain). 2. starting with a beta strand connect an alpha helix and green loop to make the blue-red connecting peptide bond. Making sure that you connect the carbonyl (red) end of the beta strand to the amino (blue) end of the loop-helix-loop. Secure the just connected peptide bond bond with a twist-tie as shown. 3. complete step 2 for all beta strand/loop-helix-loop pairs, working in parallel with your partners 4. As pairs are completed attach the carboxy end of the strand- loop-helix-loop to the amino end of the next strand-loop-helix-loop module and secure the new peptide bond with a twist-tie as before. Repeat until the full linear TIM polypeptide chain is assembled. Make sure all strands and helices are still in the correct conformations.
    [Show full text]
  • Supplemental Table 7. Every Significant Association
    Supplemental Table 7. Every significant association between an individual covariate and functional group (assigned to the KO level) as determined by CPGLM regression analysis. Variable Unit RelationshipLabel See also CBCL Aggressive Behavior K05914 + CBCL Emotionally Reactive K05914 + CBCL Externalizing Behavior K05914 + K15665 K15658 CBCL Total K05914 + K15660 K16130 KO: E1.13.12.7; photinus-luciferin 4-monooxygenase (ATP-hydrolysing) [EC:1.13.12.7] :: PFAMS: AMP-binding enzyme; CBQ Inhibitory Control K05914 - K12239 K16120 Condensation domain; Methyltransferase domain; Thioesterase domain; AMP-binding enzyme C-terminal domain LEC Family Separation/Social Services K05914 + K16129 K16416 LEC Poverty Related Events K05914 + K16124 LEC Total K05914 + LEC Turmoil K05914 + CBCL Aggressive Behavior K15665 + CBCL Anxious Depressed K15665 + CBCL Emotionally Reactive K15665 + K05914 K15658 CBCL Externalizing Behavior K15665 + K15660 K16130 KO: K15665, ppsB, fenD; fengycin family lipopeptide synthetase B :: PFAMS: Condensation domain; AMP-binding enzyme; CBCL Total K15665 + K12239 K16120 Phosphopantetheine attachment site; AMP-binding enzyme C-terminal domain; Transferase family CBQ Inhibitory Control K15665 - K16129 K16416 LEC Poverty Related Events K15665 + K16124 LEC Total K15665 + LEC Turmoil K15665 + CBCL Aggressive Behavior K11903 + CBCL Anxiety Problems K11903 + CBCL Anxious Depressed K11903 + CBCL Depressive Problems K11903 + LEC Turmoil K11903 + MODS: Type VI secretion system K01220 K01058 CBCL Anxiety Problems K11906 + CBCL Depressive
    [Show full text]
  • Tertiary Structure
    Comments Structural motif v sequence motif polyproline (“PXXP”) motif for SH3 binding “RGD” motif for integrin binding “GXXXG” motif within the TM domain of membrane protein Most common type I’ beta turn sequences: X – (N/D/G)G – X Most common type II’ beta turn sequences: X – G(S/T) – X 1 Putting it together Alpha helices and beta sheets are not proteins—only marginally stable by themselves … Extremely small “proteins” can’t do much 2 Tertiary structure • Concerns with how the secondary structure units within a single polypeptide chain associate with each other to give a three- dimensional structure • Secondary structure, super secondary structure, and loops come together to form “domains”, the smallest tertiary structural unit • Structural domains (“domains”) usually contain 100 – 200 amino acids and fold stably. • Domains may be considered to be connected units which are to varying extents independent in terms of their structure, function and folding behavior. Each domain can be described by its fold, i.e. how the secondary structural elements are arranged. • Tertiary structure also includes the way domains fit together 3 Domains are modular •Because they are self-stabilizing, domains can be swapped both in nature and in the laboratory PI3 kinase beta-barrel GFP Branden & Tooze 4 fluorescence localization experiment Chimeras Recombinant proteins are often expressed and purified as fusion proteins (“chimeras”) with – glutathione S-transferase – maltose binding protein – or peptide tags, e.g. hexa-histidine, FLAG epitope helps with solubility, stability, and purification 5 Structural Classification All classifications are done at the domain level In many cases, structural similarity implies a common evolutionary origin – structural similarity without evolutionary relationship is possible – but no structural similarity means no evolutionary relationship Each domain has its corresponding “fold”, i.e.
    [Show full text]
  • Downloaded from Ref
    bioRxiv preprint doi: https://doi.org/10.1101/201152; this version posted October 10, 2017. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Touching proteins with virtual bare hands: how to visualize protein-drug complexes and their dynamics in virtual reality Erick Martins Ratamero,1 Dom Bellini,2 Christopher G. Dowson,2 and Rudolf A. R¨omer1, ∗ 1Department of Physics, University of Warwick, Coventry, CV4 7AL, UK 2School of Life Sciences, University of Warwick, Coventry, CV4 7AL, UK (Dated: Revision : 1:0, compiled October 10, 2017) Abstract The ability to precisely visualize the atomic geometry of the interactions between a drug and its protein target in structural models is critical in predicting the correct modifications in previously identified inhibitors to create more effective next generation drugs. It is currently common practice among medicinal chemists while attempting the above to access the information contained in three-dimensional structures by using two-dimensional projections, which can preclude disclosure of useful features. A more precise visualization of the three-dimensional configuration of the atomic geometry in the models can be achieved through the implementation of immersive virtual reality (VR). In this work, we present a freely available software pipeline for visualising protein structures through VR. New customer hardware, such as the HTC Vive and the Oculus Rift utilized in this study, are available at reasonable prices. Moreover, we have combined VR visualization with fast algorithms for simulating intramolecular motions of protein flexibility, in an effort to further improve structure-lead drug design by exposing molecular interactions that might be hidden in the less informative static models.
    [Show full text]
  • Greek) Key to Structures Review of Neural Adhesion Molecules
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Neuron, Vol. 16, 261±273, February, 1996, Copyright 1996 by Cell Press The (Greek) Key to Structures Review of Neural Adhesion Molecules Daniel E. Vaughn* and Pamela J. Bjorkman*² 1988; Yoshihara et al., 1991). Many neural CAM Ig super- *Division of Biology family members include Fn-III domains arranged in tan- ² Howard Hughes Medical Institute dem with Ig-like domains (Figure 1). Three-dimensional California Institute of Technology structures are available for domains of several classes Pasadena, California 91125 of Ig-like domains and for Fn-III domains; thus, one can mentally (or using computer graphics; e.g., see Figure 4) piece together the likely structures of the extracellular Cells need to adhere specifically to cellular and extracel- regions of many neural CAMs. Cadherins are also impor- lular components of their environment to carry out di- tant neural CAMs, forming homophilic adhesion inter- verse physiological functions. Examples of such func- faces in the presence of calcium (Geiger and Ayalon, tions within the nervous system include neurite 1992). Two recent structures of cadherin domains pro- extension, synapse formation, and the myelination of vide a clue about how the adhesive interface is formed. axons. The ability to recognize multiple environmental The classification of Ig-like domains has evolved since cues and to undergo specific adhesion is critical to each the first description of the Ig superfamily (Williams and of these complex cellular functions. Recognition and Barclay, 1988) because of many recent structure deter- adhesion are mediated by cell adhesion molecules minations.
    [Show full text]
  • BMC Structural Biology Biomed Central
    BMC Structural Biology BioMed Central Research article Open Access Natural history of S-adenosylmethionine-binding proteins Piotr Z Kozbial*1 and Arcady R Mushegian1,2 Address: 1Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA and 2Department of Microbiology, Molecular Genetics, and Immunology, University of Kansas Medical Center, Kansas City, Kansas 66160, USA Email: Piotr Z Kozbial* - [email protected]; Arcady R Mushegian - [email protected] * Corresponding author Published: 14 October 2005 Received: 21 July 2005 Accepted: 14 October 2005 BMC Structural Biology 2005, 5:19 doi:10.1186/1472-6807-5-19 This article is available from: http://www.biomedcentral.com/1472-6807/5/19 © 2005 Kozbial and Mushegian; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: S-adenosylmethionine is a source of diverse chemical groups used in biosynthesis and modification of virtually every class of biomolecules. The most notable reaction requiring S- adenosylmethionine, transfer of methyl group, is performed by a large class of enzymes, S- adenosylmethionine-dependent methyltransferases, which have been the focus of considerable structure-function studies. Evolutionary trajectories of these enzymes, and especially of other classes of S-adenosylmethionine-binding proteins, nevertheless, remain poorly understood. We addressed this issue by computational comparison of sequences and structures of various S- adenosylmethionine-binding proteins. Results: Two widespread folds, Rossmann fold and TIM barrel, have been repeatedly used in evolution for diverse types of S-adenosylmethionine conversion.
    [Show full text]
  • Molecular Modeling 2021 Lecture 3 -- Tues Feb 2
    Molecular Modeling 2021 lecture 3 -- Tues Feb 2 Protein classification SCOP TOPS Contact maps domains Domains To a cell biologist a domain is a sequential unit within a gene, usually with a specific function. To a structural biologist a domain is a compact globular unit within a protein, classified by its 3D structure. 2 A domain is... • ... an autonomously-folding substructure of a protein. • ... > 30 residues, but typically < 200. May be bigger. • ...usually has a single hydrophobic core • ... usually composed of one chain (occasionally composed of multiple chains) • ...is usually composed on one contiguous segment (occasionally made of discontiguous segments of the same chain) 3 SARS-CoV-2 spike protein — a multi domain protein 4 SCOPe -- classification of domains !http://scop.berkeley.edu similar secondary structure (1) class content (2) fold vague structural homology (3) superfamily Clear structural homology (4) family increasing structural similarity structural increasing (5) protein Clear sequence homology (6) species nearly identical sequences individual structures SCOPe -- class 1. all α (289) classes of domains 2. all β (178) 3. α/β (148) 4. α+β (388) 5. multidomain (71) 6. membrane (60) 7. small (98) Not true classes of globular 8. coiled coil (7) protein domains 9. low-resolution (25) 10. peptides (148) 11. designed proteins (44) 12. artifacts (1) Proteins of the same class conserve secondary structure content SCOPe -- fold level within α/β proteins -- Mainly parallel beta sheets (beta-alpha-beta units) TIM-barrel (22) swivelling beta/beta/alpha domain (5) Many folds have historical spoIIaa-like (2) names. “TIM” barrel was flavodoxin-like (10) first seen in TIM.
    [Show full text]
  • 8-Barrel Enzymes John a Gerlt and Frank M Raushely
    252 Evolution of function in (b/a)8-barrel enzymes John A Gerltà and Frank M Raushely The (b/a)8-barrel is the most common fold in structurally closed, parallel b-sheet structure of the (b/a)8-barrel is characterized enzymes. Whether the functionally diverse formed from eight parallel (b/a)-units linked by hydrogen enzymes that share this fold are the products of either divergent bonds that form a cylindrical core. Despite its eightfold or convergent evolution (or both) is an unresolved question that pseudosymmetry, the packing within the interior of the will probably be answered as the sequence databases continue barrel is better described as four (b/a)2-subdomains in to expand. Recent work has examined natural, designed, and which distinct hydrophobic cores are located between the directed evolution of function in several superfamilies of (b/a)8- b-sheets and flanking a-helices [2]. barrel containing enzymes. The active sites are located at the C-terminal ends of the Addresses b-strands. So placed, the functional groups surround the ÃDepartments of Biochemistry and Chemistry, University of Illinois at active site and are structurally independent: the ‘old’ and Urbana-Champaign, 600 South Mathews Avenue, Urbana, ‘new’ enzymes retain functional groups at the ends of Illinois 61801, USA e-mail: [email protected] some b-strands, but others are altered to allow the ‘new’ yDepartment of Chemistry, Texas A&M University, PO Box 30012, activity [3]. With this blueprint, the (b/a)8-barrel is opti- College Station, Texas 77842-3012, USA mized for evolution of new functions.
    [Show full text]
  • Booklet-The-Structures-Of-Life.Pdf
    The Structures of Life U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES NIH Publication No. 07-2778 National Institutes of Health Reprinted July 2007 National Institute of General Medical Sciences http://www.nigms.nih.gov Contents PREFACE: WHY STRUCTURE? iv CHAPTER 1: PROTEINS ARE THE BODY’S WORKER MOLECULES 2 Proteins Are Made From Small Building Blocks 3 Proteins in All Shapes and Sizes 4 Computer Graphics Advance Research 4 Small Errors in Proteins Can Cause Disease 6 Parts of Some Proteins Fold Into Corkscrews 7 Mountain Climbing and Computational Modeling 8 The Problem of Protein Folding 8 Provocative Proteins 9 Structural Genomics: From Gene to Structure, and Perhaps Function 10 The Genetic Code 12 CHAPTER 2: X-RAY CRYSTALLOGRAPHY: ART MARRIES SCIENCE 14 Viral Voyages 15 Crystal Cookery 16 Calling All Crystals 17 Student Snapshot: Science Brought One Student From the Coast of Venezuela to the Heart of Texas 18 Why X-Rays? 20 Synchrotron Radiation—One of the Brightest Lights on Earth 21 Peering Into Protein Factories 23 Scientists Get MAD at the Synchrotron 24 CHAPTER 3: THE WORLD OF NMR: MAGNETS, RADIO WAVES, AND DETECTIVE WORK 26 A Slam Dunk for Enzymes 27 NMR Spectroscopists Use Tailor-Made Proteins 28 NMR Magic Is in the Magnets 29 The Many Dimensions of NMR 30 NMR Tunes in on Radio Waves 31 Spectroscopists Get NOESY for Structures 32 The Wiggling World of Proteins 32 Untangling Protein Folding 33 Student Snapshot: The Sweetest Puzzle 34 CHAPTER 4: STRUCTURE-BASED DRUG DESIGN: FROM THE COMPUTER TO THE CLINIC 36 The Life of an AIDS
    [Show full text]
  • Hidden Sequence Repeats: Additional Evidence for the Origin of TIM-Barrel Family
    J. Biomedical Science and Engineering, 2016, 9, 307-314 Published Online May 2016 in SciRes. http://www.scirp.org/journal/jbise http://dx.doi.org/10.4236/jbise.2016.96025 Hidden Sequence Repeats: Additional Evidence for the Origin of TIM-Barrel Family Xiaofeng Ji, Yuan Zheng, Zhipeng Wang, Jun Sheng* Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China Received 19 April 2016; accepted 28 May 2016; published 31 May 2016 Copyright © 2016 by authors and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/ Abstract Most proteins adopt an approximate structural symmetry. However, they have no symmetry de- tectable in their sequences and it is unclear for most of these proteins whether their structural symmetry originates from duplication. As one of the six popular folds (super-folds) possessing an approximate structural symmetry, the triosephosphate isomerase barrel (TIM-barrel) domain has been widely studied. Using modified recurrent quantification analysis of primary sequences, we identified the same 2-, 3-, and 4-fold symmetry pattern as their tertiary structures. This result in- dicates that the symmetry in tertiary structure is coded by symmetry in the primary sequence and that the TIM-barrel adopts a 2-, 3-, or 4-fold repeat pattern during evolution. This discovery will be useful for understanding the evolutionary mechanisms of this protein family and the symmetry pattern that may be a clue into the ancient origin of duplication of half-barrels or the β a unit. Keywords TIM-Barrel, Hidden Symmetry, Primary Sequences, Repeat Pattern, Recurrence Quantification Analysis 1.
    [Show full text]
  • Minireview Visual Methods from Atoms to Cells
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Structure, Vol. 13, 347–354, March, 2005, ©2005 Elsevier Ltd All rights reserved. DOI 10.1016/j.str.2005.01.012 Visual Methods from Atoms to Cells Minireview David S. Goodsell1,* Structure Department of Molecular Biology Most molecular illustrations are illustrations of molecu- The Scripps Research Institute lar structure. Molecular structure is naturally amenable 10550 N. Torrey Pines Road to visual metaphors. Molecules are physical objects, La Jolla, California 92037 with defined sizes and shapes. The vagaries of quantum indeterminacy only become important at submolecular levels (with a few amazing exceptions, such as reso- nance energy transfer); thus, in many cases, we can Illustrations of molecular models are widely used for treat molecular structures just as we would treat the the study and dissemination of molecular structure structure of a house or a chair, by using familiar meth- and function. Several metaphors are commonly used ods of rendering images of solid objects lit by discrete to create these illustrations, and each captures a rele- light sources. Three metaphors—lines, spheres, and rib- vant aspect of the molecule and omits other aspects. bons (Figure 1)—have shown lasting success because Effective tools are available for rendering atomic they each capture an important structural property of structures by using several standard representations, the molecule, and they are all easily rendered in a vi- and the research community is highly sophisticated sual form. in their use. Molecular properties, such as electrostat- The covalent structure of a molecule is effectively ics, and large complex molecular and cellular sys- displayed through the use of a bond diagram.
    [Show full text]