Improved Prediction of Protein Side-Chain Conformations with SCWRL4 Georgii G

Total Page:16

File Type:pdf, Size:1020Kb

Improved Prediction of Protein Side-Chain Conformations with SCWRL4 Georgii G proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 Georgii G. Krivov,1,2 Maxim V. Shapovalov,1 and Roland L. Dunbrack Jr.1* 1 Institute for Cancer Research, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, Pennsylvania 19111 2 Department of Applied Mathematics, Moscow Engineering Physics Institute (MEPhI), Kashirskoe Shosse 31, Moscow 115409, Russian Federation INTRODUCTION ABSTRACT The side-chain conformation prediction problem is an integral Determination of side-chain conformations is an component of protein structure determination, protein structure important step in protein structure prediction and protein design. Many such methods have prediction, and protein design. In single-site mutants and in closely been presented, although only a small number related proteins, the backbone often changes little and structure pre- 1 are in widespread use. SCWRL is one such diction can be accomplished by accurate side-chain prediction. In method, and the SCWRL3 program (2003) has docking of ligands and other proteins, taking into account changes remained popular because of its speed, accuracy, in side-chain conformation is often critical to accurate structure pre- and ease-of-use for the purpose of homology dictions of complexes.2–4 Even in methods that take account of modeling. However, higher accuracy at compara- changes in backbone conformation, one step in the process is recal- ble speed is desirable. This has been achieved in culation of side-chain conformation or ‘‘repacking.’’5 Because many a new program SCWRL4 through: (1) a new backbone conformations may be sampled in model refinements, backbone-dependent rotamer library based on side-chain prediction must also be very fast. In protein design, as kernel density estimates; (2) averaging over sam- ples of conformations about the positions in the changes in the sequence are proposed by Monte Carlo steps or other rotamer library; (3) a fast anisotropic hydrogen algorithms, conformations of side chains need to be predicted accu- 6–8 bonding function; (4) a short-range, soft van der rately to determine whether the change is favorable or not. Waals atom–atom interaction potential; (5) fast Most side-chain prediction methods are based on a sample space collision detection using k-discrete oriented poly- that depends on a rotamer library, which is a statistical clustering of topes; (6) a tree decomposition algorithm to observed side-chain conformations in known structures.9 Such solve the combinatorial problem; and (7) optimi- rotamer libraries can be backbone-independent, lumping all side zation of all parameters by determining the chains together regardless of the local backbone conformation,10 or interaction graph within the crystal environment backbone-dependent, such that frequencies and dihedral angles vary using symmetry operators of the crystallographic with the backbone dihedral angles / and w.11,12 An alternative to space group. Accuracies as a function of electron density of the side chains demonstrate that side using statistical rotamer libraries is to use conformer libraries, which chains with higher electron density are easier to are samples of side chains from known structures, usually in the predict than those with low-electron density and form of Cartesian coordinates, thus accounting for bond length, presumed conformational disorder. For a testing bond angle, and dihedral angle variability.13–16 Once a search space set of 379 proteins, 86% of v1 angles and 75% of in the form of rotamers (and samples around rotamers in some v112 angles are predicted correctly within 408 of cases) or conformers is defined, a scoring function is required to the X-ray positions. Among side chains with evaluate the suitability of the sampled conformations. These may higher electron density (25–100th percentile), include the negative logarithm of the observed rotamer library fre- these numbers rise to 89 and 80%. The new pro- quencies,17–20 van der Waals or hard sphere steric interactions of gram maintains its simple command-line inter- side chains with other side chains or the backbone, and sometimes face, designed for homology modeling, and is 20–24 now available as a dynamic-linked library for electrostatic, hydrogen bonding, and solvation terms. Many incorporation into other software programs. Proteins 2009; 00:000–000. Additional Supporting Information may be found in the online version of this article. VC 2009 Wiley-Liss, Inc. Grant sponsor: NIH; Grant numbers: R01 HG02302, R01 GM84453, P20 GM76222. *Correspondence to: Roland L. Dunbrack, Institute for Cancer Research, Fox Chase Cancer Center, Key words: homology modeling; side-chain pre- 333 Cottman Avenue, Philadelphia, PA 19111. E-mail: [email protected]. Received 17 February 2009; Accepted 13 May 2009 diction; protein structure; rotamer library; graph Published online in Wiley InterScience (www.interscience.wiley.com). decomposition; SCWRL. DOI: 10.1002/prot.22488 VC 2009 WILEY-LISS, INC. PROTEINS 1 G.G. Krivov et al. search algorithms have been applied, including cyclic usability of the program for homology modeling and optimization of single residues or pairs of residues,11,16 other purposes. As part of this, we wanted to make sure Monte Carlo,5,18,25 dead-end elimination (DEE),26,27 that the program always solves the structure prediction self-consistent mean field optimization,28 integer pro- problem in a reasonable time, even if the graph is not gramming,29 and graph decomposition.17,30,31 These sufficiently decomposable. This is accomplished with an methods vary in how fast they can solve the combi- approximation that while not guaranteeing a global mini- natorial problem, and whether they guarantee a global mum of the energy function given the rotamer search minimum of the given energy function or instead search space, does complete the calculation quickly in all cases for a low energy without such a guarantee. In general, tested. such a guarantee is not necessary, given the approximate In this article, we describe the development of the nature of the energy functions, and it is the overall pre- SCWRL4 program for prediction of protein side-chain diction accuracy and speed that are more important conformations. We used many different approaches to ac- features of a prediction method. In recent years, it has complish the goals described earlier. We have improved become clear that some flexibility around rotameric the SCWRL energy function using a new backbone-de- positions15,16,32 and more sophisticated energy func- pendent rotamer library (Shapovalov and Dunbrack, in tions20,33 are needed for improved side-chain packing preparation) which uses kernel density estimates and ker- and prediction. nel regressions to provide rotamer frequencies, dihedral SCWRL3 is one of the most widely used programs of angles, and variances that vary smoothly as a function of its type with 2986 licenses in 72 countries as of April 30, the backbone dihedral angles / and w. SCWRL4 also 2009. It uses a backbone-dependent rotamer library,12 a uses a short-range, soft van der Waals interaction poten- simple energy function based on the library rotamer fre- tial between atoms rather than a linear repulsive-only quencies and a purely repulsive steric energy term, and a function used in SCWRL3, as well as an anisotropic graph decomposition to solve the combinatorial packing hydrogen bond function similar to that used in Rosetta36 problem.30 It has a number of features that have made it (but using a different functional form that is faster to widely used. The first of these is speed, which has evaluate). To account for variation of dihedral angles enabled the program to be used on a number of web around the mean values given in the rotamer library, we servers that predict protein structure from sequence- used the approach of Mendes et al.,32 which samples v structure alignments34 and may perform many hundreds angles around the library values and averages the energy of predictions per day. The second is accuracy. At the of interaction between rotamers of different side chains time of its publication, it was one of the most accurate over these samples, resulting in a free-energy-like scoring side-chain prediction methods. However, a number of function. To determine the interaction graph, as used in other methods have appeared claiming higher accu- SCWRL3, we implemented a fast method for detecting racy,15,18,20,35 although often at much longer CPU collisions (i.e., atom–atom interactions less than some times. The third feature of SCWRL3 is usability. The pro- distance) using k-discrete oriented polytopes (kDOPs). gram takes input PDB coordinates for the backbone, kDOPs are three-dimensional shapes with faces perpen- optionally a new sequence, and outputs coordinates for dicular to common fixed axes, such that kDOPs around the structure with predicted side chains using the same two groups of atoms can be rapidly tested for overlap.37 residue numbering and chain identifiers as the input In SCWRL3, we used a graph decomposition method structure. This feature is simple but in fact many if not that broke down the interaction graph of residues into most side-chain prediction programs renumber the resi- biconnected components, which overlap by single resi- dues of the input structure and strip the chain identifiers, dues called articulation points. In most cases, this solves making them difficult to use in homology modeling. the graph quickly. However, with a longer range energy One unfortunate feature of SCWRL3 is that the graph function and sampling about the rotameric dihedral decomposition method used may not always result in a angles, this is no longer true. We therefore implemented combinatorial optimization that can be solved quickly. In our own version of a tree decomposition of the graphs, such cases, the program may go on for many hours as suggested by Xu31 for the side-chain prediction prob- instead of finishing in a few seconds, since it lacks any lem.
Recommended publications
  • Side-Chain Liquid Crystal Polymers (SCLCP): Methods and Materials. an Overview
    Materials 2009, 2, 95-128; doi:10.3390/ma2010095 OPEN ACCESS materials ISSN 1996-1944 www.mdpi.com/journal/materials Review Side-chain Liquid Crystal Polymers (SCLCP): Methods and Materials. An Overview Tomasz Ganicz * and Włodzimierz Stańczyk Centre of Molecular and Macromolecular Studies, Polish Academy of Science, Sienkiewicza 112, 90- 363 Łódź, Poland; E-Mail: [email protected] * Author to whom correspondence should be addressed; E-Mail: [email protected]; Tel. +48-42-684-71-13; Fax: +48-42-684-71-26 Received: 5 February 2009; in revised form: 3 March 2009 / Accepted: 9 March 2009 / Published: 11 March 2009 Abstract: This review focuses on recent developments in the chemistry of side chain liquid crystal polymers. It concentrates on current trends in synthetic methods and novel, well defined structures, supramolecular arrangements, properties, and applications. The review covers literature published in this century, apart from some areas, such as dendritic and elastomeric systems, which have been recently reviewed. Keywords: Liquid crystals, side chain polymers, polymer modification, polymer synthesis, self-assembly. 1. Introduction Over thirty years ago the work of Finkelmann and Ringsdorf [1,2] gave important momentum to synthesis of side chain liquid crystal polymers (SCLCPs), materials which combine the anisotropy of liquid crystalline mesogens with the mechanical properties of polymers. Although there were some earlier attempts [2], their approach of decoupling the motions of a polymer main chain from a mesogen, thus allowed side chain moieties to build up long range ordering (Figure 1). They were able to synthesize polymers with nematic, smectic and cholesteric phases via free radical polymerization of methacryloyl type monomers [3].
    [Show full text]
  • The Role of Side-Chain Branch Position on Thermal Property of Poly-3-Alkylthiophenes
    Polymer Chemistry The Role of Side-chain Branch Position on Thermal Property of Poly-3-alkylthiophenes Journal: Polymer Chemistry Manuscript ID PY-ART-07-2019-001026.R2 Article Type: Paper Date Submitted by the 02-Oct-2019 Author: Complete List of Authors: Gu, Xiaodan; University of Southern Mississippi, School of Polymer Science and Engineering Cao, Zhiqiang; University of Southern Mississippi, School of Polymer Science and Engineering Galuska, Luke; University of Southern Mississippi, School of Polymer Science and Engineering Qian, Zhiyuan; University of Southern Mississippi, School of Polymer Science and Engineering Zhang, Song; University of Southern Mississippi, School of Polymer Science and Engineering Huang, Lifeng; University of Southern Mississippi, School of Polymers and High Performance Materials Prine, Nathaniel; University of Southern Mississippi, School of Polymer Science and Engineering Li, Tianyu; Oak Ridge National Laboratory; University of Tennessee Knoxville He, Youjun; Oak Ridge National Laboratory Hong, Kunlun; Oak Ridge National Laboratory, Center for Nanophase Materials Science; Oak Ridge National Laboratory, Center for Nanophase Materials Science Page 1 of 13 PleasePolymer do not Chemistryadjust margins ARTICLE The Role of Side-chain Branch Position on Thermal Property of Poly-3-alkylthiophenes Received 00th January 20xx, a a a a a a Accepted 00th January 20xx Zhiqiang Cao, Luke Galuska, Zhiyuan Qian, Song Zhang, Lifeng Huang, Nathaniel Prine, Tianyu Li,b, c Youjun He,b Kunlun Hong,*b, c and Xiaodan Gu*a DOI: 10.1039/x0xx00000x Thermomechanical properties of conjugated polymers (CPs) are greatly influenced by both their microstructures and backbone structures. In the present work, to investigate the effect of side-chain branch position on the backbone’s mobility and molecular packing structure, four poly (3-alkylthiophene-2, 5-diyl) derivatives (P3ATs) with different side chains, either branched and linear, were synthesized by a quasi-living Kumada catalyst transfer polymerization (KCTP) method.
    [Show full text]
  • Molecular Modelling Virtual Chemistry Resource Pymol Experiment Answer Sheet Well Done for Completing Our Pymol Experiment! We Really Hope You Enjoyed It!
    #Outtheboxthinking Faculty of Natural & Mathematical Sciences Department of Chemistry Molecular Modelling Virtual Chemistry Resource PyMol Experiment Answer Sheet Well done for completing our PyMol experiment! We really hope you enjoyed it! Please make sure you have completed our post experiment questionnaire. https://kings.onlinesurveys.ac.uk/post-questionnaire-for-outtheboxthinking-resources Send us photos, comments, or thoughts to either our twitter or Instagram accounts using the hashtag #outtheboxthinking. We also encourage you to make a poster of your data and email it to us. Exploring how proteins are formed Press “Amino Acid” Q. What is the R group in the side chain of this amino acid? CH3- Methyl Group Q. Which amino acid is shown here? Alanine Press the button “ALA_ASP” and an additional amino acid will appear Q. What is the chemical formula of the side chain of the new amino acid? CH2COOH Q. What is the name of the second, new amino acid? Aspartic Acid or Aspartate Press the button “Bonded_Ala_Asp” Q. What colour(s) is the amide bond connecting the two amino acids? Blue and green. It is important to highlight the bond between the nitrogen of one amino acid and the carbonyl of the second amino acid and not any other bond (Figure 1). Figure 1: Arrow points to the amide bond formed between two amino acids. Page 1 of 8 Instagram: kclchemoutreach #outtheboxthinking Q. Which elements have been lost from each amino acid following their bonding? Amino acid 1:: Alanine has lost an oxygen and a hydrogen Amino acid 2: Aspartic Acid has lost a hydrogen If you are unsure why this is correct look at figure 2 and open the PyMol document again to visualise the bonding Figure 2: Reaction scheme for amino acid bonding, resulting in the formation of an amide bond and the loss of water.
    [Show full text]
  • Amino Acid Recognition by Aminoacyl-Trna Synthetases
    www.nature.com/scientificreports OPEN The structural basis of the genetic code: amino acid recognition by aminoacyl‑tRNA synthetases Florian Kaiser1,2,4*, Sarah Krautwurst3,4, Sebastian Salentin1, V. Joachim Haupt1,2, Christoph Leberecht3, Sebastian Bittrich3, Dirk Labudde3 & Michael Schroeder1 Storage and directed transfer of information is the key requirement for the development of life. Yet any information stored on our genes is useless without its correct interpretation. The genetic code defnes the rule set to decode this information. Aminoacyl-tRNA synthetases are at the heart of this process. We extensively characterize how these enzymes distinguish all natural amino acids based on the computational analysis of crystallographic structure data. The results of this meta-analysis show that the correct read-out of genetic information is a delicate interplay between the composition of the binding site, non-covalent interactions, error correction mechanisms, and steric efects. One of the most profound open questions in biology is how the genetic code was established. While proteins are encoded by nucleic acid blueprints, decoding this information in turn requires proteins. Te emergence of this self-referencing system poses a chicken-or-egg dilemma and its origin is still heavily debated 1,2. Aminoacyl-tRNA synthetases (aaRSs) implement the correct assignment of amino acids to their codons and are thus inherently connected to the emergence of genetic coding. Tese enzymes link tRNA molecules with their amino acid cargo and are consequently vital for protein biosynthesis. Beside the correct recognition of tRNA features3, highly specifc non-covalent interactions in the binding sites of aaRSs are required to correctly detect the designated amino acid4–7 and to prevent errors in biosynthesis5,8.
    [Show full text]
  • 2. Proteins Have Hierarchies of Structure
    27 2. Proteins have Hierarchies of Structure Protein structure is usually described at four different levels (Fig. II.2.1). The first level, called the primary structure, describes the linear sequence of the amino acids in the chain. The different primary structures correspond to the different sequences in which the amino acids are covalently linked together. The secondary structure describes two common patterns of structural repetition in proteins: the coiling up into helices of segments of the chain, and the pairing together of strands of the chain into β-sheets. The tertiary structure is the next higher level of organization, the overall arrangement of secondary structural elements. The quaternary structure describes how different polypeptide chains are assembled into complexes. Figure II.2.1. Different levels of protein structure. A protein chain’s primary structure is its amino acid sequence. Secondary structure consists of the regular organization of helices and sheets. The example shown is a schematic representation of an α helix. Tertiary structure is a polypeptide chain’s three- dimensional native conformation, which often involves compact packing of secondary structure elements. The example given in the figure is a schematic drawing of one of the four polypeptide chains (subunits) of hemoglobin, the protein that transports oxygen in the blood. α-helices along the chain are represented as cylinders. N is the amino terminus and C is the carboxyl terminus of the polypeptide chain. Quaternary structure is the arrangement of multiple polypeptide chains (subunits) to form a functional biomolecular structure. The figure shows the quaternary arrangement of four subunits to form the functional hemoglobin molecule.
    [Show full text]
  • Prediction of Amino Acid Side Chain Conformation Using a Deep Neural
    Prediction of amino acid side chain conformation using a deep neural network Ke Liu 1, Xiangyan Sun3, Jun Ma3, Zhenyu Zhou3, Qilin Dong4, Shengwen Peng3, Junqiu Wu3, Suocheng Tan3, Günter Blobel2, and Jie Fan1, Corresponding author details: Jie Fan, Accutar Biotechnology, 760 Parkside Ave, Room 213 Brooklyn, NY 11226, USA. [email protected] 1. Accutar Biotechnology, 760 parkside Ave, Room 213, Brooklyn, NY 11226, USA. 2. Laboratory of Cell Biology, Howard Hughes Medical Institute, The Rockefeller University, New York, New York 10065, USA 3. Accutar Biotechnology (Shanghai) Room 307, No. 6 Building, 898 Halei Rd, Shanghai, China 4. Fudan University 220 Handan Rd, Shanghai, China Summary: A deep neural network based architecture was constructed to predict amino acid side chain conformation with unprecedented accuracy. Amino acid side chain conformation prediction is essential for protein homology modeling and protein design. Current widely-adopted methods use physics-based energy functions to evaluate side chain conformation. Here, using a deep neural network architecture without physics-based assumptions, we have demonstrated that side chain conformation prediction accuracy can be improved by more than 25%, especially for aromatic residues compared with current standard methods. More strikingly, the prediction method presented here is robust enough to identify individual conformational outliers from high resolution structures in a protein data bank without providing its structural factors. We envisage that our amino acid side chain predictor could be used as a quality check step for future protein structure model validation and many other potential applications such as side chain assignment in Cryo-electron microscopy, crystallography model auto-building, protein folding and small molecule ligand docking.
    [Show full text]
  • 6 Synthesis of N-Alkyl Amino Acids Luigi Aurelio and Andrew B
    j245 6 Synthesis of N-Alkyl Amino Acids Luigi Aurelio and Andrew B. Hughes 6.1 Introduction Among the numerous reactions of nonribosomal peptide synthesis, N-methylation of amino acids is one of the common motifs. Consequently, the chemical research community interested in peptide synthesis and peptide modification has generated a sizeable body of literature focused on the synthesis of N-methyl amino acids (NMA). That literature is summarized herein. Alkyl groups substituted on to nitrogen larger than methyl are exceedingly rare among natural products. However, medicinal chemistry programs and peptide drug development projects are not limited to N-methylation. While being a much smaller body of research, there is a range of methods for the N-alkylation of amino acids and those reports are also covered in this chapter. The literature on N-alkyl, primarily N-methyl amino acids comes about due to the useful properties that the N-methyl group confers on peptides. N-Methylation increases lipophilicity, which has the effect of increasing solubility in nonaqueous solvents and improving membrane permeability. On balance this makes peptides more bioavailable and makes them better therapeutic candidates. One potential disadvantage is the methyl group removes the possibility of hydro- gen bonding and so binding events may be discouraged. It is notable though that the N-methyl group does not fundamentally alter the identity of the amino acid. Some medicinal chemists have taken advantage of this fact to deliberately discourage binding of certain peptides that can still participate in the general or partial chemistry of a peptide. A series of recent papers relating to Alzheimers disease by Doig et al.
    [Show full text]
  • Amino Acids, Peptides, Proteins: Introduction
    Chem 109 C Bioorganic Compounds Fall 2019 HFH1104 Armen Zakarian Office: Chemistry Bldn 2217 http://labs.chem.ucsb.edu/~zakariangroup/courses.html CLAS Instructor: Dhillon Bhavan [email protected] Amino acids, Peptides, Proteins: Introduction Chapter 21 O R OH NH2 α-amino acid O O + H N R R R OH 2 R N - H2O H peptide bond = amide bond Proteins: Function 21.1 3 Proteins: Structural Histone Protein Structure: DNA packaging 4 Proteins: Protective botulinum toxin (botox) structure most toxic substance known LD50 = 10 ng/kg Amino acids, Peptides, Proteins: Introduction O O R O R O H R R OH R N OH N N OH H NH2 H NH2 O NH2 O R α-amino acid dipeptide tripeptide ! oligopeptide: 3 - 10 amino acids ! polypeptide, or protein: many amino acids Proteins: Amino Acids, Configuration NH2 HO O phenylalanine O + - NH3 natural amino acids C O +H N H same as - have the L configuration (S) 3 O2C R R Amino acids: Classification • Hydrophobic: “water-fearing”, nonpolar side chains – Alkyl side chain • Hydrophilic: “water-loving” side chains – Polar, neutral side chains – Anionic – Cationic - Table 21.1 lists 20 most common natural occurring amino acids - The structures of amino acids will be provided on the tests nonpolar side chains polar neutral (uncharged) side chains Amino acids: Classification polar acidic (anionic) side chains Amino acids: Classification polar basic (cationic) side chains Amino acids: Classification PROBLEM 1 Explain why when the imidazole ring of histidine is protonated, the double-bonded nitrogen is the nitrogen atom that accepts the proton. same for guanidine group in arginine.
    [Show full text]
  • Effect of Side Chain Substituent Volume on Thermoelectric Properties of IDT-Based Conjugated Polymers
    molecules Article Effect of Side Chain Substituent Volume on Thermoelectric Properties of IDT-Based Conjugated Polymers De-Xun Xie 1,2, Tong-Chao Liu 3, Jing Xiao 1,2, Jing-Kun Fang 4,* , Cheng-Jun Pan 3 and Guang Shao 1,2,* 1 School of Chemistry, Sun Yat-sen University, Guangzhou 510275, China; [email protected] (D.-X.X.); [email protected] (J.X.) 2 Shenzhen Research Institute, Sun Yat-sen University, Shenzhen 518057, China 3 Shenzhen Key Laboratory of Polymer Science and Technology, College of Materials Science and Engineering, Shenzhen University, Shenzhen 518060, China; [email protected] (T.-C.L.); [email protected] (C.-J.P.) 4 School of Chemical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China * Correspondence: [email protected] (J.-K.F.); [email protected] (G.S.) Abstract: A p-type thermoelectric conjugated polymer based on indacenodithiophene and benzoth- iadiazole is designed and synthesized by replacing normal aliphatic side chains (P1) with conjugated aromatic benzene substituents (P2). The introduced bulky substituent on P2 is detrimental to form the intensified packing of polymers, therefore, it hinders the efficient transporting of the charge carriers, eventually resulting in a lower conductivity compared to that of the polymers bearing aliphatic side chains (P1). These results reveal that the modification of side chains on conjugated polymers is crucial to rationally designed thermoelectric polymers with high performance. Keywords: organic thermoelectric materials; conjugated polymers; power factor Citation: Xie, D.-X.; Liu, T.-C.; Xiao, J.; Fang, J.-K.; Pan, C.-J.; Shao, G. Effect of Side Chain Substituent Volume on 1.
    [Show full text]
  • Naming Organic Compounds: Alkanes
    Naming Organic Compounds: Alkanes Chemical nomenclature assigns compounds a unique name that allows them to be easily identified and structurally understood. The International Union of Pure and Applied Chemistry (IUPAC) is the organization that assigns names to chemical compounds, and these names generally have three distinct features: a root name indicating either the major carbon chain or the ring of atoms found in the compound; a suffix indicating functional groups that may be present in the compound; and names of substituent groups, other than hydrogen, that may also be present in the compound. This handout will cover how to correctly name alkanes using IUPAC methods. Important Terms When naming alkanes, it is helpful to know the following terms: • Alkanes are organic compounds that only contain single bonds between carbon elements. Alkanes are often referred to as saturated hydrocarbons. Alkane compounds end in –ane. • The longest continuous carbon chain in the compound is the parent chain. Parent chains utilize prefixes to show the amount of carbons in the chain. • A substituent is a side chain group that branches off from the parent chain of the compound. Substituents utilize prefixes to show how many carbons are in the chain • Alkyl groups are substituents that consist of just carbons and hydrogens ( 2 +1). Alkyl groups begin with a prefix, determined by the number of carbons, and end in -yl. Useful Prefixes Number of Carbons Prefix Assigned Number of Carbons Prefix Assigned 1 Meth- 6 Hex- 2 Eth- 7 Hept- 3 Prop- 8 Oct- 4 But- 9 Non- 5 Pent- 10 Dec- Provided by the Academic Center for Excellence 1 Naming Organic Compounds June 2016 Lastly, if the compound is in a ring, use the prefix cyclo-.
    [Show full text]
  • Prediction of Protein Structure by Emphasizing Local Side- Chain/Backbone Interactions in Ensembles of Turn Fragments
    PROTEINS: Structure, Function, and Genetics 53:486–490 (2003) Prediction of Protein Structure by Emphasizing Local Side- Chain/Backbone Interactions in Ensembles of Turn Fragments Qiaojun Fang and David Shortle* Department of Biological Chemistry, The Johns Hopkins University School of Medicine Baltimore, Maryland ABSTRACT The prediction strategy used in energy is known to be minimized over all chemical interac- the CASP5 experiment was premised on the assump- tions involving the protein chain. And usually the most tion that local side-chain/backbone interactions are significant term of the scoring function assesses the ener- the principal determinants of protein structure at getics of long range contacts between side-chains, espe- low resolution. Our implementation of this assump- cially those involving hydrophobic residues. tion made extensive use of a scoring function based As documented in the previous CASP experiments, this on the propensities of the 20 amino acids for 137 standard approach has enjoyed very limited success in the different sub-regions of the Ramachandran plot, prediction of new folds until recently. With the extensive allowing estimation of the quality of fit between a use of large amounts of bioinformatics-type information in sequence segment and a known conformation. New both secondary structure prediction and in fragment selec- folds were predicted in three steps: prediction of tion, real progress in structure prediction has been made .1 secondary structure, threading to isolate fragments The most common explanation for how sequence informa- of protein structures corresponding to one turn plus tion from known homologues can produce such major flanking helices/strands, and recombination of over- improvements in prediction methods is improved sam- lapping fragments.
    [Show full text]
  • 1-1 Amino Acids
    For exclusive use of University of Massachusetts Amherst. Not for distribution. Unique ID IN36425 1-1 Amino Acids The chemical characters of the amino-acid side chains have important consequences for the way they participate in the folding and functions of proteins The amino-acid side chains (Figure 1-3) have different tendencies to participate in interactions with each other and with water. These differences profoundly influence their contributions to protein stability and to protein function. Hydrophobic amino-acid residues engage in van der Waals interactions only. Their tendency to avoid contact with water and pack against each other is the basis for the hydrophobic effect. Alanine and leucine are strong helix-favoring residues, while proline is rarely found in helices because its backbone nitrogen is not available for the hydrogen bonding required for helix formation. The aromatic side chain of phenylalanine can sometimes participate in weakly polar interactions. Hydrophilic amino-acid residues are able to make hydrogen bonds to one another, to the peptide backbone, to polar organic molecules, and to water. This tendency dominates the interactions in which they participate. Some of them can change their charge state depending on their pH or the microenvironment. Aspartic acid and glutamic acid have pKa values near 5 in aqueous solution, so they are usually unprotonated and negatively charged at pH 7. But in the hydrophobic interior of a protein molecule their pKa may shift to 7 or even higher (the same effect occurs if a negative charge is placed nearby), allowing them to func- tion as proton donors at physiological pH.
    [Show full text]