Comparison of Two Variations of Neural Network Approaches to The

Total Page:16

File Type:pdf, Size:1020Kb

Comparison of Two Variations of Neural Network Approaches to The From: ISMB-93 Proceedings. Copyright © 1993, AAAI (www.aaai.org). All rights reserved. Comparison of Two Variations of Neural Network Approach to the Prediction of Protein Folding Pattern Inna Dubchak *, Stephen R. Holbrook # and*# Sung-Hou Kim #Strnctural Biology Division, LawrenceBerkeley Laboratory *Departmentof Chemistry, University of California at Berkeley Berkeley, CA 94720 II.X)UBCHAK @LBL.GOV examplesof functionally similar and dissimilar proteins with the same fold (Finkelstein & Ptitsyn 1987). Most Abstract proteins pack their secondary structures into one of a limited number of basic geometries and chain Wehave designed, trained and tested two types topologies. The definition "folding pattern" (Levitt of neural networksfor the prediction of protein folding pattern from sequence. Here we describe Chotia 1976) or "topologies" (Richardson 1977) ignore the differences in the networksand comparetheir the variations of length and orientation of ¢t- and 13- performance on a variety of proteins. Both segments and describe only their mutual positions. networkrepresentations are generally successful Someof the macrofolding patterns are: the globin fold, in predicting protein fold and can also be used the. immunoglobulin fold, the nucleotide binding together to confirma prediction. (Rossmann) fold, 4o~-helical bundles, parallel t~/~ barrels, and antiparallel ~ barrels (jelly roll fold). On Introduction smaller scale, the Greekkey motif, EF-hand,helix-loop- helix, kringle motifs and others form fragments of The prediction of protein structure from its amino acid protein structure. sequence is a central, unsolved problem in molecular biology. Despite efforts by a large number of We extended the approach which Muskal and Kim laboratories over manyyears, we are still far from a (Muskal & Kim1992) originally applied to prediction solution to this problem. The size and complexity of of secondary structure composition, to the proteins makes a strictly theoretical approach discrimination of protein folding patterns (Dubchak, intractable. A more hopeful direction is to use the Holbrook, & Kim 1993). In this method rapidly growing number of experimentally determined (Representation I) there are 21 real-valued input nodes protein structures and sequences to empirically extract representing the numberof amino acids and amino acid the principles which govern protein folding and then percent compositionof the protein as a whole, or that of apply these rules to the prediction of new structures. the domainof interest. This can be done at manylevels, from the prediction of In the second version of folding class prediction the secondarystructure of an individual aminoacid (Qian (Representation II) we made attempts to reduce the & Sejnowski 1988, Holley & Karplus 1989) to the number of inputs and accordingly variables by using surface exposure of amino acids (Holbrook, Muskal, reasonable physical subdivision of all aminoacids into a Kim1990), and even tertiary interactions (Bohr et al. few classes. In relating sequenceand structure the basic 1990). Unfortunately, as the complexity of the classification of residues is usually in terms of their prediction and the numberof parameters increase, the hydrophilic and hydrophobic character. Both groups of data to parameter ratio decreases. Thus, we have chosen authors (Qian & Sejnowski 1988, Holley & Karplus a top-downpath in whichwe first attempt to predict the 1989) used physicochemicalproperties of the aminoacid overall fold of the protein using very few parameters. residues such as hydrophobicity,charge, side chain bulk, The detailed structure may then be completed using and backbone flexibility as alternative ways of other prediction algorithms and energetic calculations. representing the inputs to the networkfor prediction of Here we study two variations of a neural network secondary structure. Qian and Sejnowski also provided approachto prediction of the overall protein fold. the network with global information such as an average It is becomingclear that there are a limited number hydrophobicity of the protein and the position of the of protein folding patterns to serve as the core or residue in the sequence. All these attempts did not help scaffold around which variations are added to perform to improve the prediction reliability over the basic specific protein functions. In fact, there are nowmany scheme. However Kneller and co-authors (Kneller, 118 ISMB-93 Cohen, & Langridge 1990) used the scheme with In our calculations we used a less simplistic additional helix hydrophobic moment and strand classification than White and Jacob, and made an hydrophobic moment input units which improved the attempt not only to group the residues accordingto their testing statistics slightly in comparison with (Qian relative hydrophobicity, but to take into consideration Sejnowski 1988). their immediateenvironment. Basically there are two opposite viewpoints (with Several scales for classification of hydrophobic- variety of opinions in between) regarding the hydrophilic character of residues have been published. distribution of hydrophobicand hydrophilic sites along Most of these scales differ only in details and so show the sequence of protein. Chothia and Finkelstein high correlation. We used the hydrophobicity groups (Chothia & Finkelstein 1990) proved that this from (Chothia & Finkelshtein 1990), which are very distribution is uniquein the ease of the globin sequence. similar to those of consensus scale (Eisenberg 1982), The formation of the native secondary structure of any which was designated to mitigate the effect of outlying protein brings together sites of the same character to values in any one scale producedby the peculiarities of form hydrophobic and hydrophilic surfaces. Protein the method. interiors are occupied mainly by non-polar residues and occasionally by neutral residues. The polar atoms in neutral residues usually form hydrogen bonds within Methods their ownsecondary structures so the surfaces between Database: A critical problem in classification of secondary structures are almost entirely hydrophobicin protein folding types by neural networks is the limited character. Theconverse is also true: the sites in protein numberof protein examples of knownthree-dimensional that are highly exposedto the solvent are nearly always structure. We have partly overcome this problem by occupied by polar or neutral residues. The same is using the much larger protein sequence database. supposedlytrue for other folds. Since an architecture of Proteins of the same family from different organisms, a definite fold is specific feature it is reasonableto use showing high sequence homology with proteins of this kind of classification for neural net input. knownstructure were assumedto have the same overall Local interactions inside structural segments can folding pattern and were used to greatly increase the often determine the protein secondarystructures despite number of input examples for use in network training the presence of muchstronger long range interactions and testing. Thus, we selected the proteins for whichthe between different segments. Someof the hydrophobicity crystal s~ucture is knownto have the fold of interest scales (Ponnuswamy1993) takes into account protein and then retrieved homologous proteins from the structural class information namelythe environment of SWlSS-PROTsequence database. We chose to begin each amino acid residue when estimating its our studies on four diverse folding patterns for which hydrophobicity. The other scale (optimal matching there exists a relatively large number of known hydrophobicity ) was derived on the assumption that structures and sequence analogs. Other folding motifs families of proteins that fold in the same way, do so can easily be added to the prediction schemeas sufficient because they have the same pattern of residue examplesof knownstrnctttre are characterized. hydrophobicities along their amino acid sequences S011, in some cases the number of variable (Eisenberg 1984). parameters did not exceed the number of independent At the same time, White and Jacobs (White observations whichled to a lack of generalization. This Jacobs 1990) studied the statistical distribution of problem, in turn, led to our experiments with an hydrophobicresidues along the length of protein chains alternate input representation specified by fewer using a binary hydrophobicity scale which assigns parameters. hydrophobic residues a value of one and non- Protein structure information was from the hydrophobes a value of zero. The resulting binary BrookhavenProtein Data Bank (Bernstein et al. 1977) sequences are tested for randomnessusing the standard (PDB)and publications describing proteins of known run test. For the majority of the 5,247 proteins crystal structure but not yet deposited in the Brooldaaven examined,the distribution of hydrophobicresidues along PDB. Sequence information was from the Swiss-Prot a sequence cannot be distinguished from that expected database (SP) Release 20 (Bairoch & Boeckmann1991). for a randomdistribution. The authors suggest that (a) Therefore, databases were compiled for each of the functional proteins may have originated from random following folding patterns 1) 4(x-helical bundles sequences, (b) the folding of proteins into compact (BUNDLE),2) Eight stranded parallel (x/~ barrels structures may be much more permissive with less (BARREL),3) Nucleotide binding or Rossmann (NBF) sequencespecificity than previously thought, and (c) the fold,
Recommended publications
  • The Janus-Like Role of Proline Metabolism in Cancer Lynsey Burke1,Innaguterman1, Raquel Palacios Gallego1, Robert G
    Burke et al. Cell Death Discovery (2020) 6:104 https://doi.org/10.1038/s41420-020-00341-8 Cell Death Discovery REVIEW ARTICLE Open Access The Janus-like role of proline metabolism in cancer Lynsey Burke1,InnaGuterman1, Raquel Palacios Gallego1, Robert G. Britton1, Daniel Burschowsky2, Cristina Tufarelli1 and Alessandro Rufini1 Abstract The metabolism of the non-essential amino acid L-proline is emerging as a key pathway in the metabolic rewiring that sustains cancer cells proliferation, survival and metastatic spread. Pyrroline-5-carboxylate reductase (PYCR) and proline dehydrogenase (PRODH) enzymes, which catalyze the last step in proline biosynthesis and the first step of its catabolism, respectively, have been extensively associated with the progression of several malignancies, and have been exposed as potential targets for anticancer drug development. As investigations into the links between proline metabolism and cancer accumulate, the complexity, and sometimes contradictory nature of this interaction emerge. It is clear that the role of proline metabolism enzymes in cancer depends on tumor type, with different cancers and cancer-related phenotypes displaying different dependencies on these enzymes. Unexpectedly, the outcome of rewiring proline metabolism also differs between conditions of nutrient and oxygen limitation. Here, we provide a comprehensive review of proline metabolism in cancer; we collate the experimental evidence that links proline metabolism with the different aspects of cancer progression and critically discuss the potential mechanisms involved. ● How is the rewiring of proline metabolism regulated Facts depending on cancer type and cancer subtype? 1234567890():,; 1234567890():,; 1234567890():,; 1234567890():,; ● Is it possible to develop successful pharmacological ● Proline metabolism is widely rewired during cancer inhibitor of proline metabolism enzymes for development.
    [Show full text]
  • A Structural Perspective on the Enzyme with Two Rossmann-Fold Domains
    biomolecules Review S-adenosyl-l-homocysteine Hydrolase: A Structural Perspective on the Enzyme with Two Rossmann-Fold Domains Krzysztof Brzezinski Laboratory of Structural Microbiology, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland; [email protected] Received: 16 November 2020; Accepted: 15 December 2020; Published: 16 December 2020 Abstract: S-adenosyl-l-homocysteine hydrolase (SAHase) is a major regulator of cellular methylation reactions that occur in eukaryotic and prokaryotic organisms. SAHase activity is also a significant source of l-homocysteine and adenosine, two compounds involved in numerous vital, as well as pathological processes. Therefore, apart from cellular methylation, the enzyme may also influence other processes important for the physiology of particular organisms. Herein, presented is the structural characterization and comparison of SAHases of eukaryotic and prokaryotic origin, with an emphasis on the two principal domains of SAHase subunit based on the Rossmann motif. The first domain is involved in the binding of a substrate, e.g., S-adenosyl-l-homocysteine or adenosine and the second domain binds the NAD+ cofactor. Despite their structural similarity, the molecular interactions between an adenosine-based ligand molecule and macromolecular environment are different in each domain. As a consequence, significant differences in the conformation of d-ribofuranose rings of nucleoside and nucleotide ligands, especially those attached to adenosine moiety, are observed. On the other hand, the chemical nature of adenine ring recognition, as well as an orientation of the adenine ring around the N-glycosidic bond are of high similarity for the ligands bound in the substrate- and cofactor-binding domains.
    [Show full text]
  • 02 Whole.Pdf (8.685Mb)
    Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and private study only. The thesis may not be reproduced elsewhere without the permission of the Author. X-RAY CRYSTALLOGRAPHIC INVESTIGATIONS OF THE STRUCTURES OF ENZYMES OF MEDICAL AND BIOTECHNOLOGICA L IMPORTANCE by Richard Lawrence Kingston A dissertation submitted in partial satisfaction of the requirements fo r the degree of Doctor of Philosophy in the Department of Biochemistry at MASSEY UNIVERSITY, NEW ZEALAND November, 1996 ABSTRACT This thesis is broadly in three parts. In the first, the problem of identifying conditions under which a protein will crystallize is considered. Then structural studies on two enzymes are reported, glucose-fructose oxidoreductase from the bacterium Zymomonas mobilis, and the human bile salt dependent lipase (carboxyl ester hydrolase). The ability of protein crystals to diffract X-rays provides the experimental data required to determine their three dimensional structures at atomic resolution. However the crystalliza­ tion of proteins is not always straightforward. A systematic procedure to search for protein crystallization conditions has been developed. This procedure is based on the use of orthog­ onal arrays (matrices whose columns possess certain balancing properties). The theoretical and practical background to the problem is discussed, and the relationship of the presented procedure to other published search methods is considered. The anaerobic Gram-negative bacterium Zymomonas mobilis occurs naturally in sugar-rich growth media, and has attracted much interest because of its potential for industrial ethanol production. In this organism the periplasmic enzyme glucose-fructose oxidoreductase (GFOR) is involved in a protective mechanism to counter osmotic stress.
    [Show full text]
  • PREDICTION REPORTS Successful Recognition of Protein Folds Using Threading Methods Biased by Sequence Similari
    PROTEINS: Structure, Function, and Genetics Suppl 3:104–111 (1999) FOLD RECOGNITION: PREDICTION REPORTS Successful Recognition of Protein Folds Using Threading Methods Biased by Sequence Similarity and Predicted Secondary Structure David T. Jones,* Michael Tress, Kevin Bryson, and Caroline Hadley Department of Biological Sciences, University of Warwick, Coventry, United Kingdom ABSTRACT Analysis of our fold recognition sequence families includes a member of known 3D struc- results in the 3rd Critical Assessment in Structure ture. Sensitive sequence profile methods can extend the Prediction (CASP3) experiment, using the programs range of comparative modeling to the point where around THREADER 2 and GenTHREADER, shows an en- 12% of the families can be assigned to a known 3D couraging level of overall success. Of the 23 submit- structural superfamily, but in contrast to this, it is esti- ted predictions, 20 targets showed no clear se- mated that the probability of a novel protein having a quence similarity to proteins of known 3D structure. similar fold to a known structure is currently as high as These 20 targets can be divided into 22 domains, of 60–70%. There is, therefore, a great deal to be gained from which, 20 domains either entirely match a previ- attempting to solve the problem of building models for very ously known fold, or partially match a substantial remotely related proteins, and for proteins which have region of a known fold. Of these 20 domains, we merely analogous folds, and so are beyond the scope of correctly assigned the folds in 10 cases. Proteins normal comparative modeling strategies. To date the most Suppl 1999:3:104–111.
    [Show full text]
  • That Is the Question
    polymers Review To Tie or Not to Tie? That Is the Question Pawel Dabrowski-Tumanski 1,2 and Joanna I. Sulkowska 1,2,* 1 Centre of New Technologies, University of Warsaw, Warsaw 02-097, Poland; [email protected] 2 Faculty of Chemistry, University of Warsaw, Warsaw 02-093, Poland * Correspondence: [email protected]; Tel.: +48-22-55-43678 Received: 26 August 2017; Accepted: 12 September 2017; Published: 16 September 2017 Abstract: In this review, we provide an overview of entangled proteins. Around 6% of protein structures deposited in the PBD are entangled, forming knots, slipknots, lassos and links. We present theoretical methods and tools that enabled discovering and classifying such structures. We discuss the advantages and disadvantages of the non-trivial topology in proteins, based on available data about folding, stability, biological properties and evolutionary conservation. We also formulate intriguing and challenging questions on the border of biophysics, bioinformatics, biology and mathematics, which arise from the discovery of an entanglement in proteins. Finally, we discuss possible applications of entangled proteins in medicine and nanotechnology, such as the chance to design super stable proteins, whose stability could be controlled by chemical potential. Keywords: knots; link; lasso; topology; polymer; disulfide 1. Introduction Self-tying of proteins was long considered as impossible, as it requires the threading of a chain through a twisted loop, which in principle is an entropically unfavorable event. However, similar structures have been known to exist in other biopolymers (such as DNA) for many years [1,2], and their self-tying was attributed to, e.g., better packing [3].
    [Show full text]
  • Α+Β 31 37 28 18 16 12 40 3
    class Fold# EC SC HI SS HP MJ MP MG total Fam. PDB Rep. Struc. Name α/β 18 60 46 23 40 19 743 202 16 183 1xel - NAD(P)-binding Rossmann Fold α/β 24 20 69 17 19 17 16 10 11 179 13 132 1gky - P-loop Containing NTP Hydrolases α+β 31 37 28 18 16 12 40 33 157 23 160 1fxd - like Ferrodoxin α/β 01 45 36 13 22 11 10 54 146 37 399 1byb - TIM-barrel α/β 23 18 17 794822 67 5 36 1pyd a:2-181 Thiamin-binding α/β 04 15 11 7 10 1955 63 13 132 2tmd a:490-645 FAD/NAD(P)-binding α+β 55 89789366 56 4 23 1sry a:111-421 Class-II-aaRS/Biotin Synthetases β 27 7 10 884433 47 5 19 1fnb 19-154 Reductase/Elongation Factor Domain β 24 13 7433333 39 18 177 1snc - OB-fold α+β 11 10 8482221 37 11 48 1igd - beta-Grasp β 55 9 10 552222 37 7 19 1bdo - Barrel-sandwich hybrid α/β 15 55445633 35 3 22 2ts1 1-217 ATP pyrophoshatases α/β 05 10 4242223 29 4 35 1zym a: The "swivelling" beta/beta/alpha domain α/β 60 57463211 29 3 18 3pmg a:1-190 Phosphoglucomutase, first 3 domains α+β 68 42364243 28 2 3 1mat - Creatinase/methionine aminopeptidase α+β 39 64344111 24 3 42 1gad o:149-312 like G3P dehydrogenase, Ct-dom α+β 18 54412212 21 3 23 1fkd - FKBP-like α/β 41 33331311 18 3 16 1opr - Phosphoribosyltransferases (PRTases) α 78 19121111 17 1 23 1oel a:(*) GroEL, the ATPase domain α+β 10 22242122 17 2 5 1dar 477-599 Ribosomal protein S5 domain 2-like α+β 43 43221122 17 4 50 3grs 364-478 FAD/NAD-linked reductases, dimer-dom.
    [Show full text]
  • Arabidopsis Transcriptome Analysis Reveals Key Roles of Melatonin in Plant Defense Systems
    Arabidopsis Transcriptome Analysis Reveals Key Roles of Melatonin in Plant Defense Systems Sarah Weeda1, Na Zhang2, Xiaolei Zhao3, Grace Ndip4, Yangdong Guo2*, Gregory A. Buck5, Conggui Fu3, Shuxin Ren1* 1 Agriculture Research Station, Virginia State University, Petersburg, Virginia, United States of America, 2 College of Agriculture and Biotechnology, China Agricultural University, Beijing, P. R. China, 3 Tianjin Crop Science Institute, Tianjin Academy of Agricultural Science, Tianjin, P. R. China, 4 Department of Chemistry, Virginia State University, Petersburg, Virginia, United States of America, 5 Center of the Study of Biological Complexity, Virginia Commonwealth University, Richmond, Virginia, United States of America Abstract Melatonin is a ubiquitous molecule and exists across kingdoms including plant species. Studies on melatonin in plants have mainly focused on its physiological influence on growth and development, and on its biosynthesis. Much less attention has been drawn to its affect on genome-wide gene expression. To comprehensively investigate the role(s) of melatonin at the genomics level, we utilized mRNA-seq technology to analyze Arabidopsis plants subjected to a 16-hour 100 pM (low) and 1 mM (high) melatonin treatment. The expression profiles were analyzed to identify differentially expressed genes. 100 pM melatonin treatment significantly affected the expression of only 81 genes with 51 down-regulated and 30 up-regulated. However, 1 mM melatonin significantly altered 1308 genes with 566 up-regulated and 742 down-regulated. Not all genes altered by low melatonin were affected by high melatonin, indicating different roles of melatonin in regulation of plant growth and development under low and high concentrations. Furthermore, a large number of genes altered by melatonin were involved in plant stress defense.
    [Show full text]
  • Supporting Information
    Supporting Information Rackovsky 10.1073/pnas.0903433106 Table S1. The 10 property factors 1. Alpha-helix/bend preference 2. Side-chain size 3. Extended structure preference 4. Hydrophobicity 5. Double-bend preference 6. Amino acid composition 7. Flat extended preference 8. Occurrence in ␣ region 9. pK 10. Surrounding hydrophobicity The first 4 property factors correspond essentially to single amino acid physical properties. The remaining 6 factors are superpositions of several physical properties and for convenience are identified by the property making the greatest contribution to the factor. Rackovsky www.pnas.org/cgi/content/short/0903433106 1of4 Table S2. The 59 CAT groups C A T Number of sequences CATH group name 1 20 5 39 Single alpha-helices involved in coiled coils or other helix–helix interfaces 1 50 10 31 Glycosyltransferase 1 20 58 38 Methane monooxygenase Hydroxylase, chain G, domain 1 1 10 238 67 Recoverin, domain 1 1 10 490 32 Globin-like 1 10 287 53 Helix hairpins 1 10 8 40 Helicase, ruva protein, domain 3 1 20 1,050 29 Glutathione S-transferase Yfyf (class Pi), chain A, domain 2 1 20 1,250 27 Growth hormone, chain A 1 10 472 20 Cyclin A, domain 1 1 10 760 33 Cytochrome Bc1 complex, chain D, domain 2 1 20 120 54 Four-helix bundle [hemerythrin (Met), subunit A] 1 10 10 150 Arc repressor mutant, subunit A 1 10 150 54 DNA polymerase, domain 1 1 25 40 67 Serine threonine protein phosphatase 5, tetratricopeptide repeat 1 10 510 28 Transferase (phosphotransferase), domain 1 2 40 50 125 OB fold (dihydrolipoamide acetyltransferase,
    [Show full text]
  • Deep Evolutionary Analysis Reveals the Design Principles of Fold A
    RESEARCH ARTICLE Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases Rahil Taujale1,2, Aarya Venkat3, Liang-Chin Huang1, Zhongliang Zhou4, Wayland Yeung1, Khaled M Rasheed4, Sheng Li4, Arthur S Edison1,2,3, Kelley W Moremen2,3, Natarajan Kannan1,3* 1Institute of Bioinformatics, University of Georgia, Athens, Georgia; 2Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia; 3Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia; 4Department of Computer Science, University of Georgia, Athens, Georgia Abstract Glycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions. The evolutionary basis for their complex and diverse modes of catalytic functions remain enigmatic. Here, based on deep mining of over half million GT-A fold sequences, we define a minimal core component shared among functionally diverse enzymes. We find that variations in the common core and emergence of hypervariable loops extending from the core contributed to GT-A diversity. We provide a phylogenetic framework relating diverse GT-A fold families for the first time and show that inverting and retaining mechanisms emerged multiple times independently during evolution. Using evolutionary information encoded in primary sequences, we trained a machine learning classifier to predict donor specificity with nearly 90% accuracy and deployed it for the annotation of understudied GTs. Our studies provide an evolutionary framework for investigating complex relationships connecting GT-A fold sequence, structure, function and regulation. *For correspondence: [email protected] Introduction Competing interests: The Complex carbohydrates make up a large bulk of the biomass of any living cell and play essential authors declare that no roles in biological processes ranging from cellular interactions, pathogenesis, immunity, quality con- competing interests exist.
    [Show full text]
  • IDH1 Targeting As a New Potential Option for Intrahepatic Cholangiocarcinoma Treatment—Current State and Future Perspectives
    molecules Review IDH1 Targeting as a New Potential Option for Intrahepatic Cholangiocarcinoma Treatment—Current State and Future Perspectives 1, 1, 1 1 Fabiana Crispo y , Michele Pietrafesa y , Valentina Condelli , Francesca Maddalena , Giuseppina Bruno 2, Annamaria Piscazzi 2, Alessandro Sgambato 1, Franca Esposito 3,* and Matteo Landriscina 1,2,* 1 Laboratory of Pre-Clinical and Translational Research, IRCCS, Referral Cancer Center of Basilicata, 85028 Rionero in Vulture (PZ), Italy; [email protected] (F.C.); [email protected] (M.P.); [email protected] (V.C.); [email protected] (F.M.); [email protected] (A.S.) 2 Medical Oncology Unit, Department of Medical and Surgical Sciences, University of Foggia, 71100 Foggia, Italy; [email protected] (G.B.); [email protected] (A.P.) 3 Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, 80131 Naples, Italy * Correspondence: [email protected] (F.E.); [email protected] (M.L.); Tel.: +39-081-746-3145 (F.E.); +39-088-173-6426 (M.L.) These authors have contributed equally to this work. y Received: 29 July 2020; Accepted: 17 August 2020; Published: 18 August 2020 Abstract: Cholangiocarcinoma is a primary malignancy of the biliary tract characterized by late and unspecific symptoms, unfavorable prognosis, and few treatment options. The advent of next-generation sequencing has revealed potential targetable or actionable molecular alterations in biliary tumors. Among several identified genetic alterations, the IDH1 mutation is arousing interest due to its role in epigenetic and metabolic remodeling. Indeed, some IDH1 point mutations induce widespread epigenetic alterations by means of a gain-of-function of the enzyme, which becomes able to produce the oncometabolite 2-hydroxyglutarate, with inhibitory activity on α-ketoglutarate-dependent enzymes, such as DNA and histone demethylases.
    [Show full text]
  • Significance of Knotted Structures for Function of Proteins and Nucleic Acids SEPTEMBER 17–21, 2014 | WARSAW, POLAND
    Biophysical Society Thematic Meeting | PROGRAM AND ABSTRACTS Significance of Knotted Structures for Function of Proteins and Nucleic Acids SEPTEMBER 17–21, 2014 | WARSAW, POLAND Ministry of Science and Higher Education Republic of Poland Thank You to the Programming Committee and Local Organizers Program Committee Wilma Olson, Rutgers University, USA Jose Onuchic, Rice University, USA Matthias Rief, Technical University of Munich, Germany Joanna Sulkowska, University of Warsaw, Poland Sarah Woodson, Johns Hopkins University, USA Local Organizers Krsysztof Bryl, University of Warmia and Mazury, Poland Sebastian Kmiecik, University of Warsaw, Poland Andrzej Kolinski, University of Warsaw, Poland Alicja Kurcinska, Secretary, University of Warsaw, Poland Monika Pietrzak, University of Warmia and Mazury, Poland Joanna Sulkowska, University of Warsaw, Poland Mariusz Szabelski, University of Warmia and Mazury, Poland Zbigniew Wieczorek, University of Warmia and Mazury, Poland - 1 - This meeting is financially supported by Ministry of Science and Higher Education University of Warsaw, Center for New Technology (CeNT), Foundation for Polish Science (Inter) and National Science Center Poland (Sonata BIS) Additional support provided by European Molecular Biology Organization (EMBO) Centrum Nowych Technologii - 2 - Significance of Knotted Structures for Function of Proteins and Nucleic Acids Welcome Letter September 2014 Dear Colleagues, We would like to welcome you to the conference on the “Significance of Knotted Structures for Function of Proteins and Nucleic Acids”. During this meeting we will focus on recent progress in identifying a biological role of knots in proteins and nucleic acids. This is a new, interdisciplinary field, which has recently been extensively explored by many groups. One important motivation to study these topics is an increasing number of proteins with knots being discovered.
    [Show full text]
  • Proteins Eugene Shakhnovich Harvard University Boulder, CO July 25-27 2012 Outline
    Proteins Eugene Shakhnovich Harvard University Boulder, CO July 25-27 2012 Outline Lecture 1. Introduction: Principles of structural organization of proteins and Protein Universe. Concepts of convergent and divergent molecular evolution. Lecture 2. Protein folding – Experiment and Theory and Simulations Lecture 3: Bridging scales: From Protein Biophysics to Population Genetics (‘’From Crick to Darwin and back’’) Marriage made in heaven: genomics and protein folding/evolution Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot) and a few thousand of Structures (protein data bank) Proteins are tightly packed Structural organization of globular proteins Fold, Folding Pattern and Packing Simplified models of protein structures. a), A detailed fold describing positions of secondary structures in the protein hain and in space (see also Fig. 13-1e). (b), The folding pattern of the protein chain with omitted details of loop pathways, the size and exact orientation of α-helices (shown as cylinders) and β-strands (shown as arrows). (c), Packing: a stack of structural segments with no loop shown and omitted details of size, orientation and direction of α-helices and β-strands (which are therefore presented as ribbons rather than as arrows). Similarities at the fold level, differences at the detailed structural level Two close relatives: horse hemoglobin α and horse hemoglobin β (both possessing a heme shown as a wire model with iron in the center). Find similarities and differences. (For tips: they are highly similar although have some differences in details of loop conformations, in details of orientation of some helices, and in one additional helical turn available in the β globin, on the right).
    [Show full text]