Automated Analysis and Validation of Open Chemical Data Nicholas Elliot

Total Page:16

File Type:pdf, Size:1020Kb

Automated Analysis and Validation of Open Chemical Data Nicholas Elliot Automated Analysis and Validation of Open Chemical Data Nicholas Elliot Day Emmanuel College A dissertation submitted to the University of Cambridge for the degree of Doctor of Philosophy Unilever Centre for Molecular Science Informatics Department of Chemistry Lensfield Road, Cambridge, CB2 1EW, United Kingdom. November 20, 2008 Disclaimer This thesis is the result of my own work and includes nothing which is the outcome of work done in collaboration, except where specifically indicated. This thesis does not exceed the specified word limit (60000) as defined by the Chemistry Degree Committee. This thesis has been typeset in 12pt font using LATEX2ε according to the specifications defined by the Board of Graduate Studies and the Chemistry Degree Committee. Abstract Methods to automatically extract Open Data from the chemical literature, validate it, and use it to validate theory are examined. Chemical identifiers which assist the automatic location of chemical struc- tures using commercial Web search engines are investigated. The IUPAC International Chemical Idenfitifer (InChI) gives almost 100% recall and pre- cision, though is shown to be too long for present search engines. A com- bination of InChI and InChIKey, a shorter, fixed-length hash of the InChI string, is concluded to be the best current method of identifying structures. The proportion of published, Open Crystallographic Information Files (CIFs) that are valid with respect to the specification is shown to be im- proving, and is around 99% in 2007. The error rate in the conversion of valid CIFs to Chemical Markup Language (CML) is less than 0.2%. The machine generation of connection tables from CIFs requires many heuristics, and in some cases it is impossible to deduce the exact connection table. CrystalEye, a fully-automated system for the reformulation of the frag- mented crystallographic Web into a structured XML-based repository is de- scribed. Published, Open CIFs can be located and aggregated programmat- ically with almost 100% recall. It is shown that, by converting CIF data to CML, software can be created to use the latest Web standards and tech- nologies to enhance the ability of Web users to browse, find, keep updated, download and reuse the latest published crystallography. A workflow for the high-throughput calculation of solid-state geometry using a semi-empirical method is described. A wide-range of organic and inorganic systems provided by CrystalEye are used to test both the data and the method. Several errors in the method are discovered, many of which can be attributed to the parameterization process. An Open NMR experiment to perform high-throughput prediction of 13C chemical shifts using a GIAO protocol is described. The data and analysis were provided on publicly-available webpages to enable crowdsourcing, which assisted in discovering an error rate of 6.1% in the starting data. The protocol was refined during the work and shown to have an average unsigned error of 2.24ppm for 13C nuclei of small, rigid molecules; comparable to the errors observed elsewhere for general structures using HOSE and Neural Network methods. iii Acknowledgments I would like to thank Dr Peter Murray-Rust for his advice and guidance throughout this work. Dr Joe Townsend, Jim Downing, Dr Yong Zhang, Dr Andrew Walkingshaw, Dr Nico Adams and Dr Simon Tyrrell are thanked for useful discussions and advice throughout the course of my PhD. I would also like to thank the UCC computing staff, particularly Dr Charlotte Bolton, without which this work would not have been possible. The EPSRC is thanked for funding. Cheers also to Dan, Andy, Rob and Jon for providing copious amounts of tea and distracting ‘activities’. As always, thanks most to Mum, Dad, Vick and Anna. iv Contents Disclaimer i Abstract ii Acknowledgements iv Table of contents viii List of tables ix List of figures xv Glossary xvi 1 Introduction 1 1.1 Chemicalidentification . 2 1.2 OpenData............................. 2 1.3 DataandMetadata........................ 4 1.4 Machine-understandable chemical data . 5 1.4.1 TheSemanticWeb .................... 6 1.5 OpenSourcesoftware....................... 8 1.6 e-Science.............................. 8 1.7 Aims................................ 9 2 Locating chemical data on the Web 11 2.1 Introduction............................ 11 2.2 Websearch-engines . .. .. 11 2.2.1 Sitemaps.......................... 12 2.3 Representing chemical entities as strings . 13 2.4 The IUPAC International Chemical Identifier . 16 2.4.1 InChIcreationandstructure. 17 2.4.2 Searching for InChIs on the Web . 22 2.4.3 The Google-InChI Web Service . 25 2.4.4 Staurosporine - an InChI case study . 26 v 2.5 Conclusions ............................ 38 3 Creating and deriving data in CML from CIF 39 3.1 Introduction............................ 39 3.2 WhyconvertCIFtoCML? . .. 40 3.3 The Crystallographic Information Framework . 42 3.3.1 STAR file concepts and syntax . 42 3.3.2 CIFsyntax ........................ 44 3.3.3 CIFdictionaries. 46 3.3.4 checkCIF ......................... 47 3.4 CIFXML: converting CIF to XML . 49 3.4.1 ConformancetoCIF . 52 3.4.2 CIFXML-J functionality . 53 3.4.3 RepresentationofCIFsinXML . 54 3.4.4 CIFXML-J architecture .................. 57 3.4.5 Using CIFXML-J ...................... 60 3.4.6 CIF to CIFXML conversion statistics . 64 3.5 CIFConverter: convertingCIFtoCML. 67 3.5.1 Handling multiple datablocks . 72 3.5.2 BuildingtheCML .................... 72 3.5.3 Adding symmetry elements . 76 3.5.4 Dictionaryvalidation . 76 3.5.5 Using CIFConverter ................... 78 3.5.6 CIFXML to CML conversion statistics . 80 3.6 EnhancingtheCML ....................... 82 3.6.1 Creatingtheconnectiontable . 84 3.6.2 Adding canonical identifiers . 98 3.7 Currentusage........................... 99 3.8 Conclusions ............................ 99 4 Automating the aggregation, creation and dissemination of semantic crystallography 101 4.1 Introduction............................101 4.2 Implementing a workflow system . 103 4.2.1 TheTavernaWorkbench . .103 4.2.2 Component development and workflows . 107 4.2.3 Developing under Taverna . 111 4.3 CrystalEye.............................114 4.3.1 Implementation . .116 4.3.2 Aggregation ........................116 4.3.3 Processingthedata. .129 4.3.4 Thewebsite ........................134 4.3.5 Services ..........................140 vi 4.3.6 Databasedistribution. .151 4.3.7 CrystalEyedatainRDF . .156 4.3.8 Furtherwork .......................158 4.4 Conclusions ............................160 5 High-throughput prediction of solid-state geometry using semi- empirical methods 161 5.1 Introduction............................161 5.2 MOPAC-CML...........................163 5.2.1 FoX .............................163 5.2.2 MOPAC and FoX .....................165 5.2.3 MOPAC-CMLandJmol . .167 5.3 High-throughputcomputing . .167 5.3.1 Condor...........................170 5.3.2 CamGrid .........................170 5.4 Thecalculationworkflow . .171 5.4.1 Structureselection . .171 5.4.2 MOPACinput.......................172 5.4.3 Condorinput .......................175 5.4.4 Job submission and retrieval . 177 5.4.5 Theoverallworkflow . .178 5.5 Thefirstprotocol .........................178 5.5.1 Organicstructures . .181 5.5.2 Inorganicstructures. .182 5.6 Thesecondprotocol . .184 5.6.1 Thedata..........................186 5.6.2 Inorganicstructures. .187 5.6.3 Organicstructures . .198 5.7 Conclusions ............................212 6 High-throughput prediction of 13C NMR chemical shifts by quantum-mechanical GIAO calculations 215 6.1 Introduction............................215 6.2 NuclearMagneticResonance. .215 6.3 ComputationalNMR . .216 6.3.1 TheRzepaProtocol . .218 6.3.2 NMRShiftDB .......................219 6.4 Calculations............................222 6.4.1 Structureselection . .222 6.4.2 Gaussian03input . .223 6.4.3 Condorinput .......................224 6.4.4 TMS............................225 6.5 OpenComputationalNMR . .226 vii 6.6 Analysis ..............................227 6.6.1 Preparingtheoutput . .227 6.6.2 Initialresults . .229 6.6.3 Sourcesoferror . .230 6.6.4 Cleaningthedataset . .233 6.6.5 Conformationalissues . .237 6.6.6 HSR1 ...........................239 6.6.7 Conclusions ........................244 7 Conclusions 246 A Analysis of the efficacy of Web search engines for chemical search 252 A.1 Search-enginesandstrategy . .252 A.2 Searchtermsandmetrics. .256 A.3 SearchingforCASnumbers . .257 A.4 SearchingforInChIs . .259 A.4.1 The InChI architecture and implications . 259 A.4.2 Results...........................260 A.5 SearchingforSMILES . .261 A.6 Searching for InChI strings from the KEGG collection using Google...............................262 B MOPAC calculation references 270 B.1 Second-protocol inorganic calculations . 270 B.1.1 Shortatom-atomdistances . 270 B.1.2 Silicas ...........................271 B.1.3 Errorsinthedata. .272 B.1.4 Errorsinmodeling . .272 B.2 Organiccalculations . .273 B.2.1 Calculations that terminated with controlled errors . 273 B.2.2 Calculations containing radicals that converged suc- cessful ...........................276 B.2.3 Changes in connection table . 276 B.2.4 Densitychangeoutliers. 277 C Published Work 282 Bibliography 284 viii List of Tables 3.1 Example of applying the canonicalization algorithm . 61 3.2 Proportion of error occurrences in CIF to CML conversion . 82 5.1 Table showing the density change mean and standard devia- tion for structures with various minimum translation vectors. 211 6.1 Comparison of the linear fitting statistics
Recommended publications
  • EXPLORING NEW ASYMMETRIC REACTIONS CATALYSED by DICATIONIC Pd(II) COMPLEXES
    EXPLORING NEW ASYMMETRIC REACTIONS CATALYSED BY DICATIONIC Pd(II) COMPLEXES A DISSERTATION FOR THE DEGREE OF DOCTOR OF PHILOSOPHY FROM IMPERIAL COLLEGE LONDON BY ALEXANDER SMITH MAY 2010 DEPARTMENT OF CHEMISTRY IMPERIAL COLLEGE LONDON DECLARATION I confirm that this report is my own work and where reference is made to other research this is referenced in text. ……………………………………………………………………… Copyright Notice Imperial College of Science, Technology and Medicine Department Of Chemistry Exploring new asymmetric reactions catalysed by dicationic Pd(II) complexes © 2010 Alexander Smith [email protected] This publication may be distributed freely in its entirety and in its original form without the consent of the copyright owner. Use of this material in any other published works must be appropriately referenced, and, if necessary, permission sought from the copyright owner. Published by: Alexander Smith Department of Chemistry Imperial College London South Kensington campus, London, SW7 2AZ UK www.imperial.ac.uk ACKNOWLEDGEMENTS I wish to thank my supervisor, Dr Mimi Hii, for her support throughout my PhD, without which this project could not have existed. Mimi’s passion for research and boundless optimism have been crucial, turning failures into learning curves, and ultimately leading this project to success. She has always been able to spare the time to advise me and provide fresh ideas, and I am grateful for this patience, generosity and support. In addition, Mimi’s open-minded and multi-disciplinary approach to chemistry has allowed the project to develop in new and unexpected directions, and has fundamentally changed my attitude to chemistry. I would like to express my gratitude to Dr Denis Billen from Pfizer, who also provided a much needed dose of optimism and enthusiasm in the early stages of the project, and managed to arrange my three month placement at Pfizer despite moving to the US as the department closed.
    [Show full text]
  • Synthesis and Applications of Derivatives of 1,7-Diazaspiro[5.5
    Synthesis and Applications of Derivatives of 1,7-Diazaspiro[5.5]undecane. A Thesis Submitted by Joshua J. P. Almond-Thynne In partial fulfilment of the requirements for the degree of Doctor of Philosophy Department of Chemistry Imperial College London South Kensington London, SW7 2AZ October 2017 1 Declaration of Originality I, Joshua Almond-Thynne, certify that the research described in this manuscript was carried out under the supervision of Professor Anthony G. M. Barrett, Imperial College London and Doctor Anastasios Polyzos, CSIRO, Australia. Except where specific reference is made to the contrary, it is original work produced by the author and neither the whole nor any part had been submitted before for a degree in any other institution Joshua Almond-Thynne October 2017 Copyright Declaration The copyright of this thesis rests with the author and is made available under a Creative Commons Attribution Non-Commercial No Derivatives licence. Researchers are free to copy, distribute or transmit the thesis on the condition that they attribute it, that they do not use it for commercial purposes and that they do not alter, transform or build upon it. For any reuse or redistribution, researchers must make clear to others the licence terms of this work. 2 Abstract Spiroaminals are an understudied class of heterocycle. Recently, the Barrett group reported a relatively mild approach to the most simple form of spiroaminal; 1,7-diazaspiro[5.5]undecane (I).i This thesis consists of the development of novel synthetic methodologies towards the spiroaminal moiety. The first part of this thesis focuses on the synthesis of aliphatic derivatives of I through a variety of methods from the classic Barrett approach which utilises lactam II, through to de novo bidirectional approaches which utilise diphosphate V and a key Horner-Wadsworth- Emmons reaction with aldehyde VI.
    [Show full text]
  • The Role of Conformational Dynamics in Isocyanide Hydratase Catalysis
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Theses and Dissertations in Biochemistry Biochemistry, Department of Spring 4-15-2020 THE ROLE OF CONFORMATIONAL DYNAMICS IN ISOCYANIDE HYDRATASE CATALYSIS Medhanjali Dasgupta University of Nebraska - Lincoln, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/biochemdiss Part of the Biochemistry Commons, Biophysics Commons, Microbiology Commons, and the Structural Biology Commons Dasgupta, Medhanjali, "THE ROLE OF CONFORMATIONAL DYNAMICS IN ISOCYANIDE HYDRATASE CATALYSIS" (2020). Theses and Dissertations in Biochemistry. 29. https://digitalcommons.unl.edu/biochemdiss/29 This Article is brought to you for free and open access by the Biochemistry, Department of at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Theses and Dissertations in Biochemistry by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. THE ROLE OF CONFORMATIONAL DYNAMICS IN ISOCYANIDE HYDRATASE CATALYSIS by Medhanjali Dasgupta A DISSERTATION Presented to the Faculty of The Graduate College at the University of Nebraska In partial Fulfillment of Requirements For the Degree of Doctor of Philosophy Major: Biochemistry Under the Supervision of Professor Mark A. Wilson Lincoln, Nebraska May 2020 THE ROLE OF CONFORMATIONAL DYNAMICS IN ISOCYANIDE HYDRATASE CATALYSIS Medhanjali Dasgupta, Ph.D. University of Nebraska, 2020 Advisor: Dr. Mark. A. Wilson Post-translational modification of cysteine residues can regulate protein function and is essential for catalysis by cysteine-dependent enzymes. Covalent modifications neutralize charge on the reactive cysteine thiolate anion and thus alter the active site electrostatic environment. Although a vast number of enzymes rely on cysteine modification for function, precisely how altered structural and electrostatic states of cysteine affect protein dynamics, which in turn, affects catalysis, remains poorly understood.
    [Show full text]
  • Activities of the COST D37 Gridchem Computational Chemistry Workflow Group
    Activities of the COST D37 GridChem Computational Chemistry Workflow Group EGEE'07 Conference Budapest 01.10.2007 Thomas Steinke Zuse Institute Berlin (ZIB) <www.zib.de> [email protected] Partners in the CCWF Working Group København • Thomas Steinke, Tim Clark (DE) Cambridge • • Berlin Hans-Peter Lüthi, Martin Brändle (CH) • London Peter Murray-Rust, Henry Rzepa (UK) Antonio Márquez (ES) • Erlangen Kurt Mikkelsen (DK) Zürich • • Manno - CSCS (Manno, CH) - ZIB (Berlin, DE) • Sevilla 2 “Traditional” Workflow in Comppyutational Chemistry WkflWorkflows h ave a l ong t tditiithCCdradition in the CC domai n. start kldb(knowledge base (DB search) automated/manually edited molecular structures molecular simulations method / program A method / program B … properties primary vi suali zati on / qualit y cont rol analysis / archival / DB storage new insights? 3 Databases: Computational protocol (T. Clark, 1998) Complete protocol runs automatically with less than 0.5% failure rate. Cleanup 2D → 3D conversion VAMP optimization Calculate properties ~3,000 compounds per processor day (3 GHz Xeon) Enhanced 3D-Databases: A Fully Electrostatic Database of AM1-Optimized Structures B. Beck, A. Horn, J. E. Carpenter, and T. Clark, J.Chem. Inf. Comput.Sci. 1998, 38, 1214-1217. source: Tim Clark, Uni Erlangen 4 Distributed Computing Environment in the 90 ’s QM packages 5 Distributed Computing Environment in the 90 ’s Example: UniChem distributed environment for quantum-chemical simulations Cray Research Inc. 1991-(2004) 6 CCWF Chemical Illustrator Applications Molecular design of functionalised enzynes Hans-Peter Lüthi, Martin Brändle, Zürich Peter Murray-Rust, Cambridge; Henry Rzepa, London Quantum chemical based QSAR/QSPR Tim Clark, Erlangen; Jon Essex , Southampton High-order dynamic and static electrostatic molecular properties Kurt Mikkelsen, Copenhagen Computational heterogeneous catalysis Antonio M.
    [Show full text]
  • AMBIDENT PROPERTIES of PHOSPHORAMIDAT'es and SULPHONAMIDES a Thesis Submitted by JAMES NICHOLAS ILEY in Partial Fulfillment of T
    Fsci AMBIDENT PROPERTIES OF PHOSPHORAMIDAT'ES AND SULPHONAMIDES A Thesis submitted by JAMES NICHOLAS ILEY in partial fulfillment of the requirements for the Degree of Doctor of Philosophy of the University of London DEPARTMENT OF ORGANIC CHEMISTRY IMPERIAL COLLEGE LONDON, SW7 SEPTEMBER 1979 ACKNOWLEDGEMENTS I wish to thank Dr. Brian Challis for his supervision and advice throughout this project and also for his friendship. I also thank Dr. Henry Rzepa for the use of the MNDO molecular orbital programme and teaching me to use it. The award of a Scholarship by the Salters' Company has enabled me to carry out this work and I acknowledge their generous gift. I greatly appreciate the friendship of all my colleagues throughout the past three years. My wife, Ruth, especially deserves mention for her love and support. Finally, my thanks go to Mrs. Sue Carlile, for typing this manuscript. ABSTRACT The nucleophilic chemistry of amides, phosphoramidates and sulphon- amides is reviewed. Particular reference is paid to the site of substi- tution and its variation with reaction conditions. The phosphorimidate-phōsphoramidate rearrangement is examined. The nature of electrophilic catalysis, particularly by alkyl halides is stud- ied and the mechanism of the reaction is discussed. The relevance of this mechanism to the nucleophilic chemistry of phosphoramidates is outlined. The behaviour of phosphoramidates in aqueous H2SO4, oleum and tri- fluoroacetic acid is briefly examined and the site of protonation in these media discussed. The acylation of phosphoramidates by acyl halides and anhydrides is reported and the effect of base and electrophilic catalysis on the reac- tions is examined.
    [Show full text]
  • 'Molecule of the Month' Website—An Extraordinary Chemistry Educational Resource Online for Over 20 Years
    May, P. , Cotton, S., Harrison, K., & Rzepa, H. (2017). The ‘Molecule of the Month’ Website—An Extraordinary Chemistry Educational Resource Online for over 20 Years. Molecules, 22(4), 549-559. https://doi.org/10.3390/molecules22040549 Publisher's PDF, also known as Version of record License (if available): CC BY Link to published version (if available): 10.3390/molecules22040549 Link to publication record in Explore Bristol Research PDF-document This is the final published version of the article (version of record). It first appeared online via MDPI at http://www.mdpi.com/1420-3049/22/4/549. Please refer to any applicable terms of use of the publisher. University of Bristol - Explore Bristol Research General rights This document is made available in accordance with publisher policies. Please cite only the published version using the reference above. Full terms of use are available: http://www.bristol.ac.uk/red/research-policy/pure/user-guides/ebr-terms/ molecules Article The ‘Molecule of the Month’ Website—An Extraordinary Chemistry Educational Resource Online for over 20 Years Paul W. May 1,*, Simon A. Cotton 2, Karl Harrison 3 and Henry S. Rzepa 4 1 School of Chemistry, University of Bristol, Bristol BS8 1TS, UK 2 School of Chemistry, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK; [email protected] 3 Department of Chemistry, University of Oxford, South Parks Road, Oxford OX1 3QZ, UK; [email protected] 4 Department of Chemistry, Imperial College, South Kensington Campus, London SW7 2AZ, UK; [email protected] * Correspondence: [email protected]; Tel.: +44-0117-928-9927 Academic Editor: Derek J.
    [Show full text]
  • NSF FAIR Chemical Data Publishing Guidelines Workshop on Chemical Structures and Spectra: Major Outcomes and Outlooks for the Chemistry Community
    NSF FAIR Chemical Data Publishing Guidelines Workshop on Chemical Structures and Spectra: Major Outcomes and Outlooks for the Chemistry Community Vincent F. Scalfani – University of Alabama Leah R. McEwen – Cornell University Deposited 04/24/2020 Scalfani, V.F., McEwen, L.R. (2020): NSF FAIR Chemical Data Publishing Guidelines Workshop on Chemical Structures and Spectra: Major Outcomes and Outlooks for the Chemistry Community. Unpublished manuscript. This is the authors' pre-print version that has not yet been submitted for publication. This work is licensed under a Creative Commons Attribution 4.0 International License. NSF FAIR Chemical Data Publishing Guidelines Workshop on Chemical Structures and Spectra: Major Outcomes and Outlooks for the Chemistry Community Vincent F. Scalfani✝* and Leah R. McEwen✝✝* ✝The University of Alabama, Tuscaloosa, AL, USA ✝✝Cornell University, Ithaca, NY, USA *Correspondence to: [email protected] and [email protected] March 24, 2020 ____________________________________________________________________________ Abstract The National Science Foundation Office of Advanced Cyberinfrastructure (NSF-OAC) funded a workshop in March 2019 focused on advancing the sharing of machine-readable chemical structures and spectra. Around 40 stakeholders from the chemistry, chemical information, and software communities took part in the two-day workshop entitled “FAIR Chemical Data Publishing Guidelines for Chemical Structures and Spectra.” Major topics discussed included publishing data workflows and guidelines, FAIR criteria/metadata
    [Show full text]
  • GUIDELINES for the USE of the INTERNET by IUPAC BODIES (Technical Report)
    Pure Appl. Chem., Vol. 71, No. 8, pp. 1587±1591, 1999. Printed in Great Britain. q 1999 IUPAC INTERNATIONAL UNION OF PURE AND APPLIED CHEMISTRY COMMITTEE ON PRINTED AND ELECTRONIC PUBLICATIONS WORKING PARTY ON THE INTERNET AND IUPAC PUBLICATIONS* GUIDELINES FOR THE USE OF THE INTERNET BY IUPAC BODIES (Technical Report) Prepared for publication by: ANTONY N. DAVIES1, STEPHEN R. HELLER2 and JOHN W. JOST3 1ISAS, Institut fuÈr Spektrochemie, Postfach 10 13 52, 44013 Dortmund, Germany <[email protected]> 2Guest Researcher NIST/SRD, Mail Stop: 820/113, 100 Bureau Drive, 820 Diamond Avenue, Room 101, Gaithersburg, MD 20899-2310 USA <[email protected]> 3IUPAC Secretariat, PO Box 13757, Research Triangle Park, NC 27709-3757, USA <[email protected]> *Membership of the Working Party during the preparation of these guidelines (1997±99) was as follows: J.W. Jost (USA), A.N. Davies (Germany), S.R. Heller (USA), H.V. Kehiaian (France), A.D. McNaught (UK). Republication or reproduction of this report or its storage and/or dissemination by electronic means is permitted without the need for formal IUPAC permission on condition that an acknowledgement, with full reference to the source along with use of the copyright symbol q, the name IUPAC and the year of publication are prominently visible. Publication of a translation into another language is subject to the additional condition of prior approval from the relevant IUPAC National Adhering Organization. 1587 1588 COMMITTEE ON PRINTED AND ELECTRONIC PUBLICATIONS Guidelines for the use of the Internet by IUPAC bodies (Technical Report) Abstract: The rapid development of the Internet as a major communication tool between scientists has led to the need for a co-ordinated IUPAC presence.
    [Show full text]
  • Historical Group
    Historical Group NEWSLETTER and SUMMARY OF PAPERS No. 78 Summer 2020 Registered Charity No. 207890 COMMITTEE Chairman: Dr Peter J T Morris ! Dr Christopher J Cooksey (Watford, 5 Helford Way, Upminster, Essex RM14 1RJ ! Hertfordshire) [e-mail: [email protected]] !Prof Alan T Dronsfield (Swanwick) Secretary: Prof. John W Nicholson ! Dr John A Hudson (Cockermouth) 52 Buckingham Road, Hampton, Middlesex, !Prof Frank James (University College) TW12 3JG [e-mail: [email protected]] !Dr Michael Jewess (Harwell, Oxon) Membership Prof Bill P Griffith ! Dr Fred Parrett (Bromley, London) Secretary: Department of Chemistry, Imperial College, ! Prof Henry Rzepa (Imperial College) London, SW7 2AZ [e-mail: [email protected]] Treasurer: Prof Richard Buscall, Exeter, Devon [e-mail: [email protected]] Newsletter Dr Anna Simmons Editor Epsom Lodge, La Grande Route de St Jean, St John, Jersey, JE3 4FL [e-mail: [email protected]] Newsletter Dr Gerry P Moss Production: School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS [e-mail: [email protected]] https://www.qmul.ac.uk/sbcs/rschg/ http://www.rsc.org/historical/ 1 RSC Historical Group Newsletter No. 78 Summer 2020 Contents From the Editor (Anna Simmons) 2 ROYAL SOCIETY OF CHEMISTRY HISTORICAL GROUP NEWS 3 Letter from the Chair (Peter Morris) 3 New “Lockdown” Webinar Series (Peter Morris) 3 RSC 2020 Award for Exceptional Service 3 OBITUARIES 4 Noel G. Coley (1927-2020) (Peter Morris, Jack Betteridge, John Hudson, Anna Simons) 4 Kenneth Schofield (1921-2019), FRSC (W. H. Brock) 5 MEMBERS’ PUBLICATIONS 5 Special Issue of Ambix August 2020 5 PUBLICATIONS OF INTEREST 7 SOCIETY NEWS 8 OTHER NEWS 9 Giessen Celebrates (?) the Centenary of the Liebig Museum (W.
    [Show full text]
  • Grid Workflow Workshop 2011
    Grid Workflow Workshop 2011 GWW 2011 MoSGrid: Progress of Workflow driven Chemical Simulations Georg Birkenheuer1∗, Dirk Blunk2, Sebastian Breuers2, Andre´ Brinkmann1, Gregor Fels3, Sandra Gesing4, Richard Grunzke5, Sonja Herres-Pawlis6, Oliver Kohlbacher4, Jens Kruger¨ 3, Ulrich Lang7, Lars Packschies7, Ralph Muller-Pfefferkorn¨ 5, Patrick Schafer¨ 8, Johannes Schuster1, Thomas Steinke8, Klaus-Dieter Warzecha7, and Martin Wewior7 1Paderborn Center for Parallel Computing, Universitat¨ Paderborn. 2Department fur¨ Chemie, Universitat¨ zu Koln.¨ 3Department Chemie, Universitat¨ Paderborn. 4Zentrum fur¨ Bioinformatik, Eberhard-Karls-Universitat¨ Tubingen.¨ 5Zentrum fur¨ Informationsdienste und Hochleistungsrechnen, Technische Universitat¨ Dresden. 6Fakultat¨ Chemie, Technische Universitat¨ Dortmund. 7Regionales Rechenzentrum, Universitat¨ zu Koln.¨ 8Konrad-Zuse-Institut fur¨ Informationstechnik Berlin. ABSTRACT Parallel to the extension of WS-PGRADE and gUSE for generic Motivation: Web-based access to computational chemistry grid workflows, MoSGrid started to implement intuitive portlets for the resources has proven to be a viable approach to simplify the use orchestration of specific workflows. The workflow application for of simulation codes. The introduction of recipes allows to reuse Molecular Dynamics is described in Section 3, followed by the already developed chemical workflows. By this means, workflows workflow application for Quantum Mechanics in Section 4. Section for recurring basic compute jobs can be provided for daily services. 5 covers the distributed data management for the workflow systems. Nevertheless, the same platform has to be open for active workflow development by experienced users. This paper provides an overview of recent developments of the MoSGrid project on providing tools and instruments for building workflow recipes. 2 THE WORKFLOW-ENABLED GRID PORTAL IN Contact: [email protected] MOSGRID The MoSGrid portal is developed on top of WS-PGRADE [3, 4], a workflow-enabled grid portal.
    [Show full text]
  • COMMITTEE Chairman: Dr Peter J T Morris 5 Helford Way, Upminster, Essex RM14 1RJ [E-Mail: [email protected]] Secretary: Prof
    COMMITTEE Chairman: Dr Peter J T Morris 5 Helford Way, Upminster, Essex RM14 1RJ [e-mail: [email protected]] Secretary: Prof. John W Nicholson 52 Buckingham Road, Hampton, Middlesex, TW12 3JG [e-mail: [email protected]] Membership Prof Bill P Griffith Secretary: Department of Chemistry, Imperial College, London, SW7 2AZ [e-mail: [email protected]] Treasurer: Prof Richard Buscall 34 Maritime Court, Haven Road, Exeter EX2 8GP Newsletter Dr Anna Simmons Editor Epsom Lodge, La Grande Route de St Jean, St John, Jersey, JE3 4FL [e-mail: [email protected]] Newsletter Dr Gerry P Moss Production: School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS [e-mail: [email protected]] Committee: Dr Helen Cooke (Nantwich) Dr Christopher J Cooksey (Watford, Hertfordshire) Prof Alan T Dronsfield (Swanwick, Derbyshire) Dr John A Hudson (Cockermouth) Prof Frank James (University College London) Dr Fred Parrett (Bromley, London) Dr Kathryn Roberts Prof Henry Rzepa (Imperial College) https://www.qmul.ac.uk/sbcs/rschg/ http://www.rsc.org/historical/ -1- RSC Historical Group Newsletter No. 79 Winter 2021 From the Editor Welcome to the winter 2021 RSC Historical Group Newsletter. I Contents sincerely hope it finds you and your loved ones keeping safe and well From the Editor (Anna Simmons) 3 and that it provides some interesting reading during the ongoing crisis. Letter from the Chair (Peter Morris) 4 This issue includes a bumper crop of short articles and I am most ROYAL SOCIETY OF CHEMISTRY
    [Show full text]
  • Herman Skolnik Award Symposium Honoring Henry Rzepa and Peter Murray-Rust ACS National Meeting, Philadelphia, PA, August 21, 2012
    Herman Skolnik Award Symposium Honoring Henry Rzepa and Peter Murray-Rust ACS National Meeting, Philadelphia, PA, August 21, 2012 A report by Wendy Warr ([email protected]) for the ACS CINF Chemical Information Bulletin Introduction This one-day symposium was remarkable for its record number of speakers (23 in all, plus one withdrawn and one replaced by a demonstration). Despite the number of performers, and some unfortunate technical faults, the whole event proceeded on schedule and without serious mishap. Henry Rzepa’s own talk was an opening scene-setter. He told a 1992 tale of some molecular orbitals explaining the course of a chemical reaction in 1992. The color diagram of these lacked semantics, and when it had been sent by fax to Bangor, it even lost its color. Months later the work was published,1 but the supporting information (SI) is not available for this article, and even if it were available electronically, would it be usable? So, how can it be mined for useful data or used as the starting point for further investigation? By 1994 Henry and his colleagues had recognized the opportunities presented by the World Wide Web.2,3 The data for a later article4 do survive in the form of Quicktime and MPEG animations on the Imperial College Gopher+ server but they are semantically poor, i.e., they are interpretable by humans but not by computer. The X-ray crystallography data are locatable using the proprietary identifier HEHXIB allocated by the Cambridge Crystallographic Data Center. Open identifiers such as the IUPAC International Chemical Identifier, InChI, are preferable.
    [Show full text]