The Alexandria Library, a Quantum-Chemical Database of Molecular Properties for Force field Development 9 2017 Received: October 1 1 1 Mohammad M
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Open Babel Documentation Release 2.3.1
Open Babel Documentation Release 2.3.1 Geoffrey R Hutchison Chris Morley Craig James Chris Swain Hans De Winter Tim Vandermeersch Noel M O’Boyle (Ed.) December 05, 2011 Contents 1 Introduction 3 1.1 Goals of the Open Babel project ..................................... 3 1.2 Frequently Asked Questions ....................................... 4 1.3 Thanks .................................................. 7 2 Install Open Babel 9 2.1 Install a binary package ......................................... 9 2.2 Compiling Open Babel .......................................... 9 3 obabel and babel - Convert, Filter and Manipulate Chemical Data 17 3.1 Synopsis ................................................. 17 3.2 Options .................................................. 17 3.3 Examples ................................................. 19 3.4 Differences between babel and obabel .................................. 21 3.5 Format Options .............................................. 22 3.6 Append property values to the title .................................... 22 3.7 Filtering molecules from a multimolecule file .............................. 22 3.8 Substructure and similarity searching .................................. 25 3.9 Sorting molecules ............................................ 25 3.10 Remove duplicate molecules ....................................... 25 3.11 Aliases for chemical groups ....................................... 26 4 The Open Babel GUI 29 4.1 Basic operation .............................................. 29 4.2 Options ................................................. -
Flexible Heuristic Algorithm for Automatic Molecule Fragmentation: Application to the UNIFAC Group Contribution Model Simon Müller*
Müller J Cheminform (2019) 11:57 https://doi.org/10.1186/s13321-019-0382-3 Journal of Cheminformatics RESEARCH ARTICLE Open Access Flexible heuristic algorithm for automatic molecule fragmentation: application to the UNIFAC group contribution model Simon Müller* Abstract A priori calculation of thermophysical properties and predictive thermodynamic models can be very helpful for developing new industrial processes. Group contribution methods link the target property to contributions based on chemical groups or other molecular subunits of a given molecule. However, the fragmentation of the molecule into its subunits is usually done manually impeding the fast testing and development of new group contribution methods based on large databases of molecules. The aim of this work is to develop strategies to overcome the challenges that arise when attempting to fragment molecules automatically while keeping the defnition of the groups as simple as possible. Furthermore, these strategies are implemented in two fragmentation algorithms. The frst algorithm fnds only one solution while the second algorithm fnds all possible fragmentations. Both algorithms are tested to frag- ment a database of 20,000 molecules for use with the group contribution model Universal Quasichemical Func- tional Group Activity Coefcients+ (UNIFAC). Comparison of the results with a reference database shows that both algorithms are capable of successfully fragmenting all the molecules automatically. Furthermore, when applying them on a larger database it is shown, that the newly developed algorithms are capable of fragmenting structures previously thought not possible to fragment. Keywords: Molecule fragmentation, Cheminformatics, RDKit, Property prediction, Group contribution method, UNIFAC, Incrementation Introduction named QSPR methods (Quantitative Structure Property Cheminformatics is a growing feld due to the increas- Relationship). -
Chemical Database Projects Delivered by RSC Escience
Chemical Database Projects Delivered by RSC eScience at the FDA Meeting “Development of a Freely Distributable Data System for the Registration of Substances” Antony Williams RSC eScience . What was once just ChemSpider is much more… . ChemSpider Reactions . Chemicals Validation and Standardization Platform . Learn Chemistry Wiki . National Chemical Database Service . Open PHACTS . PharmaSea . Global Chemistry Hub We are known for ChemSpider… . The Free Chemical Database . A central hub for chemists to source information . >28 million unique chemical records . Aggregated from >400 data sources . Chemicals, spectra, CIF files, movies, images, podcasts, links to patents, publications, predictions . A central hub for chemists to deposit & curate data We Want to Answer Questions . Questions a chemist might ask… . What is the melting point of n-heptanol? . What is the chemical structure of Xanax? . Chemically, what is phenolphthalein? . What are the stereocenters of cholesterol? . Where can I find publications about xylene? . What are the different trade names for Ketoconazole? . What is the NMR spectrum of Aspirin? . What are the safety handling issues for Thymol Blue? I want to know about “Vincristine” Vincristine: Identifiers and Properties Vincristine: Vendors and Sources Vincristine: Articles How did we build it? . We deal in Molfiles or SDF files – with coordinates . Deposit anything that has an InChI – we support what InChI can handle, good and bad . Standardization based on “InChI standardization” . InChIs aggregate (certain) tautomers The InChI Identifier Downsides of InChI . Good for small molecules – but no polymers, issues with inorganics, organometallics, imperfect stereochemistry. ChemSpider is “small molecules” . InChI used as the “deduplicator” – FIRST version of a compound into the database becomes THE structure to deduplicate against… Side Effects of InChI Usage SMILES by comparison… Side Effects of InChI Usage Searches: The INTERNET Search by InChI ChemSpider Google Search http://www.chemspider.com/google/ How did we build it? . -
Chemical File Format Conversion Tools : a N Overview
International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 Vol. 3 Issue 2, February - 2014 Chemical File Format Conversion Tools : A n Overview Kavitha C. R Dr. T Mahalekshmi Research Scholar, Bharathiyar University Principal Dept of Computer Applications Sree Narayana Institute of Technology SNGIST Kollam, India Cochin, India Abstract— There are a lot of chemical data stored in large different chemical file formats. Three types of file format databases, repositories and other resources. These data are used conversion tools are discussed in section III. And the by different researchers in different applications in various areas conclusion is given in section IV followed by the references. of chemistry. Since these data are stored in several standard chemical file formats, there is a need for the inter-conversion of II. CHEMICAL FILE FORMATS chemical structures between different formats because all the formats are not supported by various software and tools used by A chemical is a collection of atoms bonded together the researchers. Therefore it becomes essential to convert one file in space. The structure of a chemical makes it unique and format to another. This paper reviews some of the chemical file gives it its physical and biological characteristics. This formats and also presents a few inter-conversion tools such as structure is represented in a variety of chemical file formats. Open Babel [1], Mol converter [2] and CncTranslate [3]. These formats are used to represent chemical structure records and its associated data fields. Some of the file Keywords— File format Conversion, Open Babel, mol formats are CML (Chemical Markup Language), SDF converter, CncTranslate, inter- conversion tools. -
Predicting Outcomes of Catalytic Reactions Using Machine Learning
Predicting outcomes of catalytic reactions using ma- chine learning† Trevor David Rhone,∗a Robert Hoyt,a Christopher R. O’Connor,b Matthew M. Montemore,b,c Challa S.S.R. Kumar,b,c Cynthia M. Friend,b,c and Efthimios Kaxiras a,c Predicting the outcome of a chemical reaction using efficient computational models can be used to develop high-throughput screening techniques. This can significantly reduce the number of experiments needed to be performed in a huge search space, which saves time, effort and ex- pense. Recently, machine learning methods have been bolstering conventional structure-activity relationships used to advance understanding of chemical reactions. We have developed a model to predict the products of catalytic reactions on the surface of oxygen-covered and bare gold using machine learning. Using experimental data, we developed a machine learning model that maps reactants to products, using a chemical space representation. This involves predicting a chemical space value for the products, and then matching this value to a molecular structure chosen from a database. The database was developed by applying a set of possible reaction outcomes using known reaction mechanisms. Our machine learning approach complements chemical intuition in predicting the outcome of several types of chemical reactions. In some cases, machine learn- ing makes correct predictions where chemical intuition fails. We achieve up to 93% prediction accuracy for a small data set of less than two hundred reactions. 1 Introduction oped to aid in reaction prediction, they have generally involved Efficient prediction of reaction products has long been a major encoding this intuition into a set of rules, and have not been goal of the organic chemistry community 1–3 and the drug discov- widely adopted 2,6–9. -
Umansysprop V1.0: an Online and Open-Source Facility for Molecular Property Prediction and Atmospheric Aerosol Calculations
UManSysProp V1.0: An Online and Open-Source Facility for Molecular Property Prediction and Atmospheric Aerosol Calculations Topping, David and Barley, Mark and Bane, Michael and Higham, Nicholas J. and Aumont, Berbard and Dingle, Nicholas and McFiggans, Gordon 2016 MIMS EPrint: 2016.13 Manchester Institute for Mathematical Sciences School of Mathematics The University of Manchester Reports available from: http://eprints.maths.manchester.ac.uk/ And by contacting: The MIMS Secretary School of Mathematics The University of Manchester Manchester, M13 9PL, UK ISSN 1749-9097 Geosci. Model Dev., 9, 899–914, 2016 www.geosci-model-dev.net/9/899/2016/ doi:10.5194/gmd-9-899-2016 © Author(s) 2016. CC Attribution 3.0 License. UManSysProp v1.0: an online and open-source facility for molecular property prediction and atmospheric aerosol calculations David Topping1,2, Mark Barley2, Michael K. Bane3, Nicholas Higham4, Bernard Aumont5, Nicholas Dingle6, and Gordon McFiggans2 1National Centre for Atmospheric Science, Manchester, M13 9PL, UK 2Centre for Atmospheric Science, University of Manchester, Manchester, M13 9PL, UK 3High End Compute, Manchester, M13 9PL, UK 4School of Mathematics, University of Manchester, Manchester, M13 9PL, UK 5LISA, UMR CNRS 7583, Universite Paris Est Creteil et Universite Paris Diderot, Creteil, France 6Numerical Algorithms Group (NAG), Ltd Peter House Oxford Street, Manchester, M1 5AN, UK Correspondence to: David Topping ([email protected]) Received: 1 September 2015 – Published in Geosci. Model Dev. Discuss.: 3 November 2015 Revised: 3 February 2016 – Accepted: 16 February 2016 – Published: 1 March 2016 Abstract. In this paper we describe the development and opment of a user community. -
Package Name Software Description Project
A S T 1 Package Name Software Description Project URL 2 Autoconf An extensible package of M4 macros that produce shell scripts to automatically configure software source code packages https://www.gnu.org/software/autoconf/ 3 Automake www.gnu.org/software/automake 4 Libtool www.gnu.org/software/libtool 5 bamtools BamTools: a C++ API for reading/writing BAM files. https://github.com/pezmaster31/bamtools 6 Biopython (Python module) Biopython is a set of freely available tools for biological computation written in Python by an international team of developers www.biopython.org/ 7 blas The BLAS (Basic Linear Algebra Subprograms) are routines that provide standard building blocks for performing basic vector and matrix operations. http://www.netlib.org/blas/ 8 boost Boost provides free peer-reviewed portable C++ source libraries. http://www.boost.org 9 CMake Cross-platform, open-source build system. CMake is a family of tools designed to build, test and package software http://www.cmake.org/ 10 Cython (Python module) The Cython compiler for writing C extensions for the Python language https://www.python.org/ 11 Doxygen http://www.doxygen.org/ FFmpeg is the leading multimedia framework, able to decode, encode, transcode, mux, demux, stream, filter and play pretty much anything that humans and machines have created. It supports the most obscure ancient formats up to the cutting edge. No matter if they were designed by some standards 12 ffmpeg committee, the community or a corporation. https://www.ffmpeg.org FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and of both real and 13 fftw complex data (as well as of even/odd data, i.e. -
Quantum Chemical Calculations of NMR Parameters
Quantum Chemical Calculations of NMR Parameters Tatyana Polenova University of Delaware Newark, DE Winter School on Biomolecular NMR January 20-25, 2008 Stowe, Vermont OUTLINE INTRODUCTION Relating NMR parameters to geometric and electronic structure Classical calculations of EFG tensors Molecular properties from quantum chemical calculations Quantum chemistry methods DENSITY FUNCTIONAL THEORY FOR CALCULATIONS OF NMR PARAMETERS Introduction to DFT Software Practical examples Tutorial RELATING NMR OBSERVABLES TO MOLECULAR STRUCTURE NMR Spectrum NMR Parameters Local geometry Chemical structure (reactivity) I. Calculation of experimental NMR parameters Find unique solution to CQ, Q, , , , , II. Theoretical prediction of fine structure constants from molecular geometry Classical electrostatic model (EFG)- only in simple ionic compounds Quantum mechanical calculations (Density Functional Theory) (EFG, CSA) ELECTRIC FIELD GRADIENT (EFG) TENSOR: POINT CHARGE MODEL EFG TENSOR IS DETERMINED BY THE COMBINED ELECTRONIC AND NUCLEAR WAVEFUNCTION, NO ANALYTICAL EXPRESSION IN THE GENERAL CASE THE SIMPLEST APPROXIMATION: CLASSICAL POINT CHARGE MODEL n Zie 4 V2,k = 3 Y2,k ()i,i i=1 di 5 ATOMS CONTRIBUTING TO THE EFG TENSOR ARE TREATED AS POINT CHARGES, THE RESULTING EFG TENSOR IS THE SUM WITH RESPECT TO ALL ATOMS VERY CRUDE MODEL, WORKS QUANTITATIVELY ONLY IN SIMPLEST IONIC SYSTEMS, BUT YIELDS QUALITATIVE TRENDS AND GENERAL UNDERSTANDING OF THE SYMMETRY AND MAGNITUDE OF THE EXPECTED TENSOR ELECTRIC FIELD GRADIENT (EFG) TENSOR: POINT CHARGE MODEL n Zie 4 V2,k = 3 Y2,k ()i,i i=1 di 5 Ze V = ; V = 0; V = 0 2,0 d 3 2,±1 2,±2 2Ze V = ; V = 0; V = 0 2,0 d 3 2,±1 2,±2 3 Ze V = ; V = 0; V = 0 2,0 2 d 3 2,±1 2,±2 V2,0 = 0; V2,±1 = 0; V2,±2 = 0 MOLECULAR PROPERTIES FROM QUANTUM CHEMICAL CALCULATIONS H = E See for example M. -
Pipenightdreams Osgcal-Doc Mumudvb Mpg123-Alsa Tbb
pipenightdreams osgcal-doc mumudvb mpg123-alsa tbb-examples libgammu4-dbg gcc-4.1-doc snort-rules-default davical cutmp3 libevolution5.0-cil aspell-am python-gobject-doc openoffice.org-l10n-mn libc6-xen xserver-xorg trophy-data t38modem pioneers-console libnb-platform10-java libgtkglext1-ruby libboost-wave1.39-dev drgenius bfbtester libchromexvmcpro1 isdnutils-xtools ubuntuone-client openoffice.org2-math openoffice.org-l10n-lt lsb-cxx-ia32 kdeartwork-emoticons-kde4 wmpuzzle trafshow python-plplot lx-gdb link-monitor-applet libscm-dev liblog-agent-logger-perl libccrtp-doc libclass-throwable-perl kde-i18n-csb jack-jconv hamradio-menus coinor-libvol-doc msx-emulator bitbake nabi language-pack-gnome-zh libpaperg popularity-contest xracer-tools xfont-nexus opendrim-lmp-baseserver libvorbisfile-ruby liblinebreak-doc libgfcui-2.0-0c2a-dbg libblacs-mpi-dev dict-freedict-spa-eng blender-ogrexml aspell-da x11-apps openoffice.org-l10n-lv openoffice.org-l10n-nl pnmtopng libodbcinstq1 libhsqldb-java-doc libmono-addins-gui0.2-cil sg3-utils linux-backports-modules-alsa-2.6.31-19-generic yorick-yeti-gsl python-pymssql plasma-widget-cpuload mcpp gpsim-lcd cl-csv libhtml-clean-perl asterisk-dbg apt-dater-dbg libgnome-mag1-dev language-pack-gnome-yo python-crypto svn-autoreleasedeb sugar-terminal-activity mii-diag maria-doc libplexus-component-api-java-doc libhugs-hgl-bundled libchipcard-libgwenhywfar47-plugins libghc6-random-dev freefem3d ezmlm cakephp-scripts aspell-ar ara-byte not+sparc openoffice.org-l10n-nn linux-backports-modules-karmic-generic-pae -
A New Jmol Interface for Handling and Visualizing
computer programs Journal of Applied J-ICE: a new Jmol interface for handling and Crystallography visualizing crystallographic and electronic ISSN 0021-8898 properties Received 20 October 2010 a b c Accepted 25 November 2010 Pieremanuele Canepa, * Robert M. Hanson, Piero Ugliengo and Maria Alfredssona aSchool of Physical Sciences, University of Kent, Ingram Building, Canterbury, Kent CT2 7NH, England, bDepartment of Chemistry, St Olaf College, 1520 St Olaf Avenue, Northfield, MN 55057, USA, and cDipartimento di Chimica IFM, Universita` degli Studi di Torino, and NIS (Nanostructured Interfaces and Surfaces) Centre of Excellence, Via P. Giuria 7, Torino 10125, Italy. Correspondence e-mail: [email protected] The growth in complexity of quantum mechanical software packages for modelling the physicochemical properties of crystalline materials may hinder their usability by the vast majority of non-specialized users. Consequently, a free operating-system-independent graphical user interface (GUI) has been devel- oped to drive the most common simulation packages for treating both molecules and solids. In order to maintain maximum portability and graphical efficiency, the popular molecular graphics engine Jmol, written in the portable Java language, has been combined with a specialized GUI encoded in HTML and JavaScript. This framework, called J-ICE, allows users to visualize, build and manipulate complex input or output results (derived from modelling) entirely via a web server, i.e. without the burden of installing complex packages. This solution also dramatically speeds up both the development procedure and bug # 2011 International Union of Crystallography fixing. Among the range of software appropriate for modelling condensed Printed in Singapore – all rights reserved matter, the focus of J-ICE is currently only on CRYSTAL09 and VASP. -
Bringing Open Source to Drug Discovery
Bringing Open Source to Drug Discovery Chris Swain Cambridge MedChem Consulting Standing on the shoulders of giants • There are a huge number of people involved in writing open source software • It is impossible to acknowledge them all individually • The slide deck will be available for download and includes 25 slides of details and download links – Copy on my website www.cambridgemedchemconsulting.com Why us Open Source software? • Allows access to source code – You can customise the code to suit your needs – If developer ceases trading the code can continue to be developed – Outside scrutiny improves stability and security What Resources are available • Toolkits • Databases • Web Services • Workflows • Applications • Scripts Toolkits • OpenBabel (htttp://openbabel.org) is a chemical toolbox – Ready-to-use programs, and complete programmer's toolkit – Read, write and convert over 110 chemical file formats – Filter and search molecular files using SMARTS and other methods, KNIME add-on – Supports molecular modeling, cheminformatics, bioinformatics – Organic chemistry, inorganic chemistry, solid-state materials, nuclear chemistry – Written in C++ but accessible from Python, Ruby, Perl, Shell scripts… Toolkits • OpenBabel • R • CDK • OpenCL • RDkit • SciPy • Indigo • NumPy • ChemmineR • Pandas • Helium • Flot • FROWNS • GNU Octave • Perlmol • OpenMPI Toolkits • RDKit (http://www.rdkit.org) – A collection of cheminformatics and machine-learning software written in C++ and Python. – Knime nodes – The core algorithms and data structures are written in C ++. Wrappers are provided to use the toolkit from either Python or Java. – Additionally, the RDKit distribution includes a PostgreSQL-based cartridge that allows molecules to be stored in relational database and retrieved via substructure and similarity searches. -
PSC-Db: a Structured and Searchable 3D-Database for Plant Secondary Compounds
molecules Article PSC-db: A Structured and Searchable 3D-Database for Plant Secondary Compounds Alejandro Valdés-Jiménez 1,† , Carlos Peña-Varas 2,†, Paola Borrego-Muñoz 3 , Lily Arrue 2 , Melissa Alegría-Arcos 2, Hussam Nour-Eldin 4, Ingo Dreyer 1 , Gabriel Nuñez-Vivanco 1 and David Ramírez 2,* 1 Center for Bioinformatics, Simulations, and Modeling (CBSM), Faculty of Engineering, University of Talca, Talca 3460000, Chile; [email protected] (A.V.-J.); [email protected] (I.D.); [email protected] (G.N.-V.) 2 Instituto de Ciencias Biomédicas, Universidad Autónoma de Chile, Santiago 8900000, Chile; [email protected] (C.P.-V.); [email protected] (L.A.); [email protected] (M.A.-A.) 3 Bioorganic Chemistry Laboratory, Facultad de Ciencias Básicas y Aplicadas, Campus Nueva Granada, Universidad Militar Nueva Granada, Cajicá 250247, Colombia; [email protected] 4 DynaMo Center, Department of Plant and Environmental Sciences, University of Copenhagen, 1017 Copenhagen, Denmark; [email protected] * Correspondence: [email protected]; Tel.: +56-2-230-36667 † These authors equally contributed to this work. Abstract: Plants synthesize a large number of natural products, many of which are bioactive and have practical values as well as commercial potential. To explore this vast structural diversity, we present PSC-db, a unique plant metabolite database aimed to categorize the diverse phytochemical Citation: Valdés-Jiménez, A.; space by providing 3D-structural information along with physicochemical and pharmaceutical Peña-Varas, C.; Borrego-Muñoz, P.; Arrue, L.; Alegría-Arcos, M.; properties of the most relevant natural products. PSC-db may be utilized, for example, in qualitative Nour-Eldin, H.; Dreyer, I.; estimation of biological activities (Quantitative Structure-Activity Relationship, QSAR) or massive Nuñez-Vivanco, G.; Ramírez, D.