Recent Advances in the Application of Data Science and Machine/Deep Learning Methods to Research in Chemical and Biological Sciences : a Review
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Free and Open Source Software for Computational Chemistry EducationFree and Open Source Software for Computational Chemistry Education Susi Lehtola∗,y and Antti J. Karttunenz yMolecular Sciences Software Institute, Blacksburg, Virginia 24061, United States zDepartment of Chemistry and Materials Science, Aalto University, Espoo, Finland E-mail: [email protected].fi Abstract Long in the making, computational chemistry for the masses [J. Chem. Educ. 1996, 73, 104] is finally here. We point out the existence of a variety of free and open source software (FOSS) packages for computational chemistry that offer a wide range of functionality all the way from approximate semiempirical calculations with tight- binding density functional theory to sophisticated ab initio wave function methods such as coupled-cluster theory, both for molecular and for solid-state systems. By their very definition, FOSS packages allow usage for whatever purpose by anyone, meaning they can also be used in industrial applications without limitation. Also, FOSS software has no limitations to redistribution in source or binary form, allowing their easy distribution and installation by third parties. Many FOSS scientific software packages are available as part of popular Linux distributions, and other package managers such as pip and conda. Combined with the remarkable increase in the power of personal devices—which rival that of the fastest supercomputers in the world of the 1990s—a decentralized model for teaching computational chemistry is now possible, enabling students to perform reasonable modeling on their own computing devices, in the bring your own device 1 (BYOD) scheme. In addition to the programs’ use for various applications, open access to the programs’ source code also enables comprehensive teaching strategies, as actual algorithms’ implementations can be used in teaching.
- 
												  Computational ChemistryG UEST E DITORS’ I NTRODUCTION COMPUTATIONAL CHEMISTRY omputational chemistry has come of chemical, physical, and biological phenomena. age. With significant strides in com- The Nobel Prize award to John Pople and Wal- puter hardware and software over ter Kohn in 1998 highlighted the importance of the last few decades, computational these advances in computational chemistry. Cchemistry has achieved full partnership with the- With massively parallel computers capable of ory and experiment as a tool for understanding peak performance of several teraflops already on and predicting the behavior of a broad range of the scene and with the development of parallel software for efficient exploitation of these high- end computers, we can anticipate that computa- tional chemistry will continue to change the scientific landscape throughout the coming cen- tury. The impact of these advances will be broad and encompassing, because chemistry is so cen- tral to the myriad of advances we anticipate in areas such as materials design, biological sci- ences, and chemical manufacturing. Application areas Materials design is very broad in scope. It ad- dresses a diverse range of compositions of matter that can better serve structural and functional needs in all walks of life. Catalytic design is one such example that clearly illustrates the promise and challenges of computational chemistry. Al- most all chemical reactions of industrial or bio- chemical significance are catalytic. Yet, catalysis is still more of an art (at best) or an empirical fact than a design science. But this is sure to change. A recent American Chemical Society Sympo- sium volume on transition state modeling in catalysis illustrates its progress.1 This sympo- sium highlighted a significant development in the level of modeling that researchers are em- 1521-9615/00/$10.00 © 2000 IEEE ploying as they focus more and more on the DONALD G.
- 
												  NIST Computational Chemistry Comparison and Benchmark DatabaseNIST Computational Chemistry Comparison and Benchmark Database A website which compares computed and experimental ideal-gas thermochemical properties http://srdata.nist.gov/cccbdb Russell D. Johnson III Computational Chemistry Group, NIST http://www.nist.gov/compchem How good is that ab initio calculation? For predicting enthalpies. For predicting geometries. For predicting vibrational frequencies. For this molecule. For these molecules. http://srdata.nist.gov/cccbdb CCCBDB Contents • 680 molecules • Enthalpies of formation, Entropies, Geometries, Vibrational frequencies • Experimental values • Computed values • Comparisons • Auxiliary information http://srdata.nist.gov/cccbdb Properties • Energetics – Enthalpies of formation – Atomization enthalpies – Reaction enthalpies – Barriers to Internal Rotation • Entropies – Heat Capacities – Integrated Heat Capacities http://srdata.nist.gov/cccbdb More Properties • Geometries – Bond lengths and angles – Rotational Constants – Moments of Inertia • Vibrational Frequencies – Intensities – Zero point energies http://srdata.nist.gov/cccbdb Even More Properties • Electrostatics – Dipole Moments – Quadrupole Moments – Polarizabilities – Charges • Mulliken •ESP • ChelpG http://srdata.nist.gov/cccbdb Molecules in CCCBDB • Small, gas-phase • Mostly < 7 heavy atoms - hexane • Mostly no atoms with atomic number >17 (chlorine). A dozen Br-containing molecules. • 90 diatomics, 574 polyatomics • 469 organic molecules, – 125 hydrocarbons – 145 CHO species – 16 amides http://srdata.nist.gov/cccbdb Experimental
- 
												  UNE Postgraduate Conference 2017UNE Postgraduate Conference 2017 ‘Intersections of Knowledge’ 17-18 January 2017 Conference Proceedings Postgraduate Conference Conference Proceedings “Intersections of Knowledge” UNE Postgraduate Conference 2017 17th and 18th January 2017 Resource Management Building University of New England Acknowledgement Phillip Thomas – UNE Research Services Co-chair and convenor Postgraduate Conference Organising Committee: – Stuart Fisher (Co-Chair) Eliza Kent, Grace Jeffery, Emma Lockyer, Bezaye Tessema, Kristal Spreadborough, Kerry Gleeson, Marguerite Jones, Anne O’Donnell-Ostini, Rubeca Fancy, Maximillian Obiakor, Jane Michie, John Cook, Sanaz Alian, Julie Orr, Vivek Nemane, Nadiezhda Ramirez Cabral. Back row left to right: Stuart Fisher, Julie Orr, Vivek Nemane, Philip Thomas, Maximillian Obiakor Front row left to right: Emma Lockyer, Bezaye Tessema, Nadiezhda Ramirez Cabral, Kerry Gleeson, Kristal Spreadborough, Rubeca Fancy, Anne O’Donnell-Ostini, Grace Jeffery. Absent: Eliza Kent, Marguerite Jones, John Cook, Jane Michie, UNE Areas: IT Training, Research Services, Audio-Visual Support, Marketing and Public Relations, Corporate Communications, Strategic Projects Group, School of Science and Technology, VC’s Unit, Workforce Strategy, Information Technology Directorate and Development Unit. Sponsor: UNE Life, University of New England Student Association (UNESA) Research creates knowledge and when we share what we have discovered we create rich intersections that self-generate new thinking, ideas and actions within and across networks. With
- 
												  Energy Technology and ManagementENERGY TECHNOLOGY AND MANAGEMENT Edited by Tauseef Aized Energy Technology and Management Edited by Tauseef Aized Published by InTech Janeza Trdine 9, 51000 Rijeka, Croatia Copyright © 2011 InTech All chapters are Open Access articles distributed under the Creative Commons Non Commercial Share Alike Attribution 3.0 license, which permits to copy, distribute, transmit, and adapt the work in any medium, so long as the original work is properly cited. After this work has been published by InTech, authors have the right to republish it, in whole or part, in any publication of which they are the author, and to make other personal use of the work. Any republication, referencing or personal use of the work must explicitly identify the original source. Statements and opinions expressed in the chapters are these of the individual contributors and not necessarily those of the editors or publisher. No responsibility is accepted for the accuracy of information contained in the published articles. The publisher assumes no responsibility for any damage or injury to persons or property arising out of the use of any materials, instructions, methods or ideas contained in the book. Publishing Process Manager Iva Simcic Technical Editor Teodora Smiljanic Cover Designer Jan Hyrat Image Copyright Sideways Design, 2011. Used under license from Shutterstock.com First published September, 2011 Printed in Croatia A free online edition of this book is available at www.intechopen.com Additional hard copies can be obtained from [email protected] Energy Technology and Management, Edited by Tauseef Aized p. cm. ISBN 978-953-307-742-0 free online editions of InTech Books and Journals can be found at www.intechopen.com Contents Preface IX Part 1 Energy Technology 1 Chapter 1 Centralizing the Power Saving Mode for 802.11 Infrastructure Networks 3 Yi Xie, Xiapu Luo and Rocky K.
- 
												  Mathematical Chemistry! Is It? and If So, What Is It?Mathematical Chemistry! Is It? And if so, What Is It? Douglas J. Klein Abstract: Mathematical chemistry entailing the development of novel mathe- matics for chemical applications is argued to exist, and to manifest an extreme- ly diverse range of applications. Yet further it is argued to have a substantial history of well over a century, though the field has perhaps only attained a de- gree of recognition with a formal widely accepted naming in the last few dec- ades. The evidence here for the broad range and long history is by way of nu- merous briefly noted example sub-areas. That mathematical chemistry was on- ly recently formally recognized is seemingly the result of its having been somewhat disguised for a period of time – sometimes because it was viewed as just an unnamed part of physical chemistry, and sometimes because the rather frequent applications in other chemical areas were not always viewed as math- ematical (often involving somewhat ‘non-numerical’ mathematics). Mathemat- ical chemistry’s relation to and distinction from computational chemistry & theoretical chemistry is further briefly addressed. Keywords : mathematical chemistry, physical chemistry, computational chemistry, theoretical chemistry. 1. Introduction Chemistry is a rich and complex science, exhibiting a diversity of reproduci- ble and precisely describable predictions. Many predictions are quantitative numerical predictions and also many are of a qualitative (non-numerical) nature, though both are susceptible to sophisticated mathematical formaliza- tion. As such, it should naturally be anticipated that there is a ‘mathematical chemistry’, rather likely with multiple roots and with multiple aims. Mathe- matical chemistry should focus on mathematically novel ideas and concepts adapted or developed for use in chemistry (this view being much in parallel with that for other similarly named mathematical fields, in physics, or in biology, or in sociology, etc.
- 
												  Path Optimization in Free Energy CalculationsPath Optimization in Free Energy Calculations by Ryan Muraglia Graduate Program in Computational Biology & Bioinformatics Duke University Date: Approved: Scott Schmidler, Supervisor Patrick Charbonneau Paul Magwene Thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in the Graduate Program in Computational Biology & Bioinformatics in the Graduate School of Duke University 2016 Abstract Path Optimization in Free Energy Calculations by Ryan Muraglia Graduate Program in Computational Biology & Bioinformatics Duke University Date: Approved: Scott Schmidler, Supervisor Patrick Charbonneau Paul Magwene An abstract of a thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in the Graduate Program in Computational Biology & Bioinformatics in the Graduate School of Duke University 2016 Copyright c 2016 by Ryan Muraglia All rights reserved except the rights granted by the Creative Commons Attribution-Noncommercial License Abstract Free energy calculations are a computational method for determining thermodynamic quantities, such as free energies of binding, via simulation. Currently, due to compu- tational and algorithmic limitations, free energy calculations are limited in scope. In this work, we propose two methods for improving the efficiency of free energy calcu- lations. First, we expand the state space of alchemical intermediates, and show that this expansion enables us to calculate free energies along lower variance paths. We use Q-learning, a reinforcement learning technique, to discover and optimize paths at low computational cost. Second, we reduce the cost of sampling along a given path by using sequential Monte Carlo samplers. We develop a new free energy estima- tor, pCrooks (pairwise Crooks), a variant on the Crooks fluctuation theorem (CFT), which enables decomposition of the variance of the free energy estimate for discrete paths, while retaining beneficial characteristics of CFT.
- 
												  Chapter Patterns of Curriculum DesignChapter Patterns of Curriculum Design Douglas Blank & Deepak Kumar Department of Mathematics & Computer Science, Bryn Mawr College, Bryn Mawr, PA 19010 (USA) Email: dblank, [email protected] Abstract We present a perspective on the design of a curriculum for a new computer science program at a womens liberal arts college. The design incorporates lessons learned at the college in its successful implementation of other academic programs, incorporation of best practices in curriculum design at other colleges, results from studies performed on various computer science programs, and a significant number of our own ideas. Several observations and design decisions are presented as curriculum design patterns. The goal of making the design patterns explicit is to encourage a discussion on curriculum design that goes beyond the identification of core knowledge areas and courses. 1. INTRODUCTION In this paper, we present a perspective on the design of a curriculum for a new computer science program at Bryn Mawr College. Founded in 1885, Bryn Mawr College is well known for the excellence of its academic programs. Bryn Mawr combines a distinguished undergraduate college for about 1200 women with two nationally ranked, coeducational graduate schools (Arts and Sciences, and Social Work and Social research) with about 600 students. As a women's college, Bryn Mawr has a longstanding and intrinsic commitment to prepare individuals to succeed in professional fields in which they have been historically underrepresented. In 1999, the 1 2 Chapter college decided to add computer science to the college's academic programs. We are currently engaged in the expansion and design of the program.
- 
												  Managing the Computational Chemistry Big Data Problem: the Iochem-BD Platform MThis document is the Accepted Manuscript version of a Published Work that appeared in final form in Journal of Chemical Article Information and Modeling, copyright © American Chemical Society after peer review and technical editing by the pubs.acs.org/jcim publisher. To access the final edited and published work see http://dx.doi.org/10.1021/ci500593j Managing the Computational Chemistry Big Data Problem: The ioChem-BD Platform M. Álvarez-Moreno,*,†,‡ C. de Graaf,‡,∥ N. Lopez,́ † F. Maseras,†,§ J. M. Poblet,‡ and C. Bo*,†,‡ † Institute of Chemical Research of Catalonia, ICIQ, Av. Països Catalans 16, 43007 Tarragona, Catalonia, Spain ‡ Department of Physical and Inorganic Chemistry, Universitat Rovira i Virgili, C/Marcel·lí Domingo s/n, 43007 Tarragona, Catalonia, Spain § Department of Chemistry, Universitat Autonomà de Barcelona, 08193 Bellaterra, Catalonia, Spain ∥ Catalan Institution for Research and Advanced Studies, ICREA, Passeig Lluis Companys 23, 08010 Barcelona, Catalonia, Spain ABSTRACT: We present the ioChem-BD platform (www. iochem-bd.org) as a multiheaded tool aimed to manage large volumes of quantum chemistry results from a diverse group of already common simulation packages. The platform has an extensible structure. The key modules managing the main tasks are to (i) upload of output files from common computational chemistry packages, (ii) extract meaningful data from the results, and (iii) generate output summaries in user-friendly formats. A heavy use of the Chemical Mark-up Language (CML) is made in the intermediate files used by ioChem-BD. From them and using XSL techniques, we manipulate and transform such chemical data sets to fulfill researchers’ needs in the form of HTML5 reports, supporting information, and other research media.
- 
												  Computational Chemistry As “Voodoo Quantum Mechanics”: Models, Parameterization, and SoftwareView metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Philsci-Archive Computational Chemistry as “Voodoo Quantum Mechanics”: models, parameterization, and software Frédéric Wieber & Alexandre Hocquet LHSP – Archives Henri Poincaré UMR 7117 CNRS & Université de Lorraine 91 avenue de la Libération B.P. 454 54001 NANCY CEDEX FRANCE Abstract Computational chemistry grew in a new era of "desktop modeling", which coincided with a growing demand for modeling software, especially from the pharmaceutical industry. Parameterization of models in computational chemistry is an arduous enterprise, and we argue that this activity leads, in this specific context, to tensions among scientists regarding the lack of epistemic transparency of parameterized methods and the software implementing them. To explicit these tensions, we rely on a corpus which is suited for revealing them, namely the Computational Chemistry mailing List (CCL), a professional scientific discussion forum. We relate one flame war from this corpus in order to assess in detail the relationships between modeling methods, parameterization, software and the various forms of their enclosure or disclosure. Our claim is that parameterization issues are a source of epistemic opacity and that this opacity is entangled in methods and software alike. Models and software must be addressed together to understand the epistemological tensions at stake. Keywords Computational chemistry; models; software; parameterization; epistemic opacity 1 Introduction The quotation in the title (Evleth 1993a), taken from a scientific mailing list, the "Computational Chemistry List", is supposed to illustrate the issues of epistemic opacity and “theoretical tinkering” in computational modeling methods in chemistry, their relations to computers and the development and use of software in a scientific milieu.
- 
												  Combinatorial Chemistry Successful Strategies & Proof of Concept Case StudiesExploiting The Promise of Combinatorial Chemistry Successful Strategies & Proof of Concept Case Studies FRIDAY 25TH JUNE 1999 - THE MERCHANT CENTRE, LONDON Industry has embraced Combinatorial Chemistry totally but has it contributed to the number of NCE’s entering the clinic as promised? “Combinatorial chemistry - a technology for creating molecules This unique strategic forum will look at the potential of en masse and testing them rapidly for desirable properties - Combinatorial Chemistry, in terms of lead identification and continues to branch out rapidly. Compared with conventional lead optimisation, and will allow participants to debate actual one-molecule-at-a-time discovery strategies, many researchers successes and failures of Combinatorial Chemistry see combinatorial chemistry as a better way to discover new and its true worth in the drug discovery process. drugs...” Chemical & Engineering News, April 1998 08.30 Coffee & Registration 13.30 Strucure-Based Computational Screening of Virtual Combinationa Libraries & its Application to the Discovery of Novel Serine 09.00 New Dimensions of Combinatorial Chemistry & It’s Value to the Protease Inhibitors. Drug Discovery Process Dr Stephen C Young, Head of Synthetic Chemistry Proteus Molecular Design Ltd, UK Dr Lutz Weber, Chief Scientific Officer Combinatorial Chemistry and Structure-Based Drug Design have Morphochem AG, Germany shown themselves to be effective methods for finding lead ■ High diversity combinatorial chemistry: varying both backbones compounds but each has limitations. A new approach, which and substituents within one library ■ combines benefits of both methodologies without some of the Evolutionary chemistry – optimisation of compound properties deficiencies, is now coming to the fore. This comprises building a by biological feedback driven synthesis virtual combinatorial library which is screened for complementarit ■ Functional diversity, structure-reactivity and structure-property against a receptor structure.
- 
												  Computational Chemistry Software and Its Advancement As Illustrated Through Three Grand Challenge Cases for Molecular Science Anna Krylov, Theresa LPerspective: Computational chemistry software and its advancement as illustrated through three grand challenge cases for molecular science Anna Krylov, Theresa L. Windus, Taylor Barnes, Eliseo Marin-Rimoldi, Jessica A. Nash, Benjamin Pritchard, Daniel G. A. Smith, Doaa Altarawy, Paul Saxe, Cecilia Clementi, T. Daniel Crawford, Robert J. Harrison, Shantenu Jha, Vijay S. Pande, and Teresa Head-Gordon Citation: J. Chem. Phys. 149, 180901 (2018); doi: 10.1063/1.5052551 View online: https://doi.org/10.1063/1.5052551 View Table of Contents: http://aip.scitation.org/toc/jcp/149/18 Published by the American Institute of Physics THE JOURNAL OF CHEMICAL PHYSICS 149, 180901 (2018) Perspective: Computational chemistry software and its advancement as illustrated through three grand challenge cases for molecular science Anna Krylov,1 Theresa L. Windus,2 Taylor Barnes,3 Eliseo Marin-Rimoldi,3 Jessica A. Nash,3 Benjamin Pritchard,3 Daniel G. A. Smith,3 Doaa Altarawy,3 Paul Saxe,3 Cecilia Clementi,4,5 T. Daniel Crawford,6 Robert J. Harrison,7 Shantenu Jha,8 Vijay S. Pande,9 and Teresa Head-Gordon10,a) 1Department of Chemistry, University of Southern California, Los Angeles, California 90089, USA 2Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA 3Molecular Sciences Software Institute, Blacksburg, Virginia 24061, USA 4Department of Chemistry and Center for Theoretical Biological Physics, Rice University, 6100 Main Street, Houston, Texas 77005, USA 5Department of Mathematics and Computer Science, Freie Universitt Berlin, Arnimallee 6,