Molecular Representations in AI-Driven Drug Discovery

Total Page:16

File Type:pdf, Size:1020Kb

Molecular Representations in AI-Driven Drug Discovery David et al. J Cheminform (2020) 12:56 https://doi.org/10.1186/s13321-020-00460-5 Journal of Cheminformatics REVIEW Open Access Molecular representations in AI-driven drug discovery: a review and practical guide Laurianne David1* , Amol Thakkar1,2 , Rocío Mercado1 and Ola Engkvist1 Abstract The technological advances of the past century, marked by the computer revolution and the advent of high-through- put screening technologies in drug discovery, opened the path to the computational analysis and visualization of bioactive molecules. For this purpose, it became necessary to represent molecules in a syntax that would be readable by computers and understandable by scientists of various felds. A large number of chemical representations have been developed over the years, their numerosity being due to the fast development of computers and the complex- ity of producing a representation that encompasses all structural and chemical characteristics. We present here some of the most popular electronic molecular and macromolecular representations used in drug discovery, many of which are based on graph representations. Furthermore, we describe applications of these representations in AI-driven drug discovery. Our aim is to provide a brief guide on structural representations that are essential to the practice of AI in drug discovery. This review serves as a guide for researchers who have little experience with the handling of chemical representations and plan to work on applications at the interface of these felds. Keywords: Molecular representation, Cheminformatics, Drug discovery, Small molecules, Macromolecules, Linear notation, Molecular graphs, Reaction prediction, Artifcial intelligence Introduction the empirical formula of alanine, C3H7NO2, exempli- Te representation of molecules has been of interest to fes the complexity of building a notation. Indeed, while scientists since the nineteenth century [1, 2]. Tradition- information about the atoms is available, it is not possi- ally, molecules are represented as structure diagrams ble to know how the atoms are linked from the molecular with bonds and atoms, and this is likely the representa- formula which, moreover, does not encode information tion most people think of when they think of molecules. related to the molecular geometry. As such, the molecu- However, other representations are required for the com- lar formula above can be associated to alanine as well as putational processing of chemical structures in chemin- to sarcosine and lactamide. Variants of empirical formu- formatics. Here, we defne a chemical representation as las emphasizing any functional groups also exist but are any encoding of a chemical compound; linear representa- loosely defned. In these group-centric representations, tions are referred to as notations. elements are grouped in the formula as they would be in Over the years, scientists have developed many nota- the molecule so as to highlight any functional groups pre- tions depicting various properties of a compound. A sent e.g. CH3CH(NH2)COOH to represent alanine. classic notation is the empirical formula, a non-standard Te advent of computers led to the development of a form of Hill notation. Seemingly simple at frst glance, wide variety of machine-readable chemical representa- tions. Computers allowed for the rapid digital storage *Correspondence: [email protected] and querying of compounds and their structures, swift 1 Hit Discovery, Discovery Sciences, BioPharmaceuticals R&D, Astrazeneca modifcations of digital information, and greater physi- Gothenburg, Sweden cal storage efciency. Algorithms were implemented Full list of author information is available at the end of the article to visualize compounds as 2D depictions [3, 4] and the © The Author(s) 2020. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creat iveco mmons .org/publi cdoma in/ zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. David et al. J Cheminform (2020) 12:56 Page 2 of 22 computational visualization of compounds in 3D space could not consider other mappings. In typical graph rep- was popularized with the development of specialized resentations, the nodes are represented using circles or programs [5–7]. spheres, and the edges using lines. In the case of molecu- Many precursors to computer-readable notations were lar graphs, the nodes are instead often represented using introduced between 1947 and 1964 and were dedicated letters indicating the atom type (as on the periodic table), to small organic molecules [2, 8]. At the time, memory or simply using points where the bonds meet (for carbon efciency was an important factor impacting the devel- atoms). opment of chemical notations. Popular representations A molecular graph representation is formally a 2D used nowadays, however, were largely developed in and object that can be used to represent 3D information (e.g. since the 70′s to represent small molecules [9–11], mac- atomic coordinates, bond angles, chirality). However, any romolecules [12–17] and chemical reactions [18–21]. spatial relationships between the nodes must be encoded In this review, we focus on chemical representations as node and/or edge attributes, as nodes in a graph (the in cheminformatics and drug discovery. We frst intro- mathematical object) do not formally have spatial posi- duce the concept of a molecular graph, which is the tions, only pairwise relationships. Tere are of course most common machine-readable representation, and we limitations to this representation, which are discussed in give a brief overview of the main notations which paved a later section. Te 2D and 3D representations of graphs the way for the current cheminformatics notations. We can easily be visualized by many software packages, then focus on the representations that are used nowa- including ChemDraw [30], Mercury [31], Avogadro [32], days in the feld of applying artifcial intelligence (AI) to VESTA [33], PyMOL [34], and VMD [35] (the latter 5 are cheminformatics and drug discovery. Finally, we provide suitable for small- and macro-molecules, and either free examples of AI-related applications using the chemical or open-source). representations discussed in this review. Tis review is intended to provide an overview of basic cheminformat- Mathematical defnition of a graph 1 ics knowledge to practising cheminformaticians, students A graph is formally defned as a tuple G = (V, E) of a set in chemistry, cheminformatics, bioinformatics, and com- of nodes V and a set of edges E, where each edge e ∊ E puter science, and anyone interested to learn more about connects pairs of nodes in V. In a molecular graph, V is molecular representations in drug discovery. While the intuitively the set of all atoms in a molecule, and E is the coverage of representations introduced herein is not set of all bonds linking the atoms, although this does not intended to be exhaustive, we emphasise that the repre- have to be the case. Molecular graphs are generally undi- sentation used to solve a problem is always dependent rected, meaning that the pairs in E are unordered. [36]. on the task. Tus, the coverage is limited to areas where To map a graph from an abstract mathematical con- there is active research in applying machine learning cept to a concrete representation that can be handled (ML) and AI to cheminformatics and drug discovery. For on a computer, one needs to map the sets of nodes and readers interested in further reading on these topics we edges to linear data structures; a common way to do this recommend references [22–29], in addition to the refer- is using data structures such as matrices or arrays. Lin- ences cited in each section. ear data structures are necessary in order to specify the connectivity of the nodes. To do so, an artifcial node- Graph representations for small molecules ordering must frst be calculated for encoding a molecule Introduction to the molecular graph representation using arrays, even though V and E are formally sets and In order to understand the chemical representations pre- the order of elements in sets is irrelevant. Te informa- sented in this review, it is important that the reader frst tion to be mapped can include (1) how the atoms in the has a solid understanding of molecular graphs, as most molecule are connected, (2) the identity of the atoms, and molecular representations discussed in this work are (3) the identity of the bonds. built on the molecular graph representation. However, How the atoms are connected is commonly repre- there is a distinction between the notations and fle for- sented in the form of an adjacency matrix A; given that mats built using molecular graphs (e.g. SMILES strings, Molfles), and the abstract mathematical structure/data structure of a graph itself. Te latter is introduced here. 1 Note that throughout this section, using the typical convention, bold itali- Te idea behind the molecular graph representa- cized symbols are used to represent matrices and vectors, where an uppercase tion lies in mapping the atoms and bonds that make up symbol specifcally denotes a matrix (e.g.
Recommended publications
  • 1 Drug Development Working Group Report: Analysis of Existing
    Drug Development Working Group Report: Analysis of Existing Strengths, Critical Gaps, and Opportunities for Collaboration May 2014 1. Analysis. Biopharma - Focused on innovation with clinical application. Universities will play an increasingly large role in driving innovation in the biomedical industry for the foreseeable future. In New Jersey, there are several major biopharmaceutical and >350 smaller biotechnology and related companies with one or more employees. The recent integration of Rutgers creates a unique opportunity to align the University’s considerable drug discovery and development strengths. A directed effort of this sort would greatly enhance the capabilities of the University and enable University research to synergize with the biopharma industry, broadly defined. Although there have been shifts, mergers, and acquisitions in the New Jersey biopharma industry, the biopharma industry has continued to thrive. In 2003, the NY/NJ biopharma region had 17 companies with valuation >$100 M; in 2013 this region had still had 16 such companies. Moreover, the region is still vibrant: it continues to be home to large biomedical corporations, startups, entrepreneurs, and the world center for finance and capital. New Jersey also has quality industrial laboratory space, an ecosystem populated by a talented and experienced work force that has made New Jersey the “medicine chest of the world,” and an unparalleled opportunity to combine these factors in biopharma ventures that build on partnerships between Rutgers University and industry. Indeed, the primary difference between now and 15 years ago is that the present industry now is willing to reach out in a highly dynamic fashion. Rutgers is uniquely positioned to step forward as the premier university leader in biomedical innovation and development in the region.
    [Show full text]
  • The Chem Access Project from Bitmap Graphics to Fully Accessible Chemical Diagrams
    The Chem Access project From Bitmap Graphics to Fully Accessible Chemical Diagrams Volker Sorge Scientific Document Analysis Group Progressive Accessibility Solutions School of Computer Science Birmingham, UK University of Birmingham progressiveaccess.com 9th European e-Accessibility Forum, 2015, Paris, 8 June 2015 Motivation I Accessibility to STEM material is important issue for inclusive education I Diagrams are an important teaching means in STEM I Chemical diagrams (depictions of molecules) are ubiquitous in teaching from GCSE and A-levels teaching to undergrad curriculum chemistry, biosciences, life sciences. I Previous work on assistive technology for chemical diagrams I Require diagrams to be drawn in particular way or authoring environment I Need for specialist software to access and interact with diagrams I Additional hurdles for both authors and readers Goals I Make regular teaching material accessible I From inaccessible image to support for independent learning I Source independence I Do not rely on the benevolent, educated author I Platform independence I Use standard web technology (HTML5) I Accessible with all browsers, screen readers I Provide a seamless user experience without/very little interface I Support diverse material, for novices and experts alike Examples I Different representations of Aspirin molecule. Displayed formula. Skeletal formula. Structural formula. Examples I Or somewhat more complex. Procedure Input: A bitmap image of a molecule diagram 1. Image analysis and recognition 2. Generation of annotated SVG
    [Show full text]
  • Drug Design Project Tips
    Drug Design Project Information and Tips CHEM 162B (2012) Part of your grade in Drug Design courses will come from a development of a “Drug Design Project”. Students who have taken Chem162A will be able to continue their project development; students who have not taken Chem 162A can pick one of the projects developed in past and take it further. Students will present their projects during the annual Drug Design Poster Event. Students taking Chem 162 during the Winter 2011 will be able accomplish three important milestones toward completing the Project. Part of your grade will be based on your success in completing these milestones. The milestones that you are expected to complete during the Spring 2012 are: 1) Characterize a validated protein/nucleic acid target for the disease that you are working on. If the target is an enzyme, describe in detail the reaction it catalyzes. If the target is not an enzyme, describe in detail its physiological role and mode of operation 2) Obtain or create a virtual library of possible drug candidates that are expected to bind to this target and carry out virtual screening to identify best binders 3) Describe adsorption, distribution, metabolism, excretion, and possible toxicity aspects of your drug candidate(s). If appropriate, propose modifications to improve your drug. You are expected to turn in your work on each milestone by the due dates specified; you’ll automatically receive 3 points per each milestone if you submit a significantly developed work by the due date. You will earn extra 5 points if you submit all milestones by the due date.
    [Show full text]
  • Computational Approaches for Drug Design and Discovery: an Overview
    Review Article Computational Approaches for Drug Design and Discovery: An Overview Baldi A College of Pharmacy, Dr. Shri R.M.S. Institute of Science & Technology, Bhanpura, Dist. Mandsaur (M.P.), India ARTICLE INFO ABSTRACT Article history: The process of drug discovery is very complex and requires an interdisciplinary effort to design Received 12 July 2009 effective and commercially feasible drugs. The objective of drug design is to find a chemical Accepted 21 July 2009 compound that can fit to a specific cavity on a protein target both geometrically and chemically. Available online 04 February 2010 After passing the animal tests and human clinical trials, this compound becomes a drug available Keywords: to patients. The conventional drug design methods include random screening of chemicals Computer-aided drug design found in nature or synthesized in laboratories. The problems with this method are long design Combinatorial chemistry cycle and high cost. Modern approach including structure-based drug design with the help Drug of informatic technologies and computational methods has speeded up the drug discovery Structure-based drug deign process in an efficient manner. Remarkable progress has been made during the past five years in almost all the areas concerned with drug design and discovery. An improved generation of softwares with easy operation and superior computational tools to generate chemically stable and worthy compounds with refinement capability has been developed. These tools can tap into cheminformation to shorten the cycle of drug discovery, and thus make drug discovery more cost-effective. A complete overview of drug discovery process with comparison of conventional approaches of drug discovery is discussed here.
    [Show full text]
  • Chemistry: Empirical & Molecular Formulas
    Chemistry: Empirical & Molecular Formulas Name______________________ 1) What is the empirical formula for C15H30O5H5? 2) A compound has an empirical formula of C2H4O3N. Its molar mass is 359 g. What is the molecular formula? 3) A compound contains 12.7 g iron & 7.29 g oxygen. Its molar mass is 87.85 g. What is its empirical formula? What is its molecular formula? 4) A compound contains 65.6% carbon, 9.45% hydrogen & 25.0% oxygen. Its molar mass is 1922.85 g. What is its empirical formula? What is its molecular formula? 5) A compound contains 92.2% carbon & 7.75% hydrogen. It has a molar mass of 118 g. What is its empirical formula? What is its molecular formula? 6) A compound has an empirical formula of C2H5O3N. Its molar mass is 454 g. What is its molecular formula? 7) What is the empirical formula for C20H32O4N4? 8) A compound contains 69.6% carbon, 9.76% hydrogen & 20.6% oxygen. Its molar mass is 465.72 g. What is its empirical formula? What is its molecular formula? 9) A compound contains 42.044 g carbon & 7.956 g hydrogen. Its molar mass is 114.26 g. What is its empirical formula? What is its molecular formula? 10) A compound contains 30.79 g copper & 19.31 g oxygen. Its molar mass is 207 g. What is its empirical formula? What is its molecular formula? 11) A compound contains 40.9% carbon, 4.58% hydrogen & 54.5% oxygen. Its molar mass is 176.1 g. What is its empirical formula? What is its molecular formula? 12) A compound has an empirical formula of CH2O & a gram molecular mass of 180.16 g.
    [Show full text]
  • Recent Advances in Automated Protein Design and Its Future
    EXPERT OPINION ON DRUG DISCOVERY 2018, VOL. 13, NO. 7, 587–604 https://doi.org/10.1080/17460441.2018.1465922 REVIEW Recent advances in automated protein design and its future challenges Dani Setiawana, Jeffrey Brenderb and Yang Zhanga,c aDepartment of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA; bRadiation Biology Branch, Center for Cancer Research, National Cancer Institute – NIH, Bethesda, MD, USA; cDepartment of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA ABSTRACT ARTICLE HISTORY Introduction: Protein function is determined by protein structure which is in turn determined by the Received 25 October 2017 corresponding protein sequence. If the rules that cause a protein to adopt a particular structure are Accepted 13 April 2018 understood, it should be possible to refine or even redefine the function of a protein by working KEYWORDS backwards from the desired structure to the sequence. Automated protein design attempts to calculate Protein design; automated the effects of mutations computationally with the goal of more radical or complex transformations than protein design; ab initio are accessible by experimental techniques. design; scoring function; Areas covered: The authors give a brief overview of the recent methodological advances in computer- protein folding; aided protein design, showing how methodological choices affect final design and how automated conformational search protein design can be used to address problems considered beyond traditional protein engineering, including the creation of novel protein scaffolds for drug development. Also, the authors address specifically the future challenges in the development of automated protein design. Expert opinion: Automated protein design holds potential as a protein engineering technique, parti- cularly in cases where screening by combinatorial mutagenesis is problematic.
    [Show full text]
  • Protein Structure Analysis: Introducing Students to Rational Drug Design
    INQUIRY & Protein Structure Analysis: INVESTIGATION Introducing Students to Rational Drug Design • AGNIESZKA SZARECKA, CHRISTOPHER DOBSON ABSTRACT the steps involved in computational drug-design pipelines, a process We describe a series of engaging exercises in which students emulate the process that uses structural data for a specific protein target to rationally that researchers use to efficiently develop new pharmaceutical drugs, that of design the best effector molecule – a drug candidate. rational drug design. The activities are taken from a three- to four-hour Unlike most wet-lab experiments in molecular biology and bio- workshop regularly conducted with first-year college students and presented chemistry, these computational experiments do not require expen- here to take place over three to four class periods. Although targeted at college sive equipment or laboratory setups. The tools we describe here students, these activities may be appropriate at the high school level as well, are predominantly free online resources that are accessible via stan- particularly in an AP Biology course. The exercises introduce students to the dard Internet connection. Locally run software can also be used, topics of bioinformatics and computer modeling, in the context of rational drug design, using free online resources such as databases and computer programs. and we will recommend free and easy-to-install programs, capable Through the process of learning about computational drug design and drug of running on the majority of common platforms. The exercises optimization, students also learn content such as elements of protein structure can easily be divided between pre-lab, in-class, and homework and protein–ligand interactions.
    [Show full text]
  • 11.2 Alkanes
    11.2 Alkanes A large number of carbon compounds are possible because the covalent bond between carbon atoms, such as those in hexane, C6H14, are very strong. Learning Goal Write the IUPAC names and draw the condensed structural formulas and skeletal formulas for alkanes and cycloalkanes. Chemistry: An Introduction to General, Organic, and Biological Chemistry, Twelfth Edition © 2015 Pearson Education, Inc. Naming Alkanes Alkanes • are hydrocarbons that contain only C—C and C—H bonds • are formed by a continuous chain of carbon atoms • are named using the IUPAC (International Union of Pure and Applied Chemistry) system • have names that end in ane • use Greek prefixes to name carbon chains with five or more carbon atoms Chemistry: An Introduction to General, Organic, and Biological Chemistry, Twelfth Edition © 2015 Pearson Education, Inc. IUPAC Naming of First Ten Alkanes Chemistry: An Introduction to General, Organic, and Biological Chemistry, Twelfth Edition © 2015 Pearson Education, Inc. 1 Condensed Structural Formulas In a condensed structural formula, • each carbon atom and its attached hydrogen atoms are written as a group • a subscript indicates the number of hydrogen atoms bonded to each carbon atom The condensed structural formula of butane has four carbon atoms. CH3—CH2—CH2—CH3 butane Core Chemistry Skill Naming and Drawing Alkanes Chemistry: An Introduction to General, Organic, and Biological Chemistry, Twelfth Edition © 2015 Pearson Education, Inc. Condensed Structural Formulas Alkanes are written with structural formulas that are • expanded to show each bond • condensed to show each carbon atom and its attached hydrogen atoms Expanded Condensed Expanded Condensed Chemistry: An Introduction to General, Organic, and Biological Chemistry, Twelfth Edition © 2015 Pearson Education, Inc.
    [Show full text]
  • WS 4.2: Empirical Formula Problems
    Name: Answer Key Period: ______ Date: __________ Chem B odds for HW and evens for review WS 4.2: Empirical Formula Problems Directions: Determine the empirical formula for the following substances. If a molecular formula cannot be reduced, write “cannot be reduced in the box for empirical formula” Molecular Formula Empirical Formula Molecular Formula Empirical Formula 1) C6H6 CH 2) C10H18 C5H9 3) C8H18 C4H9 4) C7H15O2 Not reducible 5) WO2 Not reducible 6) N2H4 NH2 7) C2H6O2 CH3O 8) P3F6 PF2 9) X39Y13 X3Y 10) IF5 Not reducible 11) C4H10 C2H5 12) N2O5 Not reducible 13) C6H12O6 CH2O 14) N2O4 NO2 15) P2F4 PF2 16) SF6 Not reducible 17) SO3 Not reducible 18) N2Cl4 NCl2 19) N2Br4 NBr2 20) P2O5 Not reducible Directions: Solve the following problems. You may need to use a separate sheet of paper. 1) What is the empirical formula for a compound which 3) Barry Um has a sample of a compound which weighs 200 contains 0.0134 g of iron, 0.00769 g of sulfur and 0.0115 g of grams and contains only carbon, hydrogen, oxygen and oxygen? nitrogen. By analysis, he finds that it contains 97.56 grams of carbon, 4.878 g of hydrogen, 52.03 g of oxygen and 45.53 g of nitrogen. Find its empirical formula. 3 FeSO C5H3N2O2 2) A compound is found to contain 53.7 % iron and 46.27 % sulfur. Find its empirical formula. 4) What is the empirical formula for a compound which contains 80.3% zinc and the rest is oxygen? Fe2S3 ZnO 5) Rubbing alcohol was found to contain 60.0 % carbon, 13.4 9) Give the name and empirical formula of a compound % hydrogen, and the remaining mass was due to oxygen.
    [Show full text]
  • Imaging Live Drosophila Brain with Two-Photon Fluorescence Microscopy Syeed Ehsan Ahmed University of Texas at El Paso, [email protected]
    University of Texas at El Paso DigitalCommons@UTEP Open Access Theses & Dissertations 2017-01-01 Imaging Live Drosophila Brain With Two-Photon Fluorescence Microscopy Syeed Ehsan Ahmed University of Texas at El Paso, [email protected] Follow this and additional works at: https://digitalcommons.utep.edu/open_etd Part of the Biophysics Commons, and the Optics Commons Recommended Citation Ahmed, Syeed Ehsan, "Imaging Live Drosophila Brain With Two-Photon Fluorescence Microscopy" (2017). Open Access Theses & Dissertations. 397. https://digitalcommons.utep.edu/open_etd/397 This is brought to you for free and open access by DigitalCommons@UTEP. It has been accepted for inclusion in Open Access Theses & Dissertations by an authorized administrator of DigitalCommons@UTEP. For more information, please contact [email protected]. IMAGING LIVE DROSOPHILA BRAIN WITH TWO-PHOTON FLUORESCENCE MICROSCOPY SYEED EHSAN AHMED Master’s Program in Physics APPROVED: Chunqiang Li, Ph.D., Chair Cristian E. Botez, Ph.D. Chuan Xiao, Ph.D. Charles Ambler, Ph.D. Dean of the Graduate School Copyright © by Syeed Ehsan Ahmed 2017 Dedication Dedicated to my beloved family and friends. Thank you for believing in me and helping me throughout my journey. IMAGING LIVE DROSOPHILA BRAIN WITH TWO-PHOTON FLUORESCENCE MICROSCOPY by SYEED EHSAN AHMED, B.Sc. in Physics THESIS Presented to the Faculty of the Graduate School of The University of Texas at El Paso in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE Department of Physics THE UNIVERSITY OF TEXAS AT EL PASO August 2017 ACKNOWLEDGEMENTS All praise and worthiness goes to Almighty Merciful Allah who has given me the opportunity, strength and ability to complete this work.
    [Show full text]
  • Empirical and Molecular Formula Notes • the Empirical Formula of A
    Empirical and Molecular Formula Notes The empirical formula of a compound shows the ratio of elements present in a compound The molecular formula of a compound shows how many atoms of each element are present in a molecule of a compound. o Example: the compound butene has a molecular formula of C4H8. The empirical formula of butene is CH2 because there is a 1:2 ratio of carbon atoms to hydrogen atoms. Finding the molecular formula when given the empirical formula The molecular formula is just a multiple of the empirical formula. Take a look at butene: o CH2 x n = C4H8 so n=4. What is the molar mass of butene? . If the empirical formula of a compound is CH2, then you can find the molar mass of the empirical formula: 12.0+ 2(1.0) = 14.0 g/mol . 14.0 g/mol x 4 = 56 g/mol What if I have a compound with an empirical formula of CH2 that has a molar mass of 84 g/mol. What is the molecular formula of this compound? o 14.0 g/mol x n = 84 g/mol so n = 6 o CH2 x 6 = C6H12 Try a few: 1. What is the empirical formula of C8H10? C6H12O6? 2. What is the molecular formula of a molecule that has an empirical formula of NO2 and a molar mass of 92 g/mol? 3. Consider a molecule that has an empirical formula of SO3 and a molecular formula of S3O9. What is the molar mass of the molecule? Calculating empirical formula from percent composition Answer this question: What is the empirical formula of a compound that contains 25.53% Mg and 74.47% Cl? Steps: 1.
    [Show full text]
  • Arxiv:2001.11604V2 [Cs.PL] 6 Sep 2020 of Trigger Conditions, Having an in Situ Infrastructure That Simplifies Results They Desire
    DIVA: A Declarative and Reactive Language for in situ Visualization Qi Wu* Tyson Neuroth† Oleg Igouchkine‡ University of California, Davis, University of California, Davis, University of California, Davis, United States United States United States Konduri Aditya§ Jacqueline H. Chen¶ Kwan-Liu Ma|| Indian Institute of Science, India Sandia National Laboratories, University of California, Davis, United States United States ABSTRACT for many applications. VTK [61] also partially adopted this ap- The use of adaptive workflow management for in situ visualization proach, however, VTK is designed for programming unidirectional and analysis has been a growing trend in large-scale scientific simu- visualization pipelines, and provides limited support for highly dy- lations. However, coordinating adaptive workflows with traditional namic dataflows. Moreover, the synchronous dataflow model is procedural programming languages can be difficult because system somewhat difficult to use and does not always lead to modular pro- flow is determined by unpredictable scientific phenomena, which of- grams for large scale applications when control flows become com- ten appear in an unknown order and can evade event handling. This plicated [19]. Functional reactive programming (FRP) [19,24,48,53] makes the implementation of adaptive workflows tedious and error- further improved this model by directly treating time-varying values prone. Recently, reactive and declarative programming paradigms as first-class primitives. This allowed programmers to write reac- have been recognized as well-suited solutions to similar problems in tive programs using dataflows declaratively (as opposed to callback other domains. However, there is a dearth of research on adapting functions), hiding the mechanism that controls those flows under these approaches to in situ visualization and analysis.
    [Show full text]