Information Extraction from Chemical Patents

Total Page:16

File Type:pdf, Size:1020Kb

Information Extraction from Chemical Patents Information Extraction from Chemical Patents David Matthew Jessop Fitzwilliam College This dissertation is submitted for the degree of Doctor of Philosophy Preface This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except where specifically indicated in the text This dissertation does not exceed the word limit (60000) set by the Degree Committee i Abstract Information Extraction from Chemical Patents David Matthew Jessop The automated extraction of semantic chemical data from the existing literature is demonstrated. For reasons of copyright, the work is focused on the patent literature, though the methods are expected to apply equally to other areas of the chemical literature. Hearst Patterns are applied to the patent literature in order to discover hyponymic relations describing chemical species. The acquired relations are manually validated to determine the precision of the determined hypernyms (85.0%) and of the asserted hyponymic relations (94.3%). It is demonstrated that the system acquires relations that are not present in the ChEBI ontology, suggesting that it could function as a valuable aid to the ChEBI curators. The relations discovered by this process are formalised using the Web Ontology Language (OWL) to enable re-use. PatentEye – an automated system for the extraction of reactions from chemical patents and their conversion to Chemical Markup Language (CML) – is presented. Chemical patents published by the European Patent Office over a ten-week period are used to demonstrate the capability of PatentEye – 4444 reactions are extracted with a precision of 78% and recall of 64% with regards to determining the identity and amount of reactants employed and an accuracy of 92% with regards to product identification. NMR spectra are extracted from the text using OSCAR3, which is developed to greatly increase recall. The resulting system is presented as a significant advancement towards the large- scale and automated extraction of high-quality reaction information. Extended Polymer Markup Language (EPML), a CML dialect for the description of Markush structures as they are presented in the literature, is developed. Software to exemplify and to enable substructure searching of EPML documents is presented. Further work is recommended to refine the language and code to publication-quality before they are presented to the community. ii Acknowledgments I would like to thank Prof. Robert Glen and Prof. Peter Murray-Rust for supervision. I would also like to thank all those who have contributed to the creation of the software that has made this project possible – most notably Dr Peter Corbett for his work on OSCAR3, Dr Lezan Hawizy for her work on ChemicalTagger and Daniel Lowe for his work on OPSIN. Further thanks go to all those too numerous to name at the Unilever Centre, past and present, who have contributed to discussions and supported me in my work. Special thanks go to Dr Sam Adams, who volunteered to proof read this thesis, and to Jo for her love and support. I am grateful to Unilever for funding. iii Contents Preface ..................................................................................................................................................... i Abstract ................................................................................................................................................... ii Acknowledgments .................................................................................................................................. iii Contents ................................................................................................................................................. iv List of Figures ........................................................................................................................................ vii Glossary .................................................................................................................................................. ix 1. Introduction .................................................................................................................................... 1 1.1 Open and Closed Data ............................................................................................................ 1 1.2 The Semantic Web .................................................................................................................. 2 1.3 Semanticizing Chemistry ......................................................................................................... 5 1.4 Information Extraction from Chemical Documents ................................................................ 5 1.5 Development Environment ..................................................................................................... 7 2. Sources of Chemical Documents & Technologies for their Semantic Enrichment ......................... 8 2.1 Availability of Documents ....................................................................................................... 8 2.1.1 Journal Articles ................................................................................................................ 9 2.1.2 Theses ............................................................................................................................. 9 2.1.3 Patents .......................................................................................................................... 10 2.2 Key Technologies .................................................................................................................. 14 2.2.1 XML & XPath ................................................................................................................. 15 2.2.2 Regular Expressions ...................................................................................................... 16 2.2.3 Machine-Understandable Chemical Formats ............................................................... 17 2.2.4 Chemical Markup Language .......................................................................................... 21 2.2.5 CMLXOM & JUMBO ....................................................................................................... 25 2.2.6 OSCAR3 ......................................................................................................................... 26 2.2.7 ChemicalTagger ............................................................................................................. 39 2.2.8 OSRA .............................................................................................................................. 45 2.3 Conclusions ........................................................................................................................... 47 3. Representation and Manipulation of Markush Structures ........................................................... 48 3.1 Markush Structures ............................................................................................................... 49 3.2 Polymer Markup Language ................................................................................................... 50 3.2.1 Representation of Polyethylene Oxide ......................................................................... 50 iv 3.2.2 Representation of Polystyrene ..................................................................................... 53 3.2.3 Representing Variability in PML .................................................................................... 55 3.2.4 The Cambridge Polymer Builder ................................................................................... 58 3.3 Extension of PML for Markush Structures ............................................................................ 61 3.3.1 Frequency Variation ...................................................................................................... 63 3.3.2 Homology Variation ...................................................................................................... 63 3.3.3 Position Variation .......................................................................................................... 66 3.3.4 Position and Count Variation ........................................................................................ 67 3.3.5 Inline Connection Tables ............................................................................................... 68 3.4 Building Representative Examples of a Markush Structure.................................................. 70 3.5 Substructure Searching of Markush Structures .................................................................... 75 3.5.1 Implementing Extended Connection Tables ................................................................. 76 3.5.2 Building Extended Connection Tables ........................................................................... 78 3.5.3 The Relaxation Algorithm.............................................................................................. 82 3.5.4 Examples ....................................................................................................................... 87 3.6 Conclusions ........................................................................................................................... 91 4. Automatic Acquisition of Hyponymic Relations from the Chemical Literature ............................ 93 4.1 Hyponymic Relations ...........................................................................................................
Recommended publications
  • Downloaded in MS Word Format (.Docx)
    Wang and Tseng J Cheminform (2019) 11:78 https://doi.org/10.1186/s13321-019-0401-4 Journal of Cheminformatics SOFTWARE Open Access IntelliPatent: a web-based intelligent system for fast chemical patent claim drafting Pei‑Hua Wang1 and Yufeng Jane Tseng1,2* Abstract The frst step of automating composition patent drafting is to draft the claims around a Markush structure with sub‑ stituents. Currently, this process depends heavily on experienced attorneys or patent agents, and few tools are avail‑ able. IntelliPatent was created to accelerate this process. Users can simply upload a series of analogs of interest, and IntelliPatent will automatically extract the general structural scafold and generate the patent claim text. The program can also extend the patent claim by adding commonly seen R groups from historical lists of the top 30 selling drugs in the US for all R substituents. The program takes MDL SD fle formats as inputs, and the invariable core structure and variable substructures will be identifed as the initial scafold and R groups in the output Markush structure. The results can be downloaded in MS Word format (.docx). The suggested claims can be quickly generated with IntelliPatent. This web‑based tool is freely accessible at https ://intel lipat ent.cmdm.tw/. Keywords: Web server, Markush structure, Pharmaceutical patent Introduction of visualizing and analyzing Markush structures from a Claims are the most important sections in composition composition patent [7]. Its web based application, iMa- patents [1]. In the pharmaceutical industry, claims should rVis, revised the underlying R group numbering system include key compounds and all structural derivatives that to deal with nested R group presentation [8].
    [Show full text]
  • The Impact of Academic Patenting on the Rate, Quality, and Direction of (Public) Research Output
    Methodology Data & Measurement Results Summary The Impact of Academic Patenting on the Rate, Quality, and Direction of (Public) Research Output Pierre Azoulay1 Waverly Ding2 Toby Stuart3 1Sloan School of Management MIT & NBER [email protected] 2Haas School of Business University of California — Berkeley 3Graduate School of Business Harvard University January 23, 2007 — Northwestern University Azoulay, Ding, Stuart The Impact of Academic Patenting Methodology Data & Measurement Results Summary Outline 1 Motivation(s) 2 Methodology Problems with existing approaches Selection on observables with staggered treatment decisions Implementing IPTCW estimation 3 Data & Measurement Data sources Measuring “patentability” Descriptive statistics 4 Results The determinants of selection into patenting The impact of academic patenting on the rate of publications The impact of academic patenting on the quality of publications The impact of academic patenting on the content of publications 5 Caveats, Summary & Future Directions Azoulay, Ding, Stuart The Impact of Academic Patenting Methodology Data & Measurement Results Summary Outline 1 Motivation(s) 2 Methodology Problems with existing approaches Selection on observables with staggered treatment decisions Implementing IPTCW estimation 3 Data & Measurement Data sources Measuring “patentability” Descriptive statistics 4 Results The determinants of selection into patenting The impact of academic patenting on the rate of publications The impact of academic patenting on the quality of publications The impact
    [Show full text]
  • Construing Patentability of Chemical Technology Inventions, with Focus
    Construing Patentability of Chemical Technology Inventions, with Focus on their Patent Eligibility and Industrial Applicability: A Comparison on the Patent Examination Approach in the Philippines and in Japan By Anthea Kristine Y. Paculan Intellectual Property Office of the Philippines Supervised by Yorimasa Suwa, PhD., MBA Senior Researcher, Asia-Pacific Industrial Property Center, Japan Institution of Promoting Invention and Innovation Advised by Dr. Kazukiyo Nagai Professor, Department of Applied Chemistry, School of Science and Technology, Meiji University Tokiko Mizuochi, Ph.D Project Assistant Professor, Keio University Office for Open Innovation Final Report In fulfillment of the Study Cum Research fellowship program Sponsored by Japan Patent Office 27 June – 25 October 2019 The views and findings in this report are those of the author and do not necessarily reflect the views and policy of the organization or sponsor of this study. i ○c JPO 2020 Abstract The Intellectual Property Office of the Philippines (IPOPHL) must cope up with the emerging challenges in the patent examination of various fields of technology. The Chemical Technology field at IPOPHL covers a wide array of chemical-related subject-matters which in turn has resulted in handling concerns to the examiners assigned to perform substantive examination on such diverse technologies. Japan Patent Office (JPO) has provided comprehensive guidelines addressing various patentability issues, especially that of patent eligibility and industrial applicability of subject matters in the chemical field. By introducing conceptual aspects of the Japanese patent system as a model, this study allowed the investigation of the similarities and differences in JPO’s and IPOPHL’s examination procedure and assessment of patentability requirements, with focus on patent eligibility and industrial applicability of chemical technology inventions.
    [Show full text]
  • Intellectual Property in the Context of the WTO TRIPS Agreement: Challenges for Public Health MINISTER of HEALTH Humberto Costa
    Intellectual Property in the Context of the WTO TRIPS Agreement: challenges for public health MINISTER OF HEALTH Humberto Costa PRESIDENT OF OSWALDO CRUZ FOUNDATION Paulo Marchiori Buss DIRECTOR OF NATIONAL SCHOOL OF PUBLIC HEALTH Jorge A. Z. Bermudez COORDINATOR OF NUCLEUS FOR PHARMACEUTICAL POLICIES Jorge A. Z. Bermudez Intellectual Property in the Context of the WTO TRIPS Agreement: challenges for public health EDITORS: Jorge A. Z. Bermudez Maria Auxiliadora Oliveira WHO/PAHO Collaborating Center for Pharmaceutical Policies National School of Public Health Sergio Arouca Oswaldo Cruz Foundation A S L U O T R E P O P A P H S O N I O D V I M U N Rio de Janeiro, September 2004 Copyrigth © 2004 ENSP/WHO – Oswaldo Cruz Foundation – FIOCRUZ, Jorge A.Z. Bermudez. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without prior permission of the National School of Public Health Sergio Arouca. ISBN: 85-88026-16-3 Cover and Graphic Design: Lucia ReginaPantojo de Brito English Translation: Laura Anne Krech Revisions: Claudia Garcia Serpa Osório de Castro Gabriela Costa Chaves Thiago Botelho Azeredo NOTE FROM THE AUTHORS This book presents information produced or compiled by Nucleus for Pharmaceutical Policies professionals and external collaborators regarding the process of implementation of the TRIPS Agreement in developing countries, focusing on its implications for public health policies, particularly those related to access to medicines. Interpretations and views expressed in this publication are solely the responsibility of the authors, not representing any official or institutional position, as to the issues here approached.
    [Show full text]
  • Dulaneyspr15.Pdf (409.1Kb)
    Adventures in Organophosphorus Chemistry James W. Dulaney, III (Faculty Mentor: David E. Lewis) Department of Chemistry, University of Wisconsin-Eau Claire, Eau Claire, WI 54702-4004 Background Challenges with the Mitsunobu reaction What is required in a replacement to make the reaction Azodicarboxylates, an important component of the re- “green”? Organophosphorus compounds have been an increasingly important part of organic synthesis since the discovery action, are very hazardous: of the Arbuzov rearrangement in 1905. In the century that followed this discovery, the applications of phosphorus- O Green? based reagents in synthesis has grown to incorporate the Wittig and Horner-Wadsworth-Emmons reactions for OEt • They are highly explosive (especially diethyl azodi- N alkene formation, as well as the Mitsunobu inversion reaction, which is used to convert alcohols to a wide range carboxylate). N Green reactions are run under more environmentally sustainable conditions with a view to of products with inversion of configuration. O minimizing environmental impact: • They are highly toxic (especially diethyl azodicar- OEt boxylate). diethyl azodicarboxylate • They are highly regulated. (DEAD) • Wherever possible, renewable sources are used • Wherever possible, hazardous materials are eliminated completely or replaced by The Wittig reaction Ph Ph Ph BuLi Ph less hazardous materials P R P R Regulations have been put into place by the DOT that O Ph Ph O-i-Pr • Wherever possibler, catalytic reactions are used Reaction between an ylide and an aldehyde or ketone to give an alkene. make azodicarboxylic esters even more difficult to ob- N • Hazardous waste is minimized wherever possible (e.g. organic solvents can be re- Reaction gives mainly the Z isomer with aldehydes O tain and use.
    [Show full text]
  • Elsevier Research Platforms
    | 1 | Scopus Trainer : Nattaphol Sisuruk Elsevier Training Consultant, Research Solutions E-mail : [email protected] | 2 | ELSEVIER is a leading Science & Health Information Provider CONTENTPROVISION ‘E’ CONTENT PROVISION RESEARCH MGMT SEARCH & DISCOVERY /PROMOTION TOOLS Niels Louis Alexander Albert George F. John C. Roger D. Craig C Mello Smoot Medicine Bohr Physics Pasteur Fleming Einstein Mather Kornberg Physics Physics (Chemistry) Medicine Physics Chemistry 2 | 3 | Globally recognised high impact content Disseminate Global Elsevier Citations Total STM reference, publication & citations Elsevier share Coverage: Approximately 5,000 publishers Other Get cited Publisher A Publisher B CertifyCertify Investigate Global Elsevier Publications Global References to Elsevier Elsevier Publish Cite Elsevier Other Publisher A Other Publisher A Publisher B 24 Citations Per Paper: Publisher B 27% of all references Global team 2010-2014 74 offices in 24 countries Publisher References Publications Citations 7,000 Journal Editors Elsevier 56,304,346 1,888,115 45,990,748 Publisher A 15,738,334 1,221,036 22,374,220 70,000 Editorial Board Members Publisher B 23,064,330 747,976 18,298,048 600,000 authors Other 116,371,011 5,261,600 95,192,376 Totals 211,478,021 9,118,727 181,855,392 | 4 | Elsevier Research Platforms : Researchers seek a digital environment where ideas can be exchanged, examined, and applied with tools that empower STM knowledge. To find and analyze data from over 5000 publishers Access the leading eBooks and journal articles published by Elsevier Manage your research and showcase your profile via free services :These platforms make data and content easier to search, access, analyze, and share.
    [Show full text]
  • The Concept of Novel Compositions of Matter: a Theoretical Analysis
    rty Right e s: op O r p P e l n a Robson, Intel Prop Rights 2013, 2:1 u A t c c c e l e DOI: 10.4172/2375-4516.1000108 l s e t s n Intellectual Properties Rights: Open Access I ISSN: 2375-4516 ReviewResearch Article Article OpenOpen Access Access The Concept of Novel Compositions of Matter: A Theoretical Analysis Barry Robson* University Director of Research, St Matthews University School of Medicine, The Dirac Foundation, Oxford shire UK and Quantal Semantics Inc., North Carolina, USA Abstract Here is discussed in the manner of a review the nature and uses of information measures in the discipline of patenting. From one perspective, the information content in a patent diminishes rapidly as the broadness of the claims increases. Claims made by Markush representations facilitate the quantification of that. The equations will approach yielding zero information if a massive number of chemical themes were implied. Importantly, a more detailed examination of these equations have implications that allow discussion of various aspects of novelty, reasonable consistency with a specific purpose, and perhaps even how many arguments and counterarguments there should be between examiner and assignee. Keywords: Novel compositions; Patents; Similarity a biological target molecule that provides a complementary surface (albeit influenced by the surrounding environment); if favorable, it Introduction allows specific binding. In principle, the focus on formula hugely helps the discipline of patenting. It reflects the fact that adding, removing, Information from patents changing, or rearranging atoms in a molecule occurs in discrete jumps, By serving to protect intellectual property in exchange for disclosing computationally a matter of integers employed in a clear cut graph- new insight and data for the benefit of science and engineering, patents theoretic description (in contrast, the above field involves continuous are well recognized to be a rich information source [1].
    [Show full text]
  • Bulk Processing of Molecule Patent Associations“
    DIPLOMARBEIT / DIPLOMA THESIS Titel der Diplomarbeit / Title of the Diploma Thesis „Bulk Processing of Molecule Patent Associations“ verfasst von / submitted by Patrick Penner angestrebter akademischer Grad / in partial fulfilment of the requirements for the degree of Magister der Pharmazie (Mag. Pharm.) Wien, 2017 / Vienna, 2017 Studienkennzahl lt. Studienblatt / A 449 degree programme code as it appears on the student record sheet: Studienrichtung lt. Studienblatt / Diplomstudium Pharmazie degree programme as it appears on the student record sheet: Betreut von / Supervisor: Univ.-Prof. Mag. Dr. Thierry Langer Mitbetreut von / Co-Supervisor: Acknowledgements First and foremost, I would like to thank Prof. Dr. Thierry Langer for the opportunity of this project and the many things I learned in the course of it. It was an enlightening experience and a very welcome chance to apply knowledge. Furthermore, I would like to thank Gökhan Ibis for his continued support and guidance in software development. I would also like to thank Dr. Thomas Seidel for his expertise in cheminformatics and the specific pointers he gave me along the way. Miriam Penner deserves mentioning for her graphical design work on the KNIME Node icon. Lastly I would like to thank my patient proofreaders: Katharina Penner, Miriam Penner, Arthur Garon, Clara van Hoey and Markus Wieder. iii Kurzfassung Die Suche nach Molekülen in chemischen Patenten ist schon seit Jahrzehnten eine Heraus- forderung. Die Ungenauigkeit von Moleküldarstellungen, die Extraktion von Strukturen aus Patenten und die große Anzahl veröffentlichter Patente erschweren dieses Unterfangen. Dieses Projekt widmet sich einem bis jetzt wenig beachteten Aspekt der Patentsuche, nämlich einer automatisierten Suche nach Patenten für größere Molekülmengen.
    [Show full text]
  • Working with Hazardous Chemicals
    A Publication of Reliable Methods for the Preparation of Organic Compounds Working with Hazardous Chemicals The procedures in Organic Syntheses are intended for use only by persons with proper training in experimental organic chemistry. All hazardous materials should be handled using the standard procedures for work with chemicals described in references such as "Prudent Practices in the Laboratory" (The National Academies Press, Washington, D.C., 2011; the full text can be accessed free of charge at http://www.nap.edu/catalog.php?record_id=12654). All chemical waste should be disposed of in accordance with local regulations. For general guidelines for the management of chemical waste, see Chapter 8 of Prudent Practices. In some articles in Organic Syntheses, chemical-specific hazards are highlighted in red “Caution Notes” within a procedure. It is important to recognize that the absence of a caution note does not imply that no significant hazards are associated with the chemicals involved in that procedure. Prior to performing a reaction, a thorough risk assessment should be carried out that includes a review of the potential hazards associated with each chemical and experimental operation on the scale that is planned for the procedure. Guidelines for carrying out a risk assessment and for analyzing the hazards associated with chemicals can be found in Chapter 4 of Prudent Practices. The procedures described in Organic Syntheses are provided as published and are conducted at one's own risk. Organic Syntheses, Inc., its Editors, and its Board of Directors do not warrant or guarantee the safety of individuals using these procedures and hereby disclaim any liability for any injuries or damages claimed to have resulted from or related in any way to the procedures herein.
    [Show full text]
  • Patenting Nature: a Problem of History Christopher Beauchamp Brooklyn Law School, [email protected]
    Brooklyn Law School BrooklynWorks Faculty Scholarship Winter 2013 Patenting Nature: A Problem of History Christopher Beauchamp Brooklyn Law School, [email protected] Follow this and additional works at: https://brooklynworks.brooklaw.edu/faculty Part of the Intellectual Property Law Commons, Legal History Commons, and the Litigation Commons Recommended Citation 16 Stan. Tech. L. Rev. 257 (2013) This Article is brought to you for free and open access by BrooklynWorks. It has been accepted for inclusion in Faculty Scholarship by an authorized administrator of BrooklynWorks. STANFORD TECHNOLOGY LAW REVIEW VOLUME 16, NUMBER 2 WINTER 2013 PATENTING NATURE: A PROBLEM OF HISTORY Christopher Beauchamp* CITE AS: 16 STAN. TECH. L. REV. 257 (2013) http://stlr.stanford.edu/pdf/patentingnature.pdf ABSTRACT The practice of patenting genetic material is currently under sharp attack. Recent litigation has forced the courts to grapple with the doctrinal basis for patenting DNA sequences identical to those found in nature. Faced with conflicting authorities and difficult policy questions, courts have leaned heavily on history to guide—or at least to justify—their decisions. This article explores the history in question. It traces the patent law’s changing treatment of “products of nature” in an attempt to untangle the origins of present-day patentability arguments. The evidence suggests that the historical foundations of the bar on patenting products of nature are surprisingly shaky. The article also reveals how isolated biological materials first came to be patented. This task, I argue, requires looking not only to court decisions, but also to the history of patent practice. My principal vehicle for doing so is the case of Parke-Davis & Co.
    [Show full text]
  • Organic Synthesis Using Carbon Dioxide As Phosgene-Free Carbonyl Reagent*
    Pure Appl. Chem., Vol. 84, No. 3, pp. 581–602, 2012. http://dx.doi.org/10.1351/PAC-CON-11-05-04 © 2011 IUPAC, Publication date (Web): 1 September 2011 Organic synthesis using carbon dioxide as phosgene-free carbonyl reagent* An-Hua Liu, Yu-Nong Li, and Liang-Nian He‡ State Key Laboratory and Institute of Elemento-Organic Chemistry, Nankai University, Tianjin 300071, China Abstract:CO2 is very attractive as a typical renewable feedstock for manufacturing com- modity chemicals, fuel, and materials since it is an abundant, nontoxic, nonflammable, and easily available C1 resource. The development of greener chemical methodologies for replac- ing the utility of hazardous and environmentally undesirable phosgene largely relies on ingenious activation and incorporation of CO2 into valuable compounds, which is of para- mount importance from a standpoint of green chemistry and sustainable development. Great efforts have been devoted to constructing C–C, C–O, and C–N bond on the basis of CO2 acti- vation through molecular catalysis owing to its kinetic and thermodynamic stability. The aim of this article is to demonstrate the versatile use of CO2 in organic synthesis as the alterna- tive carbonyl source of phosgene, with the main focus on utilization of CO2 as phosgene replacement for the synthesis of value-added compounds such as cyclic carbonates, oxa - zolidinones, ureas, isocyanates, and polymers, affording greener pathways for future chemi- cal processes. Keywords: atom economy; aziridines; carbon dioxide; carbonylation; catalysis; green chemistry; ionic liquids; organic carbonate; phosgene-free process; urea. INTRODUCTION CO2 as an abundant, nontoxic, easily available, and typical renewable C1 source as well as an impor- tant “greenhouse” gas has been drawing more and more attention in line with the need for development of green chemistry and a sustainable society.
    [Show full text]
  • Cheminformatics and the Semantic Web: Adding Value with Linked Data and Enhanced Provenance
    Advanced Review Cheminformatics and the Semantic Web: adding value with linked data and enhanced provenance Jeremy G. Frey∗ and Colin L. Bird Cheminformatics is evolving from being a field of study associated primarily with drug discovery into a discipline that embraces the distribution, management, ac- cess, and sharing of chemical data. The relationship with the related subject of bioinformatics is becoming stronger and better defined, owing to the influence of Semantic Web technologies, which enable researchers to integrate hetero- geneous sources of chemical, biochemical, biological, and medical information. These developments depend on a range of factors: the principles of chemical identifiers and their role in relationships between chemical and biological enti- ties; the importance of preserving provenance and properly curated metadata; and an understanding of the contribution that the Semantic Web can make at all stages of the research lifecycle. The movements toward open access, open source, and open collaboration all contribute to progress toward the goals of integration. C " 2013 John Wiley & Sons, Ltd. How to cite this article: WIREs Comput Mol Sci 2013. doi: 10.1002/wcms.1127 INTRODUCTION cipline of bioinformatics evolved more recently, in heminformatics is usually defined in terms of response to the vast amount of data generated by the application of computer science and infor- molecular biology, applying mathematical, and com- mationC technology to problems in the chemical sci- putational techniques not only to the management ences. Brown1 introduced the term chemoinformatics of that data but also to understanding the biological in 1998, in the context of drug discovery, although processes, pathways, and interactions involved.
    [Show full text]