
Blue Obelisk - Interoperability in chemical informatics Citation for published version (APA): Guha, R., Howard, MT., Hutchison, GR., Murray-Rust, P., Rzepa, H., Steinbeck, C., Wegner, J., & Willighagen, EL. (2006). Blue Obelisk - Interoperability in chemical informatics. Journal of Chemical Information and Modeling, 46(3), 991-998. https://doi.org/10.1021/ci050400b Document status and date: Published: 01/05/2006 DOI: 10.1021/ci050400b Document Version: Publisher's PDF, also known as Version of record Document license: Taverne Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal. If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement: www.umlib.nl/taverne-license Take down policy If you believe that this document breaches copyright please contact us at: [email protected] providing details and we will investigate your claim. Download date: 23 Sep. 2021 This is an open access article published under an ACS AuthorChoice License, which permits copying and redistribution of the article or any adaptations for non-commercial purposes. J. Chem. Inf. Model. 2006, 46, 991-998 991 The Blue ObelisksInteroperability in Chemical Informatics Rajarshi Guha,† Michael T. Howard,‡ Geoffrey R. Hutchison,§ Peter Murray-Rust,| Henry Rzepa,⊥ Christoph Steinbeck,*,# Jo¨rg Wegner,∇ and Egon L. WillighagenO Pennsylvania State University, University Park, Pennsylvania 16804-3000, Jmol Project, U. S. A., Cornell University, Ithaca, New York 14853, Cambridge University, Cambridge CB2 1TN, Great Britain, Imperial College, London SW7 2AZ, Great Britain, Cologne University Bioinformatics Center (CUBIC), Zu¨lpicher Str. 47, D-50674 Ko¨ln, Germany, University of Tu¨bingen, Tu¨bingen, Germany, and Jmol project, The Netherlands Received September 12, 2005 The Blue Obelisk Movement (http://www.blueobelisk.org/) is the name used by a diverse Internet group promoting reusable chemistry via open source software development, consistent and complimentary chemoinformatics research, open data, and open standards. We outline recent examples of cooperation in the Blue Obelisk group: a shared dictionary of algorithms and implementations in chemoinformatics algorithms drawing from our various software projects; a shared repository of chemoinformatics data including elemental properties, atomic radii, isotopes, atom typing rules, and so forth; and Web services for the platform- independent use of chemoinformatics programs. 1. INTRODUCTION working on another machine. But the 1980s saw the value of chemical informatics and the need to “productize” it. Much While the past 20 or 30 years of development in chemo- of this was meritorious, as it brought informatics into the informatics has created a plethora of published software classroom and the research lab and helped pay for some systems and algorithms for solving chemical problems, little chemistry research, but it also had hidden costs, which we effort has been spent in providing the community with open are now facing today. In particular, costs include non- components and data, to be reused and improved by interoperability and centralized control of informatics. communal efforts. Bioinformatics, with its much younger history, adopted the principles taught by success stories of Now, several open chemistry and chemoinformatics projects the open source movement in general, and Linux in (Table 1) have pooled forces to enhance interoperability particular, from the very beginning. Recent years, however, between these tools in a movement we call “The Blue have seen the emergence of open tools and databases also Obelisk” (BO). The name originates from an informal in chemical informatics.1-4 These draw on the existing ideas meeting place in San Diego, California, during the American of independent peer review and scientific collaboration, Chemical Society 2005 Spring National Meeting (see Figure mixed with “open source” software development paradigms. 1) and was coined by one of the authors. Because contribu- Community involvement, including assessments, suggestions, tors to the component projects live around the world, few s critiques, and rapid evolution, is a core component of these had met in person instead collaborating and meeting via efforts. The benefits of open source software have been the Internet. We identify three core areas for the Blue Obelisk Move- Downloaded via MAASTRICHT UNIV on June 30, 2021 at 18:25:52 (UTC). discussed in great detail by Eric Raymond in his seminal work The Cathedral and the Bazaar and following works.5 ment: The Open Source Initiative (OSI) summarizes: “Open source • Open Source. One can use other people’s code without promotes software reliability and quality by supporting further permission, including changing it for one’s own use See https://pubs.acs.org/sharingguidelines for options on how to legitimately share published articles. independent peer review and rapid evolution of source code. and distributing it again. To be OSI certified, the software must be distributed under • Open Standards. One can find visible community mech- a license that guarantees the right to read, redistribute, anisms for protocols and communicating information. The modify, and use the software freely.”6 mechanisms for creating and maintaining these standards In the beginning, most scientific software was free. It was cover a wide spectrum of human organizations, including so difficult to port that scientists did not bother about various degrees of consent. We have been heavily influenced licensessone was delighted if someone else could get it by the mantra of the Internet Engineering Task Force: “rough consensus and running code”. * Corresponding author phone: +49 (0)221 470-7426; fax: +49 (0) • 221 470-7786; e-mail: [email protected]. Open Data. One can obtain all data in the public domain † Pennsylvania State University. when wanted and reuse it for whatever purpose. This is an ‡ Jmol project, http://www.jmol.org. underused term, which we are resurrecting. It is independent § Cornell University. | Cambridge University. of “open access” and has relevance to “closed access” as ⊥ Imperial College. well. # Cologne University Bioinformatics Center. ∇ University of Tu¨bingen. As outlined above, these areas are independent of the O Jmol project, http://www.jmol.org. concept of “open access” to read publications freely. Instead, 10.1021/ci050400b CCC: $33.50 © 2006 American Chemical Society Published on Web 02/22/2006 992 J. Chem. Inf. Model., Vol. 46, No. 3, 2006 GUHA ET AL. Table 1. Current Blue Obelisk Projects project URL principal authors CML, JUMBO12 http://cml.sf.net/ P.M.-R., H.R. JChemPaint13 http://jchempaint.sf.net/ C.S., E.L.W. Jmol http://jmol.sf.net/ M.T.H., E.L.W. NMRShiftDB3 http://www.nmrshiftdb.org/ C.S. JOElib http://joelib.sf.net/ J.W. Kalzium http://edu.kde.org/kalzium/ Carsten Niehaus Octet http://octet.sf.net/ Rich Apodaca Open Babel http://openbabel.sf.net/ G.R.H. QSAR http://qsar.sf.net/ E.L.W., R.G., C.S., J.W. The Chemistry Development Kit1 http://cdk.sf.net/ E.L.W., C.S. WWMM http://wwmm.sf.net/ P.M.-R. the three points focus on access to the scientific data, coding,10 and atom typing,11 however, are indispensable in algorithms, and implementations themselves, rather than the academic chemoinformatics research to build better, more formatted manuscript. In particular, we believe that these stable, and more reproducible chemical information systems. concepts strongly continue the spirit of communal peer In this contribution, we outline several examples for how review and reproducibility at the heart of modern scientific the Blue Obelisk projects address this need: a shared research. dictionary of algorithms and implementations in chemo- It is well-known in software development that 80% of the informatics algorithms drawing from our various software costs are caused by maintaining software and not by the projects and a shared repository of chemoinformatics data initial implementation.7 This holds both for the in-house including elemental properties, atomic radii, isotopes, atom development in pharmaceutical companies and the develop- typing rules, a set of Web-based chemoinformatics services, ment for commercial chemoinformatics suppliers. Besides
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages9 Page
-
File Size-