1 Agenda XML Generics

Total Page:16

File Type:pdf, Size:1020Kb

1 Agenda XML Generics XML for Java Developers G22.3033-002 Session 1 - Main Theme Markup Language Technologies (Part I) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical Sciences 1 Agenda n XML Generics n Course Logistics, Structure and Objectives n History of Meta-Markup Languages n XML Applications: Markup Languages n XML Information Modeling Applications n XML-Based Architectures n XML and Java n XML Development Tools n Summary n Class Project n Readings n Assignment #1a 2 XML Generics n XML means eXtensible Markup Language n XML expresses the structure of information (i.e., document content) separately from its presentation n XSL style sheets are used to convert documents to a presentation format that can be processed by a target presentation device (e.g., HTML in the case of legacy browsers) n Document Type Definitions (DTDs) or XML schemas are used to describe and validate content 3 1 Course Logistics n Course Web Site n http://cs.nyu.edu/courses/summer01/G22.3033- 002/index.htm n http://www.nyu.edu/classes/jcf/g22.3033-002/ n Password: xmljava n Review syllabus n Textbooks n XML Development with Java 2 n Professional Java XML n XML and Java, Developing Web Applications 4 Knowledge Required n Know how to ftp files, send e-mail, and load URLs into a Web browser n Reasonable knowledge of HTML n Be able to write a basic HTML page including links, images, and text using a text editor n Be able to place a page on a web server n Advanced Java programming experience 5 Knowledge Not Required n SGML n Programming in Perl, or any language other than Java n Cascading Style Sheets and other HTML advanced features 6 2 Other Useful Knowledge n Web server configuration and the HTTP protocol n JavaScript n C programming n Database theory (normalization rules) n SGML (basic knowledge) n Some knowledge of international character sets 7 History of Meta-Markup Languages n SGML (see http://www.w3c.org) n Complex and expensive n HTML n Tim Berners-Lee (Cern), 1990 n Simple replacement to SGML n Lacked the flexibility and adaptability of XML n W3C XML working group n Jon Bosak (Sun), 1996 n SGML strengths + HTML simplicity 8 History of Meta-Markup Languages (continued) n XML 1.0 Specification n Tim Bray n Language simple enough for programmers to implement n Language not limited to English n Documents easy for search engine n CML, Jumbo, MathML n XLL (1997), CDF (1997), XSL (1997), MSXSL (1998), XML-Data, Mozilla, etc. 9 3 Why XML? n Domain specific markup language n Common data format n Simple, pure Unicode text n Well-documented n Data interchange n Open Financial Exchange (OFX) n XML non copyrighted n Support of portable structured text data as most electronic information is text-based 10 Related Technologies n JavaSoft’s Java API for XML parsing (JAXP) n Java bindings to SAX, and DOM APIs n HTML n CSS, CSS2, and CSS3 n XSL n well formed XML documents n URLs and URIs n resource v.s. location n Linking (XLL) n Unicode 11 Applications of XML: Markup Languages n XML helps define sets of rules for forming semantic tags that break documents into parts n XML is a Metamarkup language n Not a markup language restricted to a fixed set of tags n Can make up tags as you go along n XML Tags can be documented in a DTD n i.e., a domain specific syntax and vocabulary (e.g., CML, MathML, etc.) 12 4 HTML Example <dt>Java <dd>by Sun MicroSystem <ul> <li>Publisher: O’Reilly <li>ISBN: 0-777-88888-9 <li>Pages: 900 </ul> <P>The complete documentation.<P> 13 XML Corresponding Structure <book> <title>Java</title> <author>Sun Microsystems</author> <publisher>O’Reilly</publisher> <isbn>0-777-88888-9</isbn> <pages>900</pages> <blurb>The complete documentation</blurb> </book> 14 XML Enables Semantic/Structured Markup Languages n XML describes a document’s structure and meaning n XML does not describe the formatting of the elements n Formatting is added through style sheets n XML document only contains tags describing the content (not the appearance) of the document 15 5 XML Information Modeling Applications (Focus on Complex Structured Document Specification) n Custom and Industry specific Markup Languages with own syntax and vocabulary n CML (Chemical Markup Language) n MathML (Mathematical Markup Language) n CML (Channel Definition Format) n Classic Literature n SMIL (Synchronized Multimedia Integration Language n OSD (Open Software Description) 16 XML-Based Architectures n Presentation-Oriented Publishing Frameworks n XML-Based Application Servers n Message-Oriented Middleware Frameworks n Web Services Frameworks n Peer to Peer (P2P) Computing n XML-Based eServices Protocols and Architectures 17 XML POP Frameworks (Focus on Content Management and Publishing) n Requirements: n XML parser n XSL processor n Document repository n Collection of document DTDs or schema n Collection of XSL style sheets n Parsing & processing can be client or server-based n Samples server-side XML technology: n Perl & CGI (http://www.perl.com/CPAN) n Java servlets (http://www.alphaworks.ibm.com , www.apache.org) n ASPs (http://www.beyondhtml.com/rocket) 18 6 XML POP Packaged Solutions n Enterprise Information Portal (EIP) toolkits n e.g., DataChannel’s XML framework n Structured content management and retrieval systems n e.g., n Vignette’s StoryServer n Gauss Interprise’s VIP’XML-Gateway n Poet’s Content Management Suite n Chrystal Software’s Astoria 19 XML MOM Applications n Component-based modeling n e.g., XML Metadata Interchange (XMI) n Enterprise Application Integration n Business-to-Customer (B2C) Applications n e.g., e-Commerce n Business-to-Business (B2B) Applications n e.g., n WebMethods n Bluestone’s XML-Server, n eXcelon 2.0 n XMLSolutions’ ExeterXML server 20 Java-enabled XML Technologies n XML provides a universal syntax for Java semantics (behavior) n Portable, reusable data descriptions in XML n Portable Java code that makes the data behave in various ways n XML standard extension n Basic plumbing that translates XML into Java n parser, namespace support in the parser, simple API for XML (SAX), and document object model (DOM) n XML data binding standard extension 21 7 XML and Java Standards n XML includes is a family of technologies n XSL, XML Schema, XML Query, XPath, XPointer, XLink, DOM, RDF, CSS, XSL, XHTML, XML Signature, MathML, SMIL, SVG, etc. n Review the current state of the XML standards at http://www.w3c.org/XML n Review the current state of Java Technology and XML (JAXP) standards at http://java.sun.com/XML n Review the Java binding to DOM 2.0 at http://www.w3.org/TR/2000/REC-DOM-Level-2- Core-20001113/java-binding.zip 22 XML Development Tools n Read and write XML documents n XML Editors n e.g., Wattle Software’s XMLWriter, SoftQuad’s Xmetal, Vervet Logic’s XML Pro, eXcelon Stylus, XML Spy n XML well-formedness and/or validation Processors n e.g., IBM’s XML4J, Apache’s XercesJ, Oracle Parser, Java Project X processor, James Clark’s Expat parser, Microsoft MSXML n XML Browsers n e.g., IE5 23 Summary n XML is a metamarkup language n XML tags describe the structure and semantics of a document’s content (not the format of the content) n XML addresses SGML’s complexity and HTML’s inadequacies n Applications of XML for information modeling exist in various domains (chemistry, math, multimedia, etc.) 24 8 Summary (continued) n XML supports MOM and POP applications n XML supports the Web Services Infrastructure n XML and Java complements each other n Java-enabled XML technology is available in the market today n Standards for integrating XML and Java are being developed 25 Class Project n Project Description n The project will focus on “multi-channel online community platforms”, and will consist of providing custom XML-based services to support the various aspects of a chosen eBusiness application implemented on such platforms (e.g., “community -based shopping”). Examples of useful services to support these platforms may include synchronized multimedia presentation viewing, and “offline” chat capabilities. A sample specification of an online community platform for a virtual university eBusiness application will be provided for illustration purpose. 26 Readings n Readings n XML Development with Java 2: Chapter 1 n Professional Java XML: Chapter 1 n XML and Java: Chapter 1, Appendix C n Handouts posted on the course web site n Explore brief review of XML at XML in 10 points n Read article at XMLJ2EE n Review XML 1.0 (Second edition), Namespaces, and the XML activity statement at W3C n Project Frameworks Setup (ongoing) n Apache’s Web Server, TomCat, and Cocoon 27 n Apache’s Xerces, Xalan 9 Assignment n Assignment #1a: n Explore the textbooks’ CDs, and the textbook references to XML tools n Install and experiment with XML editors. Settle on a tool you feel comfortable using, and come up with a small XML application that demonstrates the use of that tool. Write a short report that uses your sample application to document your findings and recommendations with respect to selection criteria in support of XML editors 28 Next Session: Markup Language Technologies (Part II) n Current state of the XML standard n Advanced XML applications n XML style Specification Languages 29 10.
Recommended publications
  • Designing Universal Chemical Markup (UCM) Through the Reusable Methodology Based on Analyzing Existing Related Formats
    Designing Universal Chemical Markup (UCM) through the reusable methodology based on analyzing existing related formats Background: In order to design concepts for a new general-purpose chemical format we analyzed the strengths and weaknesses of current formats for common chemical data. While the new format is discussed more in the next article, here we describe our software s t tools and two stage analysis procedure that supplied the necessary information for the n i r development. The chemical formats analyzed in both stages were: CDX, CDXML, CML, P CTfile and XDfile. In addition the following formats were included in the first stage only: e r P CIF, InChI, NCBI ASN.1, NCBI XML, PDB, PDBx/mmCIF, PDBML, SMILES, SLN and Mol2. Results: A two stage analysis process devised for both XML (Extensible Markup Language) and non-XML formats enabled us to verify if and how potential advantages of XML are utilized in the widely used general-purpose chemical formats. In the first stage we accumulated information about analyzed formats and selected the formats with the most general-purpose chemical functionality for the second stage. During the second stage our set of software quality requirements was used to assess the benefits and issues of selected formats. Additionally, the detailed analysis of XML formats structure in the second stage helped us to identify concepts in those formats. Using these concepts we came up with the concise structure for a new chemical format, which is designed to provide precise built-in validation capabilities and aims to avoid the potential issues of analyzed formats.
    [Show full text]
  • Linking Nwchem and Avogadro with the Syntax and Semantics of Chemical Markup Language Wibe a De Jong1*†, Andrew M Walker2† and Marcus D Hanwell3†
    de Jong et al. Journal of Cheminformatics 2013, 5:25 http://www.jcheminf.com/content/5/1/25 RESEARCH ARTICLE Open Access From data to analysis: linking NWChem and Avogadro with the syntax and semantics of Chemical Markup Language Wibe A de Jong1*†, Andrew M Walker2† and Marcus D Hanwell3† Abstract Background: Multidisciplinary integrated research requires the ability to couple the diverse sets of data obtained from a range of complex experiments and computer simulations. Integrating data requires semantically rich information. In this paper an end-to-end use of semantically rich data in computational chemistry is demonstrated utilizing the Chemical Markup Language (CML) framework. Semantically rich data is generated by the NWChem computational chemistry software with the FoX library and utilized by the Avogadro molecular editor for analysis and visualization. Results: The NWChem computational chemistry software has been modified and coupled to the FoX library to write CML compliant XML data files. The FoX library was expanded to represent the lexical input files and molecular orbitals used by the computational chemistry software. Draft dictionary entries and a format for molecular orbitals within CML CompChem were developed. The Avogadro application was extended to read in CML data, and display molecular geometry and electronic structure in the GUI allowing for an end-to-end solution where Avogadro can create input structures, generate input files, NWChem can run the calculation and Avogadro can then read in and analyse the CML output produced. The developments outlined in this paper will be made available in future releases of NWChem, FoX, and Avogadro. Conclusions: The production of CML compliant XML files for computational chemistry software such as NWChem can be accomplished relatively easily using the FoX library.
    [Show full text]
  • XML for Java Developers G22.3033-002 Course Roadmap
    XML for Java Developers G22.3033-002 Session 1 - Main Theme Markup Language Technologies (Part I) Dr. Jean-Claude Franchitti New York University Computer Science Department Courant Institute of Mathematical Sciences 1 Course Roadmap Consider the Spectrum of Applications Architectures Distributed vs. Decentralized Apps + Thick vs. Thin Clients J2EE for eCommerce vs. J2EE/Web Services, JXTA, etc. Learn Specific XML/Java “Patterns” Used for Data/Content Presentation, Data Exchange, and Application Configuration Cover XML/Java Technologies According to their Use in the Various Phases of the Application Development Lifecycle (i.e., Discovery, Design, Development, Deployment, Administration) e.g., Modeling, Configuration Management, Processing, Rendering, Querying, Secure Messaging, etc. Develop XML Applications as Assemblies of Reusable XML- Based Services (Applications of XML + Java Applications) 2 1 Agenda XML Generics Course Logistics, Structure and Objectives History of Meta-Markup Languages XML Applications: Markup Languages XML Information Modeling Applications XML-Based Architectures XML and Java XML Development Tools Summary Class Project Readings Assignment #1a 3 Part I Introduction 4 2 XML Generics XML means eXtensible Markup Language XML expresses the structure of information (i.e., document content) separately from its presentation XSL style sheets are used to convert documents to a presentation format that can be processed by a target presentation device (e.g., HTML in the case of legacy browsers) Need a
    [Show full text]
  • Matml: XML for Information Exchange with Materials Property Data
    MatML: XML for Information Exchange with Materials Property Data Aparna S. Varde1,2 Edwin F. Begley Sally Fahrenholz-Mann 1. Department of Computer Science Computer Integrated Building Electronic Publishing and Content 2. Materials Science Program Processes Group Management Worcester Polytechnic Institute (WPI) National Institute of Standards and ASM International Worcester, MA, USA Technology (NIST) Materials Park, OH, USA [email protected] Gaithersburg, MD, USA sally.fahrenholz- [email protected] [email protected] ABSTRACT 1. INTRODUCTION This paper describes the development and use of MatML, the XML, the eXtensible Markup Language is today a widely Materials Markup Language. MatML is an emerging XML accepted standard for storage and exchange of information. It standard intended primarily for the exchange of materials serves a twofold purpose of providing a data model for storing property information. It provides a medium of communication for information and a medium of communication for exchanging users in materials science and related fields such as manufacturing information over the worldwide web [29]. and aerospace. It sets the stage for the development of semantic Domain-specific markup languages are often defined within the web standards to enhance knowledge discovery in materials context of XML to serve as the means for storing and exchanging science and related areas. MatML has been used in applications information in the respective domains. Some examples of such such as the development of materials digital libraries and analysis markup languages include [14, 9]: of contaminant emissions data. Data mining applications of MatML include statistical process control and failure analysis. • AniML: Analytical Information Markup Language Challenges in promoting MatML involve satisfying a broad range • ChemML: Chemical Markup Language of constituencies in the international engineering and materials science community and also adhering to other related standards in • femML: Finite Element Modeling Markup Language web data exchange.
    [Show full text]
  • The Semantics of Chemical Markup Language (CML
    Murray-Rust et al. Journal of Cheminformatics 2011, 3:43 http://www.jcheminf.com/content/3/1/43 RESEARCH ARTICLE Open Access The semantics of Chemical Markup Language (CML): dictionaries and conventions Peter Murray-Rust1*, Joe A Townsend1, Sam E Adams1, Weerapong Phadungsukanan2 and Jens Thomas3 Abstract The semantic architecture of CML consists of conventions, dictionaries and units. The conventions conform to a top-level specification and each convention can constrain compliant documents through machine-processing (validation). Dictionaries conform to a dictionary specification which also imposes machine validation on the dictionaries. Each dictionary can also be used to validate data in a CML document, and provide human-readable descriptions. An additional set of conventions and dictionaries are used to support scientific units. All conventions, dictionaries and dictionary elements are identifiable and addressable through unique URIs. Introduction semantics through dictionaries. These have ranged from From an early stage, Chemical Markup Language (CML) a formally controlled ontology (ChemAxiom [2]) which was designed so that it could accommodate an indefi- is consistent with OWL2.0 [3] and the biosciences’ nitely large amount of chemical and related concepts. Open Biological and Biomedical Ontologies (OBO)[4] This objective has been achieved by developing a dic- framework, to uncontrolled folksonomy-like tagging. tionary mechanism where many of the semantics are Although we have implemented ChemAxiom and it is added not through hard-coded elements and attributes part of the bioscientists’ description of chemistry, we but by linking to semantic dictionaries. CML has a regard it as too challenging for the current practice of number of objects and object containers which are chemistry and unnecessary for its communication.
    [Show full text]
  • Xmetal® Author Enterprise - SP Edition Bring the Efficiency of XML and DITA to Your Sharepoint Implementation
    PRODUCT OVERVIEW SP Edition XMetaL® Author Enterprise - SP Edition Bring the Efficiency of XML and DITA to your SharePoint Implementation Microsoft® SharePoint® helps accelerate and more clearly define shared business processes and facilitates more productive use and sharing of information, using a platform familiar to a company’s IT professionals and developers.With XMetaL® Author Enterprise, you can use XML to manage your content even more effectively. Content Efficiency Drives Your Competitive • More control over your global brand Advantage A Proven Authoring Tool with Seamless Today’s globally-focused, customer-driven Connectivity organizations face extraordinary challenges. Under growing expectations from customers, businesses Repeatedly voted as the top choice of end users, are increasingly expected to deliver product, service XMetaL’s flexibility and usability makes it easy for and support information in each customer’s native anyone in your organization to create, share, reuse, language, in the format they want, when they need and deliver XML-based content, including less it. Businesses are also under pressure to deliver frequent content contributors such as engineers, professional content quickly, to reduce publishing and managers, and customer service staff. A single localization costs, and to keep publications consistent seamlessly integrated interface for authoring, and up-to-date. searching, and managing valid XML content will enable front-line staff to more effectively manage To meet these challenges, many companies are content, helping you reach new levels of efficiency turning to XML, and in particular to a variety of XML and quality. named the Darwin Information Typing Architecture (DITA). Among the available technologies for creating Together, XMetaL and SharePoint offer a and storing content, XML is the gold standard comprehensive content lifecycle solution that enables for scalable content reuse, robust support for all you to harness the value of your organization’s languages, and clean output to multiple deliverables.
    [Show full text]
  • Archive and Compressed [Edit]
    Archive and compressed [edit] Main article: List of archive formats • .?Q? – files compressed by the SQ program • 7z – 7-Zip compressed file • AAC – Advanced Audio Coding • ace – ACE compressed file • ALZ – ALZip compressed file • APK – Applications installable on Android • AT3 – Sony's UMD Data compression • .bke – BackupEarth.com Data compression • ARC • ARJ – ARJ compressed file • BA – Scifer Archive (.ba), Scifer External Archive Type • big – Special file compression format used by Electronic Arts for compressing the data for many of EA's games • BIK (.bik) – Bink Video file. A video compression system developed by RAD Game Tools • BKF (.bkf) – Microsoft backup created by NTBACKUP.EXE • bzip2 – (.bz2) • bld - Skyscraper Simulator Building • c4 – JEDMICS image files, a DOD system • cab – Microsoft Cabinet • cals – JEDMICS image files, a DOD system • cpt/sea – Compact Pro (Macintosh) • DAA – Closed-format, Windows-only compressed disk image • deb – Debian Linux install package • DMG – an Apple compressed/encrypted format • DDZ – a file which can only be used by the "daydreamer engine" created by "fever-dreamer", a program similar to RAGS, it's mainly used to make somewhat short games. • DPE – Package of AVE documents made with Aquafadas digital publishing tools. • EEA – An encrypted CAB, ostensibly for protecting email attachments • .egg – Alzip Egg Edition compressed file • EGT (.egt) – EGT Universal Document also used to create compressed cabinet files replaces .ecab • ECAB (.ECAB, .ezip) – EGT Compressed Folder used in advanced systems to compress entire system folders, replaced by EGT Universal Document • ESS (.ess) – EGT SmartSense File, detects files compressed using the EGT compression system. • GHO (.gho, .ghs) – Norton Ghost • gzip (.gz) – Compressed file • IPG (.ipg) – Format in which Apple Inc.
    [Show full text]
  • Understanding
    Implementing UBL Mark Crawford UBL Vice Chair XML 2003 9 December 2003 Why Are We Talking About UBL • UBL fulfils the promise of XML for business by defining a standard cross-industry vocabulary • UBL is the ebXML missing link • UBL plus ebXML enables the next generation of eBusiness exchanges – Cheaper, easier, Internet-ready – Extends benefits of EDI to small businesses – Fits existing legal and trade concepts – Allows re-use of data • UBL can provide the XML payload for a wide variety of other web-based business frameworks Overview 1 What and Why of UBL 2 The Design of UBL ebXML Core Components Naming and Design Rules Document Engineering Customizing UBL 3 The Content of UBL 1.0 What is Normative What is non-Normative Availability 4 Making UBL Happen 5 UBL Phase 2 6 Summary The promise of XML for e-business • Plug ‘n’ play electronic commerce – Spontaneous trade – No custom programming • Ubiquity on the Internet – Dirt-cheap tools – Complete platform independence – Enable true global market availability • Enable universal interoperability – Abandon existing EDI systems – Handle both "publication" document types and "transactional" documents Goals for Successful eBusiness Services • Web-enable existing fax- and paper-based business practices • Allow businesses to upgrade at their own pace • Preserve the existing investment in electronic business exchanges • Integrate small and medium-size businesses into existing electronic data exchange-based supply chains The standardization of XML business documents is the easiest way to accomplish
    [Show full text]
  • SDL Tridion Docs Release Notes
    SDL Tridion Docs Release Notes SDL Tridion Docs 14 SP1 December 2019 ii SDL Tridion Docs Release Notes 1 Welcome to Tridion Docs Release Notes 1 Welcome to Tridion Docs Release Notes This document contains the complete Release Notes for SDL Tridion Docs 14 SP1. Customer support To contact Technical Support, connect to the Customer Support Web Portal at https://gateway.sdl.com and log a case for your SDL product. You need an account to log a case. If you do not have an account, contact your company's SDL Support Account Administrator. Acknowledgments SDL products include open source or similar third-party software. 7zip Is a file archiver with a high compression ratio. 7-zip is delivered under the GNU LGPL License. 7zip SFX Modified Module The SFX Modified Module is a plugin for creating self-extracting archives. It is compatible with three compression methods (LZMA, Deflate, PPMd) and provides an extended list of options. Reference website http://7zsfx.info/. Akka Akka is a toolkit and runtime for building highly concurrent, distributed, and fault tolerant event- driven applications on the JVM. Amazon Ion Java Amazon Ion Java is a Java streaming parser/serializer for Ion. It is the reference implementation of the Ion data notation for the Java Platform Standard Edition 8 and above. Amazon SQS Java Messaging Library This Amazon SQS Java Messaging Library holds the Java Message Service compatible classes, that are used for communicating with Amazon Simple Queue Service. Animal Sniffer Annotations Animal Sniffer Annotations provides Java 1.5+ annotations which allow marking methods which Animal Sniffer should ignore signature violations of.
    [Show full text]
  • An XML Format for Proteomics Database to Accelerate Collaboration Among Researchers
    AO HUPO workshop in Tsukuba (Dec. 19, 2002) An XML Format For Proteomics Database to Accelerate Collaboration Among Researchers HUP-ML: Human Proteome Markup Language K. Kamijo, H. Mizuguchi, A. Kenmochi, M. Sato, Y. Takaki and A. Tsugita Proteomics Res. Center, NEC Corp. 1 Overview (HUP-ML) z Introduction z XML features z XMLs in life science z HUP-ML concept z Example of HUP-ML z Details of HUP-ML z Demonstration of “HUP-ML Editor” 2 Introduction z Download entries from public DBs as a flat-file z easy for a person to read z different formats for every DB z sometimes needs special access methods and special applications for each format z Needs machine-readable formats for software tools z To boost studies by exchanging data among researchers Activates standardization 3 XML format z XML (eXtensible Markup Language) W3C: World Wide Web Consortium (inception in 1996) Attribute Key Value Example Element Start Tag <tag_element_source attribute_growth=“8 weeks”> rice leaf </tag_element_source> End Tag z Highly readable for machine and person z Can represent information hierarchy and relationships z Details can be added right away z Convenient for exchanging data z Easy to translate to other formats 4 z Logical-check by a Document Type Definition (DTD) Overview (HUP-ML) z Introduction z XML features z XMLs in life science z HUP-ML concept z Example of HUP-ML z Details of HUP-ML z Demonstration of “HUP-ML Editor” 5 XML features z Exchangeable z Inter-operability: z OS (Operating System): Windows, Linux, etc. z Software Development Languages
    [Show full text]
  • For Computational Chemistry : Compchem
    The Semantics of Chemical Markup Language (CML) for Computational Chemistry : CompChem Preprint Cambridge Centre for Computational Chemical Engineering ISSN 1473 { 4273 The Semantics of Chemical Markup Language (CML) for Computational Chemistry : CompChem Weerapong Phadungsukanan 1, Markus Kraft 1, Joe Townsend 2, Peter Murray-Rust 2 released: May 22, 2012 1 Department of Chemical Engineering and 2 Department of Chemistry Biotechnology University of Cambridge University of Cambridge Lensfield Road New Museums Site Cambridge, CB2 1EW Pembroke Street UK Cambridge, CB2 3RA UK E-mail: [email protected] Preprint No. 117 Keywords: CML, CompChem, RDF, computational chemistry, thermochemistry, control vocab- ulary, rule-based validation Edited by Computational Modelling Group Department of Chemical Engineering and Biotechnology University of Cambridge New Museums Site Pembroke Street Cambridge CB2 3RA CoMo United Kingdom GROUP Fax: + 44 (0)1223 334796 E-Mail: [email protected] World Wide Web: http://como.cheng.cam.ac.uk/ Abstract This paper introduces a subdomain chemistry format for storing computa- tional chemistry data called CompChem. It has been developed based on the design, concepts and methodologies of Chemical Markup Language (CML) by adding computational chemistry semantics on top of the CML Schema. The format allows a wide range of ab initio quantum chemistry calculations of individual molecules to be stored. These calculations include, for exam- ple, single point energy calculation, molecular geometry optimization, and vibrational frequency analysis. The paper also describes the supporting in- frastructure, such as processing software, dictionaries, validation tools and database repositories. In addition, some of the challenges and difficulties in developing common computational chemistry dictionaries are discussed.
    [Show full text]
  • SDL Contenta 5.7.1 Release Notes
    SDL Contenta Release Notes SDL Contenta 5.7.1 September 2019 Legal notice Copyright and trademark information relating to this product release. Copyright © 2009–2019 SDL Group. SDL Group means SDL PLC. and its subsidiaries and affiliates. All intellectual property rights contained herein are the sole and exclusive rights of SDL Group. All references to SDL or SDL Group shall mean SDL PLC. and its subsidiaries and affiliates details of which can be obtained upon written request. All rights reserved. Unless explicitly stated otherwise, all intellectual property rights including those in copyright in the content of this website and documentation are owned by or controlled for these purposes by SDL Group. Except as otherwise expressly permitted hereunder or in accordance with copyright legislation, the content of this site, and/or the documentation may not be copied, reproduced, republished, downloaded, posted, broadcast or transmitted in any way without the express written permission of SDL. Contenta is a registered trademark of SDL Group. All other trademarks are the property of their respective owners. The names of other companies and products mentioned herein may be the trademarks of their respective owners. Unless stated to the contrary, no association with any other company or product is intended or should be inferred. This product may include open source or similar third-party software, details of which can be found by clicking the following link: “Acknowledgments” on page 55. Although SDL Group takes all reasonable measures to provide accurate and comprehensive information about the product, this information is provided as-is and all warranties, conditions or other terms concerning the documentation whether express or implied by statute, common law or otherwise (including those relating to satisfactory quality and fitness for purposes) are excluded to the extent permitted by law.
    [Show full text]