Electronic Document Technol

Total Page:16

File Type:pdf, Size:1020Kb

Electronic Document Technol Abstract Electronic Document Technology Standards PDF, PDF/A, OOXML, OpenDocument. What is the alphabet soup? In recent years technologists and Signatures have been attempting to make electronic documents more transportable across systems and displays as well as improving their usability. This session will explain these various document E-Courts 2008, Las Vegas formats and how your court can use the technology to improve data capture, display, and information security. Erik Wilde, UC Berkeley School of Information December 9, 2008 This work is licensed under a CC Attribution 3.0 Unported License About this Presentation About Me Outline Outline 1. About this Presentation [6] 1. About this Presentation [6] 1. About Me [1] 1. About Me [1] 2. About ISD [5] 2. About ISD [5] 2. Document Standards [18] 2. Document Standards [18] 3. Document Security [7] 3. Document Security [7] About Me About ISD Computer Science at Technical University of Berlin (TUB) (88-91) Ph.D. at ETH Zürich (92-97) Post-Doc at ICSI, Berkeley (97/98) Outline Various activities back in Switzerland (98-06) teaching at ETH Zürich and FHNW 1. About this Presentation [6] working as independent consultant (training, courses, consulting) 1. About Me [1] research in various XML-related areas 2. About ISD [5] Professor at the School of Information (since Fall 2006) 2. Document Standards [18] Technical Director of the Information and Service Design (ISD) program 3. Document Security [7] Information and Service Design (ISD) Information-Intensive Applications Part of UC Berkeley's School of Information Traditional enterprise IT solutions have limits Connecting our students with real-world scenarios and projects built for long life-cycles of deployed system architectures “Building Stuff That Actually Works” built for integration of existing systems into a unified landscape getting involved in project management and associated challenges Many enterprise IT solutions cannot keep up very well understanding the real-world challenges of information modeling by definition, they never completely fail Focus on open information systems and open information access they dictate the shape and direction of information flows “usability” and “accessibility” should become terms beyond the UI realm The Web is by far the biggest information system that ever existed Example areas of ISD interest built around an astonishingly primitive data model e-Books beyond “iTunes for books”: open formats, flexible reuse the simplicity is not a deficiency, it is a feature open data for field researchers: sharing information as simply as possible everybody can cooperate as long as there is minimal agreement location on the Web: how to turn the Web into a location-aware system the Web's architectural principle can be reused for enterprise IT Project: Environmental Data Project: Justice and the Criminal Record Government agencies collect and manage a lot of environmental data Criminal records are important for background checks some of it is accessible in historical or current archives companies collect information and are re-sellers some of it is permanently produced by sensors there is no expiration date for this information Large-scale data aggregation presents various challenges Criminal record information changes in important ways implementation issues of sensor deployment and management new entry: important for background check (false negative) organization issues of classifying and grouping sensors expunged entry: not critical for background check (false positive) access issues of being able to access subsets of the available data little business incentives for companies to properly delete entries policy issues of sensible data and possible access restrictions Information accessibility can introduce new challenges Web architecture presents a proven path for large-scale systems how to hold people accountable for providing outdated data built on loose coupling and cooperation rather than integration how to create incentives for properly updating data built on a different architecture than traditional enterprise IT Dream Project: Services, not Sites Document Standards Government agencies should provide services, not sites Sites are hard to build and hard to maintain often built with specific use cases in mind Outline technology evolves and sites must be maintained to keep up Sites get in the way of services 1. About this Presentation [6] often service access is possible only through a site 2. Document Standards [18] Services provide all the necessary information 1. Application-Independent Formats [10] exposing what the public has paid for 2. Application-Specific Formats [6] not spending public money for building interfaces 3. Document Security [7] Policy issues around service design and information usage information licenses must be developed to avoid data rot “eat your own dogfood” is a good start, but not sufficient (tastes differ) REST Document Exchange as Business The Web is built on Representational State Transfer (REST) Interactions resources are the “units of interest” in any REST design peers interact by exchanging representations of resources Traditional enterprise IT is based on integration interactions can only use a small number of predefined verbs (4 in HTTP) model the complete system as one big distributed program state transitions are using hypertext as the engine of application state implement the system using some distributed programming environment Documents often are the core part of a RESTful system architecture programming is based on the abstraction of building one big system the only absolute core part of REST is identification (URIs) Web architecture is based on cooperation communications are often based on HTTP(-S) there is no overarching model, there are only local models representations often use HTML or some XML vocabulary peers can interact by exchanging information about resources representations have primacy over functions or interactions cooperation is achieved by agreeing on representations of resources there should be no assumptions about availability, links can always break Names for the debate: “REST vs. SOAP” or “REST vs. WS-*” This is an ongoing debate and will not go away anytime soon Application-Independent Formats History of Document Interchange Plain text and structured text plain text only needs agreement on a common character set (e.g., ASCII or Unicode) Outline first data formats were comma-delimited or tab-delimited structures SGML (Standard Generalized Markup Language) was the first open document format 1. About this Presentation [6] XML (Extensible Markup Language) streamlined SGML to become usable on the Web 2. Document Standards [18] Document formats vs. data formats 1. Application-Independent Formats [10] data formats represent database-like structures (e.g., UML or ER) 2. Application-Specific Formats [6] document formats represent narrative documents structures 3. Document Security [7] many existing document collections use something in the middle many applications need something in the middle Structured Documents HTML Most real-world data is semi-structured or unstructured HTML is the standard document format on the Web documents use titles, paragraphs, lists, and tables Microformats can be used to improve document semantics documents do not mark up person names, place names, … earlier microformats were not based on a common syntax Natural Language Processing (NLP) tries to extract structures RDFa (October 2008) provides a standardized syntax IT people want structured data, users often don't like forms Why HTML often is not even considered as a document format building good UIs is one of the core tasks for acceptance designed for logical structures, so rendering depends on clients badly designed data entry is sabotaged and produces garbage designed for continuous display, so paged content is not a natural fit provide feedback about the benefits of good data poor print support in regular browsers (problem of CSS and bad browser support) XML is a language for building languages, but don't do it Why HTML should be considered as a document format XML does not define any semantics (i.e., it only defines structures) focus on content structures rather than rendering XML supports semi-structured data (supporting incremental refinement) easy to adapt to a wide variety of clients vocabularies define structure and semantics of XML document types printing problem can be solved with custom print processing vocabularies may provide/use modules, thereby allowing flexible reuse PDF PDF Data Portable Document Format (PDF) evolved from a printer language PDF has evolved into a multimedia container format based on PostScript, a page description language for printers support for various media types such as images, audio, and video removed some programming features, added a lot of file format features PDF forms allow interactive forms to be created and filled out Acrobat Reader as a free product made PDF successful scripting can be used to further support interactive PDF the “give away the reader, charge for the writer” strategy extensions allow 3D models to be embedded into PDF PDF has become a complex and complicated specification Text can also appear in a variety of ways successful commercial products add features, which add data format complexity embedded images from scanning processes may only show text images backwards compatibility almost always means that no features will be removed Optical Character Recognition (OCR) may result in poorly recognized characters
Recommended publications
  • Electronic Document Distribution Why I Can't Read Microsoft Word
    Electronic document distribution or Why I can’t read Microsoft Word attachments By Tim Bell (HOD, Computer Science) 23 July 1999 We are entering an exciting era when documents can conveniently be stored and exchanged electronically, with the promise of increased efficiency and reduced costs. However, if done incorrectly, electronic documents can cause frustration and inconvenience. In the last few months a number of people have taken the bold step into the electronic world only to receive a negative response from people including myself. This document presents arguments for why some methods of document exchange are appropriate and efficient, and others are not. I am not arguing that Word should not be used; in fact this document was prepared in Microsoft Word by choice. I am discussing the method of distribution. This document can be read on any system that has normal access to the World Wide Web. It was prompted when today I received two emails that contained nothing but a Word attachment. The attachments took me about 5 minutes to decode, and turned out to be identical. Furthermore, the information in the documents could easily have been sent as a plain text email. What’s wrong with Microsoft Word attachments? Microsoft Word is fine; it is its use in disseminating documents that is the problem. A number of university documents have been distributed using Microsoft Word, usually as attachments to email. Word's "Send To" command is beguiling (as one user described it), as it instantly emails the document to many people with very little effort. For many of the recipients, only one click is required to view the document instantly.
    [Show full text]
  • A Personal Research Agent for Semantic Knowledge Management of Scientific Literature
    A Personal Research Agent for Semantic Knowledge Management of Scientific Literature Bahar Sateli A Thesis in the Department of Computer Science and Software Engineering Presented in Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy (Computer Science) at Concordia University Montréal, Québec, Canada February 2018 c Bahar Sateli, 2018 Concordia University School of Graduate Studies This is to certify that the thesis prepared By: Bahar Sateli Entitled: A Personal Research Agent for Semantic Knowledge Management of Scientific Literature and submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Computer Science) complies with the regulations of this University and meets the accepted standards with respect to originality and quality. Signed by the final examining commitee: Chair Dr. Georgios Vatistas External Examiner Dr. Guy Lapalme Examiner Dr. Ferhat Khendek Examiner Dr. Volker Haarslev Examiner Dr. Juergen Rilling Supervisor Dr. René Witte Approved by Dr. Volker Haarslev, Graduate Program Director 9 April 2018 Dr. Amir Asif, Dean Faculty of Engineering and Computer Science Abstract A Personal Research Agent for Semantic Knowledge Management of Scientific Literature Bahar Sateli, Ph.D. Concordia University, 2018 The unprecedented rate of scientific publications is a major threat to the productivity of knowledge workers, who rely on scrutinizing the latest scientific discoveries for their daily tasks. Online digital libraries, academic publishing databases and open access repositories grant access to a plethora of information that can overwhelm a researcher, who is looking to obtain fine-grained knowledge relevant for her task at hand. This overload of information has encouraged researchers from various disciplines to look for new approaches in extracting, organizing, and managing knowledge from the immense amount of available literature in ever-growing repositories.
    [Show full text]
  • Electronic Document Preparation Pocket Primer
    Electronic Document Preparation Pocket Primer Vít Novotný December 4, 2018 Creative Commons Attribution 3.0 Unported (cc by 3.0) Contents Introduction 1 1 Writing 3 1.1 Text Processing 4 1.1.1 Character Encoding 4 1.1.2 Text Input 12 1.1.3 Text Editors 13 1.1.4 Interactive Document Preparation Systems 13 1.1.5 Regular Expressions 14 1.2 Version Control 17 2 Markup 21 2.1 Meta Markup Languages 22 2.1.1 The General Markup Language 22 2.1.2 The Extensible Markup Language 23 2.2 Markup on the World Wide Web 28 2.2.1 The Hypertext Markup Language 28 2.2.2 The Extensible Hypertext Markup Language 29 2.2.3 The Semantic Web and Linked Data 31 2.3 Document Preparation Systems 32 2.3.1 Batch-oriented Systems 35 2.3.2 Interactive Systems 36 2.4 Lightweight Markup Languages 39 3 Design 41 3.1 Fonts 41 3.2 Structural Elements 42 3.2.1 Paragraphs and Stanzas 42 iv CONTENTS 3.2.2 Headings 45 3.2.3 Tables and Lists 46 3.2.4 Notes 46 3.2.5 Quotations 47 3.3 Page Layout 48 3.4 Color 48 3.4.1 Theory 48 3.4.2 Schemes 51 Bibliography 53 Acronyms 61 Index 65 Introduction With the advent of the digital age, typesetting has become available to virtually anyone equipped with a personal computer. Beautiful text documents can now be crafted using free and consumer-grade software, which often obviates the need for the involvement of a professional designer and typesetter.
    [Show full text]
  • Semantic Web Technologies and Legal Scholarly Publishing Law, Governance and Technology Series
    Semantic Web Technologies and Legal Scholarly Publishing Law, Governance and Technology Series VOLUME 15 Series Editors: POMPEU CASANOVAS, Institute of Law and Technology, UAB, Spain GIOVANNI SARTOR, University of Bologna (Faculty of Law – CIRSFID) and European, University Institute of Florence, Italy Scientific Advisory Board: GIANMARIA AJANI, University of Turin, Italy; KEVIN ASHLEY, University of Pittsburgh, USA; KATIE ATKINSON, University of Liverpool, UK; TREVOR J.M. BENCH-CAPON, University of Liv- erpool, UK; V. RICHARDS BENJAMINS, Telefonica, Spain; GUIDO BOELLA, Universita’degli Studi di Torino, Italy; JOOST BREUKER, Universiteit van Amsterdam, The Netherlands; DANIÈLE BOUR- CIER, University of Paris 2-CERSA, France; TOM BRUCE, Cornell University, USA; NURIA CASEL- LAS, Institute of Law and Technology, UAB, Spain; CRISTIANO CASTELFRANCHI, ISTC-CNR, Italy; JACK G. CONRAD, Thomson Reuters, USA; ROSARIA CONTE, ISTC-CNR, Italy; FRAN- CESCO CONTINI, IRSIG-CNR, Italy; JESÚS CONTRERAS, iSOCO, Spain; JOHN DAVIES, British Telecommunications plc, UK; JOHN DOMINGUE, The Open University, UK; JAIME DELGADO, Uni- versitat Politécnica de Catalunya, Spain; MARCO FABRI, IRSIG-CNR, Italy; DIETER FENSEL, Uni- versity of Innsbruck, Austria; ENRICO FRANCESCONI, ITTIG-CNR, Italy; FERNANDO GALINDO, Universidad de Zaragoza, Spain; ALDO GANGEMI, ISTC-CNR, Italy; MICHAEL GENESERETH, Stanford University, USA; ASUNCIÓN GÓMEZ-PÉREZ, Universidad Politécnica de Madrid, Spain; THOMAS F. GORDON, Fraunhofer FOKUS, Germany; GUIDO GOVERNATORI, NICTA, Australia; GRAHAM
    [Show full text]
  • Proceedings of Balisage: the Markup Conference 2012
    Published in: Proceedings of Balisage: The Markup Conference 2012. at: http://www.balisage.net/Proceedings/vol6/print/Witt01/BalisageVol6- Witt01.html (downloaded on: 15.02.2017/14:19). Balisage: The Markup Conference 2012 Proceedings A standards-related web-based The Markup Conference M information system Maik Stührenberg Institut für Deutsche Sprache (IDS) Mannheim Universität Bielefeld <maik.Stuehrenberg0uni-bielefeld.de> Oliver Schonefeld Institut für Deutsche Sprache (IDS) Mannheim <[email protected]> Andreas Witt Institut für Deutsche Sprache (IDS) Mannheim <witt0 ids-mannheim.de> Bcilisage: The Markup Conference 2012 August 7 - 10, 2012 Copyright © 2012 by the authors. Used with permission. Howto cite this paper Stührenberg, Maik, Oliver Schonefeld and Andreas Witt. “A standards-related web-based information system.” Presented at Balisage: The Markup Conference 2012, Montreal, Canada, August 7-10, 2012. In Proceedings ofBalisage: The Markup Conference 2012. Balisage Series on Markup Technologies, vol. 8 (2012). DOI: io.4242/Baiisagevoi8.stuhrenbergoi. Abstract This late breaking proposal introduces the prototype of a web-based information system dealing with standards in the field of annotation. Table of Contents Introduction The problem with standards Providing guidance Information structure Representation Current state and future work Introduction This late breaking proposal is based on an ongoing effort started in the CLARIN project and which was presented briefly at the LREC 2012 Workshop on Collaborative Resource Development and Delivery. Initial point was the development of an easy to use information system for the description of standards developed in ISO/IEC TC37/SC4 Language Resources Management. Since these standards are heavily related to each other it is usually not feasible to adopt only a single standard for one's work but to dive into the standards jungle in full.
    [Show full text]
  • Towards the Unification of Formats for Overlapping Markup
    Towards the unification of formats for overlapping markup Paolo Marinelli∗ Fabio Vitali∗ Stefano Zacchiroli∗ [email protected] [email protected] [email protected] Abstract Overlapping markup refers to the issue of how to represent data structures more expressive than trees|for example direct acyclic graphs|using markup (meta-)languages which have been designed with trees in mind|for example XML. In this paper we observe that the state of the art in overlapping markup is far from being the widespread and consistent stack of standards and technologies readily available for XML and develop a roadmap for closing the gap. In particular we present in the paper the design and implementation of what we believe to be the first needed step, namely: a syntactic conversion framework among the plethora of overlapping markup serialization formats. The algorithms needed to perform the various conversions are presented in pseudo-code, they are meant to be used as blueprints for re- searchers and practitioners which need to write batch translation programs from one format to the other. 1 Introduction This paper is about markup, one of the key technological ingredient of hypertext. The particular aspects of markup we are concerned with are the limits of its expressivity. XML-based markup requires that the identified features of a document are organized hierarchically as a single tree, whereby each fragment of the content of the document is contained in one and only one XML element, each of which is contained within one and only one parent element all the way up to the single root element at the top.
    [Show full text]
  • The MLCD Overlap Corpus (MOC)
    Digital Humanities 2010 The MLCD Overlap Corpus semantic openness of SGML and XML to supply non-hierarchical interpretation of what (MOC) are often thought to be inescapably hierarchical notations. Huitfeldt, Claus The SGML/XML-based approaches to overlap [email protected] fall, roughly, into three groups: milestones, Department of Philosophy, University of fragmentation-and-reconstitution, and stand- Bergen off annotation. Milestones (described as early as Sperberg-McQueen, C. M. (Barnard et al. 1988), and used in (Sperberg- [email protected] McQueen and Burnard 1990) and later versions Black Mesa Technologies LLC, USA of the TEI Guidelines) use empty elements to Marcoux, Yves mark the boundaries of regions which cannot be marked simply as elements because they [email protected] overlap the boundaries of other elements. More Université de Montréal, Canada recently, approaches to milestone markup have been generalized in the Trojan Horse and CLIX markup idioms (DeRose 2004). For some time, theorists and practitioners Fragmentation is the technique of dividing of descriptive markup have been aware that a logical unit which overlaps other units the strict hierarchical organization of elements into several smaller units, which do not; the provided by SGML and XML represents a consuming application can then re-aggregate potentially problematic abstraction. The nesting the fragments. structures of SGML and XML capture an important property of real texts and represent Stand-off annotation addresses the overlap a successful combination of expressive power problem by removing the markup from the main and tractability. But not all textual phenomena data stream of the document, at the same time appear in properly nested form, and for more adding pointers back into the base data.
    [Show full text]
  • XML Prague 2019
    XML Prague 2019 Conference Proceedings University of Economics, Prague Prague, Czech Republic February 7–9, 2019 XML Prague 2019 – Conference Proceedings Copyright © 2019 Jiří Kosek ISBN 978-80-906259-6-9 (pdf) ISBN 978-80-906259-7-6 (ePub) Table of Contents General Information ..................................................................................................... vii Sponsors .......................................................................................................................... ix Preface .............................................................................................................................. xi Task Abstraction for XPath Derived Languages – Debbie Lockett and Adam Retter ........................................................................................ 1 A novel approach to XSLT-based Schematron validation – David Maus .............. 57 Authoring DSLs in Spreadsheets Using XML Technologies – Alan Painter ......... 67 How to configure an editor – Martin Middel ........................................................... 103 Discover the Power of SQF – Octavian Nadolu and Nico Kutscherauer .................. 117 Tagdiff: a diffing tool for highlighting differences in text-oriented XML – Cyril Briquet .................................................................................................................. 143 Merge and Graft: Two Twins That Need To Grow Apart – Robin La Fontaine and Nigel Whitaker ........................................................................
    [Show full text]
  • Journal of the Text Encoding Initiative, Issue 8
    Journal of the Text Encoding Initiative Issue 8 | December 2014 - December 2015 Selected Papers from the 2013 TEI Conference Arianna Ciula and Fabio Ciotti (dir.) Electronic version URL: http://journals.openedition.org/jtei/1025 DOI: 10.4000/jtei.1025 ISSN: 2162-5603 Publisher TEI Consortium Electronic reference Arianna Ciula and Fabio Ciotti (dir.), Journal of the Text Encoding Initiative, Issue 8 | December 2014 - December 2015, « Selected Papers from the 2013 TEI Conference » [Online], Online since 28 December 2014, connection on 23 September 2020. URL : http://journals.openedition.org/jtei/1025 ; DOI : https://doi.org/10.4000/jtei.1025 This text was automatically generated on 23 September 2020. TEI Consortium 2014 (Creative Commons Attribution-NoDerivs 3.0 Unported License) 1 TABLE OF CONTENTS Editorial Introduction to Issue 8 of the Journal of the Text Encoding Initiative Arianna Ciula and Fabio Ciotti TEI in Relation to Other Semantic and Modeling Formalisms TEI, Ontologies, Linked Open Data: Geolat and Beyond Fabio Ciotti, Maurizio Lana and Francesca Tomasi The Linked Fragment: TEI and the Encoding of Text Reuses of Lost Authors Monica Berti, Bridget Almas, David Dubin, Greta Franzini, Simona Stoyanova and Gregory R. Crane TEI in LMNL: Implications for Modeling Wendell Piez Ontologies, Data Modeling, and TEI Øyvind Eide TEI to Express Domain Specific Text and Data Models Texts and Documents: New Challenges for TEI Interchange and Lessons from the Shelley- Godwin Archive Trevor Muñoz and Raffaele Viglianti Manus OnLine and the Text
    [Show full text]
  • Standoff Properties As an Alternative to XML for Digital Historical Editions
    Standoff properties as an alternative to XML for digital historical editions Desmond Schmidt eResearch, ITEE, University of Queensland, Australia 5 [email protected] Abstract In the past few years digital humanists have recognised that the development of tools for the collaborative creation of shareable, reusable and archivable transcriptions of historical documents is being held back by the complexity and lack of interoperability of customisable XML encoding 5 standards. To fix this problem transcriptions must be split into their constituent parts: a plain text and individual sets of markup. The plain text is easily processable by text-analysis software or search engines, and may be divided into layers representing different states of each document. Each markup set may also represent a separate layer of information to enrich an underlying plain text, and may be freely combined with others. Unlike in XML, textual properties in markup sets can 10 freely overlap. Plain text versions and their corresponding markup can also be reliably and efficiently converted into HTML, the language of the Web. Unlike previous attempts at separating markup from text, the proposed data representation is designed to be fully editable, and development of a suitable Web-based editor is at an advanced stage. To solve the problems we now face will require a radical rethink of how we mark up texts, and the adoption of globally 15 interoperable standards to which these internal representations can be easily converted, such as HTML, plain text and RDFa. Standoff properties as an alternative to XML for digital historical editions Introduction Most digital humanists are familiar with markup, or the use of named tags to indicate properties that 5 text should possess in addition to its basic character data.
    [Show full text]
  • Risoluzione E Linearizzazione Di Overlap in Linguaggi Di Markup
    Alma Mater Studiorum - Università di Bologna Facoltà di Scienze Matematiche, Fisiche e Naturali Corso di Laurea Specialistica in Informatica Materia: Tecnologie Web Risoluzione e linearizzazione di overlap in linguaggi di markup Tesi di Laurea di Francesco Poggi Relatore Chiar.mo Prof. Fabio Vitali Correlatore Dott. Silvio Peroni Parole chiave: EARMARK, Overlapping markup, Conversione di documenti, OWL, XML Sessione I Anno Accademico 2010-2011 Risoluzione e linearizzazione di overlap in linguaggi di markup Indice INDICE ....................................................................................................................... 1 INTRODUZIONE ...................................................................................................... 3 OVERLAPPING MARKUP ................................................................................... 11 2.1 OHCO thesis e modello ad albero ............................................................... 13 2.2 Overlapping markup .................................................................................... 14 2.2.1 Caratteristiche complesse legate all'overlap ................................................... 17 2.3 Approcci attuali al problema dell'overlap ................................................... 18 2.3.1 Soluzioni a livello di linguaggio di markup .................................................... 19 2.3.1.1 TEI milestone ...................................................................................... 21 2.3.1.2 Flat Milestone ....................................................................................
    [Show full text]
  • Towards a Formal Ontology for the Text Encoding Initiative DOI
    Umanistica Digitale - ISSN:2532-8816 - n.3, 2018 F. Ciotti – owa"ds a Fo"mal Ontolog% fo" t'e e(t Encoding Initiati*e DOI: htt+:/,doi.o"g,10.6092,issn.2532-8816/8174 o!a"#s a Fo"mal $ntolog% &o" t'e e(t )nco#ing Initiati*e Fabio Ciotti Uni*ersit1 #i 2oma 3 or 4ergata5 [email protected] 60st"act. In 78esto a"ticolo viene +"esentata 8na +"o+osta +"elimina"e #i ontologia +e" la ra++"esentazione #ello schema ext )nco#ing Initiati*e : )I;. <e motivazioni e i bene&ici #i 8na *ersione semantica e machine-"ea#able #ella )I sono molte+lici: :1) l=inte"o+erabilit1 semantica tra co#i&iche #i testi com+o"te"ebbe la &acilitazione #ell=inte""ogazione cross-co"+ora e l=integrazione #i <in>e# $+en Data? :2; la &o"malizzazione #el )I abstract model a++o"te"ebbe migliorie signi&icati*e in te"mini #i consistenza e soundness. Data la com+lessit1 #ello schema )I, viene 78i +"eso in consi#erazione 8n consistente ma rist"etto sottoinsieme #i elementi e attri08ti. In secon#o l8ogo la meta-ontolog% )62@62A viene esaminata. In 8ltimo, *engono &orniti alcuni #ettagli im+lementativi. 'is article +"esents t'e rationale an# t'e +"o+osal o& a +"elimina"% a"c'itect8"e o& a &o"mal ontolog% o& t'e ext )nco#ing Initiati*e ma">8+ lang8age. 'e "easons to 'a*e a &o"mal an# machine-"ea#a0le semantics &o" )I a"e mani&ol#.
    [Show full text]