Semantic Web Technologies and Legal Scholarly Publishing Law, Governance and Technology Series

Total Page:16

File Type:pdf, Size:1020Kb

Semantic Web Technologies and Legal Scholarly Publishing Law, Governance and Technology Series Semantic Web Technologies and Legal Scholarly Publishing Law, Governance and Technology Series VOLUME 15 Series Editors: POMPEU CASANOVAS, Institute of Law and Technology, UAB, Spain GIOVANNI SARTOR, University of Bologna (Faculty of Law – CIRSFID) and European, University Institute of Florence, Italy Scientific Advisory Board: GIANMARIA AJANI, University of Turin, Italy; KEVIN ASHLEY, University of Pittsburgh, USA; KATIE ATKINSON, University of Liverpool, UK; TREVOR J.M. BENCH-CAPON, University of Liv- erpool, UK; V. RICHARDS BENJAMINS, Telefonica, Spain; GUIDO BOELLA, Universita’degli Studi di Torino, Italy; JOOST BREUKER, Universiteit van Amsterdam, The Netherlands; DANIÈLE BOUR- CIER, University of Paris 2-CERSA, France; TOM BRUCE, Cornell University, USA; NURIA CASEL- LAS, Institute of Law and Technology, UAB, Spain; CRISTIANO CASTELFRANCHI, ISTC-CNR, Italy; JACK G. CONRAD, Thomson Reuters, USA; ROSARIA CONTE, ISTC-CNR, Italy; FRAN- CESCO CONTINI, IRSIG-CNR, Italy; JESÚS CONTRERAS, iSOCO, Spain; JOHN DAVIES, British Telecommunications plc, UK; JOHN DOMINGUE, The Open University, UK; JAIME DELGADO, Uni- versitat Politécnica de Catalunya, Spain; MARCO FABRI, IRSIG-CNR, Italy; DIETER FENSEL, Uni- versity of Innsbruck, Austria; ENRICO FRANCESCONI, ITTIG-CNR, Italy; FERNANDO GALINDO, Universidad de Zaragoza, Spain; ALDO GANGEMI, ISTC-CNR, Italy; MICHAEL GENESERETH, Stanford University, USA; ASUNCIÓN GÓMEZ-PÉREZ, Universidad Politécnica de Madrid, Spain; THOMAS F. GORDON, Fraunhofer FOKUS, Germany; GUIDO GOVERNATORI, NICTA, Australia; GRAHAM GREENLEAF, The University of New South Wales, Australia; MARKO GROBELNIK, Josef Stefan Institute, Slovenia; SERGE GUTWIRTH, Vrije Universiteit Brussels; JAMES HENDLER, Rens- selaer Polytechnic Institute, USA; RINKE HOEKSTRA, Universiteit van Amsterdam, The Netherlands; ETHAN KATSH, University of Massachusetts Amherst, USA; MARC LAURITSEN, Capstone Practice Systems, Inc., USA; RONALD LEENES, Tilburg Institute for Law, Technology, and Society, Tilburg University, The Netherlands; PHILIP LIETH, Queen’s University Belfast, UK; ARNO LODDER, VU University Amsterdam, The Netherlands; JOSÉ MANUEL LÓPEZ COBO, Playence, Austria; PIERRE MAZZEGA, LMTG-UMR5563 CNRS/IRD/UPS, France; MARIE-FRANCINE MOENS, Katholieke Universiteit Leuven, Belgium; PABLO NORIEGA, IIIA-CSIC, Spain; ANJA OSKAMP, Open Univer- siteit, The Netherlands; SASCHA OSSOWSKI, Universidad Rey Juan Carlos, Spain; UGO PAGALLO, Università degli Studi di Torino, Italy; MONICA PALMIRANI, Università di Bologna, Italy; ABDUL PALIWALA, University of Warwick, UK; ENRIC PLAZA, IIIA-CSIC, Spain; MARTA POBLET, RMIT University, Melbourne, Australia; DANIEL POULIN, Lexum informatique juridique Inc., Canada; HENRY PRAKKEN, Universiteit Utrecht and The University of Groningen, The Netherlands; HAIBIN QI, Huazhong University of Science and Technology, P.R. China; DORY REILING, Amsterdam District Court, The Netherlands; PIER CARLO ROSSI, Italy; EDWINA L. RISSLAND, University of Mas- sachusetts, Amherst, USA; COLIN RULE, University of Massachusetts, USA; MARCO SCHORLEM- MER, IIIA-CSIC, Spain; CARLES SIERRA, IIIA-CSIC, Spain; MIGEL ANGEL SICILIA, Universidad de Alcalá, Spain; RONALD W. STAUDT, Chicago-Kent College of Law, USA; RUDI STUDER, Karl- sruhe Institute of Technology, Germany; DANIELA TISCORNIA, ITTIG-CNR, Italy; JOAN-JOSEP VALLBÈ, Universitat de Barcelon, Spain; TOM VAN ENGERS, Universiteit van Amsterdam, The Neth- erlands; FABIO VITALI, Università di Bologna, Italy; MARY-ANNE WILLIAMS, The University of Technology, Sydney, Australia; RADBOUD WINKELS, University of Amsterdam, The Netherlands; ADAM WYNER, University of Liverpool, UK; HAJIME YOSHINO, Meiji Gakuin University, Japan; JOHN ZELEZNIKOW, University of Victoria, Australia For further volumes: http://www.springer.com/series/8808 Silvio Peroni Semantic Web Technologies and Legal Scholarly Publishing 2123 Silvio Peroni Department of Computer Science and Engineering University of Bologna Mura Anteo Zamboni 7 40126 Bologna (BO) Italy ISSN 2352-1902 ISSN 2352-1910 (electronic) ISBN 978-3-319-04776-8 ISBN 978-3-319-04777-5 (eBook) DOI 10.1007/978-3-319-04777-5 Springer Cham Heidelberg New York Dordrecht London Library of Congress Control Number: 2014934451 © Springer International Publishing Switzerland 2014 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. Exempted from this legal reservation are brief excerpts in connection with reviews or scholarly analysis or material supplied specifically for the purpose of being entered and executed on a computer system, for exclusive use by the purchaser of the work. Duplication of this publication or parts thereof is permitted only under the provisions of the Copyright Law of the Publisher’s location, in its current version, and permission for use must always be obtained from Springer. Permissions for use may be obtained through RightsLink at the Copyright Clearance Center. Violations are liable to prosecution under the respective Copyright Law. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. While the advice and information in this book are believed to be true and accurate at the date of publication, neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com) Foreword What you are holding in your hand is a multi-faceted book. From where I stand, there are at least three different points of view that can be taken to read it and enjoy its ideas and conceptual framework. First of all, this is a book proposing innovative ideas about Legal Publishing.As such, positioning it within the Law, Governance and Technology Series of Springer Verlag is proper and true to its nature and aims: Peroni’s work contributes new concepts and solutions for managing legal texts (doctrine as well as legislation and jurisprudence), placing them as the core cultural infrastructure upon which the uni- verse of legal activities is based, whose workflow (from ideation to drafting, to evaluation, to publication, to evolution, to oblivion or removal) is instrumental to the correct understanding of legal processes, and whose semantic representation is a compelling and exemplary application of Semantic Web technologies that provides solid bases for innovative tools helping professionals and laymen in cataloguing, searching, and reasoning upon legal knowledge. And yet, a second main viewpoint emerges, that of Semantic Publishing, consti- tuting an explicit, confessed, and transparent focus drift, whereby legal publishing is held as but one instance of many in the larger class of publishing contexts, where similarities are more important than differences, so that they can be studied and described as one discipline. In this sense, document publishing can be seen as the main theme of the book, and the legal domain as just the frontmost example. Several topics of great timeliness and complexity are tackled and relevant contributions are provided under one encom- passing paradigm giving them integrity and uniformity: from the issue of managing multiple, reciprocally inconsistent structures over the same content (a well-known problem in the markup community since the early days of TEI) to the problem of the accurate description of the domain of scholarly publishing, to the issue of providing informative and easily understandable presentation of metadata records of publica- tions to a lay audience, we are shown a full catalogue of tough problems whose solutions in literature are, at present, hardly satisfactory. Yet Peroni not only tackles them providing more advanced proposals and ideas than in literature, but does so by introducing to its full power just one new approach, that of Semantic Web tech- nologies, showing how powerful and elegant they are in addressing complex issues. v vi Foreword Namely, multiple hierarchies are in fact managed through overlapping markup as suggested long ago in literature, but through the use of Semantic Web ontologies Peroni gives life to a fully new meta-markup language, EARMARK, that success- fully and automatically replaces XML without loosing the generality and availability of its tools. Similarly, metadata for scholarly publishing are in fact managed through ontologies, as suggested in innumerable publications of the last twenty years, but Peroni suggests a unifying framework of eight different ontologies, SPAR, integrat- ing in the same conceptualization issues as different as the FRBR-based description of the relationships between versions of the same work, the finest distinction of roles and stages in scholarly publishing, the rhetorical roles of sections and fragments in scholarly articles. Also and consequently, the familiarity of Peroni to Semantic
Recommended publications
  • A Personal Research Agent for Semantic Knowledge Management of Scientific Literature
    A Personal Research Agent for Semantic Knowledge Management of Scientific Literature Bahar Sateli A Thesis in the Department of Computer Science and Software Engineering Presented in Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy (Computer Science) at Concordia University Montréal, Québec, Canada February 2018 c Bahar Sateli, 2018 Concordia University School of Graduate Studies This is to certify that the thesis prepared By: Bahar Sateli Entitled: A Personal Research Agent for Semantic Knowledge Management of Scientific Literature and submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Computer Science) complies with the regulations of this University and meets the accepted standards with respect to originality and quality. Signed by the final examining commitee: Chair Dr. Georgios Vatistas External Examiner Dr. Guy Lapalme Examiner Dr. Ferhat Khendek Examiner Dr. Volker Haarslev Examiner Dr. Juergen Rilling Supervisor Dr. René Witte Approved by Dr. Volker Haarslev, Graduate Program Director 9 April 2018 Dr. Amir Asif, Dean Faculty of Engineering and Computer Science Abstract A Personal Research Agent for Semantic Knowledge Management of Scientific Literature Bahar Sateli, Ph.D. Concordia University, 2018 The unprecedented rate of scientific publications is a major threat to the productivity of knowledge workers, who rely on scrutinizing the latest scientific discoveries for their daily tasks. Online digital libraries, academic publishing databases and open access repositories grant access to a plethora of information that can overwhelm a researcher, who is looking to obtain fine-grained knowledge relevant for her task at hand. This overload of information has encouraged researchers from various disciplines to look for new approaches in extracting, organizing, and managing knowledge from the immense amount of available literature in ever-growing repositories.
    [Show full text]
  • Digital Publications Beyond Digital Communication
    Erschienen in der Reihe Herausgeber der Reihe Digital Publications beyond Digital Communication Niels-Oliver Walkowski Berlin-Brandenburg Academy of Sciences and Humanities Working Papers Niels-Oliver Walkowski: “Digital Publications beyond Digital Communication”. DARIAH-DE Working Papers No. 17. Göttingen: DARIAH-DE, 2016. URN: urn:nbn:de:gbv:7-dariah-2016-3-4. This article is published under the licence Creative-Commons Attribution 4.0 (CC-BY). The DARIAH-DE Working Papers are published by Mirjam Blümm, Thomas Kollatz, Stefan Schmunk and Christof Schöch. Abstract Since more than twenty years different stakeholders involved in scholarly publishing have tried to fundamentally rethink the shape of publications in a digital environment. In contrast to these abundant activities with their highly experimental character the same stakeholders continuously regret that the dominant form of digital publications is still the PDF. In the research literature, the situation of digital publications is often compared with the era of the printing press. While this comparison might be helpful to create awareness about the dimension in which changes are taking place it certainly also blurs significant differences. In the situation that was described before it is even more important to identify the peculiarities of the process of change. Fortunately, the story of digital scholarly publications is long enough to tell it in a way in which these peculiarities become more transparent. The study at hand is an attempt to do so. At its end, it becomes clear that the major issue in digital publishing is not so much connected to the questions how publications look digitally but what we may perceive as a publication in a digital environment.
    [Show full text]
  • What Is Meaning? : Fundamentals of Formal Semantics Ebook, Epub
    WHAT IS MEANING? : FUNDAMENTALS OF FORMAL SEMANTICS PDF, EPUB, EBOOK Paul H. Portner | 246 pages | 22 Apr 2009 | John Wiley and Sons Ltd | 9781405109185 | English | Chicester, United Kingdom What is Meaning? : Fundamentals of Formal Semantics PDF Book Portner covers all kinds of topics that aren't very well known in philosophy, and does so in such a way that they sound interesting, especially the topics towards the end of the book on varieties of pragmatic theories including dynamic semantics, which is something I'd like to find out more about. I wrote this when I was first wading into the subject and noticed that a lot of the stuff I was learning was not easy to find online in a concise beginner-level essay. This function is usually inductively defined. New books! Share this: Twitter Facebook. W3C has developed open specifications for Semantic Technology developers to follow and has identified, via open source development, the infrastructure parts that will be needed to scale in the Web and be applicable elsewhere. Other editions. Desalegn rated it really liked it Jun 07, Rating details. The work is presented extremely clearly and attractively, with very full notes at the end of each chapter, and it cannot but be, overall, an extremely useful and far-reaching presentation. Oxford: Blackwell Publishing, Emelie Vandenberg rated it really liked it Dec 11, Inference rules are functions that take in some set of strings and produce new ones. Participants will be introduced to the main theories that have influenced work in Formal Semantics, particularly Montague Grammar and the logical work of Frege, as well as the basics of Model Theory.
    [Show full text]
  • Scientific Publishing
    LOGOS Peer Review: Scientifi c Publishing: Disruption and Semantic Build-Up Frank Hellwig Abstract A new technology paired with a viable business model will have disruptive impact on incumbent companies in a specifi c market, if they do not re- evaluate and update their business models accord- ingly. As the Internet matures, Semantic Web technologies enable applications for meaning- based and dynamic fi ltering and processing of information, which has a disruptive impact on sci- Frank Hellwig is studying Publishing at Oxford entifi c publishing. Brookes University (MA) and Leipzig University of This article calls for publishers to adopt seman- Applied Sciences with a focus on scientifi c publish- tic technologies and emphasises the “need to in- ing. He has worked in digital publishing, for a journal clude a semantic strategy in their business models” subscription agency, in book retail, event manage- (Hawkins 2009). With a focus on journals as the ment and development aid in Germany, UK, France ‘cash cow’ of scientifi c publishers, it assembles de- and Bangladesh. bates about disruption and general tendencies in scientifi c publishing. An introduction to Semantic E-mail: [email protected] Web, Text Mining and Semantic Publishing is given a s well a s va riou s examples of product development s, company partnerships and acquisitions related to semantic technologies. Finally, different ways of acquiring semantic annotation data and fi nancial aspects of semantic enhancements are discussed. Keywords: scientifi c publishing, science, semantic web, linked data, natural language processing, text mining, business, disruption Disruptive Innovation According to Clayton Christensen (2008), dis- ruptive innovation makes a product simpler and more affordable.
    [Show full text]
  • Proceedings of Balisage: the Markup Conference 2012
    Published in: Proceedings of Balisage: The Markup Conference 2012. at: http://www.balisage.net/Proceedings/vol6/print/Witt01/BalisageVol6- Witt01.html (downloaded on: 15.02.2017/14:19). Balisage: The Markup Conference 2012 Proceedings A standards-related web-based The Markup Conference M information system Maik Stührenberg Institut für Deutsche Sprache (IDS) Mannheim Universität Bielefeld <maik.Stuehrenberg0uni-bielefeld.de> Oliver Schonefeld Institut für Deutsche Sprache (IDS) Mannheim <[email protected]> Andreas Witt Institut für Deutsche Sprache (IDS) Mannheim <witt0 ids-mannheim.de> Bcilisage: The Markup Conference 2012 August 7 - 10, 2012 Copyright © 2012 by the authors. Used with permission. Howto cite this paper Stührenberg, Maik, Oliver Schonefeld and Andreas Witt. “A standards-related web-based information system.” Presented at Balisage: The Markup Conference 2012, Montreal, Canada, August 7-10, 2012. In Proceedings ofBalisage: The Markup Conference 2012. Balisage Series on Markup Technologies, vol. 8 (2012). DOI: io.4242/Baiisagevoi8.stuhrenbergoi. Abstract This late breaking proposal introduces the prototype of a web-based information system dealing with standards in the field of annotation. Table of Contents Introduction The problem with standards Providing guidance Information structure Representation Current state and future work Introduction This late breaking proposal is based on an ongoing effort started in the CLARIN project and which was presented briefly at the LREC 2012 Workshop on Collaborative Resource Development and Delivery. Initial point was the development of an easy to use information system for the description of standards developed in ISO/IEC TC37/SC4 Language Resources Management. Since these standards are heavily related to each other it is usually not feasible to adopt only a single standard for one's work but to dive into the standards jungle in full.
    [Show full text]
  • Websig: a Digital Signature Framework for the Web
    WEBSIG: A DIGITAL SIGNATURE FRAMEWORK FOR THE WEB By James P. McCusker A Dissertation Submitted to the Graduate Faculty of Rensselaer Polytechnic Institute in Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY Major Subject: COMPUTER SCIENCE Examining Committee: Deborah L. McGuinness, Dissertation Adviser James Hendler, Member Peter Fox, Member Michel Dumontier, Member Rensselaer Polytechnic Institute Troy, New York July 2015 (For Graduation August 2015) c Copyright 2015 by James P. McCusker All Rights Reserved ii CONTENTS LIST OF TABLES . vi LIST OF FIGURES . vii ACKNOWLEDGMENT . viii ABSTRACT . .x 1. Introduction . .1 1.1 Background . .4 1.2 Legal Framework . .7 1.3 Use Case . .8 1.4 Introducing WebSig . .8 1.5 Organization . 10 2. Repudiation, Trust, and Computability in Web Documents . 11 2.1 Trusting Computable Documents on the Web . 14 2.1.1 Sufficient Qualities for Verifiable Computable Signature Schemes 16 2.2 Repudiation of a Signature . 18 2.2.1 Signatory Identification . 18 2.2.2 Intention to Sign . 19 2.2.3 Adoption of Document . 19 2.3 Minimizing Repudiation . 20 2.3.1 Signatory Identification . 21 2.3.2 Intention to Sign . 21 2.3.3 Adoption of Document . 22 3. Related Digital Signature Schemes . 23 3.1 Non-Cryptographic Signatures . 24 3.2 Basic Digital Signatures . 25 3.3 XML Digital Signatures . 26 3.4 RDF Digital Signatures . 27 3.5 Conclusions . 28 iii 4. Parallel Identities for Managing Open Government Data . 29 4.1 Introduction . 29 4.1.1 Use case: Trusting Integrated Data . 32 4.2 Related Work .
    [Show full text]
  • Desirable Features and Current Challenges for a Cloud-Based Academic Ebook Infrastructure
    eBooks in the Cloud: Desirable Features and Current Challenges for a Cloud-based Academic eBook Infrastructure Wei Shen GESIS – Leibniz Institute for the Social Sciences, Lennéstr. 30, 53113 Bonn, Germany. Email: [email protected]. Ute Koch GESIS – Leibniz Institute for the Social Sciences, Schiffbauerdamm 19, 10117 Berlin, Germany. Email: [email protected]. Abstract: With the rapid development of the mobile technology, the use of multifarious mobile devices, such as tablet PCs, eReader devices and mobile phones for electronic reading, has become an important part of everyday life. In December 2010, Google launched its Google eBookstore with more than 3 million e-books. The most significant feature of the new Google application is that it stores the book content in the cloud, which enables the reader to access the book content at any time on any mobile device in a seamless manner. A cloud-based eBook infrastructure has the characteristics of providing a vast amount of publication information, storing and processing the data in the cloud and displaying the results on demand on diverse desktops or mobile services. Taking into account the characteristics of a cloud-based eBook infrastructure, the paper examines the current eBook distribution lifecycle consisting of eBook publishing, cloud computing and mobile reading technologies. Potential features necessary for a cloud-based academic eBook infrastructure which will better support publishing, searching and reading on the web, as well as promote communication in the scientific community, are outlined. Keywords: Cloud-based eBook infrastructure; semantic publishing; mobile devices; mobile reading; electronic books; collaboration; information search. Introduction Neither mobile technologies nor electronic book publishing is a newly emerging field, but with the success of Amazon’s Kindle and the rapid growth of Amazon eBook store, they have received increasing attention in recent years and are becoming the craze fields of innovations.
    [Show full text]
  • Towards the Unification of Formats for Overlapping Markup
    Towards the unification of formats for overlapping markup Paolo Marinelli∗ Fabio Vitali∗ Stefano Zacchiroli∗ [email protected] [email protected] [email protected] Abstract Overlapping markup refers to the issue of how to represent data structures more expressive than trees|for example direct acyclic graphs|using markup (meta-)languages which have been designed with trees in mind|for example XML. In this paper we observe that the state of the art in overlapping markup is far from being the widespread and consistent stack of standards and technologies readily available for XML and develop a roadmap for closing the gap. In particular we present in the paper the design and implementation of what we believe to be the first needed step, namely: a syntactic conversion framework among the plethora of overlapping markup serialization formats. The algorithms needed to perform the various conversions are presented in pseudo-code, they are meant to be used as blueprints for re- searchers and practitioners which need to write batch translation programs from one format to the other. 1 Introduction This paper is about markup, one of the key technological ingredient of hypertext. The particular aspects of markup we are concerned with are the limits of its expressivity. XML-based markup requires that the identified features of a document are organized hierarchically as a single tree, whereby each fragment of the content of the document is contained in one and only one XML element, each of which is contained within one and only one parent element all the way up to the single root element at the top.
    [Show full text]
  • Authoring and Publishing of Units and Quantities in Semantic Documents
    Authoring and Publishing Units and Quantities in Semantic Documents Mihai Cîrlănaru, Deyan Ginev, Christoph Lange Computer Science, Jacobs University Bremen, Germany {m.cirlanaru,d.ginev,ch.lange}@jacobs-university.de Abstract. This paper shows how an explicit representation of units and quantities can improve the experience of semantically published documents, and provides a first authoring method in this respect. To exemplify the potential and practical advantages of encoding explicit semantics regarding units w.r.t. user experience, we demonstrate a unit system preference service, which enables the user to choose the system of units for the displayed paper. By semantically publishing units, we obtain a basis for a wide range of applications and services such as unknown unit lookup, unit and quantity semantic search and unit and quantity manipulation. Enabling semantic publishing for units is also presented in the context of a large collection of legacy scientific documents (the arXMLiv corpus), where our approach allows to non-invasively enrich legacy publications. 1 Motivation Units and quantities, although widely spread, lack a formal standard represen- tation for semantic publishing. A multitude of problems [Usm] arise from the different flavors (country specific unit standards) and formats (abbreviations, special cases of occurrence) of units, making it hard for the untrained reader to fully understand the information provided. Semantic publishing solves most such problems by disambiguating the unit and quantity occurrences, which, further on, will enable a wide range of applications and services to interact with them. A unit is any determinate quantity, dimension, or magnitude adopted as a basis or standard of measurement for other quantities of the same kind and in terms of which their magnitude is calculated or expressed [Oxf], but from the top-most level of perception, it simply provides information on a wide range of quantifiable aspects.
    [Show full text]
  • The MLCD Overlap Corpus (MOC)
    Digital Humanities 2010 The MLCD Overlap Corpus semantic openness of SGML and XML to supply non-hierarchical interpretation of what (MOC) are often thought to be inescapably hierarchical notations. Huitfeldt, Claus The SGML/XML-based approaches to overlap [email protected] fall, roughly, into three groups: milestones, Department of Philosophy, University of fragmentation-and-reconstitution, and stand- Bergen off annotation. Milestones (described as early as Sperberg-McQueen, C. M. (Barnard et al. 1988), and used in (Sperberg- [email protected] McQueen and Burnard 1990) and later versions Black Mesa Technologies LLC, USA of the TEI Guidelines) use empty elements to Marcoux, Yves mark the boundaries of regions which cannot be marked simply as elements because they [email protected] overlap the boundaries of other elements. More Université de Montréal, Canada recently, approaches to milestone markup have been generalized in the Trojan Horse and CLIX markup idioms (DeRose 2004). For some time, theorists and practitioners Fragmentation is the technique of dividing of descriptive markup have been aware that a logical unit which overlaps other units the strict hierarchical organization of elements into several smaller units, which do not; the provided by SGML and XML represents a consuming application can then re-aggregate potentially problematic abstraction. The nesting the fragments. structures of SGML and XML capture an important property of real texts and represent Stand-off annotation addresses the overlap a successful combination of expressive power problem by removing the markup from the main and tractability. But not all textual phenomena data stream of the document, at the same time appear in properly nested form, and for more adding pointers back into the base data.
    [Show full text]
  • A Semantic Model for Scholarly Electronic Publishing
    A semantic model for scholarly electronic publishing Carlos H. Marcondes University Federal Fluminense, Department of Information Science, R. Lara Vilela, 126, 24210-590, Niterói, Rio de Janeiro, Brazil, [email protected] Abstract. Despite numerous advancements in information technology, electronic publishing is still based on the print text model. The natural language textual format prevents programs from semantically processing article content. A semantic model for scholarly electronic publishing is proposed, in which the article conclusion is specified by the author and recorded in a machine- understandable format, enabling semantic retrieval and identification of traces of scientific discoveries and knowledge misunderstandings. 89 biomedical articles were analyzed for this purpose. A prototype system that partially implements the proposed model was developed. Four patterns of reasoning and sequencing of semantic elements were identified in the analyzed articles. A content model comprising semantic elements and their sequences in articles is proposed. The development and testing of a prototype of a Web submission interface to an electronic journal system that implements the proposed model are reported. Key words: electronic publishing, scientific methodology, scientific communication, knowledge representation, ontologies, semantic content processing, e-Science 1 Introduction Before the advent of the World Wide Web (hereafter referred to as “Web”), man’s body of scientific knowledge was fuzzy and distributed across publications in libraries worldwide. The Web is fast becoming a universal platform for the disposal, exchange, and access of knowledge records. An increasing amount of records of human culture—from text, static and motion images, and sound, to multimedia—are now being created directly in a digital format.
    [Show full text]
  • Linked Research on the Decentralised Web
    Linked Research on the Decentralised Web Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-Naturwissenschaftlichen Fakultät der Rheinischen Friedrich-Wilhelms-Universität Bonn vorgelegt von Sarven Capadisli aus Istanbul, Turkey Bonn, 2019-07-29 Angefertigt mit Genehmigung der Mathematisch-Naturwissenschaftlichen Fakultät der Rheinischen Friedrich-Wilhelms-Universität Bonn 1. Gutachter: Prof. Dr. Sören Auer 2. Gutachter: Dr. Herbert Van de Sompel Tag der Promotion 2020-03-03 Erscheinungsjahr 2020 Abstract This thesis is about research communication in the context of the Web. I analyse literature which reveals how researchers are making use of Web technologies for knowledge dissemination, as well as how individuals are disempowered by the centralisation of certain systems, such as academic publishing platforms and social media. I share my findings on the feasibility of a decentralised and interoperable information space where researchers can control their identifiers whilst fulfilling the core functions of scientific communication: registration, awareness, certification, and archiving. The contemporary research communication paradigm operates under a diverse set of sociotechnical constraints, which influence how units of research information and personal data are created and exchanged. Economic forces and non-interoperable system designs mean that researcher identifiers and research contributions are largely shaped and controlled by third-party entities; participation requires the use of proprietary systems. From a technical standpoint, this thesis takes a deep look at semantic structure of research artifacts, and how they can be stored, linked and shared in a way that is controlled by individual researchers, or delegated to trusted parties. Further, I find that the ecosystem was lacking a technical Web standard able to fulfill the awareness function of research communication.
    [Show full text]