XML Transformations, Views and Updates Based on Xquery Fragments

Total Page:16

File Type:pdf, Size:1020Kb

XML Transformations, Views and Updates Based on Xquery Fragments Faculteit Wetenschappen Informatica XML Transformations, Views and Updates based on XQuery Fragments Proefschrift voorgelegd tot het behalen van de graad van doctor in de wetenschappen aan de Universiteit Antwerpen, te verdedigen door Roel VERCAMMEN Promotor: Prof. Dr. Jan Paredaens Antwerpen, 2008 Co-promotor: Dr. Ir. Jan Hidders XML Transformations, Views and Updates based on XQuery Fragments Roel Vercammen Universiteit Antwerpen, 2008 http://www.universiteitantwerpen.be Permission to make digital or hard copies of portions of this work for personal or classroom use is granted, provided that the copies are not made or distributed for profit or commercial advantage and that copies bear this notice. Copyrights for components of this work owned by others than the author must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission of the author. Research funded by a Ph.D. grant of the Institute for the Promotion of Innovation through Science and Technology in Flan- ders (IWT-Vlaanderen). { Onderzoek gefinancierd met een specialisatiebeurs van het Instituut voor de Aanmoediging van Innovatie door Wetenschap en Technologie in Vlaanderen (IWT-Vlaanderen). Grant number / Beurs nummer: 33581. http://www.iwt.be Typesetting by LATEX Acknowledgements This thesis is the result of the contributions of many friends and colleagues to whom I would like to say \thank you". First and foremost, I want to thank my advisor Jan Paredaens, who gave me the opportunity to become a researcher and teached me how good research should be performed. I had the honor to write several papers in collaboration with him and will always remember the discussions and his interesting views on research, politics and gastronomy. I am greatly indebted to my co-advisor, Jan Hidders, who helped me assessing what good research questions are and whose expertise helped me a lot. Moreover, I will also remember him for non-scientific reasons, such as the badminton games and his fascination and broad knowledge of history. Philippe Michiels deserves a special thank you for the many vivid discussions we had and the interesting research questions on XQuery optimization he came up with and that we tried to tackle together. Even though the research we did together is not reflected in this thesis, it broadened my view on the field of database research and he brought me in touch with many great researchers, such as Mary Fern´andez,J´er^omeSim´eonand Maurice van Keulen. I am also very thankful to the rest of the ADReM team for giving me a nice stimulating environment the past four years. I want to acknowledge Toon Calders, Serge Demeyer, Bart Goethals and Maarten Marx, members of my PhD Jury, for giving feed-back on this thesis, which helped to improve this thesis. I also want to thank Stefania Marrara, who was in Antwerp for six months and with whom I had the honor to collaborate on parts of the research that resulted in Chapter 3. My family deserves my gratitude as well, for the continuous support and encouragement they gave me and the nice and fun environment which I am privileged to call home. Last but not least, I want to say thank you to the Institute for the Promotion of Innovation by Science and Technology in Flanders (IWT) for making it possible for me to do the research that resulted in this thesis. Not only their financial support, but also the frequent reporting and the interesting project interviews have helped me a lot into the direction of this thesis. 3 4 Abstract During the past decade, XML has become ubiquitous when it comes to sharing data and publishing information on the Web. Because of the ever increasing amount of data available in XML format and the data model which differs from previous work in database research, this new standard has drawn the attention of the database community. Several query and updating languages have been proposed, which eventually led to the W3C recommendation for XQuery 1.0 and update facilities for this query language. We define a formal framework called LiXQuery to study and investigate properties of this language and its constructs. More precisely, we compare the relative expressive power of some of the most important features of XQuery in terms of performing transformations. Data integration and sharing is an interesting problem which can benefit from XML research. XML views can be defined to transform data that several parties want to share to a common schema. It is desirable that such virtual XML trees behave as much as possible as ordinary XML documents. We introduce the concept of projection views based on XPath expressions and show how we can translate XPath queries on the view documents to XPath queries on the base document. Moreover, we introduce three simple XPath- based update operations, i.e., insertion, attribute update and deletion, and show that it is decidable whether propagating the primitive update operations to the base tree introduces view side-effects or not. 5 6 Contents 1 Introduction 13 1.1 XML . 13 1.2 Querying XML Documents . 15 1.3 Updating XML Documents . 18 1.4 Updating XML Views . 20 1.4.1 XML Views . 20 1.4.2 Views as Access Control Mechanism . 21 1.4.3 Updating Views . 23 1.4.4 Existing Approaches in RDBMS and OODBMS . 24 1.4.5 Existing Work on XML View Updates . 25 1.5 Contributions and Organization . 26 2 A Formal Model for XQuery 29 2.1 Syntax and Informal Semantics . 29 2.1.1 Non-Updating Expressions . 30 2.1.2 Updating Expressions . 33 2.2 Some Examples and Syntactic Sugar . 35 2.3 Formal Framework . 39 2.3.1 XML Store . 40 2.3.2 Evaluation Environment . 42 2.3.3 Auxiliary Notions . 42 2.3.4 List of Pending Updates . 44 2.3.5 Program Semantics . 48 2.4 Expression Semantics . 49 2.5 Conclusion . 56 7 8 CONTENTS 3 Expressing Transformations in XQuery 59 3.1 LiXQuery Fragments . 59 3.2 Expressiveness Relationships between Fragments . 61 3.3 Properties of the Fragments . 63 3.3.1 Reachable Substores from Input/Output . 63 3.3.2 Set Equivalence and Bag Equivalence . 64 3.3.3 Relationships between Input and Output Values and Length . 66 3.3.4 Depth of Transformations . 71 3.3.5 Termination of Programs . 72 3.4 Expressibility Results . 73 3.4.1 Expression Simulations . 73 3.4.2 Program Simulations . 77 3.5 Proving the Relationships between the Fragments . 87 3.6 Conclusion . 88 4 Views and Updates with XPath Transformations 91 4.1 Introduction . 91 4.2 Preliminaries . 93 4.2.1 Data Model . 94 4.2.2 View-Update Problem . 95 4.3 Queries, Updates and Views . 95 4.3.1 Queries . 96 4.3.2 Properties . 97 4.3.3 Updates . 98 4.3.4 Views . 99 4.3.5 Additional Notations . 100 4.4 Simple Propagation Update Strategy . 100 4.4.1 Adding an Escape to Path Expressions . 101 4.4.2 View Composition . 102 4.5 Deciding Well-Behavedness . 103 4.5.1 Configurations . 103 4.5.2 Configuration Trees . 107 4.5.3 Checking Configuration Trees . 109 4.5.4 Checking for Conflicts in Configuration Trees . 112 4.5.5 Complexity of Deciding Well-Behavedness . 112 4.6 Conclusion . 114 5 Discussion 117 5.1 LiXQuery as a Framework . 117 5.2 Studying the Expressive Power of LiXQuery . 118 5.2.1 Separating Queries and Updates . 118 5.2.2 Other Measures of Expressive Power . 119 5.3 Updating XML Views . 121 CONTENTS 9 5.3.1 Complexity for Deciding in a Positive Fragment of P ........ 121 5.3.2 Dealing with a Larger Class of Update Strategies . 123 5.4 Updating XML Views with LiXQuery . 125 A Dutch Summary 127 Publications by the Author 129 Bibliography 131 10 CONTENTS List of Figures 1.1 Example of an XML document for a virtual learning environment. 16 1.2 Example of an XML view for the document for Figure 1.1. 22 2.1 Syntax of LiXQuery . 31 2.2 Simulation of the LISP list ((b c) d) .................... 39 2.3 XML tree for the Example of Definition 2.1 . 41 2.4 Semantics of the Primitive Update Operations. 45 3.1 Equivalence classes of XQuery fragments . 62 4.1 General Setting for the XML View-Update Problem . 95 4.2 Example Document Trees for illustrating update operations . 98 4.3 Rewriting P} to P expressions. 102 4.4 Node-creation independent and attribute-minimal configuration. 104 4.5 A Configuration Tree for the Configuration of Figure 4.4 . 108 5.1 XML Document containing Members of an Organization . 124 11 12 LIST OF FIGURES CHAPTER 1 Introduction haring data has been one of the main reasons why the Web has become so popular S the past decade. The HyperText Markup Language (HTML) and graphical browsers have been killer applications that caused a tremendous increase of Internet connections during the nineties. One of the major challenges posed by this evolution is managing all this available information. In this thesis we investigate one of the many facets of integrating and managing the loosely structured data on the Web. This introduction highlights the topics that are studied in this thesis. First, we briefly discuss in Section 1.1 the use of the eXtensible Markup Language (XML) to represent data on the Web. Then, we talk about querying and updating XML documents in Section 1.2 and Section 1.3. In Section 1.4 we introduce the notion of XML Views and point out some of the problems that this raises with respect to performing updates.
Recommended publications
  • Bibliography of Erik Wilde
    dretbiblio dretbiblio Erik Wilde's Bibliography References [1] AFIPS Fall Joint Computer Conference, San Francisco, California, December 1968. [2] Seventeenth IEEE Conference on Computer Communication Networks, Washington, D.C., 1978. [3] ACM SIGACT-SIGMOD Symposium on Principles of Database Systems, Los Angeles, Cal- ifornia, March 1982. ACM Press. [4] First Conference on Computer-Supported Cooperative Work, 1986. [5] 1987 ACM Conference on Hypertext, Chapel Hill, North Carolina, November 1987. ACM Press. [6] 18th IEEE International Symposium on Fault-Tolerant Computing, Tokyo, Japan, 1988. IEEE Computer Society Press. [7] Conference on Computer-Supported Cooperative Work, Portland, Oregon, 1988. ACM Press. [8] Conference on Office Information Systems, Palo Alto, California, March 1988. [9] 1989 ACM Conference on Hypertext, Pittsburgh, Pennsylvania, November 1989. ACM Press. [10] UNIX | The Legend Evolves. Summer 1990 UKUUG Conference, Buntingford, UK, 1990. UKUUG. [11] Fourth ACM Symposium on User Interface Software and Technology, Hilton Head, South Carolina, November 1991. [12] GLOBECOM'91 Conference, Phoenix, Arizona, 1991. IEEE Computer Society Press. [13] IEEE INFOCOM '91 Conference on Computer Communications, Bal Harbour, Florida, 1991. IEEE Computer Society Press. [14] IEEE International Conference on Communications, Denver, Colorado, June 1991. [15] International Workshop on CSCW, Berlin, Germany, April 1991. [16] Third ACM Conference on Hypertext, San Antonio, Texas, December 1991. ACM Press. [17] 11th Symposium on Reliable Distributed Systems, Houston, Texas, 1992. IEEE Computer Society Press. [18] 3rd Joint European Networking Conference, Innsbruck, Austria, May 1992. [19] Fourth ACM Conference on Hypertext, Milano, Italy, November 1992. ACM Press. [20] GLOBECOM'92 Conference, Orlando, Florida, December 1992. IEEE Computer Society Press. http://github.com/dret/biblio (August 29, 2018) 1 dretbiblio [21] IEEE INFOCOM '92 Conference on Computer Communications, Florence, Italy, 1992.
    [Show full text]
  • Refactoring XSLT
    XSLT and XQuery September 19, 2019 Refactoring XSLT Priscilla Walmsley, Datypic, Inc. Class Outline Introduction ......................................................................................................................................2 Cleaning Up......................................................................................................................................9 Improving Code Quality..................................................................................................................14 Other Improvements.......................................................................................................................21 Introduction 2 Obligatory Wikipedia Quote 3 Code refactoring is the process of restructuring existing computer code - changing the factoring - without changing its external behavior. Refactoring improves nonfunctional attributes of the software. Advantages include improved code readability and reduced complexity; these can improve source code maintainability and create a more expressive internal architecture or object model to improve extensibility. Typically, refactoring applies a series of standardised basic micro-refactorings, each of which is (usually) a tiny change in a computer program's source code that either preserves the behaviour of the software, or at least does not modify its conformance to functional requirements. Many development environments provide automated support for performing the mechanical aspects of these basic refactorings. If done extremely well, code
    [Show full text]
  • Supporting SPARQL Update Queries in RDF-XML Integration *
    Supporting SPARQL Update Queries in RDF-XML Integration * Nikos Bikakis1 † Chrisa Tsinaraki2 Ioannis Stavrakantonakis3 4 Stavros Christodoulakis 1 NTU Athens & R.C. ATHENA, Greece 2 EU Joint Research Center, Italy 3 STI, University of Innsbruck, Austria 4 Technical University of Crete, Greece Abstract. The Web of Data encourages organizations and companies to publish their data according to the Linked Data practices and offer SPARQL endpoints. On the other hand, the dominant standard for information exchange is XML. The SPARQL2XQuery Framework focuses on the automatic translation of SPARQL queries in XQuery expressions in order to access XML data across the Web. In this paper, we outline our ongoing work on supporting update queries in the RDF–XML integration scenario. Keywords: SPARQL2XQuery, SPARQL to XQuery, XML Schema to OWL, SPARQL update, XQuery Update, SPARQL 1.1. 1 Introduction The SPARQL2XQuery Framework, that we have previously developed [6], aims to bridge the heterogeneity issues that arise in the consumption of XML-based sources within Semantic Web. In our working scenario, mappings between RDF/S–OWL and XML sources are automatically derived or manually specified. Using these mappings, the SPARQL queries are translated on the fly into XQuery expressions, which access the XML data. Therefore, the current version of SPARQL2XQuery provides read-only access to XML data. In this paper, we outline our ongoing work on extending the SPARQL2XQuery Framework towards supporting SPARQL update queries. Both SPARQL and XQuery have recently standardized their update operation seman- tics in the SPARQL 1.1 and XQuery Update Facility, respectively. We have studied the correspondences between the update operations of these query languages, and we de- scribe the extension of our mapping model and the SPARQL-to-XQuery translation algorithm towards supporting SPARQL update queries.
    [Show full text]
  • Access Control Models for XML
    Access Control Models for XML Abdessamad Imine Lorraine University & INRIA-LORIA Grand-Est Nancy, France [email protected] Outline • Overview on XML • Why XML Security? • Querying Views-based XML Data • Updating Views-based XML Data 2 Outline • Overview on XML • Why XML Security? • Querying Views-based XML Data • Updating Views-based XML Data 3 What is XML? • eXtensible Markup Language [W3C 1998] <files> "<record>! ""<name>Robert</name>! ""<diagnosis>Pneumonia</diagnosis>! "</record>! "<record>! ""<name>Franck</name>! ""<diagnosis>Ulcer</diagnosis>! "</record>! </files>" 4 What is XML? • eXtensible Markup Language [W3C 1998] <files>! <record>! /files" <name>Robert</name>! <diagnosis>! /record" /record" Pneumonia! </diagnosis> ! </record>! /name" /diagnosis" <record …>! …! </record>! Robert" Pneumonia" </files>! 5 XML for Documents • SGML • HTML - hypertext markup language • TEI - Text markup, language technology • DocBook - documents -> html, pdf, ... • SMIL - Multimedia • SVG - Vector graphics • MathML - Mathematical formulas 6 XML for Semi-Structered Data • MusicXML • NewsML • iTunes • DBLP http://dblp.uni-trier.de • CIA World Factbook • IMDB http://www.imdb.com/ • XBEL - bookmark files (in your browser) • KML - geographical annotation (Google Maps) • XACML - XML Access Control Markup Language 7 XML as Description Language • Java servlet config (web.xml) • Apache Tomcat, Google App Engine, ... • Web Services - WSDL, SOAP, XML-RPC • XUL - XML User Interface Language (Mozilla/Firefox) • BPEL - Business process execution language
    [Show full text]
  • Web and Semantic Web Query Languages: a Survey
    Web and Semantic Web Query Languages: A Survey James Bailey1, Fran¸coisBry2, Tim Furche2, and Sebastian Schaffert2 1 NICTA Victoria Laboratory Department of Computer Science and Software Engineering The University of Melbourne, Victoria 3010, Australia http://www.cs.mu.oz.au/~jbailey/ 2 Institute for Informatics,University of Munich, Oettingenstraße 67, 80538 M¨unchen, Germany http://pms.ifi.lmu.de/ Abstract. A number of techniques have been developed to facilitate powerful data retrieval on the Web and Semantic Web. Three categories of Web query languages can be distinguished, according to the format of the data they can retrieve: XML, RDF and Topic Maps. This ar- ticle introduces the spectrum of languages falling into these categories and summarises their salient aspects. The languages are introduced us- ing common sample data and query types. Key aspects of the query languages considered are stressed in a conclusion. 1 Introduction The Semantic Web Vision A major endeavour in current Web research is the so-called Semantic Web, a term coined by W3C founder Tim Berners-Lee in a Scientific American article describing the future of the Web [37]. The Semantic Web aims at enriching Web data (that is usually represented in (X)HTML or other XML formats) by meta-data and (meta-)data processing specifying the “meaning” of such data and allowing Web based systems to take advantage of “intelligent” reasoning capabilities. To quote Berners-Lee et al. [37]: “The Semantic Web will bring structure to the meaningful content of Web pages, creating an environment where software agents roaming from page to page can readily carry out sophisticated tasks for users.” The Semantic Web meta-data added to today’s Web can be seen as advanced semantic indices, making the Web into something rather like an encyclopedia.
    [Show full text]
  • Towards Quality Analysis for Document Oriented Bases Paola Gómez, Claudia Roncancio, Rubby Casallas
    Towards quality analysis for document oriented bases Paola Gómez, Claudia Roncancio, Rubby Casallas To cite this version: Paola Gómez, Claudia Roncancio, Rubby Casallas. Towards quality analysis for document oriented bases. International Conference on Conceptual Modeling (ER), Oct 2018, Xi’an, China. hal-01908112 HAL Id: hal-01908112 https://hal.archives-ouvertes.fr/hal-01908112 Submitted on 29 Oct 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Towards quality analysis for document oriented bases Paola Gomez´ 1, Claudia Roncancio1, and Rubby Casallas2 1 Univ. Grenoble Alpes, CNRS, Grenoble INP* paola.gomez-barreto,[email protected] 2 TICSw, Universidad de los Andes,Bogota´ - Colombia, [email protected] Abstract. Document-oriented bases allow high flexibility in data representation which facilitates a rapid development of applications and enables many possi- bilities for data structuring. Nevertheless, the structural choices remain crucial because of their impact on several aspects of the document base and application quality, e.g, memory print, data redundancy, readability and maintainability. Our research is motivated by quality issues of document-oriented bases. We aim at facilitating the study of the possibilities of data structuring and providing objective metrics to better reveal the advantages and disadvantages of each solution with respect to user needs.
    [Show full text]
  • Declarative Access to Filesystem Data New Application Domains for XML Database Management Systems
    Declarative Access to Filesystem Data New application domains for XML database management systems Alexander Holupirek Dissertation zur Erlangung des akademischen Grades Doktor der Naturwissenschaften (Dr. rer. nat.) Fachbereich Informatik und Informationswissenschaft Mathematisch-Naturwissenschaftliche Sektion Universität Konstanz Referenten: Prof. Dr. Marc H. Scholl Prof. Dr. Marcel Waldvogel Tag der mündlichen Prüfung: 17. Juli 2012 Abstract XML and state-of-the-art XML database management systems (XML-DBMSs) can play a leading role in far more application domains as it is currently the case. Even in their basic configuration, they entail all components necessary to act as central systems for complex search and retrieval tasks. They provide language-specific index- ing of full-text documents and can store structured, semi-structured and binary data. Besides, they offer a great variety of standardized languages (XQuery, XSLT, XQuery Full Text, etc.) to develop applications inside a pure XML technology stack. Benefits are obvious: Data, logic, and presentation tiers can operate on a single data model, and no conversions have to be applied when switching in between. This thesis deals with the design and development of XML/XQuery driven informa- tion architectures that process formerly heterogeneous data sources in a standardized and uniform manner. Filesystems and their vast amounts of different file types are a prime example for such a heterogeneous dataspace. A new XML dialect, the Filesystem Markup Language (FSML), is introduced to construct a database view of the filesystem and its contents. FSML provides a uniform view on the filesystem’s contents and allows developers to leverage the complete XML technology stack on filesystem data.
    [Show full text]
  • XSL-FO by Dave Pawson Publisher
    XSL-FO By Dave Pawson Publisher : O'Reilly Pub Date : August 2002 ISBN : 0-596-00355-2 Pages : 282 Table of • Contents • Index • Reviews Reader • Reviews Extensible Style Language-Formatting Objects, or XSL-FO, is a set of tools developers and web designers use to describe page printouts of their XML (including XHTML) documents. XSL-FO teaches you how to think about the formatting of your documents and guides you through the questions you'll need to ask to ensure that your printed documents meet the same high standards as your computer-generated content. 777 Copyright Preface Who Should Read This Book? What Does This Book Cover? Motivation Organization of This Book What Else Do You Need? Conventions Used in This Book How to Contact Us Acknowledgments Chapter 1. Planning for XSL-FO Section 1.1. XML and Document Processing Section 1.2. Choosing Your Print Production Approach Section 1.3. Choosing Tools Section 1.4. The Future for XSL-FO Chapter 2. A First Look at XSL-FO Section 2.1. An XSL-FO Overview Section 2.2. Related Stylesheet Specifications Section 2.3. Using XSL-FO as Part of XSL Section 2.4. Shorthand, Short Form, and Inheritance Chapter 3. Pagination Section 3.1. Document Classes Section 3.2. The Main Parts of an XSL-FO Document Section 3.3. Simple Page Master Section 3.4. Complex Pagination Section 3.5. Page Sequences Chapter 4. Areas Section 4.1. Informal Definition of an Area Section 4.2. Area Types Section 4.3. Components of an Area Section 4.4.
    [Show full text]
  • How Can the DDI Community Take Advantage of the Native XML Database Exist-Db EDDI 2011, Gothenburg
    How can the DDI community take advantage of the native XML database eXist-db EDDI 2011, Gothenburg Olof Olsson, SND Leif-Jöran Olsson, Språkbanken Johan Fihn, SND New features in eXist since IASSIST ● 1.4.1 ○ Better Referential Integrity, revised Database Shutdown and Crash Protection, new WebDAV implementation based on the Milton WebDAV library. The Lucene Full-Text Indexing has been made more robust and performant. It now also includes configurable analyzers and parameters, better match highlighting and additional functions for working with Lucene document fields. The betterFORM server-side XForms 1.1 engine is integrated and enabled by default. New features in eXist since IASSIST ● 1.5 ○ Lucene indexing of binary files, e.g. pdf ○ eXide editing, app development, package management (xar archives) and repository installation (hot deploy) of these packages ○ svn extension, working copy in db etc ○ new authentication and access restriction model, allowing openid, oauth, ldap etc ○ Some XQuery 3 functions e.g. switch, try/catch, hof Will be reelased as new stable release 1.6 in beginning of 2012. OAI-PMH ● Provider ○ configuration per datatype available e.g. ddi, cmdi, meta-share etc ● Harvester Next steps ● Package as apps DDI Stylesheets News in the DDI 3.1 stylesheet ○ Translatable labels (re-use from icpsr stylesheet) ○ Support for more elements To do: ● Better navigation (javascript) ● More parameters for different types of views ● Separation of the stylesheet More info and download: https://code.google.com/p/ddixslt Call for stylesheets
    [Show full text]
  • Standardised Xquery 3.0 Annotations for REST
    RESTful XQuery Standardised XQuery 3.0 Annotations for REST Adam Retter Adam Retter Consulting <[email protected]> January, 2012 Abstract Whilst XQuery was originally envisaged and designed as a query language for XML, it has been adopted by many as a language for application development This, in turn, has encouraged additional and diverse extensions, many of which could not easily have been foreseen. This paper examines how XQuery has been used for Web Application development, current implementation approaches for executing XQuery in a Web context, and subsequently presents a proposal for a standard approach to RESTful XQuery through the use of XQuery 3.0 Annotations. Keywords: XQuery 3.0, Annotations, REST, HTTP, Standard 1. Introduction 1.1 Background XML Query Language (XQuery) was originally born from several competing query languages for XML[1]. All of these languages had in common the noble yet limited goal of querying XML. They focused on XML as a read-only store for data. In addition, whilst several of these predecessors recognised the Web as a critical factor, like their successor XQuery, none of them attempted to implement constructs in the language that supported use as a (Web) server-side processing language. With the adoption and use of XQuery, because of its functional nature and module system which permit the organisation of code units, people attempted to write complex processing applications in XQuery. As the limits of what was achievable in XQuery were tested, real world scenarios emerged which called for additional XQuery facilities, resulting in extension standards: XPath and XQuery Update[2] and XQuery Full-Text[3].
    [Show full text]
  • Xpath, Xquery and XSLT
    XPath, XQuery and XSLT Timo Lilja February 25, 2010 Timo Lilja () XPath, XQuery and XSLT February 25, 2010 1 / 24 Introduction When processing, XML documents are organized into a tree structure in the memory All XML Query and transformation languages operate on this tree XML Documents consists of tags and attributes which form logical constructs called nodes The basic operation unit of these query/processing languages is the node though access to substructures is provided Timo Lilja () XPath, XQuery and XSLT February 25, 2010 2 / 24 XPath XPath XPath 2.0 [4] is a W3C Recomendation released on January 2007 XPath provides a way to query the nodes and their attributes, tags and values XPath type both dynamic and static typing if the XML Scheme denition is present, it is used for static type checking before executing a query otherwise nodes are marked for untyped and they are coerced dynamically during run-time to suitable types Timo Lilja () XPath, XQuery and XSLT February 25, 2010 3 / 24 XPath Timo Lilja () XPath, XQuery and XSLT February 25, 2010 4 / 24 XPath Path expressions Path expressions provide a way to choose all matching branches of the tree representation of the XML Document Path expressions consist of one or more steps separated by the path operater/. A step can be an axis which is a way to reference a node in the path. a node-test which corresponds to a node in the actual XML document zero or more predicates Syntax for the path expressions: /step/step Syntaxc for a step: axisname::nodestest[predicate] Timo Lilja () XPath, XQuery and XSLT February 25, 2010 5 / 24 XPath Let's assume that we use the following XML Docuemnt <bookstore> <book category="COOKING"> <title lang="it">Everyday Italian</title> <author>Giada De Laurentiis</author> <year>2005</year> <price>30.00</price> </book> <book category="CHILDREN"> <title lang="en">Harry Potter</title> <author>J K.
    [Show full text]
  • Database Architecture Evolution: Mammals Flourished Long Before Dinosaurs Became Extinct
    Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct Stefan Manegold Martin L. Kersten Peter Boncz CWI, Amsterdam, The Netherlands {manegold,mk,boncz}@cwi.nl ABSTRACT building block for realizing more complex structures (such as ta- The holy grail for database architecture research is to find a solution bles, but also objects and even trees or graphs). that is Scalable & Speedy, to run on anything from small ARM In this paper, we summarize the quest to renovate database ar- processors up to globally distributed compute clusters, Stable & chitecture with the goal of addressing data management needs with Secure, to service a broad user community, Small & Simple, to be a common core rather than bolting on new subsystems [38], and comprehensible to a small team of programmers, Self-managing, to pushing through the functionality plateau, especially using self- let it run out-of-the-box without hassle. adaptation, exploiting resource optimization opportunities as they In this paper, we provide a trip report on this quest, covering appear dynamically, rather than by static prediction. This quest both past experiences, ongoing research on hardware-conscious al- covers the major design space choices, experiences gained and on- gorithms, and novel ways towards self-management specifically fo- going research around the MonetDB system. cused on column store solutions. In all, this story demonstrates the viability of new self-adaptive life-forms that may be very successful yet are very different from the long existing species. In this, the MonetDB strain forms an 1. INTRODUCTION open platform available for all who want to explore further evolu- Forty years of relational database technology seemingly con- tion in the data management ecosystem.
    [Show full text]