Converting Into Pattern-Based Schemas: a Formal Approach
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Using Tree Automata and Regular Expressions to Manipulate
Using Tree Automata and Regular Expressions to Manipulate Hierarchically Structured Data Nikita Schmidt and Ahmed Patel University College Dublin∗ Abstract Information, stored or transmitted in digital form, is often structured. Individual data records are usually represented as hierarchies of their elements. Together, records form larger structures. Information processing applications have to take account of this structuring, which assigns different semantics to different data elements or records. Big variety of structural schemata in use today often requires much flexibility from applications—for example, to process information coming from different sources. To ensure application interoperability, translators are needed that can convert one structure into another. This paper puts forward a formal data model aimed at supporting hierarchical data pro- cessing in a simple and flexible way. The model is based on and extends results of two classical theories, studying finite string and tree automata. The concept of finite automata and regular languages is applied to the case of arbitrarily structured tree-like hierarchical data records, represented as “structured strings.” These automata are compared with classical string and tree automata; the model is shown to be a superset of the classical models. Regular grammars and expressions over structured strings are introduced. Regular expression matching and substitution has been widely used for efficient unstruc- tured text processing; the model described here brings the power of this proven technique to applications that deal with information trees. A simple generic alternative is offered to replace today’s specialised ad-hoc approaches. The model unifies structural and content transforma- tions, providing applications with a single data type. -
Custom XML Attribute
ISO/IEC JTC 1/SC 34/WG 4 N 0050 IS 29500:2008 Defect Report Log Rex Jaeschke (project editor) ([email protected]) 2009-05-27 DRs 08-0001 through 08-0015 DRs 09-0001 through 09-0230 IS 29500:2008 Defect Report Log Introduction ........................................................................................................................................................................ 1 Revision History ................................................................................................................................................................... 3 DR Status at a Glance .......................................................................................................................................................... 5 1. DR 08-0001 — DML, Framework: Removal of ST_PercentageDecimal from the strict schema .............................. 6 2. DR 08-0002 — Primer: Format of ST_PositivePercentage values in strict mode examples .................................... 9 3. DR 08-0003 — DML, Main: Format of ST_PositivePercentage values in strict mode examples ........................... 12 4. DR 08-0004 — DML, Diagrams: Type for prSet attributes ................................................................................... 14 5. DR 08-0005 — PML, Animation: Description of hsl attributes Lightness and Saturation ..................................... 20 6. DR 08-0006 — PML, Animation: Description of rgb attributes Blue, Green and Red ........................................... 22 7. DR 08-0007 — DML, Main: Format -
Taxonomy of XML Schema Languages Using Formal Language Theory
Taxonomy of XML Schema Languages using Formal Language Theory MAKOTO MURATA IBM Tokyo Research Lab DONGWON LEE Penn State University MURALI MANI Worcester Polytechnic Institute and KOHSUKE KAWAGUCHI Sun Microsystems On the basis of regular tree grammars, we present a formal framework for XML schema languages. This framework helps to describe, compare, and implement such schema languages in a rigorous manner. Our main results are as follows: (1) a simple framework to study three classes of tree languages (“local”, “single-type”, and “regular”); (2) classification and comparison of schema languages (DTD, W3C XML Schema, and RELAX NG) based on these classes; (3) efficient doc- ument validation algorithms for these classes; and (4) other grammatical concepts and advanced validation algorithms relevant to XML model (e.g., binarization, derivative-based validation). Categories and Subject Descriptors: H.2.1 [Database Management]: Logical Design—Schema and subschema; F.4.3 [Mathematical Logic and Formal Languages]: Formal Languages— Classes defined by grammars or automata General Terms: Algorithms, Languages, Theory Additional Key Words and Phrases: XML, schema, validation, tree automaton, interpretation 1. INTRODUCTION XML [Bray et al. 2000] is a meta language for creating markup languages. To represent an XML based language, we design a collection of names for elements and attributes that the language uses. These names (i.e., tag names) are then used by application programs dedicated to this type of information. For instance, XHTML [Altheim and McCarron (Eds) 2000] is such an XML-based language. In it, permissible element names include p, a, ul, and li, and permissible attribute names include href and style. -
Introducing Regular Expressions
Introducing Regular Expressions wnload from Wow! eBook <www.wowebook.com> o D Michael Fitzgerald Beijing • Cambridge • Farnham • Köln • Sebastopol • Tokyo Introducing Regular Expressions by Michael Fitzgerald Copyright © 2012 Michael Fitzgerald. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://my.safaribooksonline.com). For more information, contact our corporate/institutional sales department: 800-998-9938 or [email protected]. Editor: Simon St. Laurent Indexer: Lucie Haskins Production Editor: Holly Bauer Cover Designer: Karen Montgomery Proofreader: Julie Van Keuren Interior Designer: David Futato Illustrator: Rebecca Demarest July 2012: First Edition. Revision History for the First Edition: 2012-07-10 First release See http://oreilly.com/catalog/errata.csp?isbn=9781449392680 for release details. Nutshell Handbook, the Nutshell Handbook logo, and the O’Reilly logo are registered trademarks of O’Reilly Media, Inc. Introducing Regular Expressions, the image of a fruit bat, and related trade dress are trademarks of O’Reilly Media, Inc. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and O’Reilly Media, Inc., was aware of a trademark claim, the designations have been printed in caps or initial caps. While every precaution has been taken in the preparation of this book, the publisher and authors assume no responsibility for errors or omissions, or for damages resulting from the use of the information con- tained herein. -
Developerworks : XML : Abstracting the Interface, Part II
developerWorks : XML : Abstracting the interface, Part II IBM Home Products Consulting Industries News About IBM IBM : developerWorks : XML : XML articles Abstracting the interface, Part II Extensions to the basic framework Search Advanced Help Martin Gerlach ([email protected]) Software Engineer IBM Almaden Research Center March 2001 This article, a sequel to the author's December article on Web application front ends, describes extensions to the basic framework for XML data and XSL style sheets. It Contents: focuses on the back end this time, covering National Language Support (NLS), What happened so far enhancements of the view structure, and performance issues. While the first article demonstrated the basic architecture for building Web applications using XML and Where we are going now XSLT, this article shows you how to make the applications ready to go online. National Language This article is the second in a series on "Abstracting the interface". In order to follow the Support concepts presented here, you will want to read the first article, Building an adaptable Web app front end with XML and XSL, which discusses the steps necessary to complete the Enhancing the application application. Studying the sample application code also will help you grasp the concepts structure presented here. As in the first installment, this article assumes you are familiar with Web applications based on the HTTP protocol and the HTTP request-response mechanism, and Performance: A look at that you have built Web pages using HTML and possibly JavaScript. You should know the caching Java programming language and how to use Java servlets as access points for Web applications. -
XML Reasoning Solver User Manual Pierre Genevès, Nabil Layaïda
XML Reasoning Solver User Manual Pierre Genevès, Nabil Layaïda To cite this version: Pierre Genevès, Nabil Layaïda. XML Reasoning Solver User Manual. [Research Report] RR-6726, 2008. inria-00339184v1 HAL Id: inria-00339184 https://hal.inria.fr/inria-00339184v1 Submitted on 17 Nov 2008 (v1), last revised 22 Aug 2011 (v2) HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE XML Reasoning Solver User Manual Pierre Genevès — Nabil Layaïda N° 6726 November 17, 2008 Thème SYM apport de recherche ISSN 0249-6399 ISRN INRIA/RR--6726--FR+ENG XML Reasoning Solver User Manual Pierre Genev`es∗, Nabil Laya¨ıda Th`emeSYM — Syst`emessymboliques Equipes-Projets´ Wam Rapport de recherche n° 6726 — November 17, 2008 — 17 pages Abstract: This manual provides documentation for using the logical solver introduced in [Genev`es,2006; Genev`es et al., 2007]. Key-words: Static Analysis, Logic, Satisfiability Solver, XML, Schema, XPath, Queries ∗ CNRS Centre de recherche INRIA Grenoble – Rhône-Alpes 655, avenue de l’Europe, 38334 Montbonnot Saint Ismier Téléphone : +33 4 76 61 52 00 — Télécopie +33 4 76 61 52 52 XML Reasoning Solver User Manual R´esum´e: Ce manuel documente l’utilisation du solveur logique d´ecritdans[Genev`es, 2006; Genev`es et al., 2007]. -
XML Access Control Using Static Analysis
XML Access Control Using Static Analysis Makoto Murata Akihiko Tozawa Michiharu Kudo IBM Tokyo Research Lab/IUJ IBM Tokyo Research Lab IBM Tokyo Research Lab Research Institute 1623-14, Shimotsuruma, 1623-14, Shimotsuruma, 1623-14, Shimotsuruma, Yamato-shi, Yamato-shi, Yamato-shi, Kanagawa-ken 242-8502, Kanagawa-ken 242-8502, Kanagawa-ken 242-8502, Japan Japan Japan [email protected] [email protected] [email protected] ABSTRACT 1. INTRODUCTION Access control policies for XML typically use regular path XML [5] has become an active area in database research. expressions such as XPath for specifying the objects for ac- XPath [6] and XQuery [4] from the W3C have come to be cess control policies. However such access control policies widely recognized as query languages for XML, and their are burdens to the engines for XML query languages. To implementations are actively in progress. In this paper, relieve this burden, we introduce static analysis for XML we are concerned with fine-grained (element- and attribute- access control. Given an access control policy, query expres- level) access control for XML database systems rather than sion, and an optional schema, static analysis determines if document-level or collection-level access control. We be- this query expression is guaranteed not to access elements lieve that access control plays an important role in XML or attributes that are permitted by the schema but hidden database systems, as it does in relational database systems. by the access control policy. Static analysis can be per- Some early experiences [21, 10, 3] with access control for formed without evaluating any query expression against an XML documents have been reported already. -
Advanced Techniques in XSLT Jean-Michel Hufflen
Advanced Techniques in XSLT Jean-Michel Hufflen To cite this version: Jean-Michel Hufflen. Advanced Techniques in XSLT. Biuletyn , 2006, pp.69–75. hal-00663119 HAL Id: hal-00663119 https://hal.archives-ouvertes.fr/hal-00663119 Submitted on 26 Jan 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Advanced Techniques in xslt∗ Jean-Michel HUFFLEN LIFC (FRE CNRS 2661) University of Franche-Comté 16, route de Gray 25030 BESANÇON CEDEX FRANCE [email protected] http://lifc.univ-fcomte.fr/~hufflen Abstract This talk focus on some advanced techniques used within xslt, such as sort procedures, keys, interface with identifier management, and priority rules among templates matching an xml node. We recall how these features work and propose some examples, some being related to bibliography styles. A short comparison between xslt and nbst, the language used within MlBibTEX for bibliography styles, is given, too. Keywords xslt, sort, keys, invoking rules, nbst. Streszczenie W prezentacji skupimy się na zaawansowanych technikach używanych w xslt, ta- kich jak procedury sortowania, klucze, interfejs do zarządzania identyfikatorami i reguły pierwszeństwa szablonów dopasowanych do konkretnego węzła. Przypo- mnimy jak te własności działają i pokażemy kilka przykładów, w tym niektóre związane ze stylami bibliograficznymi. -
Towards Conceptual Modeling for XML
Towards Conceptual Modeling for XML Erik Wilde Swiss Federal Institute of Technology Zurich¨ (ETHZ) Abstract: Today, XML is primarily regarded as a syntax for exchanging structured data, and therefore the question of how to develop well-designed XML models has not been studied extensively. As applications are increasingly penetrated by XML technologies, and because query and programming languages provide native XML support, it would be beneficial to use these features to work with well-designed XML models. In order to better focus on XML-oriented technologies in systems engineering and programming languages, an XML modeling language should be used, which is more focused on modeling and structure than typical XML schema languages. In this paper, we examine the current state of the art in XML schema languages and XML modeling, and present a list of requirements for a XML conceptual modeling language. 1 Introduction Due to its universal acceptance as a way to exchange structured data, XML is used in many different application domains. In many scenarios, XML is used mainly as an exchange format (basically, a way to serialize data that has a model defined in some non- XML way), but an increasing number of applications is using XML as their native data model. Developments like the XML Query Language (XQuery) [BCF+05] are a proof of the fact that XML becomes increasingly visible on higher layers of the software architec- ture, and in this paper we argue that current XML modeling approaches are not sufficient to support XML in this new context. Rather than treating XML as a syntax for some non-XML data model, it would make more sense to embrace XML’s structural richness by using a modeling language with built-in XML support. -
Word Add Xml Schema
Word Add Xml Schema Gauzy and nominalistic Normie aspirating her misbelief drubs while Sutherland reflex some Cherenkov bolt. Welbie beep opprobriously while par Colin fordoes crucially or confounds understandably. Unorthodoxy and ornate Jeremie never shipped genuinely when Valentin puckers his annas. Access to the annotations for xml schemas and attributes and you will take full wxs notation would add schema inferred from one compare two serialized form again, support building an Transform XML into HTML or raw ASCII text. Data Interchange Standards Association Develops New Software Tool to Ease XML Schema Documentation. Any text data can be stored to an UNTYPED XML column or variable as long as it is in XML format. PDFelement lets you approve as well as sign documents digitally. Modify the data in the custom XML parts while the document is closed. Though having a schema is not a necessary condition, but it becomes simpler to work with the document if you apply a schema to it. The cls_Monitor supports the demos as you will discover when you try them out. XML language that allows to link XSDs to XML files. Xml schema is xml word schema! In the XML Structure task pane, click on the prefix element to apply it to the document. Wild card components allow adding content that is not known at the time of schema design. Let us assume that we wish to identify within an anthology only poems, their headings, and the stanzas and lines of which they are composed. XML Data Reduced to XSD. XML is ready for the next big step to worldwide deployment. -
The Extensible XML Information Set
The Extensible XML Information Set Erik Wilde Computer Engineering and Networks Laboratory Swiss Federal Institute of Technology, Z¨urich TIK Report 160 (February 2003) Abstract XML and its data model, the XML Information Set, are used for a large number of applications. These applications have widely varying data models, ranging from very sim- ple regular trees to irregularly structured graphs using many different types of nodes and vertices. While some applications are sufficiently supported by the data model provided by the XML Infoset itself, others could benefit from extensions of the data model and assistance for these extensions in supporting XML technologies (such as the DOM API or the XSLT programming language). In this paper, we describe the Extensible XML Information Set (EXIS), which is a reformulation of the XML Infoset targeted at making the Infoset easier to extend and to make these extensions usable in higher-level XML technologies. EXIS provides a framework for defining extensions to the core XML Infoset, and for identifying these extensions (using namespace names). Higher-level XML tech- nologies (such as DOM or XPath) can then support EXIS extensions through additional interfaces, such as a dedicated DOM module, or XPath extension mechanisms (extension axes and/or functions). In order to make EXIS work, additional efforts are required in these areas of higher-level XML technologies, but EXIS itself could be used rather quickly to provide a foundation for well-defined Infoset extensions, such as XML Schema’s PSVI contributions, or the reformulation of XLink as being based on a data model (rather than a syntax). -
XML and Content Management
XML and Content Management Lecture 4: More on XML Schema and patterns in schema design Maciej Ogrodniczuk MIMUW, 22 October 2012 Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 1 XML Schema: last week summary XML Schema is a formalism for defining document schemata. It has (at least what we know so far): XML syntax, types: assigned to elements and attributes, set of predefined types, means for defining own types, more flexible definition of content models: occurrences, order in mixed content, constraints on uniqueness and references, ... Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 2 Schema-document binding The binding is composed of 3 elements: declaration of the namespace for XML Schema-compatible document instance: xmlns:xsi= "http://www.w3.org/2001/XMLSchema-instance", binding the schema for elements not in any namespace — by specifying the schema URL in xsi:noNamespaceSchemaLocation attribute, binding the list of namespaces with URLs of schemata used for validation of elements which names are belong to namespaces used in the document — in xsi:schemaLocation attribute. Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 3 Schema-document binding <?xml version="1.0"?> <text xmlns:xsi="http://www.w3.org/2001/ XMLSchema-instance" xsi:noNamespaceSchemaLocation="text.xsd" xsi:schemaLocation="http://www.example.org/equations equations.xsd http://www.example.org/graphs graphs.xsd"> ... </text> Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 4 Validation model Multilevel validation: 1 check (recursively) whether schema documents are valid (according to a schema for XML Schema), 2 check whether the document satisfies the requirements specified in the schema.