Usage of XSL Stylesheets for the Annotation of the Sámi Language Corpora

Total Page:16

File Type:pdf, Size:1020Kb

Usage of XSL Stylesheets for the Annotation of the Sámi Language Corpora Usage of XSL Stylesheets for the annotation of the Sami´ language corpora Saara Huhmarniemi Sjur N. Moshagen Trond Trosterud University of Tromsø Norwegian Sami´ Parliament University of Tromsø [email protected] [email protected] [email protected] Abstract texts. The small amount of those texts forced us to take into account a wide variety of sources and This paper describes an annotation system formats, and also to include texts that were of low for Sami´ language corpora, which consists technical quality. That introduced problems for the of structured, running texts. The annotation automatic processing of the texts. The solution to of the texts is fully automatic, starting from this problem was the document-specific processing the original documents in different formats. instructions that were implemented in XSLT. The texts are first extracted from the origi- nal documents preserving the original struc- 2 The Project tural markup. The markup is enhanced by a document-specific XSLT script which con- The corpus described here is the first structurally an- tains document-specific formatting instruc- notated text corpus for any Sami´ language. The cor- tions. The overall maintenance is achieved pus database was developed in parallel with the spell by system-wide XSLT scripts. checker and the syntactic analyzer projects for North 1 1 Introduction and Lule Sami´ . The new texts became test mate- rial for these two applications as soon as they were Corpus building for a specific language is consid- added to the corpus database. The requirements for ered to require much human effort and time. To the markup were constantly being re-evaluated dur- overcome this difficulty, there is a recent develop- ing the project. The infrastructure was designed ment of applications for automatic corpus building flexible so that it would accomodate to the differ- using often the Web as a resource e.g. (Baroni and ent needs of the two projects in different phases of Bernardini eds., 2006; Sharoff, 2006). For minority the application development. languages, the resources for building a text corpus At the moment, the corpus database consists of al- are often limited. Automatic tools for building cor- most 6 million words for North Sami´ and some 240 pus database specifically for the minority languages 000 for Lule Sami.´ Even though the system was pri- are developed e.g. by (Ghani et al., 2005; Scannell, marily designed for the Sami´ languages, there are 2004). no strictly language-dependent sections in the sys- The requirement to have the corpus building pro- tem; it has already been tested with Norwegian and cess automatized as much as possible was also cen- Finnish, among others. tral in the Sami´ language corpora project. How- One of the main applications of the text corpus ever, the collection of texts is done in a ”traditional” database is the syntactically annotated and fully dis- manner: the files are gathered and classified man- ambiguated corpus database for Sami´ languages. ually. For North Sami,´ there are texts available in The syntactic annotation is done automatically us- electronic form which can be exploited in a cor- pus database, mainly administrative and newspaper 1http://www.divvun.no/, http://giellatekno.uit.no/ 45 Proceedings of the Linguistic Annotation Workshop, pages 45–48, Prague, June 2007. c 2007 Association for Computational Linguistics ing the tools developed in the syntactic analyzer result is the final XML-document with structural project, but the process is out of the scope of this markup, see Fig. 1. paper. There is also some parallel texts with Norwe- gian, and plans for extending parallel text corpora to different Sami´ languages and Finnish and Swedish. The corpus database is freely available for research purposes. There will be a web-based corpus inter- face for the syntactically annotated corpus and a re- stricted access to the system for examining the cor- pus data directly. 3 XSLT and corpus maintenace Flexibility and reusability are in general the design requirements of annotated text corpora. XML has become the standard annotation system in physical storage representation. XML Transformation Lan- guage (XSLT) (Clark ed., 1999) provides an easy Figure 1: The overall architecture of the conversion data transformation between different formats and process. applications. XSLT is commonly used in the con- temporary corpus development. The power of XSLT The conversion of a document always starts from mainly comes from its sublanguage XPath (Clark the original file, which makes it possible to adapt and DeRose eds., 1999). XPath provides an access for the latest versions of the text extraction tools and to the XML structure, elements, attributes and text other tools used in the process as well as the changes through concise path expressions. in XML-markup. In the Sami´ language corpora, XSLT is used in The annotation process is fully automatic and corpus establishment and maintenance. The raw can be rerun at will. Some documents may con- structural format is produced by text extraction tools tain errors or formatting that are not taken into ac- and coverted to a preliminary XML-format using count by the automatic tools. On the other hand, XSLT. The markup is further enhanced by document the automatized annotation process does not allow specific information and a system-wide processing manual correction of the texts, nor manual XML- instruction, both implemented in XSLT. markup. Those exceptions can be taken into account 4 The Sami´ corpus database by document-specific processing instructions, which are implemented using XSLT. The script can be used 4.1 Overall architecture for adding XML-annotation for specific parts of the The corpus database is organized so that the original document, fixing smaller errors in the document, or text resources, which are the documents in various even to rescue a corrupted file that would be other- formats (Word, PDF, HTML, text) form the source wise unusable. This is a useful feature when build- base. The text is extracted from the original docu- ing a corpus for a minority language with diverse ments using various freely available text extraction and often limited text resources. tools, such as antiword and HTML Tidy. They al- ready provide a preliminary structural markup: anti- 4.2 XML-annotation word produces DocBook and HTML Tidy provides In the Sami´ language corpora, markup of running output in XHTML. There are XSLT scripts for con- text is simple, containing no more structural infor- verting the different preliminary formats the to an mation than what is generally available in the orig- intermediate document format. inal text. The body text can contain sections and The intermediate format is further processed to paragraphs and each section can contain sections the desired XML-format using XSLT-scripts. The and paragraphs. There are four paragraph types: ti- 46 tle, text, table and list. The paragraphs are classified tion of wrongly encoded characters. whenever the information is available in the origi- Another example of a template that can be called nal document. Lists and especially tables contain in- from the document-specific XSLT script is the complete sentences and in many cases numeric data. string-replacement, that can be used for marking When conducting e.g. syntactic analysis, it might spelling errors in the text. Due to the variety of con- be better to leave tables and even lists or titles out, ventions of writing Sami,´ the texts tend to contain whereas for e.g. terminological work the tables are lot of strings that are classified as spelling errors. highly relevant. Tagging for paragraph type makes it The errors disturb the testing of the analyzer, but are possible to include or exclude the relevant paragraph on the other hand interesting from the point of view types at will. of the spell checker project. When a spelling error Inside a paragraph, there is a possibility to is discovered in the text, the erroneous strings and add emphasis markup and other span information, their corrections are added to the document-specific such as quotes. The sentence-level and word-level metafile from where they are picked by the con- markup is not included in the text corpus. The version process. The errors are thus preserved in markup is added when the text corpus is moved to the XML-format but with a correction which can be the syntactically annotated corpus database. used instead of the erroneous string. This is achieved The XML-annotation does not follow any stan- by a special markup: dardized XML-format, but it is, in essence, a subset <error correct="text">tetx</error> of the XCES (Ide et al., 2000) format. Furthermore, In this way the original documents stay intact and the system is designed so that changing the XML- the information is preserved also when the file is re- annotation and moving to a standardized format is a converted. If the error is not just a single word but straightforward process. involves more context, it is possible to add the con- text to the error string. 4.3 XSLT processing In addition, the document-specific XSLT script Each original document in the corpus database is contains variables that may be used already in the paired with an XSLT script. The document-specific text extraction phase. An example would be the text XSLT script contains processing instructions that are alignment information of a pdf-file. applied to the document during the conversion from the preliminary document format to the final XML- 4.4 Language identification format (see Fig. 1.). The XPath expressions are pow- Most documents in the Sami´ corpus database con- erful tools for accessing portions of text in a docu- tain sections of text that are not in the document’s ment and modifying the XML-markup without edit- main language.
Recommended publications
  • Introducing Regular Expressions
    Introducing Regular Expressions wnload from Wow! eBook <www.wowebook.com> o D Michael Fitzgerald Beijing • Cambridge • Farnham • Köln • Sebastopol • Tokyo Introducing Regular Expressions by Michael Fitzgerald Copyright © 2012 Michael Fitzgerald. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://my.safaribooksonline.com). For more information, contact our corporate/institutional sales department: 800-998-9938 or [email protected]. Editor: Simon St. Laurent Indexer: Lucie Haskins Production Editor: Holly Bauer Cover Designer: Karen Montgomery Proofreader: Julie Van Keuren Interior Designer: David Futato Illustrator: Rebecca Demarest July 2012: First Edition. Revision History for the First Edition: 2012-07-10 First release See http://oreilly.com/catalog/errata.csp?isbn=9781449392680 for release details. Nutshell Handbook, the Nutshell Handbook logo, and the O’Reilly logo are registered trademarks of O’Reilly Media, Inc. Introducing Regular Expressions, the image of a fruit bat, and related trade dress are trademarks of O’Reilly Media, Inc. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and O’Reilly Media, Inc., was aware of a trademark claim, the designations have been printed in caps or initial caps. While every precaution has been taken in the preparation of this book, the publisher and authors assume no responsibility for errors or omissions, or for damages resulting from the use of the information con- tained herein.
    [Show full text]
  • Developerworks : XML : Abstracting the Interface, Part II
    developerWorks : XML : Abstracting the interface, Part II IBM Home Products Consulting Industries News About IBM IBM : developerWorks : XML : XML articles Abstracting the interface, Part II Extensions to the basic framework Search Advanced Help Martin Gerlach ([email protected]) Software Engineer IBM Almaden Research Center March 2001 This article, a sequel to the author's December article on Web application front ends, describes extensions to the basic framework for XML data and XSL style sheets. It Contents: focuses on the back end this time, covering National Language Support (NLS), What happened so far enhancements of the view structure, and performance issues. While the first article demonstrated the basic architecture for building Web applications using XML and Where we are going now XSLT, this article shows you how to make the applications ready to go online. National Language This article is the second in a series on "Abstracting the interface". In order to follow the Support concepts presented here, you will want to read the first article, Building an adaptable Web app front end with XML and XSL, which discusses the steps necessary to complete the Enhancing the application application. Studying the sample application code also will help you grasp the concepts structure presented here. As in the first installment, this article assumes you are familiar with Web applications based on the HTTP protocol and the HTTP request-response mechanism, and Performance: A look at that you have built Web pages using HTML and possibly JavaScript. You should know the caching Java programming language and how to use Java servlets as access points for Web applications.
    [Show full text]
  • XML Reasoning Solver User Manual Pierre Genevès, Nabil Layaïda
    XML Reasoning Solver User Manual Pierre Genevès, Nabil Layaïda To cite this version: Pierre Genevès, Nabil Layaïda. XML Reasoning Solver User Manual. [Research Report] RR-6726, 2008. inria-00339184v1 HAL Id: inria-00339184 https://hal.inria.fr/inria-00339184v1 Submitted on 17 Nov 2008 (v1), last revised 22 Aug 2011 (v2) HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE XML Reasoning Solver User Manual Pierre Genevès — Nabil Layaïda N° 6726 November 17, 2008 Thème SYM apport de recherche ISSN 0249-6399 ISRN INRIA/RR--6726--FR+ENG XML Reasoning Solver User Manual Pierre Genev`es∗, Nabil Laya¨ıda Th`emeSYM — Syst`emessymboliques Equipes-Projets´ Wam Rapport de recherche n° 6726 — November 17, 2008 — 17 pages Abstract: This manual provides documentation for using the logical solver introduced in [Genev`es,2006; Genev`es et al., 2007]. Key-words: Static Analysis, Logic, Satisfiability Solver, XML, Schema, XPath, Queries ∗ CNRS Centre de recherche INRIA Grenoble – Rhône-Alpes 655, avenue de l’Europe, 38334 Montbonnot Saint Ismier Téléphone : +33 4 76 61 52 00 — Télécopie +33 4 76 61 52 52 XML Reasoning Solver User Manual R´esum´e: Ce manuel documente l’utilisation du solveur logique d´ecritdans[Genev`es, 2006; Genev`es et al., 2007].
    [Show full text]
  • Advanced Techniques in XSLT Jean-Michel Hufflen
    Advanced Techniques in XSLT Jean-Michel Hufflen To cite this version: Jean-Michel Hufflen. Advanced Techniques in XSLT. Biuletyn , 2006, pp.69–75. hal-00663119 HAL Id: hal-00663119 https://hal.archives-ouvertes.fr/hal-00663119 Submitted on 26 Jan 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Advanced Techniques in xslt∗ Jean-Michel HUFFLEN LIFC (FRE CNRS 2661) University of Franche-Comté 16, route de Gray 25030 BESANÇON CEDEX FRANCE [email protected] http://lifc.univ-fcomte.fr/~hufflen Abstract This talk focus on some advanced techniques used within xslt, such as sort procedures, keys, interface with identifier management, and priority rules among templates matching an xml node. We recall how these features work and propose some examples, some being related to bibliography styles. A short comparison between xslt and nbst, the language used within MlBibTEX for bibliography styles, is given, too. Keywords xslt, sort, keys, invoking rules, nbst. Streszczenie W prezentacji skupimy się na zaawansowanych technikach używanych w xslt, ta- kich jak procedury sortowania, klucze, interfejs do zarządzania identyfikatorami i reguły pierwszeństwa szablonów dopasowanych do konkretnego węzła. Przypo- mnimy jak te własności działają i pokażemy kilka przykładów, w tym niektóre związane ze stylami bibliograficznymi.
    [Show full text]
  • Word Add Xml Schema
    Word Add Xml Schema Gauzy and nominalistic Normie aspirating her misbelief drubs while Sutherland reflex some Cherenkov bolt. Welbie beep opprobriously while par Colin fordoes crucially or confounds understandably. Unorthodoxy and ornate Jeremie never shipped genuinely when Valentin puckers his annas. Access to the annotations for xml schemas and attributes and you will take full wxs notation would add schema inferred from one compare two serialized form again, support building an Transform XML into HTML or raw ASCII text. Data Interchange Standards Association Develops New Software Tool to Ease XML Schema Documentation. Any text data can be stored to an UNTYPED XML column or variable as long as it is in XML format. PDFelement lets you approve as well as sign documents digitally. Modify the data in the custom XML parts while the document is closed. Though having a schema is not a necessary condition, but it becomes simpler to work with the document if you apply a schema to it. The cls_Monitor supports the demos as you will discover when you try them out. XML language that allows to link XSDs to XML files. Xml schema is xml word schema! In the XML Structure task pane, click on the prefix element to apply it to the document. Wild card components allow adding content that is not known at the time of schema design. Let us assume that we wish to identify within an anthology only poems, their headings, and the stanzas and lines of which they are composed. XML Data Reduced to XSD. XML is ready for the next big step to worldwide deployment.
    [Show full text]
  • The Extensible XML Information Set
    The Extensible XML Information Set Erik Wilde Computer Engineering and Networks Laboratory Swiss Federal Institute of Technology, Z¨urich TIK Report 160 (February 2003) Abstract XML and its data model, the XML Information Set, are used for a large number of applications. These applications have widely varying data models, ranging from very sim- ple regular trees to irregularly structured graphs using many different types of nodes and vertices. While some applications are sufficiently supported by the data model provided by the XML Infoset itself, others could benefit from extensions of the data model and assistance for these extensions in supporting XML technologies (such as the DOM API or the XSLT programming language). In this paper, we describe the Extensible XML Information Set (EXIS), which is a reformulation of the XML Infoset targeted at making the Infoset easier to extend and to make these extensions usable in higher-level XML technologies. EXIS provides a framework for defining extensions to the core XML Infoset, and for identifying these extensions (using namespace names). Higher-level XML tech- nologies (such as DOM or XPath) can then support EXIS extensions through additional interfaces, such as a dedicated DOM module, or XPath extension mechanisms (extension axes and/or functions). In order to make EXIS work, additional efforts are required in these areas of higher-level XML technologies, but EXIS itself could be used rather quickly to provide a foundation for well-defined Infoset extensions, such as XML Schema’s PSVI contributions, or the reformulation of XLink as being based on a data model (rather than a syntax).
    [Show full text]
  • XML and Content Management
    XML and Content Management Lecture 4: More on XML Schema and patterns in schema design Maciej Ogrodniczuk MIMUW, 22 October 2012 Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 1 XML Schema: last week summary XML Schema is a formalism for defining document schemata. It has (at least what we know so far): XML syntax, types: assigned to elements and attributes, set of predefined types, means for defining own types, more flexible definition of content models: occurrences, order in mixed content, constraints on uniqueness and references, ... Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 2 Schema-document binding The binding is composed of 3 elements: declaration of the namespace for XML Schema-compatible document instance: xmlns:xsi= "http://www.w3.org/2001/XMLSchema-instance", binding the schema for elements not in any namespace — by specifying the schema URL in xsi:noNamespaceSchemaLocation attribute, binding the list of namespaces with URLs of schemata used for validation of elements which names are belong to namespaces used in the document — in xsi:schemaLocation attribute. Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 3 Schema-document binding <?xml version="1.0"?> <text xmlns:xsi="http://www.w3.org/2001/ XMLSchema-instance" xsi:noNamespaceSchemaLocation="text.xsd" xsi:schemaLocation="http://www.example.org/equations equations.xsd http://www.example.org/graphs graphs.xsd"> ... </text> Lecture 4: More on XML Schema and patterns in schema design XML and Content Management 4 Validation model Multilevel validation: 1 check (recursively) whether schema documents are valid (according to a schema for XML Schema), 2 check whether the document satisfies the requirements specified in the schema.
    [Show full text]
  • Data-Centric XML
    © 2006 Martin v. Löwis Other SchemaLanguages Data-centric XML Problems of XML Schema • According to Schematron docs: – No support for entities – idiomatic or localized data types (date, time) not supported – limited support for element content models • e.g. start with one fixed element, then allow arbitrary order – limited support for idiomatic attribute usage • e.g. using attributes to contain or select element content – no notion of “document type” • in particular, no way of specifying root element – complexity – implementability • “persistent problems with buggy implementations, outside the conservative use of features” . Löwis © 2006 Martin v Datenorientiertes XML 2 Problems with XML Schema Everybody who actually touches the technology has known the truth for years, and it’s time to stop sweeping it under the rug. W3C XML Schemas (XSD) suck. They are hard to read, hard to write, hard to understand, have interoperability problems, and are unable to describe lots of things you want to do all the time in XML. ... It’s a pity; when XSD came out people thought that since it came from the W3C, same as XML, it must be the way to go, and it got baked into a bunch of other technology before anyone really had a chance to think it over. So now lots of people say “Well, yeah, it sucks, but we’re stuck with it.” Wrong! The time has come to declare it a worthy but failed experiment, tear down the shaky towers with XSD in their foundation, and start using RELAX for all significant XML work. Löwis Tim Bray, 2006/11/28 © 2006 Martin v Datenorientiertes XML 3 Relax NG • Schema language for XML – Designed by James Clark • Design goals: – simple – easy to learn – both with XML syntax, and compact non-XML syntax – support for namespaces – treat attributes uniformly with elements as far as possible – unrestricted support for mixed content – solid theoretical basis – can partner with separate data type language (e.g.
    [Show full text]
  • XML, CSS, and HTML-Based Presentation Graphics System
    Rochester Institute of Technology RIT Scholar Works Theses 2006 XPoint: XML, CSS, and HTML-based presentation graphics system Sudhanshu Sood Follow this and additional works at: https://scholarworks.rit.edu/theses Recommended Citation Sood, Sudhanshu, "XPoint: XML, CSS, and HTML-based presentation graphics system" (2006). Thesis. Rochester Institute of Technology. Accessed from This Master's Project is brought to you for free and open access by RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. XPOINT: XML, CSS and HTML based presentation graphics system Master’s Project Proposal By Sudhanshu Sood Department of Computer Science Rochester Institute of Technology Rochester, NY Chairman: Dr. Axel T. Schreiner Reader: Dr. Hans-Peter Bischof Observer: Dr. Fereydoun Kazemian Table of Contents SUMMARY................................................................................................................................................... 1 1. INTRODUCTION............................................................................................................................... 1 2. XPOINT DESCRIPTION .................................................................................................................. 2 3. ARCHITECTURE.............................................................................................................................. 3 4. TOOLS AND TECHNOLOGY ........................................................................................................
    [Show full text]
  • XML Schemas RELAX NG (Regular Language
    VALIDATION • An outline of various validation tools • XML DTD (Document Type Definition) • XML Schemas • RELAX NG (REgular LAnguage for XML Next Generation) • Schematron • "Absolute" comparison • NeuroML specifications according to the XML Schemas proposed • A RELAX NG alternative • A Schematron contribution • Other helpful validation tools An outline of different validation tools XML probably is one of the main data-exchange formats of Internet. Because of its variety and complexity, data must often be validated before being exploited. A large number of validation tools exist, each with its own grammar and peculiar features. Since there are different aspects which may be in need of a check (such as data structure, format constraints, datatyping, etc.), each validation tool may be more appropriated for a specific task (or it could simply make it easier). In this document only a part of the validation appliance ensemble will be studied as a possible alternative to XML Schemas for NeuroML specifications. XML DTD (Document Type Definition) It is the standard XML schema language of the "past". If compared to other validation tools, it shows limited capabilities. DTD possesses a non-XML syntax and a Document Type Definition Sheet typically consist of a set of declaration blocks that describe a class or a type (datatyping is very limited). XML Schemas XML Schema is much more expressive than DTD and it's well recognized by a wide variety of applications. This object-oriented schema language is capable of defining structure, content and semantics of XML documents and possesses many useful features, like inheritance (for elements and attributes) or user-datatype definition.
    [Show full text]
  • FOP: an Open-Source XSL Formatter and Renderer A) What Is FOP? B) Downloading FOP C) Running
    This is not the latest Fop documentation, but just an fo example. FOP - p. 1 FOP: An Open-Source XSL Formatter and Renderer A) What is FOP? FOP is the world's first print formatter driven by XSL formatting objects. It is a Java 1.1 application that reads a formatting object tree and then turns it into a PDF document. The formatting object tree, can be in the form of an XML document (output by an XSLT engine like XT or Xalan) or can be passed in memory as a DOM Document or (in the case of XT) SAX events. FOP is part of Apache's XML Graphics Project. The homepage of FOP is http:// xmlgraphics.apache.org/fop B) Downloading FOP The latest release version is FOP 0.20.5 () . NOTE: you do not have to unjar or unzip this jar file. Documentation can be downloaded here as HMTL file () or as PDF file () . To run FOP from the command line, see Running FOP. If you are interested in embedding FOP in a Java application of your own, see Embedding FOP. You can also download the source code v. 0.20.5 () as jar file C) Running FOP 1) Prerequisites Following software must be installed: a) Java 1.1.x or later For the fo-file viewer mode of FOP (see below) you must have the swing classes installed. From Java 1.2 on (aka Java 2) they are part of the standard java distribution. If you use Java 1.1.x you must seperately include the swing classes, which can be found at the Sun website (http://java.sun.com/products/jfc/#download-swing) .
    [Show full text]
  • XML Path Language (Xpath) Version 1.0 W3C Recommendation 16 November 1999
    XML Path Language (XPath) XML Path Language (XPath) Version 1.0 W3C Recommendation 16 November 1999 This version: http://www.w3.org/TR/1999/REC-xpath-19991116 (available in XML or HTML) Latest version: http://www.w3.org/TR/xpath Previous versions: http://www.w3.org/TR/1999/PR-xpath-19991008 http://www.w3.org/1999/08/WD-xpath-19990813 http://www.w3.org/1999/07/WD-xpath-19990709 http://www.w3.org/TR/1999/WD-xslt-19990421 Editors: James Clark <[email protected]> Steve DeRose (Inso Corp. and Brown University) <[email protected]> Copyright © 1999 W3C® (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply. Abstract XPath is a language for addressing parts of an XML document, designed to be used by both XSLT and XPointer. Status of this document This document has been reviewed by W3C Members and other interested parties and has been endorsed by the Director as a W3C Recommendation. It is a stable document and may be used as reference material or cited as a normative reference from other documents. W3C's role in making the Recommendation is to draw attention to the specification and to promote its widespread deployment. This enhances the functionality and interoperability of the Web. http://www.w3.org/TR/1999/REC-xpath-19991116 (1 of 30) [7/19/2001 5:31:03 PM] XML Path Language (XPath) The list of known errors in this specification is available at http://www.w3.org/1999/11/REC-xpath-19991116-errata. Comments on this specification may be sent to [email protected]; archives of the comments are available.
    [Show full text]