Information Retrieval of Text, Structure and Sequential Data in Heterogeneous XML Document Collections Eugen Popovici
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Recommended Formats Statement 2019-2020
Library of Congress Recommended Formats Statement 2019-2020 For online version, see https://www.loc.gov/preservation/resources/rfs/index.html Introduction to the 2019-2020 revision ....................................................................... 2 I. Textual Works and Musical Compositions ............................................................ 4 i. Textual Works – Print .................................................................................................... 4 ii. Textual Works – Digital ................................................................................................ 6 iii. Textual Works – Electronic serials .............................................................................. 8 iv. Digital Musical Compositions (score-based representations) .................................. 11 II. Still Image Works ............................................................................................... 13 i. Photographs – Print .................................................................................................... 13 ii. Photographs – Digital ................................................................................................. 14 iii. Other Graphic Images – Print .................................................................................... 15 iv. Other Graphic Images – Digital ................................................................................. 16 v. Microforms ................................................................................................................ -
The Music Encoding Initiative As a Document-Encoding Framework
12th International Society for Music Information Retrieval Conference (ISMIR 2011) THE MUSIC ENCODING INITIATIVE AS A DOCUMENT-ENCODING FRAMEWORK Andrew Hankinson1 Perry Roland2 Ichiro Fujinaga1 1CIRMMT / Schulich School of Music, McGill University 2University of Virginia [email protected], [email protected], [email protected] ABSTRACT duplication of effort that comes with building entire encoding schemes from the ground up. Recent changes in the Music Encoding Initiative (MEI) In this paper we introduce the new tools and techniques have transformed it into an extensible platform from which available in MEI 2011. We start with a look at the current new notation encoding schemes can be produced. This state of music document encoding techniques. Then, we paper introduces MEI as a document-encoding framework, discuss the theory and practice behind the customization and illustrates how it can be extended to encode new types techniques developed by the TEI community and how their of notation, eliminating the need for creating specialized application to MEI allows the development of new and potentially incompatible notation encoding standards. extensions that leverage the existing music document- 1. INTRODUCTION encoding platform developed by the MEI community. We also introduce a new initiative for sharing these The Music Encoding Initiative (MEI)1 is a community- customizations, the MEI Incubator. Following this, we driven effort to define guidelines for encoding musical present a sample customization to illustrate how MEI can documents in a machine-readable structure. The MEI be extended to more accurately capture new and unique closely mirrors work done by text scholars in the Text music notation sources. -
Dynamic Generation of Musical Notation from Musicxml Input on an Android Tablet
Dynamic Generation of Musical Notation from MusicXML Input on an Android Tablet THESIS Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Laura Lynn Housley Graduate Program in Computer Science and Engineering The Ohio State University 2012 Master's Examination Committee: Rajiv Ramnath, Advisor Jayashree Ramanathan Copyright by Laura Lynn Housley 2012 Abstract For the purpose of increasing accessibility and customizability of sheet music, an application on an Android tablet was designed that generates and displays sheet music from a MusicXML input file. Generating sheet music on a tablet device from a MusicXML file poses many interesting challenges. When a user is allowed to set the size and colors of an image, the image must be redrawn with every change. Instead of zooming in and out on an already existing image, the positions of the various musical symbols must be recalculated to fit the new dimensions. These changes must preserve the relationships between the various musical symbols. Other topics include the laying out and measuring of notes, accidentals, beams, slurs, and staffs. In addition to drawing a large bitmap, an application that effectively presents sheet music must provide a way to scroll this music across a small tablet screen at a specified tempo. A method for using animation on Android is discussed that accomplishes this scrolling requirement. Also a generalized method for writing text-based documents to describe notations similar to musical notation is discussed. This method is based off of the knowledge gained from using MusicXML. -
XML a New Web Site Architecture
XML A New Web Site Architecture Jim Costello Derek Werthmuller Darshana Apte Center for Technology in Government University at Albany, SUNY 1535 Western Avenue Albany, NY 12203 Phone: (518) 442-3892 Fax: (518) 442-3886 E-mail: [email protected] http://www.ctg.albany.edu September 2002 © 2002 Center for Technology in Government The Center grants permission to reprint this document provided this cover page is included. Table of Contents XML: A New Web Site Architecture .......................................................................................................................... 1 A Better Way? ......................................................................................................................................................... 1 Defining the Problem.............................................................................................................................................. 1 Partial Solutions ...................................................................................................................................................... 2 Addressing the Root Problems .............................................................................................................................. 2 Figure 1. Sample XML file (all code simplified for example) ...................................................................... 4 Figure 2. Sample XSL File (all code simplified for example) ....................................................................... 6 Figure 3. Formatted Page Produced -
Scanscore 2 Manual
ScanScore 2 Manual Copyright © 2020 by Lugert Verlag. All Rights Reserved. ScanScore 2 Manual Inhaltsverzeichnis Welcome to ScanScore 2 ..................................................................................... 3 Overview ...................................................................................................... 4 Quickstart ..................................................................................................... 4 What ScanScore is not .................................................................................... 6 Scanning and importing scores ............................................................................ 6 Importing files ............................................................................................... 7 Using a scanner ............................................................................................. 7 Using a smartphone ....................................................................................... 7 Open ScanScore project .................................................................................. 8 Multipage import ............................................................................................ 8 Working with ScanScore ..................................................................................... 8 The menu bar ................................................................................................ 8 The File Menu ............................................................................................ 9 The -
Techniques for Authoring Complex XML Documents Vincent Quint, Irène Vatton
Techniques for Authoring Complex XML Documents Vincent Quint, Irène Vatton To cite this version: Vincent Quint, Irène Vatton. Techniques for Authoring Complex XML Documents. Proceedings of the 2004 ACM symposium on Document Engineering, DocEng 2004, Oct 2004, MilWaukee, WI, United States. pp.115-123, 10.1145/1030397.1030422. inria-00423365 HAL Id: inria-00423365 https://hal.inria.fr/inria-00423365 Submitted on 9 Oct 2009 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Techniques for Authoring Complex XML Documents Vincent Quint Irene` Vatton INRIA Rhone-Alpesˆ INRIA Rhone-Alpesˆ 655 avenue de l’Europe 655 avenue de l’Europe 38334 Saint Ismier Cedex, France 38334 Saint Ismier Cedex, France [email protected] [email protected] ABSTRACT 1. INTRODUCTION This paper reviews the main innovations of XML and con- Authoring techniques for structured documents consti- siders their impact on the editing techniques for structured tuted an active research area during the second half of the documents. Namespaces open the way to compound docu- 80’s and the early 90’s [10]. Several experimental systems ments; well-formedness brings more freedom in the editing such as Grif [7] and Rita [6] were developed and a few pro- task; CSS allows style to be associated easily with structured duction tools resulted from that work. -
A Wiki-Based Authoring Tool for Collaborative Development of Multimedial Documents
MEDIA2MULT – A WIKI-BASED AUTHORING TOOL FOR COLLABORATIVE DEVELOPMENT OF MULTIMEDIAL DOCUMENTS Author Name * Affiliation * Address * Author Name * Affiliation * Address * * Only for Final Camera-Ready Submission ABSTRACT media2mult is an extension for PmWiki developed at our university. It provides functionality for embedding various media files and script languages in wiki pages. Furthermore media2mult comes with a cross media publishing component that allows to convert arbitrary wiki page sequences to print-oriented formats like PDF. This article gives an overview over the offered extensions, their functionality and implementation concepts. KEYWORDS wiki, multimedia, cross-media-publishing, authoring tool, XML 1. INTRODUCTION At least since the founding of the free web encyclopedia Wikipedia and its increasing popularity wiki web , wiki-wiki or just wiki are widely known terms in context of Web 2.0. However, their exact meaning often remains unclear. Sometimes wiki and Wikipedia are actually used synonymously. The crucial functionality of every wiki system is the possibility to edit wiki web pages directly inside a browser by entering an easy to learn markup language. Thus, manual uploads of previously edited HTML files are superfluous here. The user doesn't even have to know anything about HTML or external HTML editors. The browser- and server-based concept makes it possible that several authors can edit and revise common documents without the necessity of exchanging independently written and updated versions. Because most wiki systems offer an integrated version management system, authors can easily merge their changes and revert selected passages to former stages. Thus, accidentally or deliberately applied changes of protected or publicly accessible wiki pages can be taken back in a second. -
Methodology and Technical Methods
Recordare MusicXML: Methodology and Technical Methods Michael Good Recordare LLC Los Altos, California, USA [email protected] 17 November 2006 Copyright © 2006 Recordare LLC 1 Outline Personal introduction What is MusicXML? Design methodology Technical methods MusicXML use today Suitability for digital music editions Recommendations Future directions 17 November 2006 Copyright © 2006 Recordare LLC 2 My Background B.S. and M.S. in computer science from Massachusetts Institute of Technology B.S. thesis on representing scores in Music11 Trumpet on MIT Symphony Orchestra recordings available on Vox/Turnabout Opera and symphony chorus tenor; have performed for Alsop, Nagano, Ozawa Worked in software usability at SAP and DEC before founding Recordare in 2000 17 November 2006 Copyright © 2006 Recordare LLC 3 What is MusicXML? The first standard computer format for common Western music notation Covers 17th century onwards Available via a royalty-free license Supported by over 60 applications, including Finale, Sibelius, capella, and music scanners Useful for music display, performance, retrieval, and analysis applications Based on industry standard XML technology 17 November 2006 Copyright © 2006 Recordare LLC 4 The Importance of XML XML is a language for developing specialized formats like MusicXML, MathML, and ODF XML files can be read in any computer text editor Fully internationalized via Unicode The files are human readable as well as machine readable Each specialized format can use standard XML tools Allows musicians to leverage the large -
Proposals for Musicxml 3.2 and MNX Implementation from the DAISY Music Braille Project
Proposals for MusicXML 3.2 and MNX Implementation from the DAISY Music Braille Project 11 March 2019 Author: Haipeng Hu, Matthias Leopold With contributions from: Jostein Austvik Jacobsen, Lori Kernohan, Sarah Morley Wilkins. This document provides general suggestions on the implementation of MusicXML 3.2 and MNX formats, hoping they can be more braille-transcription-friendly. But it doesn't limit to the requirement of music braille transcription. Some of the points may also benefit the common transformation of musical data to be more accurately and clearly. Note: The word “text” in this document specifically means all non-lyrics texts, such as titles, composers, credit page information, as well as expression / technique / instructional texts applied to a staff or the whole system. A. MusicXML 3.2 Suggestions 1. If possible, implement a flow to include all system-wide texts (such as tempo texts, stage indications), lines (such as a line with text “poco a poco rit.”) and symbols (such as a big fermata or breath mark above the whole system), like the "global" section of MNX, so that all things in it can affect the whole score instead of the topmost staff or (in orchestra) piccolo and first violin staves. This can also ease part extraction. The position of the elements can be placed using the forward command. (e.g. in SATB choir pieces: sometimes breathing marks, dynamics and wedges are only written once in Soprano but are meant to apply to all voices; and in orchestral scores, tempo texts and overall indications are written above piccolo and first violin; and some big fermata or breath marks appear above every instrument category). -
Introduction to Information Retrieval
DRAFT! © April 1, 2009 Cambridge University Press. Feedback welcome. 1 1 Boolean retrieval The meaning of the term information retrieval can be very broad. Just getting a credit card out of your wallet so that you can type in the card number is a form of information retrieval. However, as an academic field of study, INFORMATION information retrieval might be defined thus: RETRIEVAL Information retrieval (IR) is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers). As defined in this way, information retrieval used to be an activity that only a few people engaged in: reference librarians, paralegals, and similar pro- fessional searchers. Now the world has changed, and hundreds of millions of people engage in information retrieval every day when they use a web search engine or search their email.1 Information retrieval is fast becoming the dominant form of information access, overtaking traditional database- style searching (the sort that is going on when a clerk says to you: “I’m sorry, I can only look up your order if you can give me your Order ID”). IR can also cover other kinds of data and information problems beyond that specified in the core definition above. The term “unstructured data” refers to data which does not have clear, semantically overt, easy-for-a-computer structure. It is the opposite of structured data, the canonical example of which is a relational database, of the sort companies usually use to main- tain product inventories and personnel records. -
Access Control Models for XML
Access Control Models for XML Abdessamad Imine Lorraine University & INRIA-LORIA Grand-Est Nancy, France [email protected] Outline • Overview on XML • Why XML Security? • Querying Views-based XML Data • Updating Views-based XML Data 2 Outline • Overview on XML • Why XML Security? • Querying Views-based XML Data • Updating Views-based XML Data 3 What is XML? • eXtensible Markup Language [W3C 1998] <files> "<record>! ""<name>Robert</name>! ""<diagnosis>Pneumonia</diagnosis>! "</record>! "<record>! ""<name>Franck</name>! ""<diagnosis>Ulcer</diagnosis>! "</record>! </files>" 4 What is XML? • eXtensible Markup Language [W3C 1998] <files>! <record>! /files" <name>Robert</name>! <diagnosis>! /record" /record" Pneumonia! </diagnosis> ! </record>! /name" /diagnosis" <record …>! …! </record>! Robert" Pneumonia" </files>! 5 XML for Documents • SGML • HTML - hypertext markup language • TEI - Text markup, language technology • DocBook - documents -> html, pdf, ... • SMIL - Multimedia • SVG - Vector graphics • MathML - Mathematical formulas 6 XML for Semi-Structered Data • MusicXML • NewsML • iTunes • DBLP http://dblp.uni-trier.de • CIA World Factbook • IMDB http://www.imdb.com/ • XBEL - bookmark files (in your browser) • KML - geographical annotation (Google Maps) • XACML - XML Access Control Markup Language 7 XML as Description Language • Java servlet config (web.xml) • Apache Tomcat, Google App Engine, ... • Web Services - WSDL, SOAP, XML-RPC • XUL - XML User Interface Language (Mozilla/Firefox) • BPEL - Business process execution language -
Iso/Iec 19757-8:2008(E)
This is a previewINTERNATIONAL - click here to buy the full publication ISO/IEC STANDARD 19757-8 First edition 2008-12-15 Information technology — Document Schema Definition Languages (DSDL) — Part 8: Document Semantics Renaming Language (DSRL) Technologies de l'information — Langages de définition de schéma de documents (DSDL) — Partie 8: Langage pour renommer une sémantique de documents (DSRL) Reference number ISO/IEC 19757-8:2008(E) © ISO/IEC 2008 ISO/IEC 19757-8:2008(E) This is a preview - click here to buy the full publication PDF disclaimer This PDF file may contain embedded typefaces. In accordance with Adobe's licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties accept therein the responsibility of not infringing Adobe's licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below. COPYRIGHT PROTECTED DOCUMENT © ISO/IEC 2008 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body in the country of the requester.