Downloads/PDF for Archive.Pdf

Total Page:16

File Type:pdf, Size:1020Kb

Downloads/PDF for Archive.Pdf European Space Agency Symposium “Ensuring Long-Term Preservation and Adding Value to Scientific and Technical Data”, 5 - 7 October 2004, Frascati, Italy Providing Authentic Long-term Archival Access to Complex Relational Data Stephan Heuscher a, Stephan J¨armann b, Peter Keller-Marxer c, and Frank M¨ohle d Swiss Federal Archives, Archivstrasse 24, CH-3003 Bern, Switzerland [email protected], [email protected], [email protected], [email protected] (Dated: July 19, 2004) We discuss long-term preservation of and access to relational databases. The focus is on national archives and science data archives which have to ingest and integrate data from a broad spectrum of vendor-specific relational database management systems (RDBMS). Furthermore, we present our solution SIARD which analyzes and extracts data and data logic from almost any RDBMS. It enables, to a reasonable level of authenticity, complete detachment of databases from their vendor-specific environment. The user can add archival descriptive metadata according to a customizable schema. A SIARD database archive integrates data, data logic, technical metadata, and archival descriptive information in one archival information package, independent of any specific software and hardware, based upon plain text files and the standardized languages SQL and XML. For usage purposes, a SIARD archive can be reloaded into any current or future RDBMS which supports standard SQL. In addition, SIARD contains a client that enables ‘on demand’ reload of archives into a target RDBMS, and multi-user remote access for querying and browsing the data together with its technical and descriptive metadata in one graphical user interface. Keywords: Long-Term Digital Preservation, Digital Archives, Relational Databases, Value of Information The urgency of deep-infrastructure solutions for long- tific activities has drastically increased during the past term digital preservation and archiving was clearly for- decade. In particular, a recent international workshop mulated in the late 90’s of the past century by national on the long-term preservation of databases (16) revealed archives and libraries (1; 2; 3; 4; 5; 6; 7) as well as space that many archives do have a long-standing practice and agencies and institutions in earth observation, oceanog- experience in ingesting and preserving relational data, raphy, and astronomy (8; 9). The finding of problem but their daily work is constrained to the treatment of statements and strategies is still in progress and was re- rather simply structured data sets, requires extensive cently supported by a charta of the United Nations (10). manual work by archives personnel, and does not allow Deep and lasting solutions are still not available, and for for smooth and standardized integration of data and de- national archives and libraries (faced with very heteroge- scriptive meta data on a level required to ingest, preserve, neous types of content) it is broadly accepted that digital and provide access to complex relational databases. collections are growing at a rate that outpaces their abil- In this paper we present a method and applica- ity to manage and preserve them (11). tion named “Software-Invariant Archiving of Relational There is an ongoing process of recognition that long- Databases” (SIARD), developed at the Swiss Federal term digital preservation poses similar problems in di- Archives. It completely detaches typed relational data verse disciplines. Despite of different vocabularies used from almost any relational database management sys- by different communities, research and development in tem, while still retaining most of the original data logic arXiv:cs/0408054v1 [cs.DL] 24 Aug 2004 the field can only be successful in a joint effort. During and integrating data and metadata in one archival infor- the past decade, however, decisive progress was achieved mation package that is based on text files and standard- in analytical and conceptual work (12; 13; 14). Fur- ized technologies. In Section II we discuss the technical thermore, the Open Archival Information System (OAIS) and intellectual complexity of relational data in modern reference model became widely accepted in diverse dis- database systems, the resulting problems for long-term ciplines and covers a full range of archival information preservation, and its relevance to archives. The objec- preservation functions including ingest, archival storage, tives in the development of SIARD are described in Sec- data management, access, and dissemination. It has be- tion III, while Section IV covers SIARD’s system archi- come the international standard ISO 14721:2003 (15) and tecture, workflow, features, and development platform. may be applicable to any archive as it does not refer to specific implementations or archival strategies but merely provides a common terminology and functional frame- I. INTRODUCTION work to discuss different implementation approaches. Many current development projects focus on the Relational data is one of the oldest forms of structured archiving of digital images (digitized photographs or pa- information representation, intuitively used already cen- per documents) and sound recordings, while more com- turies before the “digital age”. With the rise of com- plex types of digital information have been neglected, puter technology, the introduction of mathematical for- even though their relevance for governmental and scien- mulations of the relational data model in the mid-20th 2 century, and the international standardization of a corre- widely used in modern databases. As for the linking of ta- sponding data definition and query language, relational bles, the connection between the data and its data types data has become an omnipresent method to organize data and domain definitions is not preserved when table data for electronic data processing in almost every field of is trivially exported to external plain text files. work, form business activities to scientific research and In addition to those entities and features mentioned government administration. above, modern RDBMS include, for example: During the past two decades, usage has developed from • processing single-table data files with specific applica- Check constraints and assertions: For a single col- tion software to generic relational database management umn of a table or a set of entire tables, assure that systems (RDBMS). These have internal mechanisms for changing or entering data does never violate de- logical and physical organization of arbitrary relational fined data types, quantitative restrictions, or value data models, are able to physically store terabytes of domains. In particular, it can be assured that data data, cover rich data types (including internal procedural items in a table column will never be empty. code), enable multi-user transactions, and provide inter- • Views: Assemble selected parts of several tables nal data life-cycle management. The definition, repre- and operate on them as customized, virtual tables. sentation, management, and query of relational data was thereby standardized and separated from specific appli- • Triggers: Force the RDBMS to initiate timed op- cation logic and application software that operates on the erations on data when user-specified conditions are database. As a result, RDBMS have become core com- met, for example log and audit user activities. ponents of almost any type of digital information system. • Functions (basic and user-defined): Perform nu- It is obvious that this development has decisive impacts merical calculations, conversions, or character op- on the work of those institutions which are charged to erations on data items or sets. collect or accept digital data from various data sources, to make it broadly accessible, and to preserve it over • Stored Procedures: Store and execute programs in- decades: national archives and libraries, science data side the RDBMS to perform common or critical archives, or business companies being under special legal tasks which are not part of the specific application regulations for long-term data retention (like, for exam- software outside the RDBMS. ple, the pharmaceutical sector (19)). • Foreign Keys: Ensure referential data integrity, i.e., automatically prevent that values can be stored in rows of one table if there are no corresponding val- II. COMPLEXITY AND RELEVANCE ues in referencing entries within the database. A. Technical Complexity of Relational Data • Grants and Roles: Define user profiles and assign or withdraw privileges, for example to create new One consequence of the progress in database technol- tables or access certain parts of the database. ogy is that relational data and relational databases be- come highly complex. They often consist of hundreds of For long-term preservation in national archives, rela- linked tables (i.e. physical representations of relational tional data is collected from many different database sys- entities1), which makes it impossible to handle and query tems and has to be retained and kept processable and table data outside an RDBMS if these links become bro- accessible for decades. It is therefore essential to store ken or cannot be managed automatically anymore. and maintain the databases independent of any specific Furthermore, any data item in a RDBMS has a pre- and short-lived products (or at least transferring them all cisely defined data type and domain. (Entire tables or into only one preferred product). In fact, most RDBMS integral parts of them may likewise have
Recommended publications
  • Features and Functionality History Mimer SQL, Features and Functionality History © Copyright Mimer Informationtechnology AB, August 2017
    Mimer SQL Features and Functionality History Mimer SQL, Features and Functionality History © Copyright Mimer InformationTechnology AB, August 2017 The contents of this manual may be printed in limited quantities for use at a Mimer SQL installation site. No parts of the manual may be reproduced for sale to a third party. Information in this document is subject to change without notice. All registered names, product names and trademarks of other companies mentioned in this documentation are used for identification purposes only and are acknowledged as the property of the respective company. Companies, names and data used in examples herein are fictitious unless otherwise noted. Produced and published by Mimer InformationTechnology AB, Uppsala, Sweden. P.O. Box 1713, SE-751 47 Uppsala, Sweden. Tel +46(0)18-780 92 00. Fax +46(0)18-780 92 40. Mimer Web Sites: http://developer.mimer.com http://www.mimer.com Mimer SQL i Features and Functionality History Contents Chapter 1 New Features and Functions ........................................................... 1 The Mimer SQL Database Server..................................................................................1 Internal Databank Identifier Invalid (V9.3.8B) ............................................................... 1 Background Dbcheck Hang (V9.3.8).............................................................................. 1 LOGDB Restart (V9.3.8).................................................................................................. 1 Improved Control of LOB References
    [Show full text]
  • Database Access Via Programming Languages 5DV119 — Introduction to Database Management Ume˚Auniversity Department of Computing Science Stephen J
    Database Access via Programming Languages 5DV119 | Introduction to Database Management Ume˚aUniversity Department of Computing Science Stephen J. Hegner [email protected] http://www.cs.umu.se/~hegner Database Access via Programming Languages 20150213 Slide 1 of 20 The Limitations of Stand-Alone SQL • SQL is primarily a language for data definition, retrieval, and update. • It is not designed for complex computation. • Enhancements such as OLAP are useful for certain specific tasks, but still leave many important tasks difficult or impossible to achieve. Theoretical shortcoming: Unlike most programming languages, it is not Turing complete. • There are computations which cannot be expressed in SQL at all. Interoperability shortcoming: Stand-alone SQL clients are generally vendor specific. • Concurrent access to databases of different vendors is not possible with a single client. • Access to multiple databases via the same client is usually awkward, requiring vendor-specific directives. Database Access via Programming Languages 20150213 Slide 2 of 20 The Limitations of Stand-Alone SQL: 2 Practical shortcomings: There is also a host of practical reasons why stand-alone SQL does not suffice: Accessibility: Most users of databases are not computer scientists. • They need a custom interface for non-experts. • Even experts can often work more effectively via custom interfaces. Simplicity: Real-world database schemata are often very large and complex. • Users often need to work with custom views which present what they need to know and are allowed to know. Security: The correct management of access rights is a very complex task. • It is often easier to manage access by admitting access via specific interfaces.
    [Show full text]
  • Openvms Guide
    Mimer SQL OpenVMS Guide Version 11.0 Mimer SQL, OpenVMS Guide, Version 11.0, January 2018 © Copyright Mimer Information Technology AB. The contents of this manual may be printed in limited quantities for use at a Mimer SQL installation site. No parts of the manual may be reproduced for sale to a third party. Information in this document is subject to change without notice. All registered names, product names and trademarks of other companies mentioned in this documentation are used for identification purposes only and are acknowledged as the property of the respective company. Companies, names and data used in examples herein are fictitious unless otherwise noted. Produced and published by Mimer Information Technology AB, Uppsala, Sweden. Mimer SQL Web Sites: https://developer.mimer.com https://www.mimer.com Contents i Contents .............................................................................................................................. i Chapter 1 Introduction .......................................................................................1 About Mimer SQL for OpenVMS...................................................................................1 The Mimer SQL Database Server .................................................................................. 1 Embedded SQL................................................................................................................. 1 Module SQL......................................................................................................................
    [Show full text]
  • Sql Plus Download Mac
    Sql plus download mac LINK TO DOWNLOAD oracle sql plus free download - Insight Developer for Oracle, SQLite Database, Orac, and many more programs. SQL*Plus Instant Client can be installed in two ways: Download the packages from the Oracle Technology Network (OTN). Copy the same files that are in the packages from an Oracle Database 10 g Client Administrator installation. Both the SQL*Plus and OCI packages must be from the same Oracle Database version, for example, SQL*PLus on Mac. December 1st, Goto comments Leave a comment. I would think installing SQL*Plus on the Mac would be point, click download, point click, install bam it works. Nah. It did install mostly straight forward on my old Mac. Got a new Mac and no dice. Jul 30, · Download the (free) Docker Community Edition for Mac (unless you’ve already got it installed on your system). This will enable you to run SQL Server from within a Docker container. To download, visit the Docker CE for Mac download page and click Get Docker. Find "Macintosh OSX" and download both the Basic and SQL*Plus packages from the Instant Client downloads page. Move the contents of both into a single convenient folder (I used /Applications/Application_folders/instantclient). Add the location to your PATH variable, and also set DYLD_LIBRARY_PATH to point to it; for example. That could affect running SQL*Plus from a shell script, for example. There are workarounds for the 11g instant client. The installation notes at the bottom of the download page have changed since I last did this, and it now says to hard link the library files to the user's ~/lib directory to avoid that issue.
    [Show full text]
  • Squirrel SQL Client
    Pagina: 1 di 22 Versione: 01.09 Data agg.: 08/02/2021 RM Squirrel SQL client guida rapida Autori: Marco Riva Revisori: g:\installazioni\squirrel sql client 4.1.0\squirrel sql client - guida rapida.docx Squirrel SQL client Pagina: 2 di 22 guida rapida Versione: 01.09 Data agg.: 08/02/2021 RM Sommario 1. Introduzione ........................................ 3 1.1. Fast startup .......................................... 3 2. Installazione ........................................ 4 2.1. Requisiti ............................................... 4 2.2. Download ............................................. 4 2.3. Installazione .......................................... 4 2.4. Aggiornamento ...................................... 4 3. Overview ............................................ 5 4. Configurazione ..................................... 5 4.1. Configurazione driver per DB2 for i ........... 5 4.2. Configurazione alias ............................... 7 5. Utilizzo .............................................. 12 5.1. Connessione ad un database .................. 12 5.2. Maggiori informazioni sull’area object ...... 14 5.2.1. Filtro e ricerca oggetti ..................... 14 5.2.2. Scheda content .............................. 15 5.2.3. Esportazione contenuti.................... 15 5.2.4. Modifica diretta in tabella ................ 16 5.3. Editor SQL .......................................... 17 5.3.1. Scorciatoie da tastiera .................... 17 5.3.2. Regole sintassi .............................. 17 5.3.3. Barra dei pulsanti ..........................
    [Show full text]
  • Database Proxies: a Data Management Approach for Component-Based Real-Time Systems∗
    Database Proxies: A Data Management approach for Component-Based Real-Time Systems¤ Andreas Hjertström, Dag Nyström, Mikael Sjödin Mälardalen Real-Time Research Centre, Västerås, Sweden {andreas.hjertstrom, dag.nystrom, mikael.sjodin}@mdh.se Abstract ture to share global data safely and efficiently by provid- ing concurrency-control, temporal consistency, and over- We present a novel concept, database proxies, which en- load and transaction management. Furthermore the typical able the fusion of two disjoint productivity-enhancement interface provided by the RTDBMS opens up for a whole techniques; Component Based Software Engineering new range of possibilities, much needed by industry, such (CBSE) and Real-Time Database Management Systems as dynamic run-time queries which could be a welcome (RTDBMS). This fusion is neither obvious nor intuitive contribution to aid in logging, diagnostics, monitoring [18], since CBSE and RTDBMS promotes opposing design goals; compared to the pre defined static data access often used by CBSE promotes encapsulation and decoupling of compo- developers today. nent internals from the component environment, whilst RT- CBSE promotes encapsulation of functionality into DBMS provide mechanisms for efficient and safe global reusable software entities that communicates through well data sharing. Database proxies decouple components from defined interfaces and that can be mounted together as an underlying database thus retaining encapsulation and building blocks. This enable a more efficient and structured component reuse, while providing temporally predictable development where available components can be reused or access to data maintained in database. We specifically tar- COTS components effectively can be integrated in the sys- get embedded systems with a subset of functionality with tem to save cost and increase quality.
    [Show full text]
  • Datasheet: Rapidrep® Test Suite
    Datasheet: RapidRep® Test Suite Overview: Test automation for the back-end The RapidRep® Test Suite is an innovative solution Test objects: Programs (non-GUI), data storage, data for the automated testing of objects that do not require processing, Web services, processes any user interaction (back-end). Test types: Function tests, migration tests, data com- parison tests, regression tests, and others RapidRep® is universally applicable and complies with all relevant ISO/IEC/IEEE testing standards. Further information: www.rapidrep.com/test-suite Important product features ● Rule Engine for the determination of expected re- XPath or Javascript sults (model-based testing with the help of sets of ● Integration of MS Excel as tool for test case design, rules) result presentation, management of rules ● Configurable solution for rule-based data quality ● Seamless connection to various test and defect evaluations and automated data comparison (with a management systems (see below) wizard) ● Version control of all states of development in a ● Integrated, efficient development environment for repository SQL and 2 script languages, incl. syntax highligh- ● Central administration of all settings for authentica- ting, proposal lists, business API tion and authorization ● Wizard for the connection to and further processing ● Documentation, Help, program surface are all avail- of Web services by means of SQL, XQuery, XSLT, able completely in German and English Supported systems Test management systems Repository databases ● HP Quality Center / ALM:
    [Show full text]
  • Presto: the Definitive Guide
    Presto The Definitive Guide SQL at Any Scale, on Any Storage, in Any Environment Compliments of Matt Fuller, Manfred Moser & Martin Traverso Virtual Book Tour Starburst presents Presto: The Definitive Guide Register Now! Starburst is hosting a virtual book tour series where attendees will: Meet the authors: • Meet the authors from the comfort of your own home Matt Fuller • Meet the Presto creators and participate in an Ask Me Anything (AMA) session with the book Manfred Moser authors + Presto creators • Meet special guest speakers from Martin your favorite podcasts who will Traverso moderate the AMA Register here to save your spot. Praise for Presto: The Definitive Guide This book provides a great introduction to Presto and teaches you everything you need to know to start your successful usage of Presto. —Dain Sundstrom and David Phillips, Creators of the Presto Projects and Founders of the Presto Software Foundation Presto plays a key role in enabling analysis at Pinterest. This book covers the Presto essentials, from use cases through how to run Presto at massive scale. —Ashish Kumar Singh, Tech Lead, Bigdata Query Processing Platform, Pinterest Presto has set the bar in both community-building and technical excellence for lightning- fast analytical processing on stored data in modern cloud architectures. This book is a must-read for companies looking to modernize their analytics stack. —Jay Kreps, Cocreator of Apache Kafka, Cofounder and CEO of Confluent Presto has saved us all—both in academia and industry—countless hours of work, allowing us all to avoid having to write code to manage distributed query processing.
    [Show full text]
  • Openvms Guide
    Mimer SQL OpenVMS Guide Version 10.1 Mimer SQL, OpenVMS Guide, Version 10.1, December 2017 © Copyright Mimer Information Technology AB. The contents of this manual may be printed in limited quantities for use at a Mimer SQL installation site. No parts of the manual may be reproduced for sale to a third party. Information in this document is subject to change without notice. All registered names, product names and trademarks of other companies mentioned in this documentation are used for identification purposes only and are acknowledged as the property of the respective company. Companies, names and data used in examples herein are fictitious unless otherwise noted. Produced and published by Mimer Information Technology AB, Uppsala, Sweden. P.O. Box 1713, SE-751 47 Uppsala, Sweden. Tel +46(0)18-780 92 00. Fax +46(0)18-780 92 40. Mimer SQL Web Sites: http://developer.mimer.com http://www.mimer.com Contents i Contents .............................................................................................................................. i Chapter 1 Introduction ....................................................................................... 1 About Mimer SQL for OpenVMS...................................................................................1 The Mimer SQL Database Server .................................................................................. 1 Embedded SQL................................................................................................................. 1 JDBC Driver......................................................................................................................
    [Show full text]
  • What Is Mimer SQL 11 C
    WHAT IS MIMER SQL 11 Mimer is a company with a strong foundation in standards and innovation. The Mimer SQL dataBase server has from the start been built with scalability and flexiBility in mind, which is why it is, in the true sense of the word, the Swiss Army Knife of dataBases. With Mimer SQL you can freely mix between various technologies for replication, database partitioning, distribution, physical hardware, in-memory operation, operating systems, all depending on your applications’ needs. Thanks to this flexiBility, and an extremely compact code Base, Mimer SQL has an unsurpassed support for multiple computing platforms where we can provide the exact same functions and capabilities, regardless if Mimer SQL runs in an emBedded system such as an IoT device, or a vehicle, in the cloud, or in the datacenter of an enterprise. Being a true 64-bit solution, the only thing that limits the size of data that Mimer SQL can handle is the hardware. Our codeBase is designed with this flexiBility in mind which explains the Broad range of operating systems and architectures Mimer SQL is ported to, and the range of computing environments we support. The way we can offer a complete, Core SQL-2016 compliant dataBase over the whole range of products, is unique in the market. Mimer SQL is used for enterprise, cloud, desktop, emBedded systems, and real time applications, and we have versions for operating systems such as Windows, Linux, macOS, and VMS as well as support for emBedded and real time operating systems such as INTEGRITY from Green Hills Software, QNX from BlackBerry, VxWorks, and Android.
    [Show full text]
  • Mimer SQL on Openvms Present and Future
    Mimer SQL on OpenVMS Present and Future Bengt Gunne, CTO 1 Agenda . Background . Mimer customers . Platforms . Technical features . Mimer SQL on OpenVMS . Ongoing development . Q&A 2 Bengt Gunne . Started working with Mimer in 1981 – Have worked with many parts of the system such as: . Database kernel . Transaction handling . Client/server communications . Clients such as ODBC and ADO.NET . Overall design . Multiuser systems for PDP, VMS, HP-UX, Windows etc. Most recently written a new SQL optimizer – Head of development since 1991 – Chief Technical Officer 3 Who was Mimer? Mimer was a giant in the Norse Mythology who guarded the well of wisdom. The gods came to Mimer for advice. When they looked into the well they could see everything that happened in the world. The god Oden even took out one of his eyes and put it in the well to be able to see everything at all times. Today, ordinary people come to Mimer for advice… 4 Mimer Information Technology AB . HQ in Uppsala, Sweden – Office in Stockholm and Beijing – Partners in China, Japan, Korea, Central Europe, and USA . World class experts in relational database technology . Developer of the Mimer SQL product family – Enterprise Solutions – Industrial/Automotive/Embedded Solutions – Mobile Solutions . Mimer SQL used in mission critical systems world wide since the 1970s 5 What is Mimer SQL? . Relational database management system . Standard SQL . Runs on many platforms – Tight integration with OpenVMS 6 Company focus . The company focuses solely on Mimer SQL – Tools are through third party integrations . How is this possible/facilitated? – Standard SQL – Standard programming interfaces Requires a truly open system 7 Mimer SQL background .
    [Show full text]
  • Data Management in Component-Based Embedded Real-Time Systems
    Mälardalen University Doctoral Thesis No. 125 Data Management in Component-Based Embedded Real-Time Systems Andreas Hjertström June 2012 School of Innovation, Design and Engineering Copyright ⃝c Andreas Hjertström, 2012 ISSN 1651-4238 ISBN 978-91-7485-064-2 Printed by Mälardalen University, Västerås, Sweden Distribution: Mälardalen University Press Abstract This thesis presents new data management techniques for run-time data in component-based embedded real-time systems. These techniques enable data to be modeled, analyzed and structured to improve data management dur- ing system development, maintenance, and execution. The foundation of our work is a case-study that identifies a number of problems with current state-of- practice in data management for industrial embedded real-time systems. We introduce two novel concepts: the data entity and the database proxy. The data entity is a design-time concept that allows designers to manage data objects throughout different design and maintenance activities. It includes data-type specification, documentation, specification of timing and quality pro- perties, tracing of dependencies between data objects, and enables analysis and automated validation. The database proxy is a run-time concept designed to allow the use of state- of-the-art database technologies in contemporary software-component tech- nologies for embedded systems. Database proxies decouple components from an underlying database residing in the component framework. This allows components to remain encapsulated and reusable, while providing temporally predictable access to data maintained in a database, thus enabling the use of database technologies, which has previously excluded, in these systems. To validate our proposed techniques, we present a tool implementation of the data entity as well as implementations of the database proxy approach, using commercial tools, the AUTOSAR standardized automotive software ar- chitecture, and automotive hardware.
    [Show full text]