2. Common HOPE Metadata Structure
Total Page:16
File Type:pdf, Size:1020Kb
HOPE Heritage of the People’s Europe Grant agreement No. 250549 Deliverable D2.2 The Common HOPE Metadata Structure, including the Harmonisation Specifications Deliverable number D2.2 Version 1.1 Authors Bert Lemmens – Amsab-ISG Joris Janssens – Amsab-ISG Ruth Van Dyck – Amsab-ISG Alessia Bardi – CNR-ISTI Paolo Manghi – CNR-ISTI Eric Beving – UPIP Kathryn Máthé – KEE-OSA Katalin Dobó – KEE-OSA Armin Straube – FES-A Delivery Date 31/05/2011 Dissemination Level Public HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 2 of 266 HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 3 of 266 Table of Contents Table of Contents 3 1. Introduction 8 1.1 Task Description 8 1.2 Approach 10 1.2.1 Task breakdown 10 1.2.2. Dublin Core Application Profile Guidelines 14 1.2.3. Used Terminology 15 1.2.4. Structure of the Document 17 1.3 Glossary 18 1.3.1 Hope glossary 18 1.3.2 Technical Glossary 18 2. Common HOPE Metadata Structure 24 2.1 Functional requirements 24 2.1.1 General 24 2.1.1.1 What does HOPE want to accomplish with the HOPE Aggregator? 24 2.1.1.2 What the HOPE Aggregator will not attempt to do? 26 2.1.2. Aggregation 26 2.1.2.1 What are the key characteristics of the HOPE metadata? 27 2.1.2.2 How do the characteristics of the HOPE metadata affect the design of the Common HOPE Metadata Structure? 28 2.1.3. Dissemination 30 2.1.3.1 What are the key characteristics of the HOPE users? 30 2.1.3.2 How does the HOPE System want to serve its users? 31 2.1.3.3 Listing the Functional Requirements of the Common HOPE Metadata Structure 33 2.2 Transformation Procedure 36 2.2.1. Definitions 36 2.2.2 Transformation Procedure 37 2.2.3. Transformation Issues regarding Semantic Interoperability 40 2.3 HOPE Data Model 42 2.3.1 Introduction 42 HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 4 of 266 2.3.2 HOPE Data Model 55 2.3.2.1 Diagram 55 2.3.2.2 Entities 56 2.4 HOPE Domain Profiles 119 2.4.1 Introduction 119 2.4.1.1 Purpose of domain profiles 119 2.4.1.2 Definition of domain profiles 120 2.4.2 HOPE Domain Profiles 122 2.4.2.1 HOPE archive profile 124 2.4.2.2 HOPE library profile 127 2.4.2.3 HOPE visual profile 131 2.4.2.4 HOPE audio-visual profile 135 2.4.2.5 HOPE Dublin Core profile 138 2.5 HOPE XML Schema 141 2.5.1 Introduction 141 2.5.2 XML Schema 142 2.5.2.1 Enumerations 142 2.5.2.2 Domain 143 2.5.2.3 Bi-/one-directional relationships 143 2.5.3 Structure of the XML Schema 144 2.5.3.1 Entity level 144 2.5.3.2 Complex types 145 3. Harmonisation Requirements 153 3.1 Local and Persistent Identifiers 153 3.1.1 Introduction 153 3.1.1.1 Objective 153 3.1.1.2 Survey Results 154 3.1.2. General requirements 154 3.1.2.1 General requirement 01: identifiers and URLs 154 3.1.2.2. General requirement 02 : persistence of identifiers and URLs 155 3.1.2.3. General requirement 03: uniqueness of identifiers 155 3.1.2.4. General requirement 04: identifiers (except PIDs) should not be URLs 156 3.1.3. PID services and resolver services 156 HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 5 of 266 3.1.3.1. What is a PID service? 156 3.1.3.2 Requirements for the usage of PIDs in the HOPE System - Glossary 159 3.1.3.3 Requirements for the HOPE PID Service 161 3.1.4. Scenarios and requirements for creation, supply and update of identifiers by content providers 161 3.1.4.1 Structure of scenarios 162 3.1.4.2 Requirements for metadata records (i.e. descriptive units) and landing pages 162 3.1.4.3 Requirements for digital files 164 3.1.4.4 Requirements for CP authority records 172 3.2. Granularity 175 3.2.1 Survey Results 175 3.2.2 Europeana and Granularity 176 3.2.3 Recording Granularity in the Common HOPE Metadata Structure 177 3.2.4 Hierarchical Descriptions in the Descriptive Unit Entity 178 3.2.4.1 Elements 178 3.2.4.2 Hierarchical structures 182 3.2.5 Compound Digital Objects in the Digital Resource Entity 194 3.2.5.1 Elements 198 3.2.5.2 Acceptance Levels 200 3.3. Descriptive and Administrative Metadata 203 3.3.1 Mandatory, Recommended & Optional Elements 203 3.3.1.1 HOPE Aggregator 204 3.3.1.2 Europeana 205 3.3.1.3 HOPE@Europeana: 207 3.3.1.4 IALHI Portal 207 3.3.2. Metadata Normalisation 213 3.3.2.1 Dates 214 3.3.2.2 Languages 216 3.3.2.3 Scripts 218 3.3.2.4 Description Level 219 3.3.2.5 Rights 220 3.3.2.6 Type 221 4. Case Studies Semantic Enrichment 223 HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 6 of 266 4.1. Common HOPE Authorities 223 4.1.1.Survey results 223 4.1.1.1 Content Providers Survey 223 4.1.1.2 Additional Survey 225 4.1.2. Mapping and merging procedure 227 4.1.2.1 HOPE Authority File Manager Tool 227 4.1.2.2 Agents 230 4.1.2.3 Places 231 4.1.2.4 Concepts 232 4.1.2.5 Events 232 4.1.3 HOPE Themes 233 4.2. Multilingual Access 235 4.2.1 Introduction 235 4.2.1.1 Sources of information 235 4.2.1.2 Purpose of this document 235 4.2.1.3 Building a multilingual controlled vocabulary 237 4.2.2 Hope Metadata: Language and Scripts, Translations and Transliterations 238 4.2.2.1 Languages / translations 238 4.2.2.2 Scripts / transliterations 239 4.2.3 Resources, Tools and Services 241 4.2.3.1 Introduction 241 4.2.3.2 Availability of controlled vocabularies and tools 242 4.2.3.3 MultiMatch project 243 4.2.3.4 EuropeanaConnect 244 4.2.3.5 Hitime 246 4.2.3.6 Multilingual vocabularies for subjects / topical terms 247 4.2.3.7 Conclusion 249 4.2.4 Recording Multilingual HOPE Metadata 252 4.2.4.1 Translation issues 252 4.2.4.2 Transliteration issues 255 4.2.4.3 Issues related to the use of special characteristics and diacritics 259 4.2.5 Enrichment: Building Multilingual HOPE Metadata 262 Labels 262 HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 7 of 266 Linked Data 262 Online Translation Tools 262 Transliterations 262 Multilingual vocabularies 263 Persons & Places 263 HOPE Historical Themes 264 BIBLIOGRAPHY 266 APPENDIX A, B, C 266 HOPE is co-funded by the European Union through the ICT Policy Support Programme D2.2 Common HOPE Metadata Structure & Harmonisation Requirements V1.1 – 31/05/2011 Page 8 of 266 1. Introduction This chapter includes the description of the tasks 2.3 and 2.4 as presented in the HOPE Description of Work and the way these tasks have been approached. 1.1 Task Description This deliverable includes the specifications for the Common HOPE Metadata Structure and the subsequent harmonization requirements. These specifications and requirements have been drafting within the scope Tasks 2.3 and 2.4 of the HOPE Description of Work. The task description of these two tasks reads as follows: T2.3 Specification of the common metadata structure This task deals with the HOPE data model at the more syntactical level. T2.3.1 An inventory will be made of the metadata standards and schemes applied locally by the participants, the level of granularity of the descriptive units and the local practices for producing previews/thumbnails (low res scans, video stills, etc.) and the use of unique identifiers for referential integrity. These will be matched with the Europeana specifications (metadata scheme, previews/thumbnails sizes and formats and the object link) in order to assess the range of interoperability issues and the extent of the harmonization and enrichment work that needs to be done. T2.3.2 This task will then specify a common metadata structure for interoperability within HOPE and full-interoperability with Europeana. This will include: Specification of the overall metadata framework Agreed standards for metadata (descriptive, admin and structural) Agreed levels of descriptive granularity per content object type (collection, archive, book, periodical, image, etc.) Agreed standards for digital content representations (thumbnails, preview, low-res, etc.) per content object type Agreed common format for cross-searching Requirements for resolvable identifiers (incl. requirements for ensuring that the identifiers are compatible with different resolution solutions in case of migration from one solution to another) The specifications of the common HOPE metadata structure (incl.