
DASISH Data Service Infrastructure for the Social Sciences and Humanities EC FP7 Grant Agreement Number: 283646 Deliverable Report Deliverable: D5.2A & D5.2B Deliverable Name Part A: Metadata Quality Improvement Deliverable Name Part B: Portal Progress report Responsible Part A: DANS Authors: Hervé L'Hours (UEssex- UK Data Archive), Lene Offersgaard (UCPH), Marion Wittenberg (KNAW-DANS), Bartholomäus Wloka (OEAW) Contributing: Lucy Bell (UEssex), Emily Ekstrand-Brummer (KNAW-DANS), Tom Ensom (UEssex), Mike Priddy (KNAW-DANS) Responsible Part B: MPG-PL Contributing: Catharina Wasner (GESIS), Matej Durco, Bartholomäus Wloka (OEAW), Stephanie Roth, Olof Olsson (UGOT), Przemek Lenckiewic, Kees Jan van de Looij, Binyam Gebreke, Daan Broeder (MPG-PL). Work Package Leader: Daan Broeder (MPG-PL) www.dasish.eu Table of Contents Executive Summary – Part A .................................................................... 6 Executive Summary – Part B .................................................................... 6 PART A – METADATA QUALITY IMPROVEMENT ............................................ 8 Guide to the Reader ............................................................................... 8 Metadata lifecycle .......................................................................................... 8 Metadata strategies of CLARIN, DARIAH and CESSDA ......................................... 9 Background information on metadata ............................................................. 10 1. Introduction ................................................................................. 11 2. Metadata and metadata quality ....................................................... 13 2.1. The Research Data Lifecycle and Metadata Lifecycle ............................... 13 2.2. Types of Metadata ............................................................................. 14 2.3. Metadata Quality ............................................................................... 15 3. Metadata lifecycle .......................................................................... 18 3.1. Lifecycles Referenced ......................................................................... 20 3.2. Actors and Communications across the Lifecycle .................................... 21 3.3. Full Lifecycle Planning ......................................................................... 24 3.4. Recurrent Actions and Events .............................................................. 26 3.5. Sequential Actions ............................................................................. 33 4. Research Infrastructure Model ......................................................... 40 5. Metadata Strategies of CLARIN ........................................................ 42 5.1. Organisation of CLARIN ...................................................................... 42 5.2. CLARIN metadata strategies ................................................................ 42 5.3. Metadata in the infrastructure ............................................................. 45 5.4. Initiatives to ensure metadata quality in the infrastructure ...................... 46 6. DARIAH’s strategies for metadata .................................................... 48 6.1. Organisation of DARIAH ...................................................................... 48 6.2. DARIAH standardisation strategies ....................................................... 50 6.3. DARIAH metadata strategies ............................................................... 50 6.4. Particular Initiatives in the infrastructure ............................................... 51 6.5. Metadata in the infrastructure ............................................................. 52 6.6. Initiatives to ensure metadata quality in the infrastructure ...................... 53 7. Metadata strategies of CESSDA ....................................................... 54 7.1. Organisation of CESSDA ..................................................................... 54 7.2. CESSDA’s metadata strategies ............................................................. 55 7.3. Metadata in the infrastructure ............................................................. 58 7.4. Initiatives to ensure metadata quality in the infrastructure ...................... 59 8. Cross Fertilisation between CESSDA, CLARIN, and DARIAH ................. 61 8.1. Sharing lifecycle models, descriptions, and diagrams of infrastructures ..... 61 8.2. Mandatory or recommended metadata profiles ....................................... 62 8.3. Sharing of knowledge and linking of resources ....................................... 62 8.4. Discussion on metadata quality aspects between and within infrastructures 63 9. Using the DASISH Joint Metadata Repository Prototype to exemplify challenges on Metadata Quality .............................................................. 64 9.1. CreationDate ..................................................................................... 65 9.2. Creator ............................................................................................. 66 2 www.dasish.eu GA no. 283646 9.3. Language ......................................................................................... 66 9.4. Discipline .......................................................................................... 68 9.5. Summing up ..................................................................................... 68 10. Conclusion ................................................................................. 70 PART B: PORTAL PROGRESS REPORT ...................................................... 72 11. Introduction ............................................................................... 72 12. The Use of Interdisciplinary Metadata Catalogues ............................ 74 13. Implementation .......................................................................... 76 13.1. The SSH Metadata Providers ............................................................. 76 13.2. SSH Metadata Frameworks and Schemas ........................................... 78 13.3. The Metadata Catalogue Software and Workflow .................................. 80 13.3.1 UI Modifications .............................................................................. 83 13.3.2 Metadata Mapping Module ................................................................ 83 13.3.3 CMDI Mapping Generator ................................................................. 84 13.3.4 CKAN Performance Issues ................................................................ 84 13.4. Facets for the DASISH Catalogue ....................................................... 85 13.5. Mapping Metadata to Facets and Fields ............................................... 85 13.6. Normalization ................................................................................. 86 14. Metadata Quality Improvement ..................................................... 87 14.1. Suggestions on Metadata Improvement from Task 5.3 ......................... 87 14.2. Improving the Catalogue .................................................................. 88 15. Findings ..................................................................................... 89 16. Future of the DASISH Catalogue ................................................... 90 References .......................................................................................... 91 Glossary ............................................................................................. 98 PART A APPENDICES ............................................................................ 102 Appendix A: Background information about metadata .............................. 102 Metadata Standards and Schemas ................................................................. 102 Choosing a Metadata Schema ....................................................................... 102 Metadata Schemas ...................................................................................... 103 Metadata Interoperability ............................................................................. 104 Structural Interoperability ............................................................................ 105 Controlled Vocabularies ............................................................................... 105 Metadata Schema Registries ......................................................................... 106 ISO/IEC 11179 ........................................................................................... 106 Types of metadata ...................................................................................... 107 Descriptive Metadata ................................................................................... 108 Contextual Metadata ................................................................................... 110 Technical Metadata ..................................................................................... 111 Preservation Metadata ................................................................................. 112 Administrative Metadata .............................................................................. 114 Structural Metadata .................................................................................... 115 Saving Time and Money with Quality Metadata ............................................... 116 Some Tips for Creating Quality Metadata .......................................................
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages216 Page
-
File Size-