First report of content metadata alignment with EUROPEANA requirements

ECP-2008-DILI-538025

JUDAICA Europeana

First report of content metadata alignment with EUROPEANA requirements

Deliverable number D2.2 Dissemination level Public Delivery date 31 December 2010

Status 1st Version

Rachel Heuberger UB-FFM, Elizabeth Selby JML, Anastasia Loudarou JMG, Giuliana De Francesco, Maria-Teresa Natale MIBAC, Author(s) Jean-Claude Kuperminc AIU and Gilles Rozier Medem-MCY, Zsuzsanna Toronyi MZSML, Pier Giacomo Sola AMITIE, Dov Winer, Lena Stanley-Clamp EAJC

eContentplus

This project is funded under the eContentplus programme1, a multiannual Community programme to make digital content in Europe more accessible, usable and exploitable.

1 OJ L 79, 24.3.2005, p. 1.

1/26 First report of content metadata alignment with EUROPEANA requirements

Table of Contents

1. INTRODUCTION...... 3 1.1 THE PURPOSE OF WORK PACKAGE 2 3 1.2 OVERVIEW OF THE DELIVERABLE 3 2. CONTENT IDENTIFICATION AND SELECTION...... 5 2.1 IMPLEMENTATION OF THE SURVEY 5 2.2 THE SURVEY AND GUIDELINES 5 2.3 SURVEY OF THE CONTENT TO BE DIGITIZED 6 2.4 SURVEY OF THE COLLECTIONS CONTENT ALREADY DIGITIZED 7 3. METADATA SURVEY AND ALIGNMENT...... 8 3.1 THE SURVEY AND GUIDELINES 8 3.2 RESULTS OF METADATA SURVEY 9 3.3 USE OF VOCABULARIES FOR SUBJECT DESCRIPTION 10 3.4 EXPORT FORMATS 10 4. STANDARDS OF METADATA SCHEMA...... 12 4.1 DESCRIBING STANDARDS 12 4.2 INFORMATION SCHEMES (METADATA) 12 5. CONCLUSIONS...... 18 5.1 CONTENT ALREADY DIGITIZED 18 5.2 CONTENT TO BE DIGITIZED 18 5.3 METADATA SURVEY 18 5.4 USE OF CONTROLLED VOCABULARY 19 5.5 FUTURE WORK OF WP 2 19 ANNEX 1 CONTENT TO BE DIGITIZED BY THE PROVIDERS...... 21 ANNEX 2 CONTENT ALREADY DIGITIZED BY THE PROVIDERS...... 24

2/26 First report of content metadata alignment with EUROPEANA requirements

1. Introduction

1.1 The Purpose of Work Package 2 Judaica Europeana will document the Jewish contribution to the European urban development by identifying content related to the Jewish presence and heritage in the cities of Europe. It will work together with European cultural institutions and provide access to a large quantity of European Jewish cultural heritage at the level of the cultural object.

In this context, Work Package 2 of the Judaica Europeana project (WP2) is tasked with:

1. Content identification and selection by means of auditing, assessing and selecting content to be digitised at the partner institutions collections and auditing in detail the available digitised resources. Establishing an advisory group of thematic domain experts that will support the process of content selection according to set criteria; 2. Surveying the existing metadata schema used currently by the partners and facilitating the mapping of those standards to a common metadata standard; 3. Assessing the requirements for the adoption of controlled vocabularies for Judaica purposes; 4. Producing tools to support the conversion of the partners’ data into the common harvesting format for ingestion into the main Europeana service. 5. Establishing a pilot knowledge management system to support the community of practice of scholars and cultural heritage professionals in the thematic domain area.

WP2 is in constant cooperation with other work packages in the project. In particular WP2 works closely together with WP3 and WP4: feeding information about standards for their work. The survey which is the basis of this deliverable was extended to include collecting information related to WP3 on digitization of the resources and on IPR issues for use within WP4.

1.2 Overview of the deliverable This deliverable is the outcome of the ongoing work processes within Judaica Europeana and especially between the content providing partners. It is based on the Audit report on Judaica content including metadata, first version of April 2010.1 The deliverable is the result of content identification and selection of resources that belong to the thematic domain “Jews in European Cities” and have been identified by the Judaica Europeana partners in their respective collections according to the jointly agreed guidelines and on the basis of a survey carried out. These resources will provide online access to the widest possible range of objects, documents, books and other artefacts on the theme of “Jews in European Cities”.

These collections are described in the grant agreement for the targeted project Judaica Europeana, Jewish Urban Digital European Integrated Cultural Archive, in chapter 4.1, Description of Work and are described in this deliverable under No.2.3 and No. 2.4 (pp. 6-7). 2

The first part of this report shows the results of the Judaica Europeana survey that was carried out by the partners in order to provide an accurate appraisal of the current position concerning

1 Audit report on Judaica content including metadata, 1st version, 30. April 2010, public, by Rachel Heuberger et al. 2 Grant agreement Number ECP-2008-DILI-538025, version 1. 12. 2009, pp. 10-33.

3/26 First report of content metadata alignment with EUROPEANA requirements

Judaicacontent identification and metadata alignment. The survey shows in detail the resources to be digitized in the course of the project, the material already digitized and selected for ingestion into Europeana and the metadata alignment. Thus the survey gives detailed and accurate information about the metadata schema used in the partners’ institutions and their adaptation to Europeana requirements.

The second part of this report lists the existing metadata standards that are in use today to catalogue and index different kinds of material. It deals with their compatibility within the Web architecture of Europeana. The exposition of key standards being used by various types of institutions represented by the partners, reflects the multiplicity of standards and wide range of metadata alignment being practiced. Thus the partners’ institutions, being a mix of archives, libraries and museums with Judaica content, are characterised by a large diversity of standards.

Intensive exchanges have taken place between the partners during Work Package meetings and in the intervals between meetings, regarding the evaluation of proposed metadata systems and the ongoing work experience in identifying the content for digitization as well as aligning the metadata according to standards and adapting them to Europeana requirements.

4/26 First report of content metadata alignment with EUROPEANA requirements

2. Content Identification and Selection

2.1 Implementation of the survey Content identification and selection was carried out according to the survey distributed to the partners, which was based on the earlier survey for the Audit Report on Judaica Content including Metadata delivered on April 30, 2010. The information was obtained from the responses to this survey of content providers to Judaica Europeana. Selection strategies were applied by the partners according to the survey and digitization procedures were established. The questions of the survey were formulated according to the list of elements defined in the Metadata Mapping & Normalisation Guidelines for the Europeana Prototype, Version 1.2.1, 18/01/2010, Europeana v1.0, chapter 2.1. 1 The content providers were asked to provide detailed information on their collections to be digitized as well as those that are already digitized. The questionnaire with explanations for the content providers as well as the results of this survey are presented below. The results and the relevant content within the partners’ institutions surveyed for this deliverable, are described here in detail.

2.2 The survey and guidelines The content providers were asked to answer the following questions:

1) Which collections will be digitized in this project? Fill in Chart I – Content to be digitized

Chart I: Content to be digitized

Category Description Fill In Provider Name of institution Type of Object in Text - printed or manuscript, in book, detail periodical, as single page, or Image like photos and postcards, or Artefact like textiles, coins, instruments , or sound records, videos, films …. Quantity State clearly if you mean books, pages, film Example: 1) 1000 postcards clips etc Type a separate line for each: 1, 2, 3, 2) 45 pages of letters … Format and In relevant cases like images, films or sound Quality records specify: JPEG, MPEG. HTML etc. Quality: Resolution, sampling rate, colour/grey scale, etc IPR - Only public domain

1 http://version1.europeana.eu/c/document_library/get_file?uuid=58e2b828-b5f3-4fe0-aa46- 3dcbc0a2a1f0&groupId=10602, p. 5.

5/26 First report of content metadata alignment with EUROPEANA requirements

2) Which collections are already digitized? Fill in Chart II – Content already digitized

Chart II : Content already digitized

Category Description Fill In Provider Name of institution Type of Object in Text: printed or manuscript, in book, detail periodical, as single page, or Image like photos and postcards, or Artefact like textiles, coins, instruments, or sound records, videos, films ….

Quantity State clearly if you mean books, pages, film Example: 1) 1000 postcards clips, etc. For each, type a separate line 2) 45 pages of letters … 1, 2, 3, Format and In relevant cases like images, films or sound Quality records, specify: JPEG, MPEG. HTML, etc. Quality: Resolution, sampling rate, colour/grey scale, etc IPR - Only public domain

2.3 Survey of the content to be digitized Audit, assessment and selection of content to be digitized by the partners

The detailed results of the survey concerning content to be digitized by the partners are listed in Annex 1.1 They show the vast range of material to be digitized in this project. The institutions represented are libraries, archives and museums and thus the material comprises all categories of objects according to the 4 types of Europeana classification: Images, text, sound and video. Text is prevalent, as all the institutions, including the museums, possess precious manuscripts, books and documents that constitute primary resources for documenting the Jewish past in European cities.

Thus not only the libraries (The Judaica Collection of the Frankfurt University Library, the Library of the Alliance Israélite in Paris and the Palatine Library in Parma) and archives (Hungarian Jewish Archive, Jewish Historical Institute in Warsaw, Venice State Archives), but also the museums have texts that will be digitized.

Photographic collections, art objects, paintings and illustrations – counted as images - are the second largest category of objects being digitized. They are part of the collections in the museums, as well as in the archives.

Audio material is being held by the Medem Library who have a large collection of records of Yiddish songs, as well as by the Jewish museum in London. The category of video is being represented by 8 films in the collection of the Alliance Israélite Universelle.

1 See Annex 1, p. 21 The information has already been partly digitized in the course of this year.

6/26 First report of content metadata alignment with EUROPEANA requirements

2.4 Survey of the collections content already digitized The detailed results of the survey in regard to content already digitized by the partners are listed in Annex 2.1 They give an account of the progress made in the course of this year, showing that all the institutions, with the exception of the State Archives of Venice, have already carried out substantial digitization work. Thus more than one million of pages of texts and thousands of images, with an impressive number of photos, are in digital form, ready be uploaded to Europeana at a chosen date. They include books, valuable manuscripts and archival material as well as a wide range of objects, including textiles, coins, personal and religious objects. These texts and images have all been digitized according to Europeana standards.

1 See Annex 2, p. 24.

7/26 First report of content metadata alignment with EUROPEANA requirements

3. Metadata Survey and Alignment

3.1 The Survey and Guidelines The content providers were asked to answer the following questions:

3) What Metadata Scheme is currently applied in your institution? Does the Metadata Scheme fulfil the minimum set of Metadata Elements? Fill in Chart III – Minimum Set of Metadata

Chart III – Minimum Set of Metadata

Category Description Fill in

Provider Name of Institution Standard Metadata Do Standard Metadata (like Dublin Core, CDWA, MARC) exist? If you do not use Standard Metada, please specify if the mandatory categories are provided Object Basic information about the object: Identification - Library : Title of book Title Museum: Name of object Archive: Title of item or file Alternative Title Example: original title of translated book: Le If applicable vent des Khazars (Der Messias Code) by M. Halter Creator Author or editor or association, community Contributor Could also be producer/ manufacturer Date Publication or production date

Additional Europeana Elements

4) Do you provide additional Metadata? please specify in the following chart.

Chart IV Expanded elements – recommended

Coverage Geographical or periodical definition – similar to subject e.g: photo of Rome Description Text: number of pages of book, number of illustrations, size, etc Object: Material, colours,

8/26 First report of content metadata alignment with EUROPEANA requirements

Is Part of Relation - e.g. letter that is part of collection language Language of the text of the digital object Publisher Source Content holder like British Library, Louvre … The institution that aggregates=submits the data to Europeana is held under provider. If source and provider are identical, than use provider Subject Topic, people, places Type Information about type of object – carving, pressing, or material like wood Record ID Signature of the book, files Inventory Number of objects…

Additional Elements Format File Format of digitised object like text Extent = refinement of format, e.g. size of original object Medium Material of original object, similar to type Provenance History of ownership or custody

5) Which Vocabularies, if any, do you use for Subject description, for ex. LoC Subject Heading, Name Authority Files …?

6) Does your Metadata System have the capability to export files into XML files?

3.2 Results of Metadata Survey The following chart shows the Metadata that exist at the partners’ institutions referring to the real objects (books, archival material, museum artefacts) in their collection to be digitized. The categories are based on the list of elements defined in the Metadata Mapping & Normalisation Guidelines for the Europeana Prototype, Version 1.2.1, 18/01/2010, Europeana v1.0, chapter 2.1, as referred to on page 5 of this report.1

The existing metada will enable efficient mapping by the content providers to Europeana according to standards.

Minimum Set of Metadata Expanded Elements Additional Elements

Category UB-FFm AIU JM Buda- Parma Venice Warsaw JM Greece pest London

Standard Dublin Dublin Dublin Dublin Dublin Dublin In house In house Metadata Core – Core - Core Core Core Core Dublin Dublin MAB 2 MARC OAI MAG MAG Core Core

1 http://version1.europeana.eu/c/document_library/get_file?uuid=58e2b828-b5f3-4fe0-aa46- 3dcbc0a2a1f0&groupId=10602, p. 5.

9/26 First report of content metadata alignment with EUROPEANA requirements

21 Title X X X X X X X X Altern. X X - - - - - X Title Creator X X X X X X X X Contributor X X X - - - - X Date X X X X X X X X Recommended Elements Coverage X X X X X X X X Description X X X X X X X X Is part of X X X X X X - X

Language X X X X X X X X

Publisher X X X - - - - X Source X X X X X X X X Subject X X X X X X X X Type X X X - - - - X Record X X X - X X X X Id Addition. Elements Format X X X - - - X X Extent - - X - - X - - Medium - - X - - X - - Provenance X X X - X X X X

3.3 Use of Vocabularies for Subject Description In order to assess effectively the requirements for the adaptation of controlled vocabularies for Judaica purposes, one of the tasks of WP2, the partners were asked to submit the type of vocabularies that are in use in their institutions.

The result was as follows:

Institution Type of Vocabulary UB FFM German National Subject Thesaurus (RSWK), Authors authority according to LoC / VIAF. AIU Thesaurus “Rameau” with adaption to Jewish vocabulary, Authors authority files taken from Bibliothèque Nationale de France and Israel National Library /7 VIAF JM Greece UNESCO Vocabulary and ICOM Vocabularies Budapest Getty Thesaurus MiBAC / Parma MAG (Metadati Amministrativi Gestionali/ Administrative Metadata Management) MiBAC/ Venice MAG (Metadati Amministrativi Gestionali/ Administrative Metadata Management) Jewish Historical Institute/ Warsaw In-house system Jewish Museum London In-house system

10/26 First report of content metadata alignment with EUROPEANA requirements

3.4 Export Formats UB FFM: YES AIU / Medem YES JM Greece YES Budapest YES MiBAC / Parma YES MiBAC/ Venice YES Jewish Historical Insitute/ Warsaw YES Jewish Museum London YES

11/26 First report of content metadata alignment with EUROPEANA requirements

4. Standards of Metadata Schema

The purpose of this chapter is to present the range of relevant standards in existence in the different types of institutions (libraries, archives and museums) that take part in Judaica Europeana. These metadata standards differ in their structure, but are all based on the general format of Describing Standard Dublin Core (DC) and thus will all provide metadata for Europeana and enable access to cultural heritage in Europe. The need for adaptation of these accepted metadata schemes, the alignment of the metadata schemes to the digital objects, their management and implantation will be integrated in the regular workflow. Furthermore, the use of controlled vocabularies underlying the metadata schemes will support the visibility of Jewish cultural heritage in Europeana and improve substantially the results provided by Judaica Europeana.

4.1 Describing Standards The general format of describing standard is a Dublin Core (DC) derived format .9 out of the 15 DC elements used in the descriptions.

These elements are:

Title The name (or names) under which the standard is known. In most cases both the abbreviated and the full name is listed. Creator The name of the organisation or individual who originally created the standard. Publisher The name of the organisation that makes the standard publicly available. Date The date on which the standard was originally published. Identifier A number or other identifier under which standard is published or a URL which points to the definition of the standard. Rights Whether rights restrictions, e.g. patents, apply to the standard. Description A textual description explaining the standard and its usage. Subject Keywords that identify the nature of the standard. Relation Other standards that this standard relates to and associated websites.

4.2 Information schemes (metadata) Most of the standards listed below define descriptive metadata. They focus on recording information about resources: manuscripts, books, archival material, museum objects such as art objects, costumes, coins etc. in the different types of institutions

The descriptions have been divided according to the types of institutions from a particular cultural domain:  Library specific;  Archive specific;  Museum specific.

4.2.1 Library specific MAB2

12/26 First report of content metadata alignment with EUROPEANA requirements

Title MAB2 Maschinelles Austauschformat für Bibliotheken (Automated Library Exchange Format) Creator Deutsche Nationalbibliothek Publisher Deutsche Nationalbibliothek Date 2001 Identifier http://www.d-nb.de/standardisierung/txt/titelmab.txt [bibliographic data (MAB- TITEL)] Rights Deutsche Nationalbibliothek Description The machine exchange all data generated within a library environment (bibliographic, authority and local data). Subject bibliographic description Relation http://www.d-nb.de/eng/standardisierung/formate/mab.htm (German National Library MAB web page)

MARC Title MARC

Machine-Readable Cataloguing

MARC 21 Concise Format for Bibliographic Data Creator Network Development and MARC Standards Office of the Library of Congress Publisher Library of Congress Date 2002 (updated) Identifier http://www.loc.gov/marc/bibliographic/ecbdhome.html Rights [Open Standard] Description Standard for the representation and communication of bibliographic information in machine-readable form. Subject bibliographic description Relation http://www.loc.gov/marc/authority/ecadhome.html (authority information) http://www.loc.gov/marc/holdings/echdhome.html (holdings data) http://www.loc.gov/marc/classification/eccdhome.html (classification data) http://www.loc.gov/marc/community/eccihome.html (community data) ISO 2709

13/26 First report of content metadata alignment with EUROPEANA requirements

4.2.2 Archive specific EAD Title EAD Encoded Archival Description Creator Encoded Archival Description Working Group of the Society of American Archivists Network Development and MARC Standards Office of the Library of Congress Publisher Society of American Archivists Date 2002 Identifier ftp://ftp.loc.gov/pub/ead/ead.dtd (DTD) http://www.loc.gov/ead/ead.xsd (W3C schema) Rights Copyright Society of American Archivists. [Open Standard] Description DTD and schema for the encoding archival finding aids. Also used to describe collections (collection description). Subject archive description collection description document encoding Relation http://www.loc.gov/ead (EAD website)

ISAD(G) Title ISAD(G) ISAD(G): General International Standard Archival Description, Second Edition Creator Committee on Descriptive Standards (ICA/CDS) (adopter) Publisher International Congress on Archives (ICA) Date 2000 Identifier 0-9696035-5-X (ISBN) http://www.ica.org/sites/default/files/isad_g_2e.pdf Rights Copyright International Congress on Archives Description General rules for archival description that may be applied irrespective of the form or medium of the archival material. The rules accomplish these purposes by

14/26 First report of content metadata alignment with EUROPEANA requirements

identifying and defining twenty-six (26) elements that may be combined to constitute the description of an archival entity. Subject archive description Relation http://www.ica.org (ica website)

ISAAR (CPF) Title ISAAR (CPF) ISAAR (CPF): International standard archival authority record for corporate bodies, persons and families, Second Edition Creator ICA Committee on Descriptive Standard (preparer) Publisher International Congress on Archives (ICA) Date 2004 Identifier 2-9521932-2-3 (ISBN) http://www.ica.org/sites/default/files/ISAAR2EN.pdf Rights Copyright International Congress on Archives Description General rules for the establishment of archival authority records that describe the corporate bodies, persons, and families that may be named as creators in descriptions of archival documents. Subject archive description Relation http://www.ica.org (ica website)

4.2.3 Museum specific CDWA Title CDWA Categories for the Description of works of Art Creator Art Information Task Force Publisher Getty Research Institute Date 1990 Identifier http://www.getty.edu/research/institute/standards/cdwa/index.html Rights Getty Research Institute Description Describes the content of art databases by articulating a conceptual framework for describing and accessing information about objects and images. They identify vocabulary resources and descriptive practices that will make information residing

15/26 First report of content metadata alignment with EUROPEANA requirements

in diverse systems more compatible and more accessible. They also provide a framework to which existing art information systems can be mapped and upon which new systems can be developed. Subject documentation (museum) Relation http://www.getty.edu/research/conducting_research/standards/cdwa/cdwalite.h tml (CDWA Lite) museumdat Title museumdat Creator Fachgruppe Dokumentation im Deutschen Museumsbund Institut für Museumsforschung SMB-PK Zuse-Institut Berlin Publisher Fachgruppe Dokumentation im Deutschen Museumsbund / Institut für Museumsforschung SMB-PK / Zuse-Institut Berlin Date 2006-2007 Identifier http://museum.zib.de/museumdat/museumdat-v1.0.xsd Rights Copyright FG Dokumentation im Deutschen Museumsbund / Institut für Museumsforschung SMB-PK /Zuse-Institut Berlin Description A harvesting format (XML schema) optimized for retrieval and publication, meant to deliver automatically core data to museum portals. Subject description (cultural object) Relation http://museum.zib.de/museumdat/museumdat-v1.0-en.pdf (documentation) http://www.museumdat.org/index.php?ln=de&t=home (German website) http://www.museumdat.org/index.php?ln=en (English website)

Object ID Title Object ID Creator Thornes, Robin (et al) Publisher J. Paul Getty Trust Date 1999 Identifier http://www.object-id.com/guide/guide_index.html Rights Copyright The J. Paul Getty Trust. All rights reserved Description Standard for describing cultural objects. Developed through the collaboration of the

16/26 First report of content metadata alignment with EUROPEANA requirements

museum community, police and customs agencies, the art trade, insurance industry, and valuers of art and antiques. Subject description (cultural object) Relation http://www.object-id.com/checklist/check_eng.html (checklist)

17/26 First report of content metadata alignment with EUROPEANA requirements

5. Conclusions

5.1 Content already digitized The survey has shown that the digitization process has started in all but one of partner institutions and the work process has been successfully established. So far more than one million pages of texts and thousands of images, with an impressive number of photos, are in digital form, aligned with metadata that reflect current standards according to the type of institution. All institutions use metadata systems that have the capability to export files into XML files, thus allowing the data transfer to Europeana, according to standards. These data are ready to be uploaded to Europeana at a chosen date.

The detailed results of the survey are presented in Annex 2, excel sheet no 2. They give an account of the progress made in the course of 2010, showing that all the institutions have already carried out substantial digitization work. The digitized objects include books, valuable manuscripts and archival material as well as a wide range of objects, including textiles, coins, personal and religious objects. These texts and images have all been digitized according to Europeana standards.

5.2 Content to be digitized The survey has shown that Judaica Europeana will be able to present a vast range of interesting objects of Jewish cultural heritage. The institutions represented are libraries, archives and museums and thus the material comprises all categories of objects according to the 4 types of Europeana classification: Images, text, sound and video.

Text is the prevalent type, as all the institutions, including the museums, possess vast amounts of precious texts that are invaluable witnesses of the Jewish past in European cities. Photographic collections, art objects, paintings and illustrations – counted as images - are the second largest category of objects being digitized. They are part of the museums and archives collections. The category of sound is well represented, as a large audio collection, containing ca. 6000 records of Jewish songs that have already been partly digitized is being held by the Medem Library. 10 % of the ca.6000 records are in the public domain, while a solution is being sought for the rest of the collection. Additional audio material is being digitized by the Jewish Museum in London. The category of video contains only a few examples of films in the collection of the Alliance Israélite Universelle.

The collections include also text material as books; manuscripts in the form of valuable old parchments, letters, archival documents and postcards; photographs; religious artefacts; textiles; coins; paintings, as well as music and video recording. The objects are distributed unequally, some institutions like the Jewish Museum of Greece will contribute objects of all kinds, from text material to textiles, others like the Bibliotheca Palatina di Parma or the Jewish Historical Institute in Warsaw will digitize only one kind of objects, in these cases text content only.

The results of the survey are listed in detail in Annex 1, excel-sheet no.1.

5.3 Metadata Survey The metadata survey has made it clear that institutions differ considerably regarding the extent and quality of metadata in use. A comparison of the standards of metadata schema accepted internationally with the current metadata systems used by the partner institutions providing content

18/26 First report of content metadata alignment with EUROPEANA requirements to Judaica Europeana shows that while most of the partners use standard metadata schema, others use in-house solutions. These particular solutions are however compatible with Dublin Core (DC), that has been used extensively as a ‘minimum’ metadata standard for resource discovery, most notably with Europeana itself. Despite the variety of metadata schemata used by the partners, they will be able to map it to ESE (European Semantic Elements) data model, as the ESE is basically an extended version of Dublin Core.

5.4 Use of Controlled Vocabulary The aim of implementing a set of standard descriptive metadata scheme is to achieve the best results for access to the digital images of the objects presented in Judaica Europeana. In the same way, an implementation of a controlled vocabulary is needed in order to facilitate the best possible results when searching the relevant information.1 Therefore controlled vocabularies suitable for Jewish cultural heritage that are part of a normative vocabulary like VIAF (Virtual International Authority File), defining personal names (a project of the national libraries in connection with OCLC) have been tested for their benefits for Judaica Europeana.2 The German National Library (DNB), the Library of Congress (LoC) as well as others, are participating in VIAF or exchanging their data with other national authority files such as the French RAMEAU.3 The Israeli National Library will cooperate with its database in order to check the adaptability of VIAF for the Jewish context. Concerning the description of the content of the digital objects such as key words or subject headings, (schemata for identifying the “what” of the objects) there are standard vocabularies that are already in use by some of the partner institutions. The museums use the vocabulary developed by the Getty trust as basis for key words.4 The identification of places related to Jewish life is an especially difficult task, as the locations have to be fixed in their variety of names, spellings, languages and historical context. However there are some data sources for geographical information – general and specific Jewish - that have to be tested more extensively for use in context to specific Jewish content. There is a variety of general geographical thesauri, like NGA GEOnet Names Server (GNS), Open Geospatial Consortium, (OGC) or GeoNames and a few vocabularies for Jewish-related information like JewishGen or Yad Vashem.5

5.5 Future work of WP 2 The next stage of WP2 will therefore have to monitor closely the ongoing practice of defining the standard metadata schema by the content providers in their institutional setting and assist in facilitating the adaptation of international accepted norms and the implementation of controlled vocabularies. The quality of the metadata record, its content and structure, will have a great impact on the effectiviness of the access to the offered material. It will therefore be necessary to assist some of the smaller institutions involved.

Regarding the use of controlled vocabularies, these have to be tested for their quality, international acceptance and simplicity in handling. WP2 will check the feasibility of representing the relevant

1 The program of actions to implement Standard Jewish Vocabularies isdescribed in: Dov Winer, Jewish semantics in the Linked Data Semantic Web-Vocabularies, Internal Deliverable ID, Confidential, August 2010.

2 VIAF – The Virtual International Authority File: http://viaf.org 3 RAMEAU (Répertoire d'autorité-matière encyclopédique et alphabétique unifié) http://rameau.bnf.fr 4 http://www.object-id.com/guide/guide index.html 5 NGA GEOnet Names Server (GNS) http://earth-info.nga.mil/gns/html/, http://www.opengeospatial.org/, http://www.geonames.org/about.html, http://www.jewishgen.org/ShtetlSeeker/loctown.htm, http://www.yadvashem.org/wps/portal/IY_HON_Welcome.

19/26 First report of content metadata alignment with EUROPEANA requirements vocabularies in the SKOS RDF format (Simple Knowledge Organizational System). Search results will be improved and the effectiveness of Judaica Europeana increased through the mapping of metadata between major metadata standards, for example, DC, including the Europeana Semantic Elements (ESE), as well as Judaica specific vocabularies.

WP2 is following the development of the EDM, European Data Model that will be implemented in the next edition of Europeana.1 The new model enables the uploading of metadatawith all the information gathered by the provider, according to the provider’s intention rather than cutting it down and thus flattening it to a restricted schema as is the case at present.

WP2 is seeking to prepare Judaica Europeana to benefit from this new model. It is presently checking the feasibility of publishing a database of documents about the Jewish Enlightenment as Linked Data. If this pilot proves feasible, it may pave the way for additional upload of Jewish content to Europeana making full use of the EDM facilities.

1 See Europeana Data Model Primer, version 05.08.2010, http://group.europeana.eu/c/document_library/get_file? uuid=718a3828-6468-4e94-a9e7-7945c55eec65&groupId=10605

20/26 First report of content metadata alignment with EUROPEANA requirements

Annex 1 Content to be digitized by the providers

Content Provider Collection Name Type of object Qantity in pages Format Metadata IPR Export to XML Frankfurt University Library/ Judaica Frankfurteriana Paper 200.000 Text MAB 2 Public Domain yes Freimann-Collection, Part II Paper 500.000 Text MAB 2 Public Domain yes

Alliance Israelite Universelle AIU- Jewish Periodicals, 19th &20th Paper 1.000.000 Text Marc 21 Public Domain yes Library century Jewish Journals, 20th century Paper 200.000 Text Marc 21 AIU + consent yes Books Paper 30000 Text MARC 21 Public Domain yes Photo-Collection Paper 5000 Image File Maker Pro AIU Conversion into MARC 21 Data Film Collection Film 8 films x 3-45 mn Video not yet-Aleph Marc 21 AIU yes AIU- Archival Material Paper 1.000.000 Text EAD, EAC AIU yes Archives Medem Library Records of Jewish songs mp3 6.000 Audio Marc 21 only 643 Medem yes Yiddish Books Paper 78.500 Text Marc 21 Medem yes Photo-Collection Paper 1.000 Timage Marc 21 Medem yes

Jewish Museum of Greece Books -Titlepages Paper 648 Text In house JMG yes Books, complete Paper 2.300 Text In house JMG yes Manuscripts Paper 107 Text In house JMG yes Archival material -Documents Paper 1150 Text In house JMG yes Postcards Paper 25 Text In house JMG yes Religipus Artifacts Object 596 Image In house JMG yes Textiles Object 403 Image In house JMG yes Costumes Object 478 Image In house JMG yes Domestic Artifacts Object 477 Image In house JMG yes Personal Objects Object 400 Image In house JMG yes Ephemera Object 99 Image In house JMG yes Etchings Paper 94 Image In house JMG yes

21/26 First report of content metadata alignment with EUROPEANA requirements

Photo-Collection -Archive Paper 2673 Image In house JMG yes Photo-Collection Paper 873 Image In house JMG yes Coins Object 125 Image In house JMG yes Architectural Elements Object 27 Image In house JMG yes Contemporary Artworks Object 47 Image In house JMG yes Arditis Archives Paper 230 Text In house JMG yes Patras Archives Paper 2740 Text In house JMG yes Molho Archives Paper 110 Text In house JMG yes Florentin Archives Paper 60 Text In house JMG yes Yoel Archives - Documents Paper 270 Text In house JMG yes Yoel Archives - Photos Paper 45 Image In house JMG yes Hidden Children - Photos Paper 340 Image In house JMG yes Jewish Neighbourhoods-Photo Paper 150 Image In house JMG yes WWII and Holocaust-Photo Paper 376 Image In house JMG yes Holocaust Survivors Paper 55 Image In house JMG yes Young People Paper 420 Image In house JMG yes Ioannina -Collection Object 270 Image In house JMG yes

Hungarian Jewish Archives Dohany Street Synagoge-Pews Paper 2900 Text In house HJA yes Jews of Pest - Conscription Paper 1761 Text In house HJA yes

MIBAC Bibliotheca Palatina di Parma Incunabula Paper Text Dublin Core - MAG Public Domain yes 16th Century Books Paper 48.000 Text Dublin Core - MAG Public Domain yes

State Archive Venice Jewish Welfare - Registers 1581- Paper 9 registers Text EAD Public Domai yes 1799 Regulations for Jews1516-1799 Paper 3 files Text EAD Public Domain yes Jews- 1750-1796 Paper 20 files + 1 register Text EAD Public Domain yes - Documents on Jews 1705-1795 Paper 26 files = 1 + 55 Text EAD Public Domain yes registers Ghetto 1587-1647 Paper 1 register + 1 file Text EAD Public Domain yes Jewish Necrology Paper 3 registers Text EAD Public Domain yes

22/26 First report of content metadata alignment with EUROPEANA requirements

Ghetto 1661-1740 Paper 4 registers Text EAD Public Domain yes Politics towards Jews Paper 13 registers Text EAD Public Domain yes 42.000 Jewish Historical Institute, Warsaw Breslau, Electors registers Paper 4544 Text In house Public Domain yes Breslau - Rabbinate Paper 2432 Text In house Public Domain yes Burial Society Paper 41.912 Text In house Public Domain yes Community members Paper 5057 Text In house Public Domain yes Jewish cemeteries Paper 11.614 Text In house Public Domain yes Jewish Theological Library Paper 8640 Text In house Public Domain yes Jewish Charity Paper 42.070 Text In house Public Domain yes Assoc. Of German Jews Paper 2027 Text In house Public Domain yes Emigration Paper 1600 Text In house Public Domain yes Schools Paper 4000 Text In house Public Domain yes Registers of death Paper 1.863 Text In house Public Domain yes Synagogues- documents Paper 6123 Text In house Public Domain yes Administration 1791-1938 Paper 11.436 Text In house Public Domain yes Jewish Communal Life Paper 4.306 Text In house Public Domain yes Photo Collection Paper 90 Image In house Public Domain yes

Jewish Museum London Audio Recordings WAV and MP3 150 Audio In house JML yes Cassettes Books Paper 6.000 Text In house JML yes Art Object 30 Image In house JML Yes Photos Object 300 Image In house JML Yes

23/26 First report of content metadata alignment with EUROPEANA requirements

Annex 2 Content already digitized by the providers

Content Provider Collection Name Type of object Qantity in pages Format Quality Metadata IPR Export to XML Frankfurt University Library/Judaica Collection Freimann-Collection Paper 750.000 Text TIFF & JPG MAB 2 Public Domain yes Yiddish prints Paper 200.000 Text TIFF & JPG MAB 2 Public Domain yes Compact Memory Paper 700.000 Text TIFF & JPG Dublin Core Public Domain yes

Alliance Israelite Universelle AIU Books and Pamphlets Paper 16.000 Text 300 dpi, black Marc 21 Public Domain yes Library &white, jpg AIU Bulletins Paper 300.000 Text 300 dpi, grey scale, Marc 21 AIU yes tiff Photo Collection Paper 1.200 Image Tiff, 300 dpi,+ jpg File Marker Pro AIU Conversion 80 dpi, into Aleph black&white Marc21 Film Collection Film 8 Movies, Wma Video Wma colour & File Marker Pro AIU Conversion black and white into Aleph Marc21

Medem Library Records of Jewish songs mp3 4.876 Audio Marc 21 only 643 Medem yes Yiddish Books Paper 20.000 Text Marc 21 Medem yes Photo-Collection Paper 200 Timage Marc 21 Medem yes

Jewish Museum of Greece Manuscripts Paper 90 Text TIFF and jpg ESE v. 3.2. JMG yes Documents Paper 1.000 Text TIFF and jpg ESE v. 3.2. JMG yes Textiles Object 6 Image TIFF and jpg ESE v. 3.2. JMG yes Coins Object 125 Image TIFF and jpg ESE v. 3.2. JMG yes Architectural Elements Object 24 Image TIFF and jpg ESE v. 3.2. JMG yes Photos Paper 873 Image TIFF and jpg ESE v. 3.2. JMG yes Etchings Paper 90 Image TIFF and jpg ESE v. 3.2. JMG yes Ephemera Object 80 Image TIFF and jpg ESE v. 3.2. JMG yes

24/26 First report of content metadata alignment with EUROPEANA requirements

Personal Objects Object 30 Image TIFF and jpg ESE v. 3.2. JMG yes Religious Artifacts Object 54 Image TIFF and jpg ESE v. 3.2. JMG yes Costumes Object 12 Image TIFF and jpg ESE v. 3.2. JMG yes Contemporary Artworks Object 47 Image TIFF and jpg ESE v. 3.2. JMG yes -Paintings

Hungarian Jewish Archives Postcard Paper 1766 Images jpg 300 dpi colour Inhouse HJA yes

Jewish Historical Institute, Warsaw Breslau, Electors registers Paper 3694 Text TIFF, JPG 300 dpi In house Public Domain yes Community members Paper 1508 Text In house Public Domain yes Administration 1791-1938 Paper 1.404 Text In house Public Domain yes Jewish Communal Life Paper 1.716 Text In house Public Domain yes Breslau-Representatives Paper 1307 Text In house Public Domain yes Communal organization Paper 2260 Text In house Public Domain yes Jewish families Paper 1021 Text In house Public Domain yes Correspondence Paper 381 Text In house Public Domain yes Jewish oath 1829-1869 Paper 268 Text In house Public Domain yes Antisemitic incidents 1879- Paper 526 Text In house Public Domain yes 1895 Bulletin, correspondence Paper 2812 Text In house Public Domain yes 1924-1937 Jewish Telegraphic Agency Paper 726 Text In house Public Domain yes 1925 Properties and buildings Paper 973 Text In house Public Domain yes 1919-1937 Christian Names 1836/37 Paper 24 Text In house Public Domain yes Jewish Society of Friends Paper 179 Text In house Public Domain yes Statutes Paper 1.428 Text In house Public Domain yes

Jewish Museum London Album-pages Paper 203 Text JPG 150 dpi In house JML yes colour/grayscale Letters Paper 130 Text JPG 150 dpi In house JML yes colour/grayscale

25/26 First report of content metadata alignment with EUROPEANA requirements

Theatre Programmes Paper 53 Text JPG 150 dpi In house JML yes colour/grayscale Leaflets Paper 41 Text JPG 150 dpi In house JML yes colour/grayscale Booklets Paper 3.200 Text JPG 150 dpi In house JML yes colour/grayscale Jewish Marriage Contracts Paper 28 Text JPG 150 dpi In house JML yes colour/grayscale Haggadah Paper 300 Text JPG 150 dpi In house JML yes colour/grayscale Posters Paper 20 Image JPG 150 dpi In house JML yes colour/grayscale Annual reports Paper 33 Text JPG 150 dpi In house JML yes colour/grayscale Certificates Paper 92 Text JPG 150 dpi In house JML yes colour/grayscale Books Paper 1.500 Text JPG 150 dpi In house JML yes colour/grayscale Business cards Paper 21 Image JPG 150 dpi In house JML yes colour/grayscale postcards Paper 34 Image JPG 150 dpi In house JML yes colour/grayscale Tickets Paper 16 Image JPG 150 dpi In house JML yes colour/grayscale Passports Paper 19 Image JPG 150 dpi In house JML yes colour/grayscale Photo-collection Paper 900 Image JPG 150 dpi In house JML yes colour/grayscale Membership documents Paper 200 Image JPG 150 dpi In house JML yes colour/grayscale Paintings Paper 100 Image JPG 150 dpi In house JML yes colour/grayscale Total 5.500 Text Total 1390 Images

26/26