The “F” from FAIR and NARCIS
Total Page:16
File Type:pdf, Size:1020Kb
The “F” from FAIR and NARCIS Pathways to make your data Findable in a generic national cataloging system Cees H.J. Hof @CeesH_DANS 18 September 2018, Data Federation Hub meet-up, UMC Groningen dans.knaw.nl DANS is an institute of KNAW and NWO Short intro……. (Why am I here?) • At DANS: • Project acquisition • Liaison life sciences • European Open Science Cloud • Software sustainability • Background in Biology (taxonomy & systematics) • +10 years involved in development of Global Biodiversity Information System (GBIF) • FAIR data avant la lettre • Cataloguing biodiversity data • Developing and implementing the DarwinCore data standard • Community building www.gbif.org The data deluge…. Numbers of (easy) accessible publications and datasets only growing…… Launched 4 September 2018 To find your way through the publications & data forest…. Catalogues are finding tools …. NARCIS: catalogue of Dutch research information National Academic Research and Collaborations Information System NARCIS: origin and evolution 1992: Nederlandse Onderzoek Databank (NOD) 2003: Berlin Declaration Open Access 2003: Dare Programme with DAREnet and NARCIS 2007: NOD and DAREnet merged into NARCIS (KNAW Bureau) 2011: NARCIS as one of the technical core services of DANS - Ongoing integration of research information - Increasing number of linked repositories - Improved search and visualization facilities https://www.narcis.nl NARCIS harvests metadata from 43 repositories NARCIS: catalogue of Dutch research information Access to: • Publications • Datasets • Research • People • Organisations NARCIS: Access to almost 300.000 datasets Take care….. • DANS provides back- up to Dryad (currently not only Dutch data in NARCIS) • Classification is subjective • Range from spreadsheets to GigaBites NARCIS & FAIR data The FAIR Guiding Principles To be Findable: F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (related to Reusable…) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource Source: Mark D. Wilkinson et al. (2016) The FAIR Guiding Principles for scientific data management and stewardship NARCIS & FAIR data The FAIR Guiding Principles To be Findable: F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (related to Reusable…) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource FAIR NARCIS NARCIS developments - Not only to show data end-products, also providing an overview of dynamic datasets relevant to science: long term monitoring, cohort studies, research collections, etc. - Include richer and better metadata - Interlinking of data – publications – researchers – projects – funders - Cover all domains of science - NARCIS as a gateway to full text and data harvesting… The NARCIS holy triplet: Dublincore or DataCite Metadata standards Exchange protocol Identifiers OAI-PMH DOI Open Archives Initiative Handle Protocol for Metadata Harvesting Etc. Case study: “zorggegevens” RIVM Dublincore Title In the end: DataCite But… much richer (15 terms) Creator The ultimate (19 terms) because of the use of Subject Choice has to sub-properties. Description For example: Publisher be made….. Contributor Coverage 18: GeoLocation Date “open” fields versus 18.1: geoLocationPoint Type more structured 18.2: geoLocationBox Format 18.3: geoLocationPlace Identifier 18.4: geoLocationPolygon Source Language Relation Rights Case study: “zorggegevens” RIVM - There is no rule of law in the mapping process - Focus on the potential “searchers” of your data - Best practical means to start with…. - Agile process - Rethink your database structure and set-up - Consider the addition of “new” fields in your database to facilitate optimal findability and interoperability - Identifiers… is usually an issue - NARCIS can be fine tuned for your purpose Case study: “zorggegevens” RIVM Case study: “zorggegevens” RIVM NARCIS: other routes…. Harvesting from international aggregators NARCIS: other routes…. Harvesting from international aggregators IPT toolkit harvesting rich biodiversity data www.gbif.org OAI-PMH + Filter on Dutch data providers NARCIS Dutch Biodiversity data providers In NARCIS and further on…. Plans for the future… (in the life sciences) Try to get the major life science databases and aggregators in NARCIS - Working with BBMRI (Biobanks) - In touch with “Zorginstituut” - In touch with Ministry “VWS” - Talking to ZonMw - What about the FAIR Data Point (DTL) - Workshops “Pimp your Metadata” - Possibly deploy metadata supporting services like the inclusion of DarwinCore metadata and Taxonomic services - What you would like…… Thanks for your attention! Cees Hof, with the help of Elly Dijk & Chris Baars [email protected] @CeesH_DANS dans.knaw.nl DANS is an institute of KNAW en NWO.