<<

1/22 ) a Morgan: Project SPECTRa SPECTRa Morgan: Project xposure of E IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, esearch Dat R

Peter Morgan reservation, and eaching and P T

Project SPECTRa Project Director, SPECTRa Cambridge University Library hemistry ubmission, C S ( Facilitating the deposit of experimental data in institutional repositories: 2/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, –the problem –the –methodology –investigations –software Research data and Open Access Institutional context SPECTRa project Conclusions Outline ƒ ƒ ƒ ƒ 3/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, Open Access Research data & 4/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, – e-Science – Web Semantic – through informatics re-analysis Open Access declarations (Budapest, OECD, etc.) include access to data 855 repositories - only 6% contain data machine-understandable data is needed for: Research data & Open Access ƒ ƒ ƒ 5/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, Open Data is not the same as Access OA licences often don't address reuse of data OA papers associated with restricted data Open Data can co-exist with non-OA papers many publishers are unconvinced about OA therefore many researchers are also unconvinced about OA lack of exemplars to illustrate benefits lack of practical tools to overcome obstacles Research data & Open Access ƒ ƒ ƒ ƒ ƒ ƒ ƒ ƒ 6/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, Institutional context 7/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, – Cambridge University Library + Computing Service DSpace@Cambridge, 2003- accepts all file formats, types of content contains open- and closed-access collections few OA research papers large collection (>175,000 files) of chemistry data files Chemistry Dept (Peter Murray-Rust) keen to explore potential of repository University of Cambridge ƒ ƒ ƒ ƒ ƒ ƒ 8/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, – accept texts and supporting datasets to member of LEAP (London Eprints consortium) DSpace planned as institutional repository (Henry Rzepa) in close collaboration with Cambridge colleagues potential links with High Performance Computing ƒ ƒ ƒ ƒ 9/22 ) a Morgan: Project SPECTRa SPECTRa Morgan: Project xposure of E IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, esearch dat R reservation, & P eaching and T SPECTRa ubmission, hemistry S C ( 10/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, (Joint Information Systems Digital Repositories Programme 18-month project partnership between in collaboration with eBank-UK funded by JISC Committee) Cambridge University Library (lead site) Cambridge University Chemistry Dept Imperial College London - Chemistry Dept Imperial College London - Library 3 project staff plus librarians & chemists ended March 2007 SPECTRa project ƒ ƒ ƒ ƒ ƒ 11/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, academic chemistry research community in capturing and re-using experimental scientific data, facilitating the routine extraction of data in high volumes and their ingest into institutional repositories” “to investigate the needs of SPECTRa aims 12/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, survey at Cambridge and Imperial of researchers’ requirements in crystallography, computational chemistry, and synthetic organic chemistry development of customised Open Source tools as part of researchers’ workflow to enable deposit of, and access to, Open Data using DSpace institutional repositories Methodology ƒ ƒ 13/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, – interviews with key research leaders – interviews with key research leaders interviews – survey of current use computers & – questionnaire (28 questions) –follow-up – analysis of results Crystallography Computational Chemistry: Synthetic organic chemistry Investigations ƒ ƒ ƒ 14/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, much data not stored electronically many file formats (mainly proprietary) significant ignorance of digital repositories repositories must be able to restrict access to experimental data Survey results ƒ ƒ ƒ ƒ 15/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, – publishes work involving those structures, or – moves on to a different line of research publication of chemical structures must be embargoed until the chemist chemists may need to hide research data from competitive research groups or for commercial reasons researchers are unlikely to deposit their data without adequate recognition of concerns need for an embargo process to control public release of data Data embargo ƒ ƒ ƒ ƒ 16/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, capture selected data validate against file specifications add metadata inc. InChI and systematic name add persistent identifiers (Handles) add METS packaging deposit in closed ‘embargo’ repository manage the release of embargoed data into open repository by agreement data can then be searched and harvested The SPECTRa workflow ƒ ƒ ƒ ƒ ƒ ƒ ƒ ƒ 17/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, – departmental installations – embargo repository managed at departmental level – potential role as intermediate repositories DSpace already adopted by both libraries separate DSpace repositories for SPECTRa institutional repository architecture to include co-ordinated network of repositories? Repository platform ƒ ƒ ƒ 18/22 Open Open Embargo Repository Dept Repository Dept Repository Central Institutional Central Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, & OAI-PMH harvesting SPECTRa search tools Departmental data management data Departmental using SPECTRatools deposit Validation1. extraction Metadata 2. creation Embargo 3. METS packaging 4. File upload SPECTRa Project architecture 19/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, Conclusions 20/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, Discipline-specific repository tools The “Golden Moment” Data embargo Automated deposit & human editing DOIs have cost implications DSpace Handle system underdeveloped Conclusions (1) ƒ ƒ ƒ ƒ ƒ ƒ 21/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, Data re-use requires IPR guidance Scientific data = major asset value Local solutions are resource-intensive Generic solutions involve compromises Role of departmental repositories in an institutional repository architecture Conclusions (2) ƒ ƒ ƒ ƒ ƒ 22/22 Morgan: Project SPECTRa SPECTRa Morgan: Project IATUL, Stockholm, 11-14 June 2007 June 2007 11-14 Stockholm, IATUL, [email protected] Peter Morgan SPECTRa: www.lib.cam.ac.uk/spectra/ Thank you for listening DSpace@Cambridge: www.dspace.cam.ac.uk