RDM PROGRAMME @ EDINBURGH - Service Interoperation
Total Page:16
File Type:pdf, Size:1020Kb
RDM PROGRAMME @ EDINBURGH - Service Interoperation Stuart Macdonald RDM Service Coordinator University of Edinburgh [email protected] RDMF 12 - Research Data and Repositories (and other systems), RDMF, University of Leicester, 19 November 2014 Background • EDINA and University Data Library (EDL) together are a division within Information Services (IS) of the University of Edinburgh. • EDINA is a Jisc-designated centre for digital expertise & online service delivery - http://edina.ac.uk/ • The Data Library assists Edinburgh University users in the discovery, access, use and management of research datasets: http://www.ed.ac.uk/is/data-library • Research and Learning Services offer specific services to the University with a focus on enabling research (OA publications, research data, bibliometrics) and resource discovery for learners (resource search systems). University of Edinburgh RDM Policy . University of Edinburgh is one of the first Universities in UK to adopt a policy for managing research data: http://www.ed.ac.uk/is/resear ch-data-policy . The policy was approved by the University Court on 16 May 2011. It’s acknowledged that this is an aspirational policy and that implementation will take some years. Governance An RDM Policy Implementation Committee was set up by the VP of Knowledge Management charged with delivering services that will meet RDM policy objectives: • Membership from across Information Services • Iterate with researchers to ensure services meet the needs of researchers The VP also established a Steering Committee led by Prof. Peter Clarke with members of the Research Committee from the 3 colleges, IS, and the Research Office (ERI). Their role is to: • Provide oversight to the activity of the Implementation Committee • Ensure services meet researcher requirements without harming research competitiveness Policy implementation - Research Data Management Roadmap (2012-2015) . Cross-divisional collaboration . Services already in place: o Data management planning o Active working file space = DataStore o Data publication repository = DataShare . Services in development: o Long term data archive = Before research During research After research DataVault o Data Asset Register (DAR) http://edin.ac/1u3sKqy . RDM support: Awareness raising, training & consultancy Research Data Management Planning Performed at the conceptual stage before research data are created (what, where, who, how) Customised instance of DCC’s DMPonline toolkit for University of Edinburgh use: • Funders and local (non-funder) DMP templates • Institutional guidance (storage, services, support) • Piloting customised school-level guidance - end of Jan. 2015 Tailored DMP assistance for researchers submitting research proposals (F-2-F) DataStore . NAS facility to store data that are actively used in current research activities . 1.6PB storage initially. 0.5 TB (500GB) per researchers, PGR upwards . Up to 0.25TB of each allocation can be used for “shared” group storage . Cost of extra storage: £200 per TB per year = 1TB primary storage, 10 days online file history, 60 days backup, DR copy . Infrastructure in place. Allocation of space devolved to IT departments of respective Schools overseen by Heads of IT from each College. De-allocation policy detailing responsibilities and storage costs for ‘orphaned data’ - pending approval by Steering Committee DataShare . Edinburgh DataShare is the University’s open access multi-disciplinary data repository: http://datashare.is.ed.ac.uk . Assists researchers disseminate their research, get credit for data publication, and preserve their data for the long-term (DOI, licence, citation) . Help researchers comply with funder requirements to preserve and share your data and complies with Edinburgh’s RDM Policy Data Vault . Safe, private, store of data that is only accessible by the data creator or their representative . Secure storage: File security; Storage security; Additional security: encryption . Long term assurance . Automatic versioning . Front-end application requirements (authorisation, retention & deletion, file structure, file transfter, integration Data Asset Register (DAR) . A catalogue of data assets produced by University of Edinburgh researchers . A key component of the University of Edinburgh RDM systems . Will give researchers a single place to record the existence of the data assets they produce for discovery, access, and re-use as appropriate. Paper proposing the adoption of PURE as the University’s DAR provisionally accepted the RDM Steering Committee (Oct. 2014) Interoperation Systems do not live in isolation, and become more powerful and more likely to be used if they are integrated with each other. However, the last thing that we want is to introduce further systems that need to be fed with duplicate information. This means interoperation for some or all of the components RDM Support Making the most of local support! • RDM team will work with the Research Administrators in each School. • Academic Support Librarians (who represent each of the 22 Schools). • IT staff in each School. • ERI staff. They will be receiving RDM training. • Each School’s Ethics Committee • Queries can be sent to the IS Helpline who will direct them as appropriate. Awareness raising • Introductory sessions on RDM services and support for research active and research admin staff in Schools / Institutes • RDM website: http://www.ed.ac.uk/is/data- management • RDM blog: http://datablog.is.ed.ac.uk • RDM wiki: https://www.wiki.ed.ac.uk/displa y/RDM/Research+Data+Manage ment+Wiki MANTRA . MANTRA is an internationally recognized self-paced online training course developed here for PGR’s and early career researchers in data management issues. Data handling exercises with open datasets in 4 analytical packages: R, SPSS, NVivo, ArcGIS . CC License & embed units in VLE’s e.g. Moodle http://datalib.edina.ac.uk/mantra Training: Tailored Courses . Creating a data management . A range of training plan for your grant application programmes on research data management (RDM) in . Research Data Management Programme at the University of the form of workshops, Edinburgh power sessions, seminars and drop in sessions to . Good practice in Research help researchers with Data Management research data management . Handling data using SPSS issues . Handling data with ArcGIS . http://www.ed.ac.uk/schools- http://edin.ac/1kRMPv3 departments/information- services/research-support/data- . management/rdm-training Service Integration • DataShare is a customised DSpace instance with a selection of OAI-PMH compliant DCMI metadata fields for data discovery through Google and other search engines • Records are harvested by Data Citation Index • SWORD API utilised for batch deposit of large and/or many files from remote computers (‘Push using http’) • Internal batch ingest of many/large files to circumvent 2.1GB limit via the web interface (‘Pull via command line interface’) • Use of checksums to determine that delivered object mirrors deposited object • Working with F1000Research to define a workflow for depositors to get credit for data as research output by publishing data articles - http://f1000research.com/ • Published new list of data journals for our depositors DSpace GITHUB plugin* - allows software to be archived from GitHub (or similar) source code repository into DataShare, which can then be assigned a DOI to facilitate citation - using the SWORD deposit protocol DataSync - to allow sharing of data on DataStore: • drop-box type functionality • uses open source ‘ownCloud’ technology • desktop and mobile machines synchronize files with the ownCloud server • file updates are pushed between all devices connected to a user's account. Research data deposit from RSpace Electronic Lab Notebook (ELN) interface into DataShare (and Datastore & Data Vault) using SWORD * http://blog.stuartlewis.com/2014/09/09/github-to-repository-deposit/ Integrating an electronic lab notebook with a university research infrastructure: Case Study with RSpace at the University of Edinburgh Rory Macneil Research Space [email protected] RDMF 12 - Research Data and Repositories (and other systems), RDMF, University of Leicester, 19 November 2014 Overview ● ELNs – where the demand coming from ● RSpace – origins and overview ● RSpace at Edinburgh – Linking to files in Edinburgh DataStore – Depositing content in Edinburgh DataShare – Archiving in Edinburgh DataVault ● Platform for integration with other RDM infrastructures Who and what is driving demand for ELNs? ● Researchers – Utility and convenience of paper lab book + online capabilities – On multiple devices – File management/integration ● Groups/PIs – Controlled sharing – Collaboration – Group management – File management/integration ● Institutions: data librarians, research admins, IT, commercialisation offices – Enterprise features: Scalable deployment, Single Sign On – IP protection: audit trail, signing – Publishing – Archiving – Repository integration – File management/integration RSpace ● Conceived in response to Wisconsin RFP and trial 2011 - 2012 ● Developed with Wisconsin by Research Space 2012 - 2013 Researcher experience Sketching √ Image annotation √ Chemical structures √ Notebook √ Forms √ Templating √ Snippets √ PDF export √ Export to html √ File gallery √ Journal view √ Tablet friendly √ Clean design √ Performance √ Round trip editing √ Offline access √ PI/Lab support Sharing √ Messaging √ Lab set up enabled √ Group management √ Inter-group collaboration √ Institutional requirements (IT,