Managing Sensitive Data and Authorship in Humanities and Social Sciences

Total Page:16

File Type:pdf, Size:1020Kb

Managing Sensitive Data and Authorship in Humanities and Social Sciences Managing sensitive data and authorship in Humanities and Social Sciences Louise Corti Collections Development and Producer Support ODIN conference, Cologne October 2013 Overview • Introducing the UK Data Service • Our data portfolio and users • Citation, impact measurement and DOIs • Challenges for social science citation The UK Data Archive • Based at the University of Essex, since 1967 • 45 years of selecting, ingesting, curating and providing access to social science data • designated as Place of Deposit by The National Archives • Data and data support services for higher and further education for research, teaching and learning • Recently attained the highest information security standard, ISO 27001 University of Essex The Archive SISTER DATA ARCHIVES Council of European Social Science Data Archives (CESSDA ) ICPSR (USA) Inter-University Consortium for Political and Social Research ADA Australian Social Science Data Archive What is the UK Data Service? • Comprehensive data resource funded by the UK Economic and Social Research Council • Single virtual point of access to a wide range of secondary data for social science research (Directed from Essex) • Offer promotion, support, training and guidance What does the UK Data Service do? • Put together a collection of the most valuable data • Preserve data for the long term for future research purposes • Make the data and documentation available for reuse • Provide data management advice for data creators • Provide training and support for users of the service • Bring together owners, producers and users • Demonstrate impact through evidence of usage • Easy access through website - ukdataservice.ac.uk Who is our service for? • Data for secondary analysis, research, policy making • Teaching and learning • Academic researchers and students • Government analysts • Charities and foundations • Business consultants • Independent research centres • Think tanks Our data portfolio • Over 6,000 datasets in the collection • 230 new datasets added each year • Official agencies - mainly central government • International statistical time series • Individual academic’ research grants • Market research agencies • Public records/historical sources • Access to international data via links with other data archives worldwide UK survey series • High quality repeated cross-sectional surveys • Individual or household level data • Cover many topics including health, work, crime, social attitudes, family expenditure, living costs, housing etc. • Labour Force Survey • British Crime Survey • Health Survey for England • British Social Attitudes • Annual Population Survey …. Cross-national surveys and macro databanks • Eurobarometers • European Social Survey • European Values Survey • International Social Survey Programme • Time series data aggregated to country/region • International governmental organisations (IMF, OECD, IEA, World Bank) Longitudinal studies • British Household Panel Survey and Understanding Society • Understanding Society (2009-) • English Longitudinal Study of Ageing • Families and Children Study • Growing Up in Scotland • Longitudinal Study of Young People in England UK census data • 1971-2011 census data • Baseline for other statistics • Detailed combinations of characteristics • Small geographies • Census outputs • Aggregate data • Boundary data • Flow data • Microdata Business data • Collected through a wide range of surveys, and administrative sources: • productivity, innovation, workforce skills, earnings • international trade, foreign direct investment • research and development • business demography • industrial relations Qualitative data • Interviews, focus groups • Essays, diaries, open-ended survey questions • Observations, case notes etc. • Family Life and Work Experience before 1918, Middle and Upper Class Families in the Early 20th Century,1870-1977 • Gender Difference, Anxiety and the Fear of Crime, 1995 • Mothers Alone: Poverty and the Fatherless Family, 1955-1966 Usage of data • Operate a spectrum of access • over 22,000 • Web download under End registered users User Licence • approximately • Permission only via Special 60,000 downloads Licence access worldwide p.a. • ‘Approved researcher’ access • 3,000+ user support via remote secure access queries • End user licence includes: • Appropriate data usage • Full citation of data and informing us of re-use • Have always provided a citation format Evidence of access and re-use User access information • Collect user information and ‘projects’ upon registration • Collate data and documentation download statistics • Users can share project information for others to see • Report data access stats on demand Usage information • Email all users every 6 months after registration about activity • Manually add all research outputs references to the data record • Reporting rate of publications is poor! • Prior to DOIs, have scanned citation literature for dataset mentions – very manual and unreliable, and poorly cited Impactful case studies of use • Identify and seek out case studies of re-use: research or teaching. • Very successful! • 125 case studies in our database • Can help provide impact stories for data owners/producers and users • And can inspire others! • Some are harvested by ESRC for their website • Often include ongoing work – no need to wait for publications Our Persistent identifiers approach • Our data collections are not digital objects • Need to capture changes made to data • Versioning data in a commonly understood manner • Needed rule-based definition of a‘significant’change • Integrate processes with digital preservation activities & work flows • In 2011 we assigned Datacite DOIs for all of our collections • Mint and update DOIs with our metadata management infrastructure Recording significant change • Approx. 15% UKDA data collections are altered within first year after first publication • We have distinguished between major and minor changes to a data collection = high impact vs. low impact • DOI allocated to a metadata instance of a data collection • DOIs resolve to jump page pointing to all external instances • New DOI = High Impact change, with explicit logging • Provided access only to most up-to-date version of data Major changes – high impact • New variable added • New labels/value codes added • Weighting variables reconstructed • Wrong data supplied (e.g., March not April) • Mis-coded data (e.g., Don’t know/Refused confused) • Change in format (file migration) • Significant changes in documentation • Change in access conditions Raising awareness in the social sciences • ESRC funding for short-term project on citation • Advocacy for best practice in citing research data • Audiences • Professional organisations • Academic publishers and journal editors • Researchers and postgraduates • Key activities • Data citation principles for social sciences • Personal communications • Events with BL DataCite, JISC and wider PI community • Outreach through Doctoral Training Centres Making Demonstrating impact with citation • Assuming better use of DOIS… • Starting to search for use of our DOIs – Google • Automate this process and compile reports; promote • Gather data citation statistics from Thomson Reuters Data Citation Index. One of the early 20 feeder repositories, but our own access limited! • Work with BL Datacite and ODIN to gain connectivity between identifiers & outputs – early adopters CHALLENGES FOR THE FUTURE • Citing parts (fragments) of data collections • single files • subsets of quantitative data • extracts of textual data • ESRC project Digital Futures will enable extract level citation within a web-based browsing system • Using rich highly structured XML metadata • GUIDS for everything UK Quali Bank Resolving citation objects • Will enable extract level citation • Citation object and citation format created on the fly – using GUIDS and URI • URI resolves directly to the data extract • Some more sensitive collections will be closed, so cannot resolve to data • As yet uncertain of relationship to our collection- level DOIs CONTACT UK Data Service University of Essex Wivenhoe Park Colchester Essex CO4 3SQ • ……………..…..……………………….. T +44 (0)1206 872001 E [email protected] .
Recommended publications
  • Data and Services Data and Services
    Data and services Data and services The UK Data Service provides researchers with flexible research support, training and access to the UK’s largest collection of social, economic and population data. We are a critical part of the UK’s research infrastructure, established to facilitate high quality, impactful research and education across all sectors. We enable researchers, teachers and policy makers to get an in-depth understanding of social and economic issues – and greater insights into tackling social challenges, now and in the future. A partnership of five universities and Jisc, we are funded by the Economic and Social Research Council (ESRC) and the collections of national data resources we provide access to have been built up over 50 years. Our holdings include major UK government-sponsored surveys, longitudinal studies, UK census data, international macrodata, business microdata, cross-national surveys, qualitative data and administrative data. Services that meet Data Assessment your needs We offer significant expertise across ukdataservice.ac.uk the data publishing life cycle, from data appraisal, quality and disclosure risk Teaching assessment, through to preparation The use of real data in coursework and documentation. adds authenticity and relevance to teaching resources and gives students We work closely with data owners to the data analysis skills necessary help triage and share data with the to make significant contributions appropriate governance measures to society. in place. Research We offer data cleaning, rescue and We provide a Trusted Digital Repository data 'archaeology' services, which we for social and economic researchers’ undertake alongside our core data data through our lead partner, the UK preparation activities.
    [Show full text]
  • Using Numeric Datasets in Learning and Teaching Final Report, 2002
    An enquiry into the use of numeric data in learning & teaching S e p t e m b e r, 2001 Robin Rice (Project Manager) Edinburgh UniversityData Library Peter Burnhill (Project Director) Edinburgh UniversityData Library, EDINA Melanie Wright (Project Te a m ) The UK Data Archive Sean Townsend (Project Te a m ) British Library of Political and Economic Science Part 1: Report and recommendations C o n t e n t s Executive summary 3 Introduction and participants 4 Aims and objectives of the project 5 Nature of the enquiry (or why it matters) 5 Methods used in the enquiry 6 Discussion of results 7 Recommendations and conclusions 17 References 20 Appendix A: Related developments 21 Appendix B: Case studies of current practice 23 Part 2: Teachers’ survey results 4 1 An enquiry into the use of numeric data in learning & teaching page 3 Executive summary Within UK higher education the renewed attention to learning and teaching is an impetus for change. Advances in information technology create new space for learning beyond the traditional classroom lecture format. New initiatives are creating networked teaching materials for shared use across institutions. But little is known about the readiness of teachers and students to take advantage of these resources for teaching and study. Are universities providing the support needed for using these networked resources in classrooms, computer labs, and independent study? An academic Task Force on the use of numeric data in learning and teaching has issued a report on the barriers faced by teachers and students to using national data services across a number of disciplines, including but not limited to the social sciences.
    [Show full text]
  • Skills, Not Just Diplomas
    Restoring and sustaining growth in Eastern Europe and Central Asia requires reforms to Managing Education for Results in Eastern EuropeSkills, and Central Not Asia Just Diplomas boost competitiveness and increase labor productivity. Among the required changes are reforms to education. In surveys conducted immediately before the economic crisis, companies in the region reported shortage of skills as one of the most significant bottle- necks in their operations, suggesting that education systems in Eastern Europe and Central Asia— with a reputation for high enrollment rates and well-trained teachers—still need to improve their performance. In fact, international test results show that many students—outside of a handful of coun- tries in the region—are failing to acquire more than the most basic literacy and numeracy skills. Anecdotal evidence also indicates that the rapid expansion in higher education has led to a decline in the quality and relevance of education provided. At the same time, there are few opportunities for adults to retrain, upgrade, or acquire new skills—the life-long learning needed for employability. As Skills, Not Just Diplomas suggests, the shortage of skills is a wake-up call to reform education and training systems to provide higher quality education with the flexibility for students and training institutions to better respond to market signals. Such deep reform will have to center on the following: DIRECTIONSINDEVELOPMENT • Focusing more on measuring whether students learn and graduates find jobs, and Human Development using this information to actively improve teaching and learning. • Using incentives across the education system, including granting greater autonomy to institutions on curriculum, teaching methods, resource use and institutional mission, and increasing accountability for learning.
    [Show full text]
  • Digital Humanities and Future Archives
    Digital Humanities and Future Archives SAMPO VIIRI REPORTS The Finnish Institute is a London-based private trust. Our mission is to identify emerging issues relevant to contemporary society and to act as catalyst for positive social change through partnerships. We work with artists, researchers, experts and policy makers in the United Kingdom, Finland and the Republic of Ireland to promote strong networks in the fields of culture and society. We encourage new and unexpected collaborations and support artistic interventions, research, the creative industries, foresight and social innovation in new, socially central areas. The Reports of the Finnish Institute in Unit 1, 3 York Way N1C 4AE London London is a series of publications, which United Kingdom publishes research, studies and results of T +44(0)20 3764 5090 collaborative projects carried out by the www.finnish-institute.org.uk institute. The reports provide evidence Digital Humanities and Future Archives and ideas for policy-makers and civic by the Finnish Institute in London is society organisations dealing with licensed under a Creative Commons contemporary social and cultural Attribution-ShareAlike 3.0 Unported License challenges. August 2014 Table of Contents Preface Executive Summary 1 Introduction 1 1.1 Objectives 1 1.2 Methods 3 1.3 Defining Digital Humanities 4 2 The Researcher's Perspective 8 2.1 Access to Historical Resources and Public Interest 8 2.2 New Tools and Methods 14 2.3 Sharing and Preserving 19 2.4 Narrowing Gaps and Building Bridges 20 2.5 Measuring the Impacts
    [Show full text]
  • A Tale of Research Data Management in Practice
    A tale of research data management in practice Veerle Van den Eynden UK Data Archive University of Essex Embracing Data Management - Bridging the Gap Between Theory and Practice VLIR-FOSTER seminar Brussels, 4 June 2015 Science advances through data sharing Data used (national surveys): Public Risk Perceptions, Climate Change and the Reframing of UK Energy Policy in Britain, 2005; Public Perceptions of Climate Change and Energy Futures in Britain, 2010; … Society benefits from data sharing Helping researchers manage and share data • Data management = organisation, documentation, storage, safeguarding, preservation and accessibility of data, incl. ethical and legal aspects of data handling and data ownership • Data sharing = release of data for use by other people Why manage research data well ? • Data creation in research is often expensive • Data = cornerstone of research • Data underpin published findings • Good quality data = good quality research • Protect data from loss, destruction,… • Compliance with ethical codes, data protection laws, journal requirements, funder policies Research integrity Openness Boost factors: research funders, EU European open access policies: Horizon 2020, European Research Council (ERC) • communication & recommendation on access to / preservation of scientific information (July 2012) (publications & research data) • pilot on open access to research data, primarily data underlying (open access) scientific publications for Horizon 2020 • data management guidelines for Horizon 2020 (~ policies) generally based on OECD Principles and Guidelines for Access to Research Data fro m Public Funding Boost factors: research funders, UK • Publicly funded research data are a public good, produced in the public interest, that should be made openly available with as few restrictions as possible in a timely and responsible manner that does not harm intellectual property.
    [Show full text]
  • VIII. Appraisal R8. the Repository Accepts Data and Metadata Based
    Assessment Information CoreTrustSeal Requirements 2017–2019 Repository: UK Data Archive Website: https://www.data-archive.ac.uk/ Certification Date: 18 May 2020 This repository is owned by: University of Essex CoreTrustSeal Board W www.coretrustseal.org E [email protected] UK Data Archive Notes Before Completing the Application We have read and understood the notes concerning our application submission. True Reviewer Entry Reviewer 1 Comments: Reviewer 2 Comments: CORE TRUSTWORTHY DATA REPOSITORIES REQUIREMENTS Background & General Guidance Glossary of Terms BACKGROUND INFORMATION Context R0. Please provide context for your repository. Repository Type. Select all relevant types from: Domain or subject-based repository, National repository system; including governmental, Library/Museum/Archives, Research project repository Reviewer Entry Reviewer 1 Comments: Accept Reviewer 2 Comments: Accept Brief Description of Repository The UK Data Archive (W01) domain/subject-based repository in that, through the UK Data Service Collections Development Policy (F01) we focus on data of interest to researchers in the social sciences and humanities. We also hold the legacy collection of the History Data Service. We are part of our national repository system, including governmental data and receive deposits from the Office of National Statistics (ONS), which reports through the UK Statistics Authority to the UK Parliament. Library/Museum/Archives: We are a traditional data archive, as part of the range of data services that we support. We are a research project repository as a mandated place of deposit for research data from Economic and Social Research Council (ESRC) (R01) grants. W01 UK Data Archive web site https://www.data-archive.ac.uk/ F01 Collections Development Policy https://ukdataservice.ac.uk/media/398725/cd227-collectionsdevelopmentpolicy.pdf R01 Economic and Social Research Council (ESRC) https://esrc.ukri.org/ Reviewer Entry Reviewer 1 Comments: Accept Reviewer 2 Comments: Accept Brief Description of the Repository’s Designated Community.
    [Show full text]
  • Effectiveness of Tax Incentives to Boost (Retirement) Saving: Theoretical Motivation and Empirical Evidence
    OECD Economic Studies No. 39, 2004/2 EFFECTIVENESS OF TAX INCENTIVES TO BOOST (RETIREMENT) SAVING: THEORETICAL MOTIVATION AND EMPIRICAL EVIDENCE Orazio P. Attanasio, James Banks and Matthew Wakefield TABLE OF CONTENTS Introduction ................................................................................................................................. 146 How are tax incentives supposed to work? a simple theoretical outline............................ 147 Evidence on the effectiveness of individual retirement accounts in the United States.................................................................................................................... 150 The UK experience ..................................................................................................................... 155 Aggregate evidence on the effectiveness of TESSAs ........................................................ 157 Micro-evidence on the effectiveness of ISAs ...................................................................... 159 Conclusions and thoughts for further research ....................................................................... 166 Appendix. Analysis at the individual level ................................................................................. 169 Bibliography ............................................................................................................................... 172 The authors are from University College London, Institute for Fiscal Studies and National Bureau of Economic Research; University
    [Show full text]
  • The Role of Case Studies in Effective Data Sharing, Reuse and Impact by Rebecca Parsons1 and Scott Summers2
    IASSIST QUARTERLY The Role of Case Studies in Effective Data Sharing, Reuse and Impact by Rebecca Parsons1 and Scott Summers2 Abstract The effectiveness and impact of social science research is under constant review. From the sharing, reuse and archiving of social science research data to the outcomes, reach and impact of research, social science professionals are under increasing pressure to realise the maximum potential of their data collections and their research findings. The UK Data Service is playing a key role in supporting researchers in this process and is using detailed and well-received case studies to provide them with guidance on the best practice for sharing and reusing data and also identifying and capturing the impact of research. The impact of research is now routinely considered when examining the ‘success’ of funded projects, but the reality of identifying and capturing impact can be a challenge. Publishing data in its own right is now recognised as being impactful to funders, yet exposing this narrative through ‘showcasing’ is under exploited. Such narratives can incentivise others to share data and also improve the quality of the data and documentation. This paper seeks to explore the role that case studies of research can play in this regard. The paper also examines the role that depositor and user case studies can play in enhancing the reuse of a showcased data collection. To achieve this, a variety of illustrative depositor and impact case studies are discussed, highlighting the role that these can have on research projects. Keywords Case Study, Impact, Data Sharing, Data Management. Introduction This paper explores the use of case studies in a variety of ways; First, it considers cases studies in regards to research and impact.
    [Show full text]
  • Policy-Making for Research Data in Repositories: a Guide
    Policy-making for Research Data in Repositories: A Guide http://www.disc-uk.org/docs/guide.pdf Image © JupiterImages Corporation 2007 May 2009 Version 1.2 by Ann Green Stuart Macdonald Robin Rice DISC-UK DataShare project Data Information Specialists Committee -UK Table of Contents i. Introduction 3 ii. Acknowledgements 4 iii. How to use this guide 4 1. Content Coverage 5 a. Scope: subjects and languages 5 b. Kinds of research data 5 c. Status of the research data 6 d. Versions 7 e. Data file formats 8 f. Volume and size limitations 10 2. Metadata 13 a. Access to metadata 13 b. Reuse of metadata 13 c. Metadata types and sources 13 d. Metadata schemas 16 3. Submission of Data (Ingest) 18 a. Eligible depositors 18 b. Moderation by repository 19 c. Data quality requirements 19 d. Confidentiality and disclosure 20 e. Embargo status 21 f. Rights and ownership 22 4. Access and Reuse of Data 24 a. Access to data objects 24 b. Use and reuse of data objects 27 c. Tracking users and use statistics 29 5. Preservation of Data 30 a. Retention period 30 b. Functional preservation 30 c. File preservation 30 d. Fixity and authenticity 31 6. Withdrawal of Data and Succession Plans 33 7. Next Steps 34 8. References 35 INTRODUCTION The Policy-making for Research Data in Repositories: A Guide is intended to be used as a decision-making and planning tool for institutions with digital repositories in existence or in development that are considering adding research data to their digital collections.
    [Show full text]
  • UK Data Service Collections Development Policy
    Collections Development Policy External 18 February 2021 Version: 07.00 T +44 (0)1206 872001 E [email protected] ukdataservice.ac.uk CD227-CollectionsDevelopmentPolicy Contents 1. Scope ........................................................................................................................................................ 2 2. Definition of Terms ............................................................................................................................. 2 3. Roles and Responsibilities ............................................................................................................... 3 4. Policy ........................................................................................................................................................ 4 4.1. Background...................................................................................................................................... 4 4.2. Scope of Collections ...................................................................................................................... 4 4.3. Curation Categories ....................................................................................................................... 5 4.4. External Influences ........................................................................................................................ 5 4.5. Principles of Collections Development ....................................................................................... 6 5. Procedures for
    [Show full text]
  • Recommendations for Improving Research Access to Potentially
    Ymchwil gymdeithasol Social research Rhif/Number: 26/2014 Recommendations for Improving Research Access to Potentially Disclosive Data Author Tanvi Desai London School of Economics Views expressed in this report are those of the researcher and not necessarily those of the Welsh Government) For further information please contact: Sarah Lowe Knowledge and Analytical Services Welsh Government Cathays Park Cardiff CF10 3NQ Tel: 02920 826229 Email: [email protected] Welsh Government Social Research, 2014 ISBN: 978-1-4734-1083-1 © Crown Copyright 2014 Table of Contents Glossary ..................................................................................................................... 3 1. Introduction .......................................................................................................... 4 2. Methods .............................................................................................................. 6 3. Data Access Landscape ...................................................................................... 8 International Landscape ....................................................................................... 11 UK Landscape ...................................................................................................... 12 4. Legal and Ethical Considerations ...................................................................... 15 Data sharing ......................................................................................................... 17 Data linking ..........................................................................................................
    [Show full text]
  • UK Data Archive Preservation Policy
    Preservation Policy UK Data Archive PUBLIC 26 January 2021 Version: 12.00 ------------------- T +44 (0)1206 872001 E [email protected] www.data-archive.ac.uk ------------------- ……………………………............................................................................... WE ARE SUPPORTED BY THE UNIVERSITY OF ESSEX, THE UNIVERSITY OF MANCHESTER, THE ECONOMIC AND SOCIAL RESEARCH COUNCIL, AND JISC CD062-PreservationPolicy_12_00w Contents 1 Purpose ................................................................................................................ 3 2 Scope .................................................................................................................... 4 2.1 Objectives ......................................................................................................... 4 3 Requirements ....................................................................................................... 5 3.1 Legal and regulatory framework ........................................................................... 6 4 Roles and Responsibilities .................................................................................... 6 5 Model .................................................................................................................... 7 5.1 Pre-ingest function ............................................................................................. 7 5.2 Ingest function ................................................................................................... 8 5.3 Archival storage
    [Show full text]