Science As an Open Enterprise
Total Page:16
File Type:pdf, Size:1020Kb
Science as an open enterprise June 2012 Science as an open enterprise The Royal Society Science Policy Centre report 02/12 Issued: June 2012 DES24782 ISBN: 978-0-85403-962-3 © The Royal Society, 2012 The text of this work is licensed under Creative Commons Attribution-NonCommercial-ShareAlike CC BY-NC-SA. The license is available at: creativecommons.org/licenses/by-nc-sa/3.0/ Images are not covered by this license and requests to use them should be submitted to [email protected] Requests to reproduce all or part of this document should be submitted to: The Royal Society Science Policy Centre 6 – 9 Carlton House Terrace London SW1Y 5AG T +44 20 7451 2500 E [email protected] W royalsociety.org Cover image: The Spanish Cucumber E. Coli. In May 2011, there was an outbreak of a unusual Shiga-Toxin producing strain of E.Coli, beginning in Hamburg in Germany. This has been dubbed the ‘Spanish cucumber’ outbreak because the bacteria were initially thought to have come from cucumbers produced in Spain. This figure compares the genome of the outbreak E. Coli strain C227-11 (left semicircle) and the genome of a similar E. Coli strain 55989 (right semicircle). The 55989 reference strain and other similar E.Coli have been associated with sporadic human cases but never large scale outbreak. The ribbons inside the track represent homologous mappings between the two genomes, indicating a high degree of similarity between these genomes. The lines show the chromosomal positioning of repeat elements, such as insertion sequences and other mobile elements, which reveal some heterogeneity between the genomes. Section 1.3 explains how this genome was analysed within weeks because of a global and open effort; data about the strain’s genome sequence were released freely over the internet as soon as they were produced. This figure is from Rohde H et al (2011). Open-Source Genomic Analysis of Shiga-Toxin– Producing E. coli O104:H4. New England Journal of Medicine, 365, 718-724. © New England Journal of Medicine. Science as an open enterprise: open data for open science Contents Working group ...............................................5 Chapter 3 – The boundaries of openness .....44 Summary ........................................................7 3.1 Commercial interests and economic The practice of science ...........................................7 benefits..........................................................44 Drivers of change: making intelligent openness 3.1.1 Data ownership and the exercise of ....... standard ..................................................................7 intellectual property rights ...................45 New ways of doing science: computational and 3.1.2 The exercise of intellectual property ....... communications technologies ................................7 rights in university research .................47 Enabling change ......................................................8 3.1.3 Public-private partnerships...................49 Communicating with citizens ..................................8 3.1.4 Opening up commercial information in .. The international dimension ....................................9 the public interest .................................51 Qualified openness ..................................................9 3.2 Privacy ...........................................................51 Recommendations ........................................10 3.3 Security and safety .......................................57 Data terms ....................................................12 Chapter 4 – Realising an open data Chapter 1 – The purpose and practice culture: management, of science .................................13 responsibilities, tools and 1.1 The role of openness in science ...................13 costs .........................................60 1.2 Data, information and effective 4.1 A hierarchy of data management .................60 communication .............................................14 4.2 Responsibilities..............................................62 1.3 The power of intelligently open data ............15 4.2.1 Institutional strategies ..........................63 1.4 Open science: aspiration and reality .............16 4.2.2 Triggering data release .........................64 1.5 The dimensions of open science: value 4.2.3 The need for skilled data scientists .....64 outside the science community ....................17 4.3 Tools for data management .....................644.4 1.5.1 Global science, global benefits ............17 Costs .....................................................................66 1.5.2 Economic benefit .................................19 1.5.3 Public and civic benefit ........................22 Chapter 5 – Conclusions and recommendations .....................70 Chapter 2 – Why change is needed: 5.1 Roles for national academies ........................70 challenges and opportunities ....24 5.2 Scientists and their institutions .....................71 2.1 Open scientific data in a data-rich world ......26 5.2.1 Scientists ..............................................71 2.1.1 Closing the data-gap: maintaining .......... 5.2.2 Institutions (universities and research .... science’s self-correction principle ........26 institutes) ..............................................71 2.1.2 Making information accessible: 5.3 Evaluating university research ......................73 Diverse data and diverse demands ......28 5.4 Learned societies, academies and 2.1.3 A fourth paradigm of science? .............31 Professional bodies .......................................74 2.1.4 Data linked to publication and the .......... 5.5 Funders of research: research councils promise of linked data technologies ....31 and charities ..................................................74 2.1.5 The advent of complex computational ... 5.6 Publishers of scientific journals ....................76 simulation .............................................35 5.7 Business funders of research ........................76 2.1.6 Technology-enabled networking and ...... 5.8 Government ...................................................76 collaboration .........................................37 5.9 Regulators of privacy, safety and security ....78 2.2 Open science and citizens ............................38 2.2.1 Transparency, communication and trust ...............................................38 2.2.2 Citizens’ involvement in science ..........39 2.3 System integrity: exposing bad practice and fraud .......................................................41 Science as an open enterprise 3 Contents Glossary ........................................................79 Appendix 3 – Examples of costs of digital repositories ............................92 Appendix 1 – Diverse databases ..................83 International and large national repositories Discipline-wide openness - major international (Tier 1 and 2) .........................................................92 bioinformatics databases ......................................83 1. Worldwide protein data bank Processing huge data volumes for networked (wwpdb) ....................................................92 particle physics .....................................................83 2. UK data archive .........................................93 Epidemiology and the problems of data 3. Arxiv.Org ...................................................94 heterogeneity ........................................................84 4. Dryad .........................................................95 Improving standards and supporting regulation Institutional repositories (tier 3) ............................96 In nanotechnology ................................................84 5. Eprints soton .............................................96 The avon longitudinal study of parents and 6. Dspace@mit ..............................................97 children (alspac) ....................................................84 7. Oxford university research archive Global ocean models at the uk national and databank ............................................99 oceanography centre ............................................84 The UK land cover map at the centre for Appendix 4 – Acknowledgements, ecology & hydrology .............................................85 evidence, workshops and Scientific visualisation service for the consultation ..........................100 international space innovation centre ...................85 Evidence submissions .........................................100 Laser interferometer gravitational-wave Evidence gathering meetings ............................. 101 observatory project ...............................................85 Further consultation ............................................104 Astronomy and the virtual observatory ................86 Appendix 2 – Technical considerations for open data ..........................87 Dynamic data ........................................................87 Indexing and searching for data ...........................87 Servicing and managing the data lifecycle ...........87 Provenance ............................................................89 Citation ..................................................................90 Standards and interoperability ..............................91 Sustainable data ....................................................92 4 Science as an open enterprise Membership of Working Group The members of the Working Group involved in