Turning FAIR Data Into Reality

Total Page:16

File Type:pdf, Size:1020Kb

Turning FAIR Data Into Reality Final Report and Action Plan from the European Commission Expert Group on FAIR Data TURNING FAIR INTO REALITY Research and Innovation 2018 Turning FAIR into reality European Commission Directorate General for Research and Innovation Directorate B – Open Innovation and Open Science Unit B2 – Open Science Contact Athanasios Karalopoulos E-mail [email protected] [email protected] European Commission B-1049 Brussels Manuscript completed in November 2018. This document has been prepared for the European Commission however it reflects the views only of the authors, and the Commission cannot be held responsible for any use which may be made of the information contained therein. More information on the European Union is available on the internet (http://europa.eu). Luxembourg: Publications Office of the European Union, 2018 Print ISBN 978-92-79-96547-0 doi:10.2777/54599 KI-06-18-206-EN-C PDF ISBN 978-92-79-96546-3 doi: 10.2777/1524 KI-06-18-206-EN-N © European Union, 2018. Reuse is authorised provided the source is acknowledged. The reuse policy of European Commission documents is regulated by Decision 2011/833/EU (OJ L 330, 14.12.2011, p. 39). For any use or reproduction of photos or other material that is not under the EU copyright, permission must be sought directly from the copyright holders. The Expert Group operates in full autonomy and transparency. The views and recommendations in this report are those of the Expert Group members acting in their personal capacities and do not necessarily represent the opinions of the European Commission or any other body; nor do they commit the Commission to implement them. 2018 EUROPEAN COMMISSION Turning FAIR into reality Final Report and Action Plan from the European Commission Expert Group on FAIR Data European Commission Expert Group on FAIR Data Sandra Collins, National Library of Ireland, Ireland: https://orcid.org/0000-0003-2286-8540 Françoise Genova, Observatoire Astronomique de Strasbourg, France: https://orcid.org/0000-0002-6318-5028 Natalie Harrower, Digital Repository of Ireland, Ireland: https://orcid.org/0000-0002-7487-4881 Simon Hodson, CODATA, France, Chair of the Group: https://orcid.org/0000-0003-3179-7270 Sarah Jones, Digital Curation Centre, UK, Rapporteur: https://orcid.org/0000-0002-5094-7126 Leif Laaksonen, CSC-IT Center for Science, Finland: https://orcid.org/0000-0002-2161-4461 Daniel Mietchen, Data Science Institute, University of Virginia, USA: https://orcid.org/0000-0001-9488-1870 Rūta Petrauskaité, Vytautas Magnus University, Lithuania: http://orcid.org/0000-0002-6948-3202 Peter Wittenburg, Max Planck Computing and Data Facility, Germany: https://orcid.org/0000-0003-3538-0106 2018 Directorate-General for Research and Innovation 4 Turning FAIR into reality Final Report and Action Plan on FAIR Data Table of contents Foreword by Commissioner Carlos Moedas ...............................................................................................................................................7 Preface ............................................................................................................................................................................................................................8 1. Executive summary .........................................................................................................................................................................................10 1.1 Concepts for FAIR ..................................................................................................................................................................................10 1.2 Research culture and FAIR ...............................................................................................................................................................11 1.3 Technical ecosystem for FAIR data .............................................................................................................................................12 1.4 Data science and stewardship skills ..........................................................................................................................................13 1.5 Metrics for FAIR data and assessment frameworks to certify FAIR services ......................................................13 1.6 Sustainable and strategic funding ..............................................................................................................................................14 1.7 Priority recommendations ................................................................................................................................................................14 2. Concepts – why FAIR? .....................................................................................................................................................................................18 2.1 Origin of FAIR...........................................................................................................................................................................................18 2.2 Definition of FAIR ...................................................................................................................................................................................19 2.3 FAIR and Open data .............................................................................................................................................................................21 2.4 Application and implementation of FAIR ..................................................................................................................................22 2.5 A FAIR ecosystem to support FAIR Digital Objects .............................................................................................................25 3. Creating a culture of FAIR data ................................................................................................................................................................26 3.1 Research culture and FAIR data ....................................................................................................................................................26 3.2 Developing disciplinary interoperability frameworks for FAIR .....................................................................................27 3.3 Making research workflows FAIR ..................................................................................................................................................29 3.4 Data Management Plans and FAIR ..............................................................................................................................................30 3.5 Benefits and incentives ......................................................................................................................................................................32 4. Creating a technical ecosystem for FAIR data .................................................................................................................................35 4.1 FAIR Digital Objects ..............................................................................................................................................................................35 4.2 The technical ecosystem for FAIR data.....................................................................................................................................36 4.3 Data standards, metadata standards, vocabularies and ontologies .......................................................................40 4.4 Registries, repositories and certification ..................................................................................................................................43 4.5 Automatic processing at scale .......................................................................................................................................................45 5. Skills and capacity building .........................................................................................................................................................................46 5.1 Data science and data stewardship skills for FAIR ............................................................................................................46 5.2 Professionalising roles and curricula .........................................................................................................................................47 6. Measuring change ............................................................................................................................................................................................50 6.1 Metrics / indicators ...............................................................................................................................................................................50 6.2 A maturity model for FAIR ................................................................................................................................................................51 6.3 How to track and evidence change and improvements ..................................................................................................54 5 Turning FAIR into reality Final Report and Action Plan on FAIR Data 7. Funding and sustaining FAIR data ...........................................................................................................................................................55
Recommended publications
  • Etriks -Standards Starter Pack Standards Guidelines
    eTRIKS -Standards Starter Pack Standards Guidelines Release 1.1 - 25th April 2016 Authors (by alphabetical order) Bratfalean, Dorina – CDISC Europe Foundation, Braxenthaler, Michael – Roche Innovation Center New York, Houston, Paul – CDISC Europe Foundation (*), Munro, Robin – ID Business Solutions Limited, Richard, Fabien – Centre National de la Recherche Scientifique, Rocca-Serra, Philippe – Oxford e-Research Centre, University of Oxford (*), Romacker, Martin – Roche Innovation Center Basel, Sansone, Susanna-Assunta – Oxford e-Research Centre, University of Oxford. (*) Correspondence to [email protected] or [email protected] Version History Version Date Who Role Notes 1.0 11 Philippe Rocca- Draft creator and Draft for internal use February Serra, Fabien maintainer 2015 Richard, Dorina Bratfalean 1.1 25 April Philippe Rocca- maintainer Public release 2016 Serra Licence: https://creativecommons.org/licenses/by-sa/4.0/ 1 A Business Case for Standards in eTRIKS Part 1. Introduction 1.1 eTRIKS mission and objectives 1.2 Document objective 1.3 Intended Audience 1.4 Standard Definition and Typology 1.4.1 Definition of Standards: 1.4.2 Typology of Standards 1.5 Purpose of Standards Part 2. Procedure for standards selection and recommendation 2.1 Procedure outline 2.2 Attributes of standards 2.3 Versioning of Standards 2.4. Standardization Bodies and Service Providers 2.4. Gaps in Standards 2.4.1 Coverage gap in a domain covered by an existing standard 2.4.2 Coverage gap in a domain not covered by standards 2.5 Changes, maintenance and updates to eTRIKS Standard Starter Pack Part 3. Standards in data management 3.1 Standards for Data Security, Data Privacy and Compliance with Ethical Guidelines.
    [Show full text]
  • Easychair Preprint Machine Learning Algorithm for Assessing Reusability
    EasyChair Preprint № 4142 Machine Learning Algorithm for Assessing Reusability in Component Based Software Development Pooja Negi and Umesh Kumar Tiwari EasyChair preprints are intended for rapid dissemination of research results and are integrated with the rest of EasyChair. September 7, 2020 MACHINE LEARNING ALGORITHM FOR ASSESING REUSABILITY IN COMPONENT BASED SOFTWARE DEVELOPMENT 1Pooja Negi, 2Umesh Kumar Tiwari 1,2 Department of Computer Science and Engineering, Graphic Era Deemed to be University, Dehradun [email protected], 2 [email protected] Abstract- Software reusability has been present for several decades. Software reusability is defined as making new software from existing one. Objects that can be reused: design, code, software framework. We reviewed several approaches in this dissertation, i.e. object-oriented metrics, coupling factor, etc., by which the software's reusability increases. Therefore this thesis analysis on how to classify and reuse the program using those metrics and apply the algorithm of machine learning. In this thesis we test open source software and generate a ck metric of that source code then a machine learning algorithm will process the data using weka tool to give the result. We test coefficient of correlation, mean absolute error, root mean square error, relative absolute error and root relative square error less the program would be better from this we get 98.64 accuracy on online examination system software. Keywords- Reusability, Machine Learning Algorithm, Random Forest, ck-metric. 1. Introduction In today’s world every sector of service or industry is dependent on computer based application. Industry which develops and outsources the software service is major and growing rapidly in the world.
    [Show full text]
  • Quality-Based Software Reuse
    Quality-Based Software Reuse Julio Cesar Sampaio do Prado Leite1, Yijun Yu2, Lin Liu3, Eric S. K. Yu2, John Mylopoulos2 1Departmento de Informatica, Pontif´ıcia Universidade Catolica´ do Rio de Janeiro, RJ 22453-900, Brasil 2Department of Computer Science, University of Toronto, M5S 3E4 Canada 3School of Software, Tsinghua University, Beijing, 100084, China Abstract. Work in software reuse focuses on reusing artifacts. In this context, finding a reusable artifact is driven by a desired functionality. This paper proposes a change to this common view. We argue that it is possible and necessary to also look at reuse from a non-functional (quality) perspective. Combining ideas from reuse, from goal-oriented requirements, from aspect-oriented programming and quality management, we obtain a goal-driven process to enable the quality-based reusability. 1 Introduction Software reuse has been a lofty goal for Software Engineering (SE) research and prac- tice, as a means to reduced development costs1 and improved quality. The past decade has seen considerable progress in fulfilling this goal, both with respect to research ideas and industrial practices (e.g., [1–3]). Current reuse techniques focus on the reuse of software artifacts on the basis of de- sired functionality. However, non-functional properties (qualities) of a software system are also crucial. Systems fail because of inadequate performance, security, reliability, usability, or precision, to name a few. Quality concerns, therefore, should also be front and centre in methods for software reuse. For example, in designing for the NASA Mars Spirit spacecraft, one would not adopt a “cosine” function from an arbitrary mathemat- ical library.
    [Show full text]
  • Adaptability Evaluation at Software Architecture Level Pentti Tarvainen*
    The Open Software Engineering Journal, 2008, 2, 1-30 1 Open Access Adaptability Evaluation at Software Architecture Level Pentti Tarvainen* VTT Technical Research Centre of Finland, Kaitoväylä 1, P.O. Box 1100, FIN-90571 Oulu, Finland Abstract: Quality of software is one of the major issues in software intensive systems and it is important to analyze it as early as possible. An increasingly important quality attribute of complex software systems is adaptability. Software archi- tecture for adaptive software systems should be flexible enough to allow components to change their behaviors depending upon the environmental and stakeholders' changes and goals of the system. Evaluating adaptability at software architec- ture level to identify the weaknesses of the architecture and further to improve adaptability of the architecture are very important tasks for software architects today. Our contribution is an Adaptability Evaluation Method (AEM) that defines, before system implementation, how adaptability requirements can be negotiated and mapped to the architecture, how they can be represented in architectural models, and how the architecture can be evaluated and analyzed in order to validate whether or not the requirements are met. AEM fills the gap from requirements engineering to evaluation and provides an approach for adaptability evaluation at the software architecture level. In this paper AEM is described and validated with a real-world wireless environment control system. Furthermore, adaptability aspects, role of quality attributes, and diversity of adaptability definitions at software architecture level are discussed. Keywords: Adaptability, adaptation, adaptive software architecture, software quality, software quality attribute. INTRODUCTION understand the system [6]. Examples of design decisions are the decisions such as “we shall separate user interface from Today, quality of a software system plays an increasingly the rest of the application to make both user interface and important role in the domain of software engineering.
    [Show full text]
  • Software Maintainability and Reusability Using Cohesion Metrics
    International Journal of Computer Trends and Technology (IJCTT) – Volume 54 Issue 2-December2017 Software Maintainability and Reusability using Cohesion Metrics Adekola, O.D#1, Idowu, S.A*2, Okolie, S.O#3, Joshua, J.V#4, Akinsanya, A.O*5, Eze, M.O#6, EbiesuwaSeun#7 #1Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria *2Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria #3Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria #4Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria *5Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria #6Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria #7Faculty, Computer Science Department, Babcock University,Ilishan-Remo, Ogun State, Nigeria Abstract - Among others, remarkable external software’s lifetime. Ahn et al., (2003) estimated that quality attributes of interest to software practitioners/ maintenance takes up to 80% of the total costof engineers include testability, maintainability and producing software applications. Expectation of reusability.Software engineers still combat achieving more reliable, quicker time-to-market and softwarecrisis and even chronic software affliction maintainable systems. A lot of research has gone into not because there is no standardized software the areas of software reuse and maintenance due to development process but because enough attention is the fact that these among other issues concern not given to seemingly insignificant but crucial intimately system developers/architects/engineers details of internal design attributes such as cohesion rather than end-users. Therehas been enormous and coupling especially in object-oriented systems. growth in software reuse research from the days of Consequently, the aftermath is increased structured programming concepts to object-oriented maintenance cost, effort and time which negatively methods and beyond (e.g.
    [Show full text]
  • Software Reusability: Approaches and Challenges
    International Journal of Research and Innovation in Applied Science (IJRIAS) |Volume VI, Issue V, May 2021|ISSN 2454-6194 Software Reusability: Approaches and Challenges Moko Anasuodei1, Ojekudo, Nathaniel Akpofure2 1Department of Computer Science and Informatics, Faculty of Science, Federal University Otuoke, Nigeria 2Department of Computer Science, Faculty of Natural and Applied Sciences, Ignatius Ajuru University of Education, Nigeria Abstract: Software reuse is used to aid the software phases. Software engineering has been more centered on development process which in recent times can improve the original development which gives an optimal software at a resulting quality and productivity of software development, by faster and less costly price, a design process based on assisting software engineers throughout various software systemic reusable is now recognized. engineering phases to enhance the quality of software, provide quick turnaround time for software development using few Software reuse reduces efforts and cost, because software people, tools, and methods, which creates a good software development costs can be extremely high, but they are quality by enhancing integration of the software system to shared, such that the cost of reuse can be extremely low. provide a competitive advantage. This paper examines the One major advantage of software reuse suggested by concept of software reuse, the approaches to be considered for keswani et al (2014) explains that there is a significant need software reuse, which is broadly shared into three categories: component-based software reuse, domain engineering and for the number of bugs to be reduced during software software product lines, architecture-based software reuse and development process, such that instead of developing an challenges that affect the software reuse development process.
    [Show full text]
  • Is It Transferrable? Information's Reusability, Adaptability, And
    Is it Transferrable? Information’s Reusability, Adaptability, and Transportability through SCORM Macarena Aspillaga, Ph.D. VSD Corporation Lane, Suite 200 Virginia Beach, VA 23462 Abstract The need for the sharable content object reference model (SCORM) to decrease the size of its shareable content object (SCO) is evident, especially since the introduction of Web 2.0 environments and new delivery systems. If SCORM is to be part of these emerging technologies, it needs to decrease its SCO size to the activity level to allow greater reusability, repurpose, adaptability, and portability of its learning objects. This will keep courses current at a lower cost, as well as enhance the transfer of knowledge; it will also help teach competencies, which will boost productivity. Greater reusability will help increase mental models. Emerging technologies will require that SCORM incorporate new standards for navigation, as new mobile learning environments communicate in shorter segments, requiring smaller SCOs and a different data model. Background Advanced Distributed Learning (ADL) developed a collection of specifications and standards known as the sharable content object reference model, or SCORM, as a way to standardize e-learning within the defense industry. The need arose because each government contractor had its own system and guidelines, resulting in many inconsistencies. As a result, ADL is now in charge of publishing, governing, and updating SCORM specifications and standards. There have been several versions since SCORM’s inception in 1997. The latest version, SCORM 1.3, launched in 2004, and includes the ability to specify sequencing of activities that use content objects, and resolve ambiguities. This latest version also allows using and sharing information, regarding success status for multiple learning objectives or competencies across content objects and across courses for the same learner within the same learning management system (LMS).
    [Show full text]
  • Review of Software Reusability
    International Conference on Computer Science and Information Technology (ICCSIT'2011) Pattaya Dec. 2011 Review of Software Reusability Neha Budhija and Satinder Pal Ahuja Abstract— Reusability is the likelihood a segment of source The organizations that has experience in developing code that can be used again to add new functionalities with software, but not yet used the software reuse concept, slight or no modification. Reusable modules and classes there exists extra cost to develop the reusable reduce implementation time, increase the likelihood that prior components from scratch to build and strengthen their testing and use has eliminated bugs and localizes code reusable software reservoir [2].. The cost of developing modifications when a change in implementation is required. the software from scratch can be saved by identifying Subroutines or functions are the simplest form of reuse. A and extracting the reusable components from already chunk of code is regularly organized using modules or developed and existing software systems or legacy namespaces into layers. Proponents claim that objects and systems [3]. software components offer a more advanced form of reusability, although it has been tough to objectively measure and define II. RELATED WORK levels or scores of reusability. Reusability implies some explicit management of build, packaging, distribution, Software reuse has been practiced since programming installation, configuration, deployment, maintenance and began. Reuse as a distinct field of study in software upgrade issues. If these issues are not considered, software engineering, however, is often traced to Doug Mcilroy’s may appear to be reusable from design point of view, but paper which proposed basing the software industry on will not be reused in practice.
    [Show full text]
  • White Paper on Implementing the FAIR Principles for Data in the Social, Behavioural, and Economic Sciences
    A Service of Leibniz-Informationszentrum econstor Wirtschaft Leibniz Information Centre Make Your Publications Visible. zbw for Economics Betancort Cabrera, Noemi et al. Working Paper White Paper on implementing the FAIR principles for data in the social, behavioural, and economic sciences RatSWD Working Paper, No. 274 Provided in Cooperation with: German Data Forum (RatSWD) Suggested Citation: Betancort Cabrera, Noemi et al. (2020) : White Paper on implementing the FAIR principles for data in the social, behavioural, and economic sciences, RatSWD Working Paper, No. 274, Rat für Sozial- und Wirtschaftsdaten (RatSWD), Berlin, http://dx.doi.org/10.17620/02671.60 This Version is available at: http://hdl.handle.net/10419/229719 Standard-Nutzungsbedingungen: Terms of use: Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Documents in EconStor may be saved and copied for your Zwecken und zum Privatgebrauch gespeichert und kopiert werden. personal and scholarly purposes. Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle You are not to copy documents for public or commercial Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich purposes, to exhibit the documents publicly, to make them machen, vertreiben oder anderweitig nutzen. publicly available on the internet, or to distribute or otherwise use the documents in public. Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, If the documents have been made available under an Open
    [Show full text]
  • REUSABILITY and MAINTAINABILITY in OBJECT ORIENTED LANGUAGES Suvarnalata Hiremath1, C M Tavade2 1Associate Professor, Dept
    REUSABILITY AND MAINTAINABILITY IN OBJECT ORIENTED LANGUAGES Suvarnalata Hiremath1, C M Tavade2 1Associate Professor, Dept. of CS&E, BKEC, Basavakalyan, India, 2 Professor, Dept of E&TC, SIT-COE, Yadrav, India Abstract In object-oriented languages, inheritance plays an important part for software reusability and maintainability. The separation of sub typing and inheritance makes inheritance a more flexible mechanism reusing code. Object-oriented programming has been widely acclaimed as the technology that will support the creation of reusable software, particularly because of the "inheritance” facility. In this paper, we Object Oriented Programming is a practical and explore the importance of reusability and useful programming methodology that maintainability in object oriented language. encourages modular design and software reuse. KEYWORDS: Object Oriented Object oriented Language make the promises of programming Language, Inheritance, reduced maintainance,code reusability, Software reuse and maintainability. improved reliability and flexibility and easier maintenance through better data encapsulation. I. INTRODUCTION To achieve these gains, object oriented Object-Oriented Programming (OOP) is the term language introduce the concepts of objects, used to describe a programming approach based classes, data abstraction and encapsulation, on objects and classes. The object-oriented inheritance and polymorphism. paradigm allows us to organise software as a collection of objects that consist of both data and In object oriented Language, the objects are behaviour. This is in contrast to conventional well-defined data structures coupled with a set functional programming practice that only of operations, these operations are called loosely connects data and behaviour. behavior and are only visible part of an object The object-oriented programming approach and only these operations can manipulate the encourages: objects.
    [Show full text]
  • Achieving Quality Requirements with Reused Software Components
    Achieving Quality Requirements with Reused Software Components: Challenges to Successful Reuse Second International Workshop on Models and Processes for the Evaluation of off-the-shelf Components (MPEC’05) 21 May 2005 Donald Firesmith Software Engineering Institute Carnegie Mellon University Pittsburgh, PA 15213 [email protected] 1 2005 Software Engineering Institute Topics • Introduction • Reusing Software • Quality Models and Requirements • Risks and Risk Mitigation • Conclusion 2 2005 Software Engineering Institute Introduction 1 • When reusing components, many well known problems exist regarding achieving functional requirements. • Reusing components is an architectural decision as well as a management decision. • Architectures are more about achieving quality requirements than achieving functional requirements. • If specified at all, quality requirements tend to be specified as very high level goals rather than as feasible requirements. For example: • “The system shall be secure.” 3 2005 Software Engineering Institute Introduction 2 • Actual quality requirements (as opposed to goals) are often less negotiable than functional requirements. • Quality requirements are much harder to verify. • Quality requirement achievability and tradeoffs is one of top 10 risks with software-intensive systems of systems. (Boehm et al. 2004) • How can you learn what quality requirements were originally used to build a reusable component? • What should architects know and do? 4 2005 Software Engineering Institute Reusing Software • Scope of Reuse • Types of Reusable Software • Characteristics of Reusable Software 5 2005 Software Engineering Institute Scope of Reuse • Our subject is the development of software- intensive systems that incorporate some reused component containing or consisting of software. • We are not talking about developing software for reuse in such systems (i.e., this is not a ‘design for reuse’ discussion).
    [Show full text]
  • SOFTWARE SECURITY PATTERNS in SECURITY ENGINEERING Rshma Chawla* Dr
    IJRIM Volume 2, Issue 2 (February 2012) (ISSN 2231-4334) SOFTWARE SECURITY PATTERNS IN SECURITY ENGINEERING Rshma Chawla* Dr. Naveeta Mehta** ABSTRACT Secure software is the essential need of the time. Security has become a important feature of any software system. Security issues are always the secondary task for the developers in SDLC process. But in accordance with current scenario security should be given the highest priority in all phases of SDLC. The major problem faced by developers is unavailability of information about recent attacks and the way to cure them. This paper elaborates Security patterns as reusable solutions for security related problems in SDLC as even till date security comes after the development phase. Why it is not included in each phase? Better quality specification can be produce at lower cost by indulging generalization, reusable security requirements. Keywords: Security, Patterns, Design Patterns, Security Patterns. *Lecturer, M.M. Institute of Computer Technology & Business Management, M.M. University, Mullana, India. **Associate Professor, M.M. Institute of Computer Technology & Business Management, M.M. University, Mullana, India. International Journal of Research in IT & Management 327 http://www.mairec.org IJRIM Volume 2, Issue 2 (February 2012) (ISSN 2231-4334) 1.0 INTRODUCTION Faults created by software developers open defect in applications and expose their vulnerabilities. Give security expertise to software developers is difficult .In SDLC security expertise is one frequent missing quality that needs to be addressed strongly, by taking advantage of the scaling effect of security patterns[1]. Security patterns capture security experts’ knowledge for a given security problem. Therefore security patterns are developed by security experts and are used by as software developers.
    [Show full text]