University of Massachusetts Medical School eScholarship@UMMS

University of Massachusetts and New England Area Librarian e-Science Symposium 2011 e-Science Symposium

Apr 6th, 12:00 AM - 11:15 AM

Panel Discussion presentation: "DataONE: Facilitating eScience through Collaboration"

Suzie Allard

Follow this and additional works at: https://escholarship.umassmed.edu/escience_symposium

Part of the Library and Information Science Commons

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.

Repository Citation Allard, S. (2011). Panel Discussion presentation: "DataONE: Facilitating eScience through Collaboration". University of Massachusetts and New England Area Librarian e-Science Symposium. https://doi.org/ 10.13028/9dck-z235. Retrieved from https://escholarship.umassmed.edu/escience_symposium/2011/ program/9

Creative Commons License

This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License. This material is brought to you by eScholarship@UMMS. It has been accepted for inclusion in University of Massachusetts and New England Area Librarian e-Science Symposium by an authorized administrator of eScholarship@UMMS. For more information, please contact [email protected]. DataONE: Facilitating eScience through Collaboration

Suzie Allard, Ph.D. University of Tennessee

Presented at the Third University of Massachusetts & New England Area Librarian eScience Symposium 6 April 2011 A prominent feature of e-Science is the generation of immense data sets that can be rapidly disseminated to other researchers via the internet.

-- Donna Kafel, MLIS Project Coordinator for the New England e-Science Portal for Librarians “e-Science and Its Relevance for Research Libraries” Environmental Science Challenges

3 Vision:

Provide universal access to data about life on earth and the environment that sustains it.

4 DataONE team and funders

• Amber Budden, Roger Dahl, Rebecca Koskela, • Ewa Deelman Bill Michener, Robert Nahf, Mark Servilla • Dave Vieglais • Peter Honeyman

• Suzie Allard, Carol Tenopir, Maribeth Manoff, • Jeff Horsburgh Robert Waltz, Bruce Wilson • John Cobb, Bob Cook, Giri Palanismy, • Robert Sandusky Line Pouchard • Patricia Cruse, John Kunze • Bertram Ludaescher

• Mike Frame, Viv Hutchison, Jeff Morisette, • Peter Buneman Jake Weltzin, Lisa Zolly • Chad Berkley, Stephanie Hampton, • Cliff Duke Matt Jones • Paul Allen, Rick Bonney, Steve Kelling • Carole Goble

• Ryan Scherle, Todd Vision • Donald Hobern

• Randy Butler • David DeRoure

LEON LEVY FOUNDATION 5 Mission

Support science by: (1) engaging the relevant science, data, and policy communities; (2) providing easy, secure, and persistent storage of data; and (3) disseminating integrated and user- friendly tools for data discovery, analysis, visualization, and decision- making.

6 DataONE structure

External Advisory Board NSF Principal Investigator Leadership Team DataNet William Michener Partners Executive Director Rebecca Koskela

Director Development & Operations DataONE Director Community Engagement & Outreach Dave Vieglais Office Amber Budden

R&D Operations &D Operations DUG

Coordinating Nodes Education and Core CI Team Outreach Team Member Nodes

Infrastructure and Research Working Groups Engagement Working Groups Federated security Sociocultural barriers to data sharing and preservation Distributed storage Community engagement and education Data preservation, metadata, and interoperability Citizen science and public outreach Scientific workflows Long-term sustainability and governance and semantics

Exploration, Visualization, Analysis Exploration, Visualization, Analysis Usability and assessment Usability and assessment 7 Data Life Cycle

Collect

Analyze Assure

Integrate Education Describe

Discover Deposit

Preserve 1.Build on existing cyberinfrastructure

2. Create new cyberinfrastructure

3.Support new communities of practice

9 Infrastructure Components

Three major components Member Nodes for a flexible, scalable, • diverse institutions Coordinating Nodes sustainable network • serve local community •retain complete • Investigatorprovide resources Toolkit for metadata catalog managing their data •indexing for search • retain copies of data •network-wide services •ensure content availability (preservation) •replication services

10 Candidate Nodes Around the World

11 Community Engagement Approach

• Engagement • Working Groups • All Hands’ Meetings • External Advisory Board • DataONE Users Group • Communication • Newsletter • Web presence • Training • Outreach

2009 First All Hands’ Meeting 12 Stakeholder Network Administration Research Faculty

Students Teaching Faculty

Libraries Librarians Publishers Libraries Librarians Professional Private Societies Academia Citizen-scientists Industry Citizen-activists Foundations Libraries Librarians Social Svcs Scientists K-12 Teachers Librarians Non-profit Community Think Tank - Informal Educators Advocacy Curriculum Builders Govt Policymaker Local s Federal Administrators Policymaker State Advisory s Judiciary Policymaker Committee s Decision Makers Administrators Administrators Advisory Advisory Libraries Librarians Committee Committee Decision Makers Decision Makers

Libraries Librarians Libraries Librarians 13 Assessment-stakeholders

Public Officials Publishers Data Managers

Scientists Libraries & Librarians Citizen- Students scientists & Teachers Community DataONE User Scenarios

Project Planning Research Activity Publication data discovery data mgmt visualization and access planning data upload data visualization online training best practices data download for citation and integration tool discovery Scientist data upload

Project Planning Research Activity Publication data discovery tool document session and access discovery activity spatial data archive/cite final referencing report develop range visualizations

Librarian Govt advisory committees Diplomats Intergovernmental organizations Academic scientists The Cloverleaf Local, state, federal governments Administrators/program managers Non governmental organizations Stewardship Govt advisory committee members Commercial Sector Policy Commercial developers Model of Academe Legal regulation Community Attorneys/judges Legal compliance Legislators DataONE Product s Stakeholders

Observation networks Intergovernmental organizations Secondary: Publishers Local, state, federal data Academe governments Software companies Non governmental organizations application Repositories Commercial sector Archives Academia Museums Visualization Primary: Primary: Curation Experimentation Discovery Long-term Preservation Observation of new Access data & knowledge Curators Scientists Administrators Citizen scientists IT (technician) personnel Industrial researchers Secondary: Software developers Data managers Education Archivists Librarians

Individuals Organizations K-12 libraries Education Research libraries Curriculum K-12 teachers US Dept of Education Teacher Training K-12 librarians Administrators – higher education College faculty/lecturers Accrediting organizations Learning College/research librarians Textbook companies College students Boards of education K-12 students DataONEpedia

www.dataone.org/dataonepedia Editors: Cook, Michener Contributors: Best practices workshop participants 1717 Engaging Citizens in Science

www.CitizenScience.org

18 The Information Puzzle

• Discover needs of all stakeholders • Help improve data lifecycle functions & processes • Educate next generation of data specialists • Test approaches, systems, & progress • Tackle Socio-cultural issues

19 Linking Education and Science to Develop the Next Generation of Educators for Science Librarians and Data, Information and Communication Specialists

20 Digital Curation Education in Research Centers

[Photo credits: Copyright University Corporation for Atmospheric Research] 21 Thank You!

Suzie Allard [email protected]

22 Extra slides Education Data Curation Education in Research Centers

Data Curation Doctoral Program Science Data Curation Master’s Speciality Graduate School of Library and Information Science School of Information Sciences University of Illinois at Urbana-Champaign University of Tennessee, Knoxville

Selected Seminars & Courses Representative Research Areas Selected Courses Practice and research

Research Problems in Data Cross-disciplinary data sharing Research Problems in Data Cross-disciplinary data sharing Curation and reuse Curation and reuse Ontologies in the Sciences Data units, identity, and Information Organization and Data management practices of Representation scientists Information Transfer & Collection and integration of Sources and Services in Science Participation in research teams Collaboration in Science data for scientific problem- & Engineering solving Metadata in Theory & Practices Data and literature dynamics Bioinformatics Shadowing scientists in laboratory environments Foundations of Data Curation Culture and practice of data Practicum and Research Culture and practice of data communities Participation communities

24