Generalist Repository Comparison Chart

This chart is designed to assist researchers in finding a generalist repository should no domain Generalist Repository repository be available to preserve their research data. Generalist repositories accept data regardless of data type, format, content, or disciplinary focus. For this chart, we included a repository available to all Comparison Chart researchers specific to clinical trials (Vivli) to bring awareness to those in this field. doi: 10.5281/zenodo.3946720 https://fairsharing.org/collection/GeneralRepositoryComparison HARVARD TOPIC DRYAD FIGSHARE MENDELEY DATA OSF VIVLI ZENODO DATAVERSE Brief Description Harvard Dataverse is Open-source, A free, open access, data Mendeley Data is a free OSF is a free and Vivli is an independent, Powering Open Science, a free data repository community-led data repository where users repository specialized open source project non-profit organization built on Open Source. open to all researchers curation, publishing, and can make all outputs of for research data. management tool that that has developed a Built by reserachers for from any discipline, both preservation platform for their research available in Search more than 20+ supports researchers global data-sharing researchers. Run from inside and outside of CC0 publicly available a discoverable, reusable, million datasets indexed throughout their entire and analytics platform. the CERN data centre, the Harvard community, research data and citable manner. from 1000s of data project lifecycle in open Our focus is on sharing whose purpose is long where you can share, Dryad is an independent Users can upload files repositories and collect science best practices. individual participant- term preservation archive, cite, access, and non-profit that works of any type and are and share datasets with level data from for the High Energy explore research data. directly with: able to share diverse the research community completed clinical trials Physics discipline, one research products following the FAIR data to serve the international of the largest scientific • researchers to publish including datasets, principles. research community. datasets in the world datasets utlizing best code, multimedia files, practices for discovery workflows, posters, and reuse presentations, and • publishers to support more. With discoverable the integration of data metadata supporting availability statements FAIR principles, file and data citations into visualizations, and their workflows integrations, researchers • institutions to enable can make their work scalable campus more impactful and support for research move research further data managment best faster. practices at low cost Size limits No byte size limit 300GB/dataset Soft limit of 20GB/file for 10GB per dataset Projects currently have If more than 10GB per 50GB per dataset, per dataset. Harvard free accounts. System not storage limit. There study data, reach out contact us via https:// Dataverse currently sets limit of 5000GB/file. is a 5GB/file upload limit to us zenodo.org/support for a file size limit of 2.5GB. Unlimited storage of for native OSF Storage. higher limits public data but 20GB There is no limit imposed storage for private data by OSF for the amount for free accounts. Email of storage used across [email protected] to add-ons connected to a have upload and storage given project. limits raised. Storage space per 1 TB per researcher No limit No limit No limit No limit No limit No limit researcher Persistent, Unique DOI, Handle DOI DOI DOI DOI DOI DOI Identifier Support HARVARD TOPIC DRYAD FIGSHARE MENDELEY DATA OSF VIVLI ZENODO DATAVERSE Licensing Options By default, datasets CC0 Default licenses Default licenses The following 14 licenses https://vivli.org/ Content is available are published in the supported: supported: are available: resources/vivli-data-use- publicly under any one of public domain (CC0). CC0 1.0 CC0 1.0 No License - is a agreement/ 400 open licences (from Depositors can change opendefinition.org and CC BY 4.0 CC BY 4.0 copyright license for this to apply their own spdx.org). Restricted and MIT CC BY NC 3.0 the project authors and licenses. Closed content is also Apache 2.0 MIT contributors supported. GPL v3 Apache 2.0 CC0 1.0 GPL v2 BSD 3-Clause CC-By 4.0 BSD 2-Clause MIT GPL v3 Apache 2.0 GPL v2 BSD 2-Clause LGPL BSD 3-Clause MPL-2.0 GPL 3.0 CeCILL GPL 2.0 CeCILL-B Artistic 2.0 CERN OHL Eclipse 1.0 TAPR OHL LGPL 3.0 LGPL 2.1 Mozilla 2.0 Other- user defines a license in a .txt file and uploads to the project (not available on registrations or collections) Costs to the Harvard Dataverse is Costs covered by Free Free Free https://vivli.org/about/ Free up to 50GB per researcher free for all researchers institutional, publisher, share-data/ All fees dataset. Larger datasets worldwide (up to 1 TB) and funder members, to share, store, and possible by arrangement otherwise a one-time access COVID-19 clinical fee of $120 for authors research are waived. to cover cost of curation and preservation doi: 10.5281/zenodo.3946720 2 https://fairsharing.org/collection/GeneralRepositoryComparison HARVARD TOPIC DRYAD FIGSHARE MENDELEY DATA OSF VIVLI ZENODO DATAVERSE Equitable free and Use of CC0 and free CC0, public available, Metadata is CC0. All Metadata is licensed Open, public API to Managed Access as Metadata is licensed ongoing access to data access highly broadly indexed files and metadata can CC0. support broad indexing, Human Subject Data. CC0. Content is both data encouraged. Support research data with long be accessed from docs. Datasets are and will partnership with Internet No charge period for online on disk and offline for broad indexing and term preservation in a figshare.com. Figshare continue to be free Archive for long-term data that is accessible on tape as part of long- long-term preservation CoreTrustSeal-certified has also partnered access preservation, $250k only with a research term preservation policy strategies repository with DuraSpace and preservation fund. environment. Costs after Long-term access in Chronopolis to offer no charge time period the event of cease of further assurances ends. https://vivli.org/ operations granted that public data will resources/vivli-secure- by DANS. Access to be archived under research-environment-2/ archived datasets will the stewardship of be provided for free in Chronopolis. In the perpetuity highly unlikely event of multiple AWS S3 failures, Figshare can restore public user content from Chronopolis. Version Support Yes, including version Yes, versioning is Yes Yes, including version Yes for OSF Storage, Coming soon Yes, with "Concept" comparison and W3C supported comparison, to easily see when supported for DOI to represent "all provenance support what has changed from integrated storage versions" version to version providers Characteristics - Standards Supported Metadata Dublin Core, Data DataCite, schema.org Dublin Core (oai_dc), Dublin Core, DataCite, Datacite, Crossref DataCite DataCite Schemas Documentation Initiative Datacite (oai_datacite), Schema.org preprint (DDI Codebook), RDF (rdf), CERIF DataCite, OpenAIRE, XML (cerif), Qualified Schema.org, OAI_ORE Dublin Core (qdc) (hasPart support), Metadata Encoding and Transmission Standard (mets) and UKETD_DC (uketd_dc) Formats Supported JSON, XML Fully documented API Dublin Core (oai_dc), JSON, XML JSON, API, JSONAPI DataCite, Dublin for Export available for direct Datacite (oai_datacite), Core, DCAT-AP, JSON, integration. Exports RDF (rdf), CERIF JSON-LD, GeoJSON, available in JSON, XML, XML (cerif), Qualified MARCXML, Citation schema.org Dublin Core (qdc) Style Language JSON (hasPart support), + support for custom Metadata Encoding and metadata formats Transmission Standard (mets) and UKETD_DC (uketd_dc) doi: 10.5281/zenodo.3946720 3 https://fairsharing.org/collection/GeneralRepositoryComparison HARVARD TOPIC DRYAD FIGSHARE MENDELEY DATA OSF VIVLI ZENODO DATAVERSE Supported ISO 3166-1 CV for ORCID - Authors ORCID - Authors, Omniscience taxonomy BePress 3 tier taxonomy Cochrane Linked Data ORCID - Authors, Community geospatial metadata, ISO ROR - Institutions Dimensions - Funding, for subject categories for preprints and custom Vocabulary FundRef + OpenAIRE Vocabulary or 639-1 CV for languages, Field of Research codes, and keywords, and taxonomy mapped to Projects Database - Open Funder Registry - Taxonomy DataCite’s dataset GRID ids, Implementing additional custom Bepress on preprints Funding Funding contributor vocab, and ROR and Make Data vocabularies can be subsets of the OBI PLOS Thesaurus - Count loaded on institutional Ontology and NCBI Keywords version Taxonomy for Organisms Characteristics Useful for Linking data to other relevant digital information Support for creators/ ORCID, ISNI, LCNA, VIAF, ORCID required for ORCID ORCID - Mendeley ID - ORCID, ResearcherID ORCID ORCID authors identifiers GND, DAI, ResearcherID corresponding author, Scopus Author ID and ScopusID co-author ORCID supported Support linking to Yes DataCite relation types DataCite relation types Yes, DataCite and Scholix Preprints link to Yes DataCite relation types related publications relation types related Peer-reviewed publications using Crossef 'isPrerint of' Linking of derived Yes DataCite relation types DataCite relation types Yes, DataCite and Scholix No DataCite relation types products from relation types another Grant ID(s) Yes Yes Yes, hard-linked to Yes, customizable on No Yes Dimensions.ai database institutional version Grant ID Yes Open Funder Registry Yes, hard-linked to Yes,

Generalist Repository Comparison Chart

Recursive Definitions Across the Footprint That a Straight- Line Program Manipulated, E.G

Type 2 Diabetes Prediction Using Machine Learning Algorithms

Scientometrics1

Making Institutional Repositories Work “Making Institutional Repositories Work Sums It up Very Well

Will Sci-Hub Kill the Open Access Citation Advantage and (At Least for Now) Save Toll Access Journals?

Mapping Scientometrics (1981-2001)

Scalable Fault-Tolerant Elastic Data Ingestion in Asterixdb

Anomaly Detection and Identification Scheme for VM Live Migration in Cloud Infrastructure✩

Piracy of Scientific Papers in Latin America: an Analysis of Sci-Hub Usage Data

Open Access Initiatives and Networking in the Global South Iryna Kuchma

A Decidable Logic for Tree Data-Structures with Measurements

Open Research Data: SNSF Monitoring Report 2017-2018