Institutional Repository Cloud Service JAIRO Cloud National Institute of Informatics, JAPAN

7th Japan‐China‐Korea SciTec Information Joint Seminar 23 November, 2017 National Research and Education Network

• SINET is a Japanese academic backbone network for more than 800 universities and research institutions, and for about 3 million users. • SINET covers 100% of national, 78% of municipal, and 55% of private universities.

Inter-Univ. National Municipal Private Junior Colleges of Labs and Research Universities Universities Universities Colleges Technology Others Total Institutes Sapporo Number of 86 71 348 62 55 16 179 817 Organizations (100%) (78%) (55%) (18%) (97%) (100%)

(As of March 2015) : SINET node : Domestic line (100Gbps or more) To Europe

: International line (100Gbps) : International line (10Gbps)

Fukuoka

Osaka

To US Tokyo

To Asia 2 21st Century Academic Information Infrastructure SINET5 for Advancing

Collaboration and Promotion in Research and Education

Resource Federation  Promotion of academic information GakuNin  Collaborative Federation circulation and enhancement of  Collaborative promotion of authentication between institutional repository expansion universities

Cloud Security Flow Analysis  Dramatic cost reduction and  Network flow analysis enhancement of research and and dynamic control education environment by GakuNin-Cloud  Raise of security level tailored cloud services Direct Connection for SINET users VPN

Network  Nationwide 100-Gbps backbone network and scalable network expansion  High-speed direct international lines to USA, Europe, and Asia  Introduction of new technologies such as SDN in response to user needs

3 Scholarly Information Infrastructure

Journal articles Journal articles Catalog information Research Information

CiNii CiNii KAKEN Articles JAIRO Books

Catalog of materials held Project reports of Metadata and links of Metadata and links of MEXT Japanese journal Japanese institutional by universities articles Bibliographic info 11 M supported scientific 19 M records repositories records 2.5 M records researches Holding information 137 820 K record M records

Compilation

NACSIS-CAT

Shared Repositories Compilation Digitization Integration

JSPS MEXT 学協会 J-Stage NDL 学協会Academic (JST) Societies InstitutionalI More than repository Univ. 1,300 libraries Library More than 800 Linkage to other DB Universities and Research Institutions Note: The record institutions numbers are as of services March 2017

4 Discovery Service CiNii Articles and Searches Articles Search ( ) (thousand) Fulltext(internal) Meta Search thousand

25,000 70,000 2009.4 64,100 Drastic UI 61,200 61,270 Renewal 59,120 56,400 60,000 60,460 19,270 19,730 20,000 57,600 57,580 18,730 16,720 50,000 16,020 15,300 15,000 14,300 40,000 12,000 12,800 9,900 10,600 11,500 2007.4 28,919 35,000 30,000 2016 10,000 Indexed by Google Monthly ave.

20,000 ■full text DL 4.52M 13,286 5,000 ■detail view 7,206 10,000 10.9M (3,880) (4,020) (4,150) (4,260) (4,304) (3,090) (3,200) (3,500) (3,660) (3,790) (2,600) (2,890) ■search 0 0 4.93M 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 Scholarly Information Infrastructure

Journal articles Journal articles Catalog information Research Information

CiNii CiNii KAKEN Articles JAIRO Books

Catalog of materials held Project reports of Metadata and links of Metadata and links of MEXT Japanese journal Japanese institutional by universities articles Bibliographic info 11 M supported scientific 19 M records repositories records 2.5 M records researches Holding information 137 820 K record M records

Compilation

NACSIS-CAT

Shared Repositories Compilation Digitization Integration

JSPS MEXT 学協会 J-Stage NDL 学協会Academic (JST) Societies InstitutionalI More than repository Univ. 1,300 libraries Library More than 600 Linkage to other DB Universities and Research Institutions Note: The record institutions numbers are as of services March 2017

6 NII‐funded Institutional Repository Program • NII‐IRP (Institutional Repositories Program) http://www.nii.ac.jp/irp/en/ • Phase 1 : FY2005‐2007 • Phase 2 : FY2008‐2009 • Phase 3 : FY2010‐2012 • Three categories of funding • Area 1: Support for developing IRs and content creation • Area 2: Research and development • Area 3: Support for community activities

Phase 1 Phase 2 Phase 3 2005 2006 2007 2008 2009 2010 2011 2012 Area 1 (Institutions) 19 57 70 68 74 24 31 34 Area 2 (Projects) ‐ 22 14 21 21 8 8 7 Area 3 (Projects) ‐‐‐‐‐544

7 JAIRO Cloud • Background • Limited resources and less technical knowledge hamper implementation of IR especially in small universities. • JAIRO Cloud provides a shared instance of IR system on the virtual server hosted by NII since April 2012. • Service Architecture

8 Number of IRs in Japan

900 811 800 80 811 IRs 681 700 784 IRs ■ by JAIRO Cloud: Pilot Operation 598 600 ■ by JAIRO Cloud: Production Operation 526 ■ by University On‐premise System 500 396 461 431 288 210 400 357 130 300 260 73 228 193 200 144 316 310 101 260 284 301 285 270 100 58 193 228 144 2 10 58 101 0 2 10

9 Portal services of Japanese IRs

• NII harvests almost all Japanese IRs ‐ JAIRO is a “gyroscope” of IR content OAI‐PMH ‐Use it to search all IRs in Japan at once!

‐ IRDB Content Analysis shows how contents are growing and gives detailed information on each IR.

‐ CiNii is the largest database of junii2 articles in Japan. ‐ Metadata on journal articles and departmental bulletins goes to ‐ junii2 is a “Dublin Core” application profile for institutional repository with an OpenURL CiNii and is linked to the full texts in compliant schema. the IRs. ‐It has been adopted by almost all IRs in Japan. http://www.nii.ac.jp/irp/en/archive/pdf/junii2_en_20090213.pdf

10 Contents Type stored in Japanese IRs Contents Type stored in Japanese IRs

Journal Article 288,709 (14.1%) or Dissertation 103,478 (5.1%)

Journal Article Software Others 14.14% Departmental Bulletin Paper 1,080,358 (52.9%) 0.00% 13.16% Thesis or Data or Dataset Dissertation 2.63% Conference Paper 35,303 (1.7%) 5.07% Learning Material 0.22% Presentation 12,251 (0.6%) 0.03% Article Book 32,839 (1.6%) 2.67% Research Paper Technical Report 43,313 (2.1%) 3.12% Technical Report Research Paper 63,771 (3.1%) 2.12% Book Article 54,470 (2.7%) 1.61…

Presentation Preprint 624 (0.0%) 0.60% Learning Material 4,578 (0.2%) Conference Paper 1.73% Data or Dataset 53,736 (2.6%) Departmental Software 46 (0.0%) Bulletin Paper 52.90% Others 268,744 (13.2%) NII Institutional Repositories DataBase Contents Analysis Total 2,042,220 http://irdb.nii.ac.jp/analysis/index_e.php Repository Community • Digital Repository Federation, since 2006 • JAIRO Cloud Community, since 2012 • Institutional Repository Promotion Committee, since 2013

From 2016 • Japan Consortium for Open Access Repository (JPCOAR) • Working Group • Training WG • JAIRO Cloud Operation WG • Promotion WG • Task Force • Next Generation Metadata Schema TF • Research Data TF • Open Access Policy and Tracking TF • Repository Evaluation TF • ORCID TF

13 From Open Access to Open Science Open Science Report from Japanese Cabinet Office (2015)

http://www8.cao.go.jp/cstp/sonota/openscience/150330_openscience_summary_en.pdf Framework of the Open Science in Japan

http://www8.cao.go.jp/cstp/sonota/openscience/150330_openscience_summary_en.pdf 16 National Research Infrastructure for OS

17 Research Data Infrastructure for Open Science

Discovery Service DOI Metadata Management ● Linking Func between Article and Data International Metadata ● Researcher and Research Project Identification and Management Func Subject Aggregator Discovery Service Repository ● Data Exchange with International Discovery Service Metadata Aggregation User Flow Re‐use Search/Find Data Flow Data User Journal Supplemental Article Research Data Mng User Interface Data

Access Control Metadata Mng Data Depositor Institutional Research Data Mng Exp/Store Archive Research Data Management System Research Data Repository

Exp Data Article RDM Platform Publication Platform Private Shared Public by ● High Speed Access using SINET5 Hot Hot Hot ● Data oriented Self‐Archiving Func ● Data Sharing Func using Storage Storage Storage ● Versioning and auto‐Packaging Func Virtual NW and ID Federation Cold Cold Cold Effective Data Storage Switcher Storage Storage Storage ● User Dependent Personal Data ● Pseudonym Func Storage Area for Long‐term Preservation 18 How to realize our RDM Platform

• Establish corroborative development with the Western projects • Our Functional Requirements • Connection to institutional storage service • Institutional level control panel • SAML authentication and connect with VO Platform • Metadata management functionality • Easy deposit function to JAIRO Cloud • Mash‐up with other scholarly information services in Japan Advantage of OSF is its “Flexible” and “Extensible” architecture

19 Relationship between Research Data Infrastructure and Research Workflow

Discovery Service RDM Platform

Phase1 Project Start (Application) Member Management Initial Setting Aggregation

Experiment Data Acquisition

Publication Platform

Institutional or Domain Phase2 Data Management Repository Data Analysis Phase3 Paper writing RDM Platform Deposit with Supplemental Data Possible Use Case in Phase 3 Publicizing Journal Paper and Supplemental Data

RDM Platform 1. Manage using version control function 2. Manage reference information using external service (e.g. Mendeley) add-on 3. Manage supplemental data in the paper 4. Manage tables and figures

and Submit Paper Data or Data 5. Share reviewersʼ comment and prepare for response Accept Publication Platform 6. Deposit paper in publication platform based on FA OA policy 7. Deposit supplemental data in publication platform based on publisherʼs policy and DMP Institutional Repository Domain/FA Repository 8. Validateof metadata by librarians and data curators. Open/Close Control 9. Assign DOI • Embargo periods configuration Publication • Metadata only publication • Freeze and timestamping hidden original data * Procedure would be changed depending on OA type and reviewing process. 21 Deployment Plan • FY2016 • Initial Development • αTesting with major Universities • FY2017 • System Development RDM Platform Publication Platform Discovery Service • Small Scale Feasibility Study (βTesting) • FY2018 • Large Scale Feasibility Study • FY2019 • Pilot Operation • FY2020 • Production Operation

22