Rosetta Update Roadmap, community and more

Daniel Greenberg | Rosetta Product Manager

© 2021 Ex Libris | Confidential & Proprietary • Recently introduced

• Roadmap Agenda

• Community Updates

© 2021 Ex Libris | Confidential & Proprietary V7.0 Highlights Preservation • Events for DNX Validation User Experience • Migration reports Delivery • IIIF Support for non- • Simplified system setup • Migration error management image files • WCAG 2.1 • Preservation planning robustness

Infrastructure Data Management Integrations • • Large scale env Institution-level publishing • REST – Deposit • Reporting • Enhanced S3 storage and SIP Processing enhancement management • 3rd party upgrades • Apply Storage Extension Limitations © 2021 Ex Libris | Confidential & Proprietary 3 v7.0 Delivery

© 2021 Ex Libris | Confidential & Proprietary IIIF Support for non-image files

• Upgrade to IIIF Presentation API v3, to support wider range of formats – PDF, audio/visual • Upgrade equivalent Universal Viewer version • Contributed a fix to the Universal Viewer open-source project

© 2021 Ex Libris | Confidential & Proprietary 5 v7.0 Infrastructure

© 2021 Ex Libris | Confidential & Proprietary Performance for large scale archives

Unique local test environment setup at Ex Libris to simulate repositories of over 1bn files of diverse types and structures.

• Main issues identified and scheduled for 2021 releases

© 2021 Ex Libris | Confidential & Proprietary 7 Reporting Improvements

Reporting integrity improved by flattening selected DNX values in the DB.

• New "Property Flattening" Configuration to select DNX properties to extract and save separately in the DB.

© 2021 Ex Libris | Confidential & Proprietary 8 Monitor Oracle tablespace critical issues

System check to alert on Oracle tablespace critical utilization

• New bundled startup check plugin StartUp- OracleTablespaceUtilizationChecker • Warning and error thresholds of 5% and 1%

© 2021 Ex Libris | Confidential & Proprietary 9 v7.0 Preservation

© 2021 Ex Libris | Confidential & Proprietary Preservation Planning Robustness

• Significant improvement of processing migration of mass files, as part of preservation planning flow. • External migration: Export only files to be migrated

© 2021 Ex Libris | Confidential & Proprietary 11 Migration Improvements

• Report on migrated IEs: Row action "View Report" added per Execution

• Display informative error message received from migration tool • "View Log" for system errors

© 2021 Ex Libris | Confidential & Proprietary 12 User Experience

© 2021 Ex Libris | Confidential & Proprietary Simplified Material Flow Setup

Simplified setup of material flows: • Quick setup of sub-components • Branching out and back from the material flow setup to add / remove / edit components of the material flow

© 2021 Ex Libris | Confidential & Proprietary 14 Accessibility - WCAG

• All native viewers (General IE, representation and viewers) fully accessible and comply with the latest WCAG 2.1 standard • Deposit page fully accessible • Multiple back-end pages • 3rd party consultancy

© 2021 Ex Libris | Confidential & Proprietary 15 Data Management

© 2021 Ex Libris | Confidential & Proprietary Institution-level Publishing

Institution-level publishing schedule:

• Full transparency and control over scheduled and manual runs.

• Centralized IE and collection publishing

© 2021 Ex Libris | Confidential & Proprietary 17 Enhanced S3 storage management

Storage plugin upgraded to use AWS SDK version 2: • Includes validation of MD5 checksum value retrieved from S3 storage against preserved value • Ongoing improvements to remote storage integration

© 2021 Ex Libris | Confidential & Proprietary 18 Apply Storage Extension Limitations

• Define extensions via regular expressions that are not supported in the storage • Files included in list will be stored with the 'unknown' extension. Extension Handling examples:

Full Name Extension abc.efg efg abc.efg efg abc.efg.hij hij abc.efg. (null) © 2021 Ex Libris | Confidential & Proprietary 19 Integrations

© 2021 Ex Libris | Confidential & Proprietary REST APIs – Deposit and SIP Processing

• Initial drop of REST APIs to provide web services for Deposit and SIP processing functionality.

• To access the WADL on your server, go to: http://:/rest

© 2021 Ex Libris | Confidential & Proprietary 21 v7.1 and Beyond

© 2021 Ex Libris | Confidential & Proprietary v7.1 Preservation

• JHOVE error IDs

• JHOVE 1.22 introduced error IDs. These will be: 1. Stored in Rosetta DNX 2. Added to events for ignoring errors 3. Fully indexed in the back end • Save SIP processing events as provenance

• SIP processing stages should be considered as provenance and therefore saved in the AIP: 1. Completed processing stages will be recorded as provenance events in underlying IEs 2. Improve usability of events administration - better documentation and cleanup unused events • Expose linkingIEIdentifier for editing

© 2021 Ex Libris | Confidential & Proprietary v7.1 User Experience

• Customizable dashboard • Allow individual users to replace dashboard reports with a personalized set of reports

• Simplified system setup – continued simplification of navigation between material flow components

© 2021 Ex Libris | Confidential & Proprietary v7.1 Infrastructure

• Upgrade to latest 15 • Redhat 8.X certification • Preservation robustness – continued streamlining of mass files migration • Custom TLS certificates for SAML – certificates to be agnostic to Rosetta version • Huge volume stability – continued testing and remediation based on setup simulating over 1bn files • WCAG 2.1 – Working towards full adherence to latest accessibility standard

© 2021 Ex Libris | Confidential & Proprietary v7.1 Data Management

• Filter SIPs by error

• Facilitate mass SIP processing management by filtering all SIPs in technical analyst workbench by error and running bulk actions by error • Automated BagIt ingest

• Submission jobs and API ingests to support BagIt structure • REST APIs

• Delivery: Return json METS object to a Delivery http request • Expose existing REST APIs via Developers Network framework • Provide example for external viewer integrated via REST

© 2021 Ex Libris | Confidential & Proprietary Roadmap Focus Areas 2021 / 2022 and beyond

REST APIs Accessibility Huge volume Simplified flows stability Esploro integration

Structural IEs Expand rendered Validation Stack formats: MS Office, Pronom and 3D and more Redesigned Viewer beyond

© 2021 Ex Libris | Confidential & Proprietary Structural IES Preservation of hierarchical relations between IEs

• Expand data dictionary for hierarchical structures

• On-going PREMIS 3 compliance

• Collaborative design with Working Groups

© 2021 Ex Libris | Confidential & Proprietary Enhanced Indexing

• Access copies to be fully indexed

• Full-text generated and indexed via OCR tool

• Integration of additional OCR tools

© 2021 Ex Libris | Confidential & Proprietary Validation Stack Overhaul

Discussions led by working groups on:

• Splitting file validation and metadata

extraction to separate tasks

• Rethinking risk identification

• More

© 2021 Ex Libris | Confidential & Proprietary PRONOM and beyond

• End game: Support additional format registries beyond PRONOM, to expand coverage and reliability

• Initial phase: Additional format identification tool beyond Droid

• Discussions led by working groups

© 2021 Ex Libris | Confidential & Proprietary Enhanced Delivery • Render Microsoft Office files

• Redesigned and customizable IE / Representation Viewer

• 3D delivery via IIIF v3

• Enrich OTB delivery tools

© 2021 Ex Libris | Confidential & Proprietary Reporting Designer

Customer Benefits: Generation of new reports made easier; Multiple display options • Migrate reporting infrastructure to a new platform, providing:

• Modern and customizable display of reporting results

• Easy method for generating customized reports

© 2021 Ex Libris | Confidential & Proprietary 33 Operational data sharing

Anonymous sharing of system usage, allowing:

• Pre-emptive and efficient support

• Intelligent roadmap planning based on actual system usage

Next phase - community portal for analyzing technical data profiles and sharing best practices for

© 2021 Ex Libris | Confidential & Proprietary 34 Community Updates

© 2021 Ex Libris | Confidential & Proprietary A major religious Institution

Serving more than 250 Institutions from 17 Countries © 2021 Ex Libris | Confidential & Proprietary 36 Institution Types

National & Subject Libraries State Libraries

Archives, Governmental, Public Libraries Academic Institutions & Cultural Heritage

© 2021 Ex Libris | Confidential & Proprietary 37 Recent Joiners

National Library of the Netherlands

Coming Soon..

© 2021 Ex Libris | Confidential & Proprietary Rosetta @ DP Community

WDPD

Jhove Hack Week

© 2021 Ex Libris | Confidential & Proprietary 39 Thank you!

[email protected]

© 2021 Ex Libris | Confidential & Proprietary