Rosetta Update Roadmap, community and more
Daniel Greenberg | Rosetta Product Manager
© 2021 Ex Libris | Confidential & Proprietary • Recently introduced
• Roadmap Agenda
• Community Updates
© 2021 Ex Libris | Confidential & Proprietary V7.0 Highlights Preservation • Events for DNX Validation User Experience • Migration reports Delivery • IIIF Support for non- • Simplified system setup • Migration error management image files • WCAG 2.1 • Preservation planning robustness
Infrastructure Data Management Integrations • • Large scale env Institution-level publishing • REST APIs – Deposit • Reporting • Enhanced S3 storage and SIP Processing enhancement management • 3rd party upgrades • Apply Storage Extension Limitations © 2021 Ex Libris | Confidential & Proprietary 3 v7.0 Delivery
© 2021 Ex Libris | Confidential & Proprietary IIIF Support for non-image files
• Upgrade to IIIF Presentation API v3, to support wider range of formats – PDF, audio/visual • Upgrade equivalent Universal Viewer version • Contributed a fix to the Universal Viewer open-source project
© 2021 Ex Libris | Confidential & Proprietary 5 v7.0 Infrastructure
© 2021 Ex Libris | Confidential & Proprietary Performance for large scale archives
Unique local test environment setup at Ex Libris to simulate repositories of over 1bn files of diverse types and structures.
• Main issues identified and scheduled for 2021 releases
© 2021 Ex Libris | Confidential & Proprietary 7 Reporting Improvements
Reporting integrity improved by flattening selected DNX values in the DB.
• New "Property Flattening" Configuration to select DNX properties to extract and save separately in the DB.
© 2021 Ex Libris | Confidential & Proprietary 8 Monitor Oracle tablespace critical issues
System check to alert on Oracle tablespace critical utilization
• New bundled startup check plugin StartUp- OracleTablespaceUtilizationChecker • Warning and error thresholds of 5% and 1%
© 2021 Ex Libris | Confidential & Proprietary 9 v7.0 Preservation
© 2021 Ex Libris | Confidential & Proprietary Preservation Planning Robustness
• Significant improvement of processing migration of mass files, as part of preservation planning flow. • External migration: Export only files to be migrated
© 2021 Ex Libris | Confidential & Proprietary 11 Migration Improvements
• Report on migrated IEs: Row action "View Report" added per Execution
• Display informative error message received from migration tool • "View Log" for system errors
© 2021 Ex Libris | Confidential & Proprietary 12 User Experience
© 2021 Ex Libris | Confidential & Proprietary Simplified Material Flow Setup
Simplified setup of material flows: • Quick setup of sub-components • Branching out and back from the material flow setup to add / remove / edit components of the material flow
© 2021 Ex Libris | Confidential & Proprietary 14 Accessibility - WCAG
• All native viewers (General IE, representation and file viewers) fully accessible and comply with the latest WCAG 2.1 standard • Deposit page fully accessible • Multiple back-end pages • 3rd party consultancy
© 2021 Ex Libris | Confidential & Proprietary 15 Data Management
© 2021 Ex Libris | Confidential & Proprietary Institution-level Publishing
Institution-level publishing schedule:
• Full transparency and control over scheduled and manual runs.
• Centralized IE and collection publishing
© 2021 Ex Libris | Confidential & Proprietary 17 Enhanced S3 storage management
Storage plugin upgraded to use AWS SDK version 2: • Includes validation of MD5 checksum value retrieved from S3 storage against preserved value • Ongoing improvements to remote storage integration
© 2021 Ex Libris | Confidential & Proprietary 18 Apply Storage Extension Limitations
• Define extensions via regular expressions that are not supported in the storage • Files included in list will be stored with the 'unknown' extension. Extension Handling examples:
Full Name Extension abc.efg efg abc.
© 2021 Ex Libris | Confidential & Proprietary REST APIs – Deposit and SIP Processing
• Initial drop of REST APIs to provide web services for Deposit and SIP processing functionality.
• To access the WADL on your server, go to: http://
© 2021 Ex Libris | Confidential & Proprietary 21 v7.1 and Beyond
© 2021 Ex Libris | Confidential & Proprietary v7.1 Preservation
• JHOVE error IDs
• JHOVE 1.22 introduced error IDs. These will be: 1. Stored in Rosetta DNX 2. Added to events for ignoring errors 3. Fully indexed in the back end • Save SIP processing events as provenance
• SIP processing stages should be considered as provenance and therefore saved in the AIP: 1. Completed processing stages will be recorded as provenance events in underlying IEs 2. Improve usability of events administration - better documentation and cleanup unused events • Expose linkingIEIdentifier for editing
© 2021 Ex Libris | Confidential & Proprietary v7.1 User Experience
• Customizable dashboard • Allow individual users to replace dashboard reports with a personalized set of reports
• Simplified system setup – continued simplification of navigation between material flow components
© 2021 Ex Libris | Confidential & Proprietary v7.1 Infrastructure
• Upgrade to latest Java 15 • Redhat 8.X certification • Preservation robustness – continued streamlining of mass files migration • Custom TLS certificates for SAML – certificates to be agnostic to Rosetta version • Huge volume stability – continued testing and remediation based on setup simulating over 1bn files • WCAG 2.1 – Working towards full adherence to latest accessibility standard
© 2021 Ex Libris | Confidential & Proprietary v7.1 Data Management
• Filter SIPs by error
• Facilitate mass SIP processing management by filtering all SIPs in technical analyst workbench by error and running bulk actions by error • Automated BagIt ingest
• Submission jobs and API ingests to support BagIt structure • REST APIs
• Delivery: Return json METS object to a Delivery http request • Expose existing REST APIs via Developers Network framework • Provide example for external viewer integrated via REST
© 2021 Ex Libris | Confidential & Proprietary Roadmap Focus Areas 2021 / 2022 and beyond
REST APIs Accessibility Huge volume Simplified flows stability Esploro integration
Structural IEs Expand rendered Validation Stack formats: MS Office, Pronom and 3D and more Redesigned Viewer beyond
© 2021 Ex Libris | Confidential & Proprietary Structural IES Preservation of hierarchical relations between IEs
• Expand data dictionary for hierarchical structures
• On-going PREMIS 3 compliance
• Collaborative design with Working Groups
© 2021 Ex Libris | Confidential & Proprietary Enhanced Indexing
• Access copies to be fully indexed
• Full-text generated and indexed via OCR tool
• Integration of additional OCR tools
© 2021 Ex Libris | Confidential & Proprietary Validation Stack Overhaul
Discussions led by working groups on:
• Splitting file validation and metadata
extraction to separate tasks
• Rethinking risk identification
• More
© 2021 Ex Libris | Confidential & Proprietary PRONOM and beyond
• End game: Support additional format registries beyond PRONOM, to expand coverage and reliability
• Initial phase: Additional format identification tool beyond Droid
• Discussions led by working groups
© 2021 Ex Libris | Confidential & Proprietary Enhanced Delivery • Render Microsoft Office files
• Redesigned and customizable IE / Representation Viewer
• 3D delivery via IIIF v3
• Enrich OTB delivery tools
© 2021 Ex Libris | Confidential & Proprietary Reporting Designer
Customer Benefits: Generation of new reports made easier; Multiple display options • Migrate reporting infrastructure to a new platform, providing:
• Modern and customizable display of reporting results
• Easy method for generating customized reports
© 2021 Ex Libris | Confidential & Proprietary 33 Operational data sharing
Anonymous sharing of system usage, allowing:
• Pre-emptive and efficient support
• Intelligent roadmap planning based on actual system usage
Next phase - community portal for analyzing technical data profiles and sharing best practices for digital preservation
© 2021 Ex Libris | Confidential & Proprietary 34 Community Updates
© 2021 Ex Libris | Confidential & Proprietary A major religious Institution
Serving more than 250 Institutions from 17 Countries © 2021 Ex Libris | Confidential & Proprietary 36 Institution Types
National & Subject Libraries State Libraries
Archives, Governmental, Public Libraries Academic Institutions & Cultural Heritage
© 2021 Ex Libris | Confidential & Proprietary 37 Recent Joiners
National Library of the Netherlands
Coming Soon..
© 2021 Ex Libris | Confidential & Proprietary Rosetta @ DP Community
WDPD
Jhove Hack Week
© 2021 Ex Libris | Confidential & Proprietary 39 Thank you!
© 2021 Ex Libris | Confidential & Proprietary