Hitachi Solution for Databases – Optimized Enterprise Reduce Costs, Speed Access and Gain More Value

SOLUTION PROFILE

Data is at the core of modern digital business, but immense data growth presents challenges for IT organizations. As data volume increases, inefficiencies in your enterprise data warehouse (EDW) can prevent you from realizing the full value of your data. Extract-transform-load (ETL) processes consume more compute and storage resources, leading to higher licensing and management costs. Scheduled downtime to manually manage databases interrupts availability. Backup and archive cycles increasingly take longer, significantly slowing access times for users. Query operations must sort through massive amounts of data, much of it cold, infrequently accessed or irrelevant. As a result, you experience degraded performance.

Optimizing your EDW by offloading cold and unused data to a data lake can help you overcome these challenges. With this approach, you can reduce costs, deliver faster access to data, and provide better information for decision-making. EDW Inefficiencies Impede Innovation and Digital Transformation

Optimizing your enterprise data warehouse is critical. In a typical EDW, 50-70% of data is cold or unused, while only 2.8% of data is hot1. You are challenged to offload unused and cold data to a cost-effective, certified NoSQL MongoDB or (Cloudera or MapR) environment to reduce the amount of data queried and backed up for critical operations. At the same time, you must provide timely access to all data and lay a foundation for data blending and analysis.

TYPICAL ORACLE ENTERPRISE HITACHI SOLUTION FOR DATA WAREHOUSE DATABASES WITH ORACLE EDW (EDW) ARCHITECTURE OPTIMIZATION

Systems of Certified MongoDB or Record Enterprise Data Hadoop Appliance Cluster Warehouse With Hitachi VSP, COLD DATA with UCP and VSP Exadata, EMC and so on OFFLOAD

RMDB ERP <30%Hot 60% -Cold70% Hot DataData Cold Data Hadoop CRM Other

RMDB = relational database management system, ERP = enterprise resource processing, CRM = customer relationship management, VSP = Hitachi Virtual Storage Platform, UCP = Hitachi Unified Compute Platform

The typical EDW architecture is inefficient. Up to 70% of the data stored is cold or unused, resulting in increased query and backup times as well as higher costs. © Hitachi Vantara Corporation 2017. All Rights Reserved

Offload Cold Data to a Cost-Effective NoSQL MongoDB or Hadoop (Cloudera or MapR) Database Using Pentaho Data Integration Hitachi Vantara solutions and services can help you optimize your EDW environment. Cold data is offloaded to a MongoDB or Hadoop (Cloudera or MapR) database running on general-purpose servers and storage. Hot and warm data remains in your existing EDW, supported by advanced, specialized hardware. Hitachi Vantara Global Services uses a software toolkit to automatically map data between your Oracle database and MongoDB or Hadoop (Cloudera or MapR). This approach speeds the offload operation and lowers costs. It also diminishes the risk of human errors by reducing the number of manual processes by up to 90%.

By optimizing the placement of data according to cost and availability priority, this solution helps you reduce database management costs, improve data availability, and increase your overall EDW performance. Hitachi Vantara and our partners can fully manage this solution implementation, ensuring seamless deployment.

1 Source: Innovation and Strategy Team and Appfluent Analysis

2 Pentaho Data Integration for EDW Offload: Features and Benefits

Integrated Management Accelerated Backup and Archival Pentaho Data Integration lets you access both your Optimized storage tiers let you perform backup existing EDW and a second environment operations on smaller subsets of data at the from a single tool. Intuitive drag-and-drop appropriate frequency. For example, hot data can integration with a graphical ETL designer simplifies be backed up daily, while offloaded cold data is data pipeline creation. replicated for availability using MongoDB or Hadoop (Cloudera or MapR) Automate schema mapping between your Oracle and MongoDB or Hadoop (Cloudera or Increase data availability and reduce backup MapR) databases, lower costs and eliminate and archival times. human errors.

Hitachi Solution for Databases With Optimized Oracle EDW and Hitachi Unified Compute Platform ■■ Offload cold and unused data to a cost-effective big data environment.

■■ Store all data in an orchestrated data lake.

■■ Submit queries to the data lake.

■■ Return unified data sets to users. Hadoop

© Hitachi Vantara Corporation 2017. All Rights Reserved Extreme Scalability Cost-Effective Storage Tiering Horizontal scalability of MongoDB and Cloudera A second, low-cost, MongoDB or Hadoop lets you offload many terabytes of cold data from (Cloudera or MapR) based storage tier reduces the existing specialized EDW servers onto cost-effective number of EDW licenses you need to process your general-purpose infrastructure. Offloaded data data. It also lets you store cold data on general- remains readily accessible. purpose infrastructure, decreasing the amount of specialized EDW infrastructure needed. Adapt easily to increasing data growth while maximizing your budget. Reduce licensing and infrastructure costs for your data environment. Optimized Data Placement Pentaho Data Integration helps you divide your data into optimized storage tiers. Store hot and warm data in your existing EDW system to maximize availability and performance. Offload cold and unused data onto general-purpose servers to minimize cost.

Maximize EDW performance by limiting the amount of data to be examined.

3 Take Advantage of Proven Database Solution Expertise

Hitachi Vantara is a trusted and experienced provider of database solutions. We help you find innovative ways to achieve your business goals by focusing on the value of your data. With a global partner ecosystem, we deliver proven, high-performance, enterprise-class solutions and services, worldwide.

Using converged infrastructure, advanced software and proven database and industry expertise, we can help your organization develop and execute a data management strategy. Optimize your database environment, realize the full value of your data, and pave the way for business innovation. At Hitachi Vantara, we can help you get there faster.

4 Next Steps

Learn more about the Hitachi Solution for Databases and related Hitachi Vantara solutions and services in the following guides.

Hitachi Solution for Databases in an Enterprise Data Warehouse Offload Package for Oracle Database Reference Architecture.

Hitachi Solution for Enterprise Data Intelligence with MongoDB Reference Architecture Guide.

Find out more about Hitachi Solution for Databases here.

Hitachi Vantara at a Glance Your data is the key to new revenue, better customer experiences and lower costs. With technology and expertise, Hitachi Vantara drives data to meaningful outcomes.

Hitachi Vantara

Corporate Headquarters Contact Information 2535 Augustine Drive USA: 1-800-446-0744 Santa Clara, CA 95054 USA Global: 1-858-547-4526 HitachiVantara.com | community.HitachiVantara.com HitachiVantara.com/contact

HITACHI is a trademark or registered trademark of Hitachi, Ltd. VSP is a trademark or registered trademark of Hitachi Vantara Corporation. All other trademarks, service marks, and company names are properties of their respective owners. SP-270-B BTD March 2019