FUSION

WANDISCO FUSION® GOOGLE CLOUD DATAPROC

Cloud migration and hybrid cloud WANdisco Fusion for Google Cloud Dataproc extends on-premises Hadoop clusters to the cloud for active burst-out processing, offsite disaster recovery and GOOGLE data archiving. CLOUD DATAPROC

Guaranteed data consistency Take control of your data with our game-changing, patented technology from on-premises Hadoop clusters running any distribution to Google Cloud Dataproc. Data is FUSION replicated between on-premises and cloud environments as it changes. Consistency is guaranteed and recovery is automatic after hardware or network outages.

On-demand analytics FUSION FUSION Replicate fast streaming data to Google Cloud Dataproc as it is ingested on-premises to leverage Dataproc’s scalability and performance for demanding real-time analytics applications. WANdisco Fusion supports Dataproc’s ability to spin up and shutdown clusters on demand, so you only pay for resources when you HADOOP use them. LOCAL AND NFS MOUNTED Automatic recovery FILE SYSTEMS Recovery is automatic after planned or unplanned network or hardware outages in both on-premises and cloud environments. DistCp-based solutions offered by Hadoop vendors: Seamless, flexible and easy to install • Run in batch Replicates data between on-premises clusters deployed • Require significant administrator overhead for setup, on any Hadoop compatible storage and Google Cloud maintenance and monitoring Dataproc. Uses standard Google utilities for installation • Impose significant overhead when moving data that and deployment. prevents other applications from performing Overcomes challenges of other hybrid • Don’t guarantee data consistency across on-premises cloud solutions for Hadoop and Google Cloud Dataproc clusters Fusion runs as a proxy to on-premises Hadoop clusters • Can’t replicate data as it’s ingested and Google Cloud Dataproc, replicating data as it • Require manual intervention to handle out-of- sync changes in either environment. conditions • Risk administrator error leading to data loss and extended downtime during recovery.

Copyright © 2019 WANdisco, Inc. All rights reserved. Supported environments HHadoop Cloud • Amazon EMR • Amazon • Cloudera CDH • Alibaba Cloud • Google Cloud • Google Cloud™ Dataproc • ® • Hortonworks (HDP) • Oracle® Cloud ® • IBM BigInsights File • MapR • Amazon S3 • Microsoft Azure • IBM COS HDInsight® • Local and NFS • Oracle Big Data Cloud mounted file ® • Oracle BDA and systems BDCS • NetApp ONTAP Operating Systems • OpenStack® Swift • Centos • Oracle Object Storage • (OCI and Classic) • RHEL • Virtustream Storage • SLES Cloud •

ABOUT WANDISCO

WANdisco is the LiveData company that empowers enterprises to revolutionize their IT infrastructure with its groundbreaking distributed coordination engine (DConE) in the WANdisco Fusion platform, enabling companies to generate hyperscale economics with the same IT budget — across multiple development environments, data centers, and cloud providers. WANdisco Fusion powers hundreds of the Global 2000, including , Allianz, AMD, Juniper, Morgan Stanley and more. With significant OEM relationships with IBM and Dell EMC and go-to-market partnerships with , Cisco, Microsoft Azure, Google Cloud, Oracle, Alibaba and other industry titans – WANdisco is igniting a LiveData movement worldwide. For more information on WANdisco, visit wandisco.com or contact [email protected].

Talk to one of our specialists today wandisco.com EMEA +44 114 303 9985 Join us online to access our APAC +61 2 8211 0620 extensive resource library. ALL OTHER +1 925 380 1728 US +1 877 926 3472 Follow us to stay in touch

5000 Executive Parkway, Suite 270 San Ramon, California 94583

Copyright © 2019 WANdisco, Inc. All rights reserved.