Data Center Disaster Recovery
Total Page:16
File Type:pdf, Size:1020Kb
Data Center Disaster Recovery KwaiSeng Consulting Systems Engineer Presentation_ID © 2006 Cisco Systems, Inc. All rights reserved. Cisco Confidential 1 Agenda Data Center—The Evolution Data Center Disaster Recovery Objectives Failure Scenarios Design Options Components of Disaster Recovery Site Selection—Front End GSLB Server High Availability—Clustering Data Replication and Synchronization—SAN Extension Data Center Technology Trends Summary © 2006 Cisco Systems, Inc. All rights reserved. 2 The Evolution of Data Centers © 2006 Cisco Systems, Inc. All rights reserved. 3 Data Center Evolution Networked Data Center Phase Data Center Continuous Data Center Availability Virtualization Data Center Network Consolidation Optimization Compute Internet Evolution Computing Data Center Client/ Networking Server 1. Consolidation Mainframes 2. Integration Content 3. Virtualization Networking 4. High Availability Thin Client: HTTP Business Agility Business TCP/IP Network Terminal Evolution 1960 1980 2000 2010 © 2006 Cisco Systems, Inc. All rights reserved. 4 Today’s Data Center Integration of Many Systems and Services Storage N-Tier Front End Network Applications Network Application/Server WAN/ Optimization FC Security Internet Switch Web Servers Resilient Cache IP Firewall DR Data Center Scalable Infrastructure Application and Server Optimization NAS App Servers Content Data Center Security IDS Switch MAN/ DC Storage Networks Internet VSANs Distributed Data Centers DB Servers FC Switch Mainframe IP Comm. Operations FC Switch RAID Metro Network DWDM/SONET/Ethernet FC Tape SAN Secondary Data Center © 2006 Cisco Systems, Inc. All rights reserved. 5 What Is Distributed Data Center? App A App B App A App C DataReplication FC FC Primary Secondary Data Center Data Center © 2006 Cisco Systems, Inc. All rights reserved. 6 Distributed Data Centers Required by disaster recovery and business continuance Avoid single , concentrated data depositary High availability of applications and data access Load balancing together with performance scalability Better response and optimal content routing: proximity to clients © 2006 Cisco Systems, Inc. All rights reserved. 7 Front-End IP Access Layer “Content Routing” SiteSelection App A App B App A App C FC FC Primary Secondary Data Center Data Center © 2006 Cisco Systems, Inc. All rights reserved. 8 Application and Database Layer “Content Switching” Load Balancing App A App B “Server Clustering” App A App C High Availability FC FC Primary Secondary Data Center Data Center © 2006 Cisco Systems, Inc. All rights reserved. 9 Backend SAN Extension “Storage” and “Optical” App A App B DataReplication App A App C and Transporting FC FC Primary Secondary Data Center Data Center © 2006 Cisco Systems, Inc. All rights reserved. 10 Data Center Disaster Recovery © 2006 Cisco Systems, Inc. All rights reserved. 11 Agenda Introduction to Data Center—The Evolution Data Center Disaster Recovery Objectives Failure Scenarios Design Options Components of Disaster Recovery Site Selection—Front End GSLB Server High Availability—Clustering Data Replication and Synchronization—San Extension Data Center Technology Trends Summary © 2006 Cisco Systems, Inc. All rights reserved. 12 Disaster Recovery Recovery of data and resumption of service—Ensuring business can recover and continue after failure or disaster Ability of a business to adapt, change and continue when confronted with various outside impacts Mitigating the impact of a disaster © 2006 Cisco Systems, Inc. All rights reserved. 13 Disaster Recovery What It Means for Business Business Resilience Continued Operation of Business During a Failure Business Continuance Restoration of Business After a Failure Disaster Recovery Protecting Data Through Offsite Data Replication Zero Down Time Is and Backup the Ultimate Goal © 2006 Cisco Systems, Inc. All rights reserved. 14 Disaster Recovery Planning Business Impact Analysis ( BIA ) Determines the impacts of various disasters to specific business functions and company assets Risk analysis Identifies important functions and assets that are critical to company’s operations Disaster Recovery Plan ( DRP ) Restores operability of the target systems, applications, or computing facility at the secondary data center after the disaster © 2006 Cisco Systems, Inc. All rights reserved. 15 Disaster Recovery Objectives Recovery Point Objective (RPO) The point in time (prior to the outage) in which system and data must be restored to Tolerable lost of data in event of disaster or failure The impact of data loss and the cost associated with the loss Recovery Time Objective (RTO) The period of time after an outage in which the systems and data must be restored to the predetermined RPO The maximum tolerable outage time © 2006 Cisco Systems, Inc. All rights reserved. 16 Recovery Point/Time vs. Cost Critical Data Is Disaster Systems Recovered Recovered Strikes and Operational Time Recovery Point Recovery Time time t 0 Time t 1 Time t 2 Days Hours Mins Secs Secs Mins Hours Days Weeks Tape Periodic Asynchronous Synchronous Extended Manual Tape backup Replication Replication Replication Cluster Migration Restore $$$ Increasing Cost $$$ Increasing Cost Smaller RPO/RTO Larger RPO/RTO Higher $$$, replication, Lower $$$, tape backup/restore, hot standby cold standby © 2006 Cisco Systems, Inc. All rights reserved. 17 Agenda Introduction to Data Center—The Evolution Data Center Disaster Recovery Objectives Failure Scenarios Design Options Components of Disaster Recovery Site Selection—Front End GSLB Server High Availability—Clustering Data Replication and Synchronization—San Extension Data Center Technology Trends Summary © 2006 Cisco Systems, Inc. All rights reserved. 18 Failure Scenarios Disaster Could Mean Many Types of Failure Network failure Device failure Storage failure Site failure © 2006 Cisco Systems, Inc. All rights reserved. 19 Network Failures ISP failure Service Internet Service Dual ISP connections Provider A Provider B Multiple ISP Connection failure within the network EtherChannel ® Multiple route paths © 2006 Cisco Systems, Inc. All rights reserved. 20 Device Failures Routers, switches, Service Internet Service FWs Provider A Provider B HSRP VRRP Hosts HA cluster LB server farm NIC teaming © 2006 Cisco Systems, Inc. All rights reserved. 21 Storage Failures Disk arrays Service Internet Service RAID Provider A Provider B Disk controllers Storage Replication Site to Site Mirroring Optimization © 2006 Cisco Systems, Inc. All rights reserved. 22 Site Failures Partial site failure Service Internet Service Application maintenance Provider A Provider B Application migration Application scheduled DR exercise Complete site failure Disaster © 2006 Cisco Systems, Inc. All rights reserved. 23 Agenda Introduction to Data Center—The Evolution Data Center Disaster Recovery Objectives Failure Scenarios Design Options Components of Disaster Recovery Site Selection—Front End GSLB Server High Availability—Clustering Data Replication and Synchronization—San Extension Data Center Technology Trends Summary © 2006 Cisco Systems, Inc. All rights reserved. 24 Warm Standby A data center that is equipped with hardware and communications interfaces capable of providing backup operating support Latest backups from the production data center must be delivered Network access needs to be activated Application needs to be manually started © 2006 Cisco Systems, Inc. All rights reserved. 25 Disaster Recovery—Active/Standby App A App B App A App C IP/Optical Network FC Secondary FC Primary Data Center Data Center (Warm Standby) © 2006 Cisco Systems, Inc. All rights reserved. 26 Hot Standby A data center that is environmentally ready and has sufficient hardware, software to provide data processing service with little down time Hot backup offers disaster recovery, with little or no human intervention Application data is replicated from the primary site A hot backup site provides better RTO/RPO than warm standby but cost more to implement Business continuance © 2006 Cisco Systems, Inc. All rights reserved. 27 Disaster Recovery—Active/Standby App A App B App A App C IP/Optical Network FC FC Primary Secondary Data Center Data Center © 2006 Cisco Systems, Inc. All rights reserved. 28 Active/Active DR Design Multiple Tiers of Application Service Internet Service Provider A Provider B Presentation Tier Application Tier Storage Tier © 2006 Cisco Systems, Inc. All rights reserved. 29 Active/Active Data Centers Internal Internet Network Service Service Provider A Provider B Internal Network Active/Active Web Hosting Active/Active Application Processing Active/Standby Database Processing or Active/Active for Different Application © 2006 Cisco Systems, Inc. All rights reserved. 30 Components of Disaster Recovery © 2006 Cisco Systems, Inc. All rights reserved. 31 Agenda Introduction to Data Center—The Evolution Data Center Disaster Recovery Objectives Failure Scenarios Design Options Components of Disaster Recovery Site Selection—Front End GSLB Server High Availability—Clustering Data Replication and Synchronization—SAN Extension Data Center Technology Trends Summary © 2006 Cisco Systems, Inc. All rights reserved. 32 Site Selection Mechanisms Site selection mechanisms depend on the technology or mix of technologies adopted for request routing : 1. HTTP redirect 2. DNS-based 3. L3 Routing with Route Health Injection (RHI) Health of servers and/or applications needs to be taken into account Optionally, other metrics (like