Best Practices in Data Center Transformation, Into the Next Century

Total Page:16

File Type:pdf, Size:1020Kb

Best Practices in Data Center Transformation, Into the Next Century riding the clouds - best practices in data center transformation, into the next century and beyond EMC Proven Professional Knowledge Sharing 2011 Paul Brant EMC Corporation [email protected] 2011 EMC Proven Professional Knowledge Sharing 2 Table of Contents Table of Figures................................................................................................. 10 Table of Equations ............................................................................................ 12 Abstract .............................................................................................................. 13 Introduction ....................................................................................................... 15 The need for Preservation in the Cloud/Next Generation Data Center ....................... 17 The ascent of the Computer in the Cloud/Next Generation Data Center .................... 18 The ascent of Data Longevity in the Cloud/Next Generation Data Center .................. 19 The ascent of Virtualization in the Cloud/Next Generation Data Center ..................... 21 The ascent of the Appliance in the Cloud/Next-Generation Data Center .................... 25 The ascent of the Intercloud – AKA Interconnected Global Connectivity .................... 26 The ascent of the Personal Data Store in the Cloud/Next-Generation Data Center .... 27 The Personal Information Management Ecosystem ................................................ 31 The Drivers for the Next-Generation Data Center ...................................................... 32 Attributes and Technologies ............................................................................ 35 Massively Parallel Processing (MPP) and other architectures .................................... 37 Best Practice – Understand advantages and trade-offs of scalable systems in the NGDC ............................................................................................................................... 37 SMP Architectures .................................................................................................. 38 Clustered Architecture Systems .............................................................................. 39 Best Practice – Understand Advantages in MPP (massively parallel processing) in addressing the NGDC ..................................................................................................... 41 Best Practice – Understand Amdahl’s and Gustafin’s Laws as it relates to MPP to achieve NGDC architectures ........................................................................................................ 42 Applianceization......................................................................................................... 47 Best Practice – Consider Applianceization in Big Data Architectures ...................... 48 Best Practice – Utilize Applianceization in the Financial Sector .............................. 49 Best Practice – For NGDC DBMS, consider utilizing appliances ............................. 51 2011 EMC Proven Professional Knowledge Sharing 3 Best Practice – Implement Applianceization within the infrastructure – VBLOCK Technology and Architectures .................................................................................................... 52 Best Practice – Implement a Unified Management solution for Appliance Architectures ............................................................................................................................... 53 Best Practice – Consider utilizing appliance structures in the cloud (IT in a box) .... 55 Data Transference ..................................................................................................... 57 Best Practice – Implement Directory-based Cache Coherency to create efficient local and remote distributed Data Transference ..................................................................... 59 Best Practice – Implement Directory-Based Cache Coherency for Data Federation63 Object Affinity ............................................................................................................ 65 Best Practice – Standardize Cloud Computing Interfaces that is object-based and vendor- neutral .................................................................................................................... 66 Best Practice – Implement an Object-Based Open Cloud Computing Interface into NGDC designs ................................................................................................................... 66 Security ..................................................................................................................... 70 Best Practice – Implement Identity assurance into the Personal Data Store Architecture ............................................................................................................................... 70 Best Practice – Implement PDS to enable the Cloud in an ever-changing social media world ............................................................................................................................... 71 Best Practice – Implement Federated login and password for Personal Data Stores72 Best Practice – Embrace Identity methodologies to Enhance Security ................... 73 Best Practice – Consider Containerization to determine who you should trust with your data ............................................................................................................................... 74 Virtualization .............................................................................................................. 75 Best Practice – Consider Virtualization considerations when implementing a Cloud or Intercloud................................................................................................................ 75 Best Practice – Implement the NGDC with VM metadata abstraction considerations using OVF ........................................................................................................................ 76 Best Practice – Consider Client and Server Virtualization into the NGDC ............... 77 Best Practice – Utilize Virtualization to address cost reduction and cost avoidance in the NGDC ..................................................................................................................... 78 Best Practice – Understand the benefits of Virtualized Service-Based Delivery ...... 79 Personal Data Store Attributes ................................................................................... 82 2011 EMC Proven Professional Knowledge Sharing 4 Best Practice – Implement Personal Data Stores into the NGDC architecture ........ 82 Best Practice – Implement additional Data Store flows to the existing lexicon ........ 87 Best Practice – Implement NGDC‘s to address the needs of Personal Data Stores as a service to the individual. ......................................................................................... 89 Archival Ecosystem ................................................................................................... 94 What is an Archive .......................................................................................................... 94 Best Practice – Implement OAIS in the Next Generation Data Center Archive Architectures ................................................................................................................... 97 Best Practice – Utilize Archival strategies for digital content preservation by leveraging the knowledge of the archival profession ............................................................................. 101 Best Practice – Implement the Self-contained Information Retention Format (SIRF) for long-term retention in the NGDC ................................................................................... 102 Best Practice – Integrate SIRF and OAIS into the NGDC .............................................. 106 Best Practice – Integrate SIRF and XAM into the NGDC ............................................... 109 Best Practice – Integrate SIRF and JHOVE into the NGDC ........................................... 110 Best Practice – Integrate SIRF and Bagit into the NGDC .............................................. 111 Best Practice – Next-Generation Data Center containerization should address specific Media and Platform requirements .................................................................................. 111 Storage Ecosystem .................................................................................................. 114 Storage Object Affinity .......................................................................................... 114 Storage for Cloud Computing ............................................................................... 114 Best Practice – Implement an Object-based Data Management Infrastructure to Address Interoperability and Transparency Requirements .......................................................... 115 Best Practice – Implement CDMI Metadata Properties Consistent with the CDMI Specification .................................................................................................................. 117 Data Portability ..................................................................................................... 117 Best Practice – Implement Data Portability techniques into the NGDC architecture ...... 118 Storage Tiering ....................................................................................................
Recommended publications
  • Using the EMC SRDF Adapter for Vmware Site Recovery Manager
    Using the EMC SRDF Adapter for VMware Site Recovery Manager Version 5.0 • SRDF Overview • Installing and Configuring the SRDF SRA • Initiating Test Site Failover Operations • Performing Site Failover/Failback Operations Drew Tonnesen September 2017 VMAX Engineering Copyright © 2017 EMC Corporation. All rights reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date regulatory document for your product line, go to the Technical Documentation and Advisories section on support.emc.com. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. All other trademarks used herein are the property of their respective owners. EMC is now part of the Dell group of companies. Part number H10553.8 2 Using EMC SRDF Adapter for VMware vCenter Site Recovery Manager Contents Preface Chapter 1 Symmetrix Remote Data Facility Introduction ....................................................................................... 14 SRDF overview.................................................................................
    [Show full text]
  • Netezza EOL: IBM IAS Is a Flawed Upgrade Path Netezza EOL: IBM IAS Is a Flawed Upgrade Path
    Author: Marc Staimer WHITE PAPER Netezza EOL: What You’re Not Being Told About IBM IAS Is a Flawed Upgrade Path Database as a Service (DBaaS) WHITE PAPER • Netezza EOL: IBM IAS Is a Flawed Upgrade Path Netezza EOL: IBM IAS Is a Flawed Upgrade Path Executive Summary It is a common technology vendor fallacy to compare their systems with their competitors by focusing on the features, functions, and specifications they have, but the other guy doesn’t. Vendors ignore the opposite while touting hardware specs that mean little. It doesn’t matter if those features, functions, and specifications provide anything meaningfully empirical to the business applications that rely on the system. It only matters if it demonstrates an advantage on paper. This is called specsmanship. It’s similar to starting an argument with proof points. The specs, features, and functions are proof points that the system can solve specific operational problems. They are the “what” that solves the problem or problems. They mean nothing by themselves. Specsmanship is defined by Wikipedia as: “inappropriate use of specifications or measurement results to establish supposed superiority of one entity over another, generally when no such superiority exists.” It’s the frequent ineffective sales process utilized by IBM. A textbook example of this is IBM’s attempt to move their Netezza users to the IBM Integrated Analytics System (IIAS). IBM is compelling their users to move away from Netezza. In the fall of 2017, IBM announced to the Enzee community (Netezza users) that they can no longer purchase or upgrade PureData System for Analytics (the most recent IBM name for its Netezza appliances), and it will end-of-life all support next year.
    [Show full text]
  • EMC Greenplum Data Computing Appliance: High Performance for Data Warehousing and Business Intelligence an Architectural Overview
    EMC Greenplum Data Computing Appliance: High Performance for Data Warehousing and Business Intelligence An Architectural Overview EMC Information Infrastructure Solutions Abstract This white paper provides readers with an overall understanding of the EMC® Greenplum® Data Computing Appliance (DCA) architecture and its performance levels. It also describes how the DCA provides a scalable end-to- end data warehouse solution packaged within a manageable, self-contained data warehouse appliance that can be easily integrated into an existing data center. October 2010 Copyright © 2010 EMC Corporation. All rights reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com All other trademarks used herein are the property of their respective owners. Part number: H8039 EMC Greenplum Data Computing Appliance: High Performance for Data Warehousing and Business Intelligence—An Architectural Overview 2 Table of Contents Executive summary .................................................................................................
    [Show full text]
  • How to Start Your Disaster Recovery in This “Cloudy” Landscape EMC Proven Professional Knowledge Sharing 2011
    how to start your disaster recovery in this “cloudy” landscape EMC Proven Professional Knowledge Sharing 2011 Roy Mikes Storage and Virtualization Architect Mondriaan Zorggroep [email protected] Table of Contents About This Document 3 Who Should Read This Document? 3 Introduction 4 1. What is a Disaster 6 2. What is a Disaster Recovery Plan (DR plan) 7 2.1. Other benefits of a Disaster Recovery Plan 7 3. Business Impact Analysis (BIA) 8 3.1. Maximum Tolerable Downtime (MTD) 9 3.2. Recovery Time Objective (RTO) 9 3.3. Recovery Point Objective (RPO) 9 4. Data Classification 10 5. Risk Assessment 13 5.1. Component Failure Impact Analysis (CFIA) 16 5.2. Identifying Critical Components 18 5.2.1. Personnel 18 5.2.2. Systems 18 5.3. Dependencies 19 5.4. Redundancy 21 6. Emergency Response Team (ERT) 23 7. Developing a Recovery Strategy 24 7.1. Types of backup 26 7.2. Virtualized Servers and Disaster Recovery 27 7.3. Other thoughts 28 8. Testing Recovery Plans 29 9. Role of virtualization 30 9.1. Role of VMware 31 9.2. Role of EMC 33 9.3. Role of VMware Site Recovery Manager (SRM) 35 10. VMware Site Recovery Manager 36 11. Standardization 41 12. Conclusion 42 References 44 EMC Proven Professional Knowledge Sharing 2 About This Article Despite our best efforts and precautions, disasters of all kind eventually strike an organization, usually unanticipated and unannounced. Natural disasters such as hurricanes, floods, or fires can threaten the very existence of an organization. Well-prepared organizations establish plans, procedures, and protocols to survive the effects that a disaster may have on continuing operations and help facilitate a speedy return to working order.
    [Show full text]
  • Dell EMC VPLEX: SAN Connectivity Implementation Planning and Best Practices
    Best Practices Dell EMC VPLEX: SAN Connectivity Implementation planning and best practices Abstract This document describes SAN connectivity for both host to Dell EMC™ VPLEX™ front-end and VPLEX to storage array back end. This document covers active/active and active/passive and ALUA array connectivity with illustrations detailing PowerStore™, Dell EMC Unity™ XT, and XtremIO™. Included also is information for metro node connectivity. January 2021 H13546 Revisions Revisions Date Description May 2020 Version 4: Updated ALUA connectivity requirements January 2021 Added metro node content Acknowledgments Author: VPLEX CSE Team [email protected] This document may contain certain words that are not consistent with Dell's current language guidelines. Dell plans to update the document over subsequent future releases to revise these words accordingly. This document may contain language from third party content that is not under Dell's control and is not consistent with Dell's current guidelines for Dell's own content. When such third party content is updated by the relevant third parties, this document will be revised accordingly. The information in this publication is provided “as is.” Dell Inc. makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any software described in this publication requires an applicable software license. Copyright © January 2021 Dell Inc. or its subsidiaries. All Rights Reserved. Dell Technologies, Dell, EMC, Dell EMC and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be trademarks of their respective owners.
    [Show full text]
  • Asia Pacific
    Enterprise Data Storage Market Insights Future Technologies and Trends Will Drive Market Growth P8CD-72 September 2015 Contents Section Slide Number Scope and Market Overview 3 Enterprise Storage Market Trends 7 Future Technologies in Storage 12 Business Models of Vendors 16 Business Models of OEMs 26 Business Models of Distributors 31 Implications of Storage Trends on Data Centers 39 Frost & Sullivan Story 45 P8CD-72 2 Scope and Market Overview Return to contents P8CD-72 3 Scope of the Study Objectives • To provide an overview of the global enterprise data storage market, focusing on the trends, technology, and business models of major market participants • To understand the trends affecting the global storage market and the implications of these trends on the data center market • Future technologies entering the storage market • To gain a detailed understanding on the distribution structure and channel partner program for key storage vendors EMC and NetApp • To provide an understanding of the business model for ODM/OEMs in terms of service and support capabilities across regions; these include Foxconn and Supermicro ODM: Original Design Manufacturer OEM: Original Equipment Manufacturer Source: Frost & Sullivan P8CD-72 4 Market Overview The enterprise data storage market is changing at a rapid pace. More data is being digitized than ever before and stored on disks of various size capacities. Globally, by 2020, there are expected to be about 26 billion connected devices. These devices would generate large amounts of data that need to be stored and analyzed at some point of time. Storage devices need to be agile, scalable, low cost, able to handle huge data loads, and durable to sustain the huge data growth.
    [Show full text]
  • Microsoft's SQL Server Parallel Data Warehouse Provides
    Microsoft’s SQL Server Parallel Data Warehouse Provides High Performance and Great Value Published by: Value Prism Consulting Sponsored by: Microsoft Corporation Publish date: March 2013 Abstract: Data Warehouse appliances may be difficult to compare and contrast, given that the different preconfigured solutions come with a variety of available storage and other specifications. Value Prism Consulting, a management consulting firm, was engaged by Microsoft® Corporation to review several data warehouse solutions, to compare and contrast several offerings from a variety of vendors. The firm compared each appliance based on publicly-available price and specification data, and on a price-to-performance scale, Microsoft’s SQL Server 2012 Parallel Data Warehouse is the most cost-effective appliance. Disclaimer Every organization has unique considerations for economic analysis, and significant business investments should undergo a rigorous economic justification to comprehensively identify the full business impact of those investments. This analysis report is for informational purposes only. VALUE PRISM CONSULTING MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS DOCUMENT. ©2013 Value Prism Consulting, LLC. All rights reserved. Product names, logos, brands, and other trademarks featured or referred to within this report are the property of their respective trademark holders in the United States and/or other countries. Complying with all applicable copyright laws is the responsibility of the user. Without limiting the rights under copyright, no part of this report may be reproduced, stored in or introduced into a retrieval system, or transmitted in any form or by any means (electronic, mechanical, photocopying, recording, or otherwise), or for any purpose, without the express written permission of Microsoft Corporation.
    [Show full text]
  • EMC VPLEX FAMILY Continuous Availability and Data Mobility Within and Across Data Centers
    EMC VPLEX FAMILY Continuous Availability and Data Mobility within and across data centers DELIVERING CONTINUOUS AVAILABILITY AND DATA MOBILITY FOR MISSION-CRITICAL APPLICATIONS Storage infrastructure is evolving to deliver new methods of making data mobile, freeing it from physical device limitations. Virtualizing storage extends the benefits of server virtualization, providing automation, integration with existing infrastructure and growth on demand. In the past, application data was tied to a single array. Providing Availability/mobility was challenging and doing so without application interruption was impossible. BENEFITS • Continuous Availability - Deliver application and data availability within the data center and over distance with active-active infrastructure utilization, eliminating planned and unplanned downtime. • Data Mobility - Move The EMC VPLEX family delivers data availability and mobility across arrays in a applications, virtual machines single data center and across arrays in data centers separated by distance. VPLEX and data within and across is a continuous availability and data mobility platform that enables mission-critical data centers and over distance applications to remain up and running during a variety of planned and unplanned without impacting users downtime scenarios, even in the event of a data center site failure. VPLEX enables non-disruptive data mobility, leveraging technologies like VMware and other • Migrate and relocate virtual clusters that were built assuming a single storage instance and enabling them to machines, applications, and function across arrays in a single data center and across distance. data non-disruptively So, how does VPLEX work? VPLEX enables the exact same data to be read/write • Eliminate planned and accessible across two arrays at the same time.
    [Show full text]
  • Dell Quickstart Data Warehouse Appliance 2000
    Dell Quickstart Data Warehouse Appliance 2000 Reference Architecture Dell Quickstart Data Warehouse Appliance 2000 THIS DESIGN GUIDE IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS OR IMPLIED WARRANTIES OF ANY KIND. © 2012 Dell Inc. All rights reserved. Reproduction of this material in any manner whatsoever without the express written permission of Dell Inc. is strictly forbidden. For more information, contact Dell. Dell , the DELL logo, the DELL badge, and PowerEdge are trademarks of Dell Inc . Microsoft , Windows Server , and SQL Server are either trademarks or registered trademarks of Microsoft Corporation in the United States and/or other countries. Intel and Xeon are either trademarks or registered trademarks of Intel Corporation in the United States and/or other countries. Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Dell Inc. disclaims any proprietary interest in trademarks and trade names other than its own. December 2012 Page ii Dell Quickstart Data Warehouse Appliance 2000 Contents Introduction ............................................................................................................ 2 Audience ............................................................................................................. 2 Reference Architecture ..............................................................................................
    [Show full text]
  • Lynn Conway Professor of Electrical Engineering and Computer Science, Emerita University of Michigan, Ann Arbor, MI 48109-2110 [email protected]
    IBM-ACS: Reminiscences and Lessons Learned From a 1960’s Supercomputer Project * Lynn Conway Professor of Electrical Engineering and Computer Science, Emerita University of Michigan, Ann Arbor, MI 48109-2110 [email protected] Abstract. This paper contains reminiscences of my work as a young engineer at IBM- Advanced Computing Systems. I met my colleague Brian Randell during a particularly exciting time there – a time that shaped our later careers in very interesting ways. This paper reflects on those long-ago experiences and the many lessons learned back then. I’m hoping that other ACS veterans will share their memories with us too, and that together we can build ever-clearer images of those heady days. Keywords: IBM, Advanced Computing Systems, supercomputer, computer architecture, system design, project dynamics, design process design, multi-level simulation, superscalar, instruction level parallelism, multiple out-of-order dynamic instruction scheduling, Xerox Palo Alto Research Center, VLSI design. 1 Introduction I was hired by IBM Research right out of graduate school, and soon joined what would become the IBM Advanced Computing Systems project just as it was forming in 1965. In these reflections, I’d like to share glimpses of that amazing project from my perspective as a young impressionable engineer at the time. It was a golden era in computer research, a time when fundamental breakthroughs were being made across a wide front. The well-distilled and highly codified results of that and subsequent work, as contained in today’s modern textbooks, give no clue as to how they came to be. Lost in those texts is all the excitement, the challenge, the confusion, the camaraderie, the chaos and the fun – the feeling of what it was really like to be there – at that frontier, at that time.
    [Show full text]
  • EMC Secure Remote Services 3.18 Site Planning Guide
    EMC® Secure Remote Services Release 3.26 Site Planning Guide REV 01 Copyright © 2018 EMC Corporation. All rights reserved. Published in the USA. Published January 2018 EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. The information in this publication is provided as is. EMC Corporation makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. EMC2, EMC, and the EMC logo are registered trademarks or trademarks of EMC Corporation in the United States and other countries. All other trademarks used herein are the property of their respective owners. For the most up-to-date regulatory document for your product line, go to Dell EMC Online Support (https://support.emc.com). 2 EMC Secure Remote Services Site Planning Guide CONTENTS Preface Chapter 1 Overview ESRS architecture........................................................................................ 10 ESRS installation options ...................................................................... 10 Other components ................................................................................ 11 Requirements for ESRS customers......................................................... 11 Supported devices.....................................................................................
    [Show full text]
  • Celerra NSX Data Sheet
    DATA SHEET EMC Celerra NSX Series IP Storage Reach new heights of availability and scalability with EMC Celerra NSX Meeting the information-sharing challenge Performance bottlenecks, security issues, and the high cost of data protection and management associated with deploying file servers using general-purpose operating systems become non-issues ® ® The Big Picture with the EMC Celerra NSX. NSX combines commodity CPUs with flexible and sophisticated Data Access in Real Time (DART) file-serving software into a purpose-built, scalable, clustered file server • Ensure no-compromise availability optimized for moving mission-critical data. through integrated advanced clustering, managed as a single device Unparalleled EMC Symmetrix® and CLARiiON® storage technologies, combined with Celerra’s • Receive advanced functionality at no extra impressive I/O system architecture, offer industry-leading availability, scalability, performance, cost with the most comprehensive suite of and ease of management. built-in features • Leverage EMC Celerra’s price/performance With Celerra NSX, you get distributed file services in a centrally managed information storage system. leadership to support large user commu- You can dynamically grow, share, and cost-effectively manage file systems with multi-protocol file nities with an intuitive, GUI-based, single access capabilities. And you take advantage of simultaneous support for NFS, CIFS, and iSCSI proto- point of management for up to eight file server nodes in an advanced clustering cols, concurrently, that
    [Show full text]