Intel® Solutions for Lustre* Software SC16 Technical Training

Total Page:16

File Type:pdf, Size:1020Kb

Intel® Solutions for Lustre* Software SC16 Technical Training SC’16 Technical Training All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit http://www.intel.com/performance. Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at http://www.intel.com/content/www/us/en/software/intel-solutions-for-lustre-software.html. You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined". Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. 3D XPoint, Intel, the Intel logo, Intel Core, Intel Xeon Phi, Optane and Xeon are trademarks of Intel Corporation in the U.S. and/or other countries. * Other names and brands may be claimed as the property of others. © 2016 Intel Corporation 2 Introductions Lustre Overview Roadmap Deep Dive 10:30 Break Lustre on ZFS Update Lunch (provided) Intel® Omni-Path with Lustre Knights Landing with Lustre Introduction to Intel® HPC Orchestrator Lustre Performance Tuning Review 3 Intel® Scalable System Framework A Holistic Solution for All HPC Needs Small Clusters Through Supercomputers Compute Memory / Storage Compute and Data-Centric Computing Fabric Software Standards-Based Programmability On-Premise and Cloud-Based Intel® Solutions for Lustre* Intel® HPC Orchestrator Intel® Xeon® Processors Intel® Omni-Path Architecture Intel® Optane™ Technology Intel® Software Tools Intel® Xeon Phi™ Processors Intel® Silicon Photonics 3D XPoint™ Technology Intel® Cluster Ready Program Intel® FPGAs and Server Solutions Intel® Ethernet Intel® SSDs Intel Supported SDVis 4 Lets go around the room! 5 71% 9 of Top10 Sites 71% of Top100 Most Adopted PFS Most Scalable PFS Open Source GPL v2 18% Commercial Packaging Vibrant Community 4% 7% Lustre GPFS NFS Other December 2015: Intel’s Analysis of Top 100 Systems (top100.org) 6 Commit per Organization Lines of codes per organization 2% 1% 1% 1%1% 2% 2% 2% 2% 3% 2% 3% 4% 6% 6% 18% 8% Intel Intel 65% 65% Intel ORNL* Seagate* Cray* DDN* Intel ORNL Cray Atos Seagate Atos* LLNL* CEA* IU Other DDN IU CEA Other 1 Source: Chris Morrone, Lead of OpenSFS Lustre Working Group, April 2016 7 Bioscience Government research and defense Large-scale manufacturing Genomic data analysis, modeling and Government funded research. Surveillance, Mechanical, computer-aided design & simulations Signal Processing, encryption etc. computer-aided engineering systems Weather and climate Energy Finance Highly complex CGI rendering Seismic processing, reservoir modeling / Fraud detection, Monte Carlo simulations, characterization, sensor data analysis risk management analysis * Other names and brands may be claimed as the property of others. 8 Intel® Scalable System Framework for HPC Intel® FOUNDATION Edition Intel® ENTERPRISE Edition Intel® CLOUD Edition for Lustre* software for Lustre* software for Lustre* software Delivers the latest functions and Maximum performance with minimal Cost-effective access to parallel features, fully supported by Intel complexity and cost for multi- storage on Amazon Web Services* Ideal for organizations that prefer to petabyte file system. Management (AWS) and Microsoft Azure* to boost design and deploy their own open with Intel® Manager for Lustre* cloud-computing source configurations software * Other names and brands may be claimed as the property of others. 9 Read/White OST Heat Map Balance Metadata Read/White Operations Bandwidth 10 * Other names and brands may be claimed as the property of others. 11 * Other names and brands may be claimed as the property of others. 12 Management Object Storage Object Storage Target (MGT) MetadataTarget (MDT) Targets (OSTs) Targets (OSTs) Metadata Object Storage Servers Servers (1-10s) (10s-1000s) Management Network Intel Manager for Lustre High Performance Data Network (Infiniband*, 10GbE) Lustre Clients (1 – 100,000+) Native Lustre* Client for Intel® Xeon Phi™ processor Intel® Omni-Path Support Robin Hood OpenZFS, RAIDz Hadoop* Adapters HSM * Other names and brands may be claimed as the property of others. 13 14 15 Lustre w/ZFS – Unique Features ZFS System Design Software Installation Lustre ZFS HA Overview 16 Raidz2: Data+2 parity data protection scheme Raidz3: Data+3 parity data protection scheme Vdev: Collection of devices (eg: raidz2 9+2 Vdev) Zpool: Collection of vdevs Zpools become Lustre OSTs You can have many devs in a zpool L2arc cache: ZFS Read Cache 17 Incredible reliability – Data is always consistent on disk; silent data corruption is detected and corrected; smart rebuild strategy Compression – Maximize usable capacity for increased ROI Snapshot – support built into Lustre – Consistent snapshot across all the storage targets without stopping the file system. Hybrid Storage Pool – Data is tiered automatically across DRAM, SSD/NVMe and HDD accelerating random & small file read performance Manageability – Powerful storage pool management makes it easy to assemble and maintain Lustre storage targets from individual devices 18 Silent Data Corruption is a real world issue: “Data ~= Dada” Causes: Interface Design Manufacturing Defects Cable Defects Heat/Power/Vibrations Software defects Netapp Study* : 1.5 Million Drives: 41 Months:400,000 Errors * https://atg.netapp.com/wp-content/uploads/2008/03/corruption-fast08.pdf 19 On Write: Write data + checksum On Read: Read data and re-compute checksum then compare to original On Error: If running zRaid discard read and recalculate from VDEV Notify user and continue on 20 Enable more space allocation to users minimizes hardware costs more data in the same footprint Increase the file transfer rate Increase throughput by up to 25% See Laval University’s presentation from HP CAST 2015: http://www.hp-cast.org/ Compression effects on genomics files Text based output of genomic sequence systems Human genome can generate 600GB file size 21 How Can Lustre* Snapshots Be Used? Undo/undelete/recover file(s) from the snapshot . Removed file by mistake, application failure causes data invalid Quickly backup the filesystem before system upgrade . Upgrade Lustre/kernel may hit some trouble and need to roll back Prepare a consistent frozen data view for backup tools . Ensure system is consistent for the whole backup Intel, the Intel logo, Xeon, and others are trademarks of Intel Corporation in the U.S. and/or other countries. 22 * Other names and brands may be claimed as the property of others. © 2016 Intel Corporation ZFS-based Lustre* Snapshot Overview . ZFS snapshot created on each target with a new fsname . Mount as separate read-only Lustre filesystem on client(s) . Architecture details: http://wiki.lustre.org/Lustre_Snapshots commands lctl API lctl snapshot Lustre ZFS control control MGS Lustre kernel Userspace MDSs Lustre kernel ZFS tools set Lustre kernel ZFS tools set OSSs Intel, the Intel logo, Xeon, and others are trademarks of Intel Corporation in the U.S. and/or other countries. 23 * Other names and brands may be claimed as the property of others. © 2016 Intel Corporation Global Write Barrier “Freeze” the system during creating snapshot pieces on every target. Write barrier on MDTs only . No orphans, no dangling references New lctl commands for the global write barrier . lctl barrier_freeze <fsname> [timeout (seconds)] . lctl barrier_thaw <fsname> . lctl barrier_stat <fsname> Intel, the Intel logo, Xeon, and others are trademarks of Intel Corporation in the U.S. and/or other countries. 24 * Other names and brands may be claimed as the property of others.
Recommended publications
  • Elastic Storage for Linux on System Z IBM Research & Development Germany
    Peter Münch T/L Test and Development – Elastic Storage for Linux on System z IBM Research & Development Germany Elastic Storage for Linux on IBM System z © Copyright IBM Corporation 2014 9.0 Elastic Storage for Linux on System z Session objectives • This presentation introduces the Elastic Storage, based on General Parallel File System technology that will be available for Linux on IBM System z. Understand the concepts of Elastic Storage and which functions will be available for Linux on System z. Learn how you can integrate and benefit from the Elastic Storage in a Linux on System z environment. Finally, get your first impression in a live demo of Elastic Storage. 2 © Copyright IBM Corporation 2014 Elastic Storage for Linux on System z Trademarks The following are trademarks of the International Business Machines Corporation in the United States and/or other countries. AIX* FlashSystem Storwize* Tivoli* DB2* IBM* System p* WebSphere* DS8000* IBM (logo)* System x* XIV* ECKD MQSeries* System z* z/VM* * Registered trademarks of IBM Corporation The following are trademarks or registered trademarks of other companies. Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries. Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States, other countries, or both and is used under license therefrom. Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
    [Show full text]
  • IBM Spectrum Scale CSI Driver for Container Persistent Storage
    Front cover IBM Spectrum Scale CSI Driver for Container Persistent Storage Abhishek Jain Andrew Beattie Daniel de Souza Casali Deepak Ghuge Harald Seipp Kedar Karmarkar Muthu Muthiah Pravin P. Kudav Sandeep R. Patil Smita Raut Yadavendra Yadav Redpaper IBM Redbooks IBM Spectrum Scale CSI Driver for Container Persistent Storage April 2020 REDP-5589-00 Note: Before using this information and the product it supports, read the information in “Notices” on page ix. First Edition (April 2020) This edition applies to Version 5, Release 0, Modification 4 of IBM Spectrum Scale. © Copyright International Business Machines Corporation 2020. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. iii iv IBM Spectrum Scale CSI Driver for Container Persistent Storage Contents Notices . vii Trademarks . viii Preface . ix Authors. ix Now you can become a published author, too . xi Comments welcome. xii Stay connected to IBM Redbooks . xii Chapter 1. IBM Spectrum Scale and Containers Introduction . 1 1.1 Abstract . 2 1.2 Assumptions . 2 1.3 Key concepts and terminology . 3 1.3.1 IBM Spectrum Scale . 3 1.3.2 Container runtime . 3 1.3.3 Container Orchestration System. 4 1.4 Introduction to persistent storage for containers Flex volumes. 4 1.4.1 Static provisioning. 4 1.4.2 Dynamic provisioning . 4 1.4.3 Container Storage Interface (CSI). 5 1.4.4 Advantages of using IBM Spectrum Scale storage for containers . 5 Chapter 2. Architecture of IBM Spectrum Scale CSI Driver . 7 2.1 CSI Component Overview. 8 2.2 IBM Spectrum Scale CSI driver architecture.
    [Show full text]
  • File Systems and Storage
    FILE SYSTEMS AND STORAGE On Making GPFS Truly General DEAN HILDEBRAND AND FRANK SCHMUCK Dean Hildebrand manages the PFS (also called IBM Spectrum Scale) began as a research project Cloud Storage Software team that quickly found its groove supporting high performance comput- at the IBM Almaden Research ing (HPC) applications [1, 2]. Over the last 15 years, GPFS branched Center and is a recognized G expert in the field of distributed out to embrace general file-serving workloads while maintaining its original and parallel file systems. He pioneered pNFS, distributed design. This article gives a brief overview of the origins of numer- demonstrating the feasibility of providing ous features that we and many others at IBM have implemented to make standard and scalable access to any file GPFS a truly general file system. system. He received a BSc degree in computer science from the University of British Columbia Early Days in 1998 and a PhD in computer science from Following its origins as a project focused on high-performance lossless streaming of multi- the University of Michigan in 2007. media video files, GPFS was soon enhanced to support high performance computing (HPC) [email protected] applications, to become the “General Parallel File System.” One of its first large deployments was on ASCI White in 2002—at the time, the fastest supercomputer in the world. This HPC- Frank Schmuck joined IBM focused architecture is described in more detail in a 2002 FAST paper [3], but from the outset Research in 1988 after receiving an important design goal was to support general workloads through a standard POSIX inter- a PhD in computer science face—and live up to the “General” term in the name.
    [Show full text]
  • Filesystems HOWTO Filesystems HOWTO Table of Contents Filesystems HOWTO
    Filesystems HOWTO Filesystems HOWTO Table of Contents Filesystems HOWTO..........................................................................................................................................1 Martin Hinner < [email protected]>, http://martin.hinner.info............................................................1 1. Introduction..........................................................................................................................................1 2. Volumes...............................................................................................................................................1 3. DOS FAT 12/16/32, VFAT.................................................................................................................2 4. High Performance FileSystem (HPFS)................................................................................................2 5. New Technology FileSystem (NTFS).................................................................................................2 6. Extended filesystems (Ext, Ext2, Ext3)...............................................................................................2 7. Macintosh Hierarchical Filesystem − HFS..........................................................................................3 8. ISO 9660 − CD−ROM filesystem.......................................................................................................3 9. Other filesystems.................................................................................................................................3
    [Show full text]
  • A Sense of Time for Node.Js: Timeouts As a Cure for Event Handler Poisoning
    A Sense of Time for Node.js: Timeouts as a Cure for Event Handler Poisoning Anonymous Abstract—The software development community has begun to new Denial of Service attack that can be used against EDA- adopt the Event-Driven Architecture (EDA) to provide scalable based services. Our Event Handler Poisoning attack exploits web services. Though the Event-Driven Architecture can offer the most important limited resource in the EDA: the Event better scalability than the One Thread Per Client Architecture, Handlers themselves. its use exposes service providers to a Denial of Service attack that we call Event Handler Poisoning (EHP). The source of the EDA’s scalability is also its Achilles’ heel. Multiplexing unrelated work onto the same thread re- This work is the first to define EHP attacks. After examining EHP vulnerabilities in the popular Node.js EDA framework and duces overhead, but it also moves the burden of time sharing open-source npm modules, we explore various solutions to EHP- out of the thread library or operating system and into the safety. For a practical defense against EHP attacks, we propose application itself. Where OTPCA-based services can rely on Node.cure, which defends a large class of Node.js applications preemptive multitasking to ensure that resources are shared against all known EHP attacks by making timeouts a first-class fairly, using the EDA requires the service to enforce its own member of the JavaScript language and the Node.js framework. cooperative multitasking [89]. An EHP attack identifies a way to defeat the cooperative multitasking used by an EDA-based Our evaluation shows that Node.cure is effective, broadly applicable, and offers strong security guarantees.
    [Show full text]
  • Shared File Systems: Determining the Best Choice for Your Distributed SAS® Foundation Applications Margaret Crevar, SAS Institute Inc., Cary, NC
    Paper SAS569-2017 Shared File Systems: Determining the Best Choice for your Distributed SAS® Foundation Applications Margaret Crevar, SAS Institute Inc., Cary, NC ABSTRACT If you are planning on deploying SAS® Grid Manager and SAS® Enterprise BI (or other distributed SAS® Foundation applications) with load balanced servers on multiple operating systems instances, , a shared file system is required. In order to determine the best shared file system choice for a given deployment, it is important to understand how the file system is used, the SAS® I/O workload characteristics performed on it, and the stressors that SAS Foundation applications produce on the file system. For the purposes of this paper, we use the term "shared file system" to mean both a clustered file system and shared file system, even though" shared" can denote a network file system and a distributed file system – not clustered. INTRODUCTION This paper examines the shared file systems that are most commonly used with SAS and reviews their strengths and weaknesses. SAS GRID COMPUTING REQUIREMENTS FOR SHARED FILE SYSTEMS Before we get into the reasons why a shared file system is needed for SAS® Grid Computing, let’s briefly discuss the SAS I/O characteristics. GENERAL SAS I/O CHARACTERISTICS SAS Foundation creates a high volume of predominately large-block, sequential access I/O, generally at block sizes of 64K, 128K, or 256K, and the interactions with data storage are significantly different from typical interactive applications and RDBMSs. Here are some major points to understand (more details about the bullets below can be found in this paper): SAS tends to perform large sequential Reads and Writes.
    [Show full text]
  • Comparative Analysis of Distributed and Parallel File Systems' Internal Techniques
    Comparative Analysis of Distributed and Parallel File Systems’ Internal Techniques Viacheslav Dubeyko Content 1 TERMINOLOGY AND ABBREVIATIONS ................................................................................ 4 2 INTRODUCTION......................................................................................................................... 5 3 COMPARATIVE ANALYSIS METHODOLOGY ....................................................................... 5 4 FILE SYSTEM FEATURES CLASSIFICATION ........................................................................ 5 4.1 Distributed File Systems ............................................................................................................................ 6 4.1.1 HDFS ..................................................................................................................................................... 6 4.1.2 GFS (Google File System) ....................................................................................................................... 7 4.1.3 InterMezzo ............................................................................................................................................ 9 4.1.4 CodA .................................................................................................................................................... 10 4.1.5 Ceph.................................................................................................................................................... 12 4.1.6 DDFS ..................................................................................................................................................
    [Show full text]
  • Zfs-Ascalabledistributedfilesystemusingobjectdisks
    zFS-AScalableDistributedFileSystemUsingObjectDisks Ohad Rodeh Avi Teperman [email protected] [email protected] IBM Labs, Haifa University, Mount Carmel, Haifa 31905, Israel. Abstract retrieves the data block from the remote machine. zFS also uses distributed transactions and leases, instead of group- zFS is a research project aimed at building a decentral- communication and clustering software. We intend to test ized file system that distributes all aspects of file and stor- and show the effectiveness of these two features in our pro- age management over a set of cooperating machines inter- totype. connected by a high-speed network. zFS is designed to be zFS has six components: a Front End (FE), a Cooper- a file system that scales from a few networked computers to ative Cache (Cache), a File Manager (FMGR), a Lease several thousand machines and to be built from commodity Manager (LMGR), a Transaction Server (TSVR), and an off-the-shelf components. Object Store (OSD). These components work together to The two most prominent features of zFS are its coop- provide applications/users with a distributed file system. erative cache and distributed transactions. zFS integrates The design of zFS addresses, and is influenced by, issues the memory of all participating machines into one coher- of fault tolerance, security and backup/mirroring. How- ent cache. Thus, instead of going to the disk for a block ever, in this article, we focus on the zFS high-level archi- of data already in one of the machine memories, zFS re- tecture and briefly describe zFS’s fault tolerance character- trieves the data block from the remote machine.
    [Show full text]
  • ZFS Basics Various ZFS RAID Lebels = & . = a Little About Zetabyte File System ( ZFS / Openzfs)
    = BDNOG11 | Cox's Bazar | Day # 3 | ZFS Basics & Various ZFS RAID Lebels. = = ZFS Basics & Various ZFS RAID Lebels . = A Little About Zetabyte File System ( ZFS / OpenZFS) =1= = BDNOG11 | Cox's Bazar | Day # 3 | ZFS Basics & Various ZFS RAID Lebels. = Disclaimer: The scope of this topic here is not to discuss in detail about the architecture of the ZFS (OpenZFS) rather Features, Use Cases and Operational Method. =2= = BDNOG11 | Cox's Bazar | Day # 3 | ZFS Basics & Various ZFS RAID Lebels. = What is ZFS? =0= ZFS is a combined file system and logical volume manager designed by Sun Microsystems and now owned by Oracle Corporation. It was designed and implemented by a team at Sun Microsystems led by Jeff Bonwick and Matthew Ahrens. Matt is the founding member of OpenZFS. =0= Its development started in 2001 and it was officially announced in 2004. In 2005 it was integrated into the main trunk of Solaris and released as part of OpenSolaris. =0= It was described by one analyst as "the only proven Open Source data-validating enterprise file system". This file =3= = BDNOG11 | Cox's Bazar | Day # 3 | ZFS Basics & Various ZFS RAID Lebels. = system also termed as the “world’s safest file system” by some analyst. =0= ZFS is scalable, and includes extensive protection against data corruption, support for high storage capacities, efficient data compression and deduplication. =0= ZFS is available for Solaris (and its variants), BSD (and its variants) & Linux (Canonical’s Ubuntu Integrated as native kernel module from version 16.04x) =0= OpenZFS was announced in September 2013 as the truly open source successor to the ZFS project.
    [Show full text]
  • Lessons Learned from Using Samba in IBM Spectrum Scale
    Lessons learned from using Samba in IBM Spectrum Scale Christof Schmitt [email protected] [email protected] IBM / Samba Team sambaXP 2020 Christof Schmitt (IBM / Samba Team) Using Samba in IBM Spectrum Scale sambaXP 2020 1 / 33 Table of Contents 1 Integrating Samba in IBM Spectrum Scale product 2 Examples of issues faced 3 Lessons learned Christof Schmitt (IBM / Samba Team) Using Samba in IBM Spectrum Scale sambaXP 2020 2 / 33 Abstract IBM Spectrum Scale is a software defined storage offering of a clustered file system bundled together with other services. Samba is included as part of the product to provide a clustered SMB file server and integration into Active Directory. This talk discusses from a development point of view the integration of Samba into a storage product and what the development team has learned over the years. Topics will include the requirement for automated testing on multiple levels and the collaboration with the upstream Samba project. Examples will be used to illustrate problems encountered over time and how they have been solved. Further topics will be challenges that have been solved and gaps that have been seen with the usage of Samba. Christof Schmitt (IBM / Samba Team) Using Samba in IBM Spectrum Scale sambaXP 2020 3 / 33 About me Senior Software Engineer at IBM Currently working on Samba integration in IBM Spectrum Scale Customer support Samba team member Previous roles: Samba integration and support in other products. Linux device driver maintenance. Christof Schmitt (IBM / Samba Team) Using Samba in IBM Spectrum Scale sambaXP 2020 4 / 33 Table of Contents 1 Integrating Samba in IBM Spectrum Scale product 2 Examples of issues faced 3 Lessons learned Christof Schmitt (IBM / Samba Team) Using Samba in IBM Spectrum Scale sambaXP 2020 5 / 33 Context: IBM Spectrum Scale Clustered file system as software defined storage (former name GPFS).
    [Show full text]
  • Implementing High Availability and Disaster Recovery Solutions with SAP HANA on IBM Power Systems
    Front cover Implementing High Availability and Disaster Recovery Solutions with SAP HANA on IBM Power Systems Dino Quintero Luis Bolinches Rodrigo Ceron Ferreira de Castro Fabio Martins John Wright Redpaper International Technical Support Organization Implementing High Availability and Disaster Recovery Solutions with SAP HANA on IBM Power Systems October 2017 REDP-5443-00 Note: Before using this information and the product it supports, read the information in “Notices” on page vii. First Edition (October 2017) This edition applies to: SAP HANA V2.0 SPS01 revision 10 (2.00.010.00.1491294693) SUSE Linux Enterprise Server for SAP Applications V12 SP2 IBM Virtual I/O Server (VIOS): V2.2.4.23 IBM Hardware Management Console (HMC): V8.8.6.0 SP1 + MH01703 IBM Spectrum Scale V4.2.3.1 © Copyright International Business Machines Corporation 2017. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . vii Trademarks . viii Preface . ix Authors. ix Now you can become a published author, too! . xi Comments welcome. xi Stay connected to IBM Redbooks . xi Chapter 1. Introduction. 1 1.1 About this publication . 2 1.2 The SAP HANA platform. 2 1.3 High availability for HANA . 3 1.3.1 Disaster recovery: SAP HANA Systems Replication . 5 1.3.2 High availability: SAP HANA Host Auto-Failover . 6 1.3.3 High availability: SAP HANA Systems Replication with SUSE Linux High Availability Extension . 8 Chapter 2. Planning your installation . 11 2.1 SAP requirements for HANA on Power Systems implementations.
    [Show full text]
  • So You Think You Understand Multi-Instance Queue Managers?
    SoSo YouYou ThinkThink YouYou UnderstandUnderstand MultiMulti --InstanceInstance QueueQueue Managers?Managers? Christopher Frank IBM WebSphere Connectivity [email protected] Capitalware's MQ Technical Conference v2.0.1.4 Introduction Topics to be covered in this presentation: Brief overview of multi-instance queue manager support and its role as an HA solution How to validate the file system for use with multi-instance queue managers File system requirements and why they are what they are Queue manager status How file locking is used Liveness checking Troubleshooting Summary Capitalware's MQ Technical Conference v2.0.1.4 High Availability Options for MQ High availability managers Products designed to provide comprehensive system high availability Can cover multiple products – MQ, IIB, DB2, Oracle, WAS etc. Requires an HA manager such as HACMP for AIX ServiceGuard for HP Solaris Cluster Veritas Cluster Server MSCS for Windows Server Linux-HA Multi-instance support for MQ and IBM Integration Bus Provides basic failover for MQ and WMB/IIB only Software only Comes out of the box – no external HA coordinator needed Capitalware's MQ Technical Conference v2.0.1.4 HA Cluster Coordination behavior (1) 1. Normal Execution Client Client Application Application network HA coordinator IP Address Machine A Machine B can fail-over QM1 shared disks Critical data persisted on shared disks Capitalware's MQ Technical Conference v2.0.1.4 HA Cluster Coordination behavior (2) 2. Disaster strikes Client Client Application Application network HA coordinator Machine A Machine B Fails-over QM1 shared disks Capitalware's MQ Technical Conference v2.0.1.4 HA Cluster Coordination behavior (3) 3.
    [Show full text]