IBM System P5 Approaches to 24X7 Availability Including AIX 5L
Total Page:16
File Type:pdf, Size:1020Kb
Front cover IBM System p5 Approaches to 24x7 Availability Including AIX 5L Planning and maintaining the IBM System p5 server for availability Service processor and HMC RAS features explained Including hardware, service processor, firmware, HMC, AIX 5L, and the VIO server Bruno Blanchard Steve Edwards Brad Gough Hans Mozes ibm.com/redbooks International Technical Support Organization IBM System p5 Approaches to 24x7 Availability Including AIX 5L August 2006 SG24-7196-00 Note: Before using this information and the product it supports, read the information in “Notices” on page xvii. First Edition (August 2006) This edition applies to: IBM eServer p5 570, IBM eServer p5 590, IBM eServer p5 595, and similar models of POWER5 processor-based systems, using LIC version SF230_145 and above. HMC Model 7310-CR2 installed with code at level Version 4 Release 5.0 and later. AIX 5L, Version 5.3 at Maintenance Level 3 (Note: Levels of AIX starting with ML3 are known as Technology Levels). Note: This book is based on a pre-GA version of a product and may not apply when the product becomes generally available. We recommend that you consult the product documentation or follow-on versions of this redbook for more current information. © Copyright International Business Machines Corporation 2006. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Figures . ix Tables . xv Notices . xvii Trademarks . xviii Preface . xix The team that wrote this redbook. xix Become a published author . xxi Comments welcome. xxi Chapter 1. Overview, concepts, and scope . 1 1.1 24x7 availability . 2 1.2 Scope of this redbook . 4 1.3 Introduction to the chapters. 5 Chapter 2. Reliability, availability, and serviceability. 7 2.1 RAS overview . 8 2.2 Recent additions to RAS . 9 2.3 Redundant and highly available components . 10 2.3.1 Power and cooling. 10 2.3.2 Memory . 10 2.3.3 IBM System p5 processor . 11 2.3.4 POWER Hypervisor . 12 2.3.5 Service processor and clocks . 12 2.3.6 The I/O subsystem . 14 2.4 Availability, fault detection, and isolation. 15 2.4.1 Memory . 16 2.4.2 Processor . 21 2.4.3 First Failure Data Capture. 23 2.4.4 The I/O subsytem and PCI bus error recovery . 24 2.5 Serviceability . 26 2.5.1 Environment . 27 2.5.2 Operator panel and light strip . 27 2.5.3 Service processor . 28 2.5.4 Diagnostics . 28 2.5.5 Integrated Virtualization Manager . 28 2.5.6 Error log analysis . 29 © Copyright IBM Corp. 2006. All rights reserved. iii 2.5.7 Service Focal Point . 30 2.5.8 Service Agent . 31 2.5.9 Hardware Information Center . 34 2.5.10 Blind-swap PCI adapters. 34 2.6 System dumps. 36 Chapter 3. Site planning . 39 3.1 Physical site planning . 40 3.1.1 Site inspection. 40 3.1.2 Electrical power planning . 41 3.1.3 Raised floor requirements . 44 3.1.4 Service clearance and floor loading . 47 3.1.5 Vibration and shock. 49 3.2 System specifications . 51 3.2.1 Power planning p5-570 . 53 3.2.2 Power planning p5-590 and p5-595 . 59 3.2.3 Physical package p5-570 . 62 3.2.4 Physical package p5-590 and p5-595. 65 3.2.5 Maintenance clearances . 68 3.2.6 Air conditioning and cooling . 69 3.2.7 Optional Rear Door Heat eXchanger (FC 6858) . 74 Chapter 4. Server hardware planning . 77 4.1 Planning for PCI placement. 78 4.2 Internal I/O subsystem of the p5-570 . 78 4.2.1 PCI-X slots and adapters . 78 4.2.2 EEH adapters and partitioning . 79 4.2.3 Remote I/O setup . 80 4.2.4 7311 Model D10 and 7311 Model D11 I/O drawers . 80 4.2.5 7311 Model D20 I/O drawer . 82 4.2.6 7311 I/O drawer and RIO-2 cabling . 83 4.2.7 7311 I/O drawer and SPCN cabling . 85 4.3 p5-590 and p5-595 I/O . 87 4.3.1 I/O drawer attachment. 88 4.3.2 Supported PCI I/O adapters . 95 4.4 InfiniBand attachment . 95 4.4.1 Application clustering . 98 4.4.2 Interprocessor communication . 98 4.4.3 Storage area networks . 99 4.4.4 InfiniBand technical overview . 99 4.4.5 InfiniBand layers . 100 4.4.6 Physical layer . 100 4.4.7 Link layer. 101 iv IBM System p5 Approaches to 24x7 Availability Including AIX 5L 4.4.8 Network layer . 103 4.4.9 Transport layer . 103 4.4.10 InfiniBand elements. 104 4.4.11 InfiniBand architecture . 104 4.4.12 Channel adapters . 104 4.4.13 Switch . 105 4.4.14 Host Channel Adapters . ..