IBM Power Systems 775 HPC Solution
Total Page:16
File Type:pdf, Size:1020Kb
Front cover IBM Power Systems 775 for AIX and Linux HPC Solution Unleashes computing power for HPC workloads Provides architectural solution overview Contains sample scenarios Dino Quintero Kerry Bosworth Puneet Chaudhary Rodrigo Garcia da Silva ByungUn Ha Jose Higino Marc-Eric Kahle Tsuyoshi Kamenoue James Pearson Mark Perez Fernando Pizzano Robert Simon Kai Sun ibm.com/redbooks International Technical Support Organization IBM Power Systems 775 for AIX and Linux HPC Solution October 2012 SG24-8003-00 Note: Before using this information and the product it supports, read the information in “Notices” on page vii. First Edition (October 2012) This edition applies to IBM AIX 7.1, xCAT 2.6.6, IBM GPFS 3.4, IBM LoadLelever, Parallel Environment Runtime Edition for AIX V1.1. © Copyright International Business Machines Corporation 2012. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Figures . vii Tables . xi Examples . xiii Notices . xvii Trademarks . xviii Preface . xix The team who wrote this book . xix Now you can become a published author, too! . xxi Comments welcome. xxii Stay connected to IBM Redbooks . xxii Chapter 1. Understanding the IBM Power Systems 775 Cluster. 1 1.1 Overview of the IBM Power System 775 Supercomputer . 2 1.2 Advantages and new features of the IBM Power 775 . 3 1.3 Hardware information . 4 1.3.1 POWER7 chip. 4 1.3.2 I/O hub chip. 10 1.3.3 Collective acceleration unit (CAU) . 12 1.3.4 Nest memory management unit (NMMU) . 14 1.3.5 Integrated switch router (ISR) . 15 1.3.6 SuperNOVA . 16 1.3.7 Hub module. 17 1.3.8 Memory subsystem. 20 1.3.9 Quad chip module (QCM) . 21 1.3.10 Octant . 22 1.3.11 Interconnect levels . 25 1.3.12 Node . 25 1.3.13 Supernodes. 28 1.3.14 System . 31 1.4 Power, packaging and cooling . 38 1.4.1 Frame . 39 1.4.2 Bulk Power and Control Assembly (BPCA). 41 1.4.3 Bulk Power Control and Communications Hub (BPCH) . 43 1.4.4 Bulk Power Regulator (BPR). 43 1.4.5 Water Conditioning Unit (WCU) . 44 1.5 Disk enclosure (Rodrigo). 47 1.5.1 Overview . 47 1.5.2 High level description . 48 1.5.3 Configuration. 50 1.6 Cluster management. 52 1.6.1 HMC . 52 1.6.2 EMS . 52 1.6.3 Service node . 53 1.6.4 Server and management networks . 54 1.6.5 Data flow . 55 © Copyright IBM Corp. 2012. All rights reserved. iii 1.6.6 LPARs. 56 1.6.7 Utility nodes . 56 1.6.8 GPFS I/O nodes . 58 1.7 Typical connection scenario between EMS, HMC, Frame . 58 1.8 Software stack. 59 1.8.1 ISNM . 62 1.8.2 DB2 . 70 1.8.3 Extreme Cluster Administration Toolkit (xCAT). 70 1.8.4 Toolkit for Event Analysis and Logging (TEAL). 73 1.8.5 Reliable Scalable Cluster Technology (RSCT) . 73 1.8.6 GPFS . 74 1.8.7 IBM Parallel Environment . 85 1.8.8 LoadLeveler . 90 1.8.9 ESSL. 91 1.8.10 Parallel ESSL . 92 1.8.11 Compilers . 92 1.8.12 Parallel Tools Platform (PTP) . 95 Chapter 2. Application integration. 97 2.1 Power 775 diskless considerations . 98 2.1.1 Stateless vs. Statelite . 98 2.1.2 System access . 102 2.2 System capabilities . 103 2.3 Application development . ..