HPC Solution on IBM POWER8
Total Page:16
File Type:pdf, Size:1020Kb
Front cover Implementing an IBM High-Performance Computing Solution on IBM POWER8 Dino Quintero Wei Li Wainer dos Santos Moschetta Mauricio Faria de Oliveira Alexander Pozdneev Redbooks International Technical Support Organization Implementing an IBM High-Performance Computing Solution on IBM POWER8 September 2015 SG24-8263-00 Note: Before using this information and the product it supports, read the information in “Notices” on page vii. First Edition (September 2015) This edition applies to IBM Spectrum Scale 4.1.0-6, Ubuntu Server 14.04.1 LTS, xCAT 2.9, Mellanox OFED 2.3-2.0.0, XLC 13.1.1-0, XL Fortran 15.1.1-0, IBM Advance Toolchain 8.0 (provides GCC-4.9.2), Parallel Environment Runtime 2.1.0.0, Parallel Environment Developer Edition 2.1.0.0, ESSL 5.3.1-0, PESSL 5.1.0-0, IBM JDK 7.1-2.0, Platform LSF 9.1.3, and NVIDIA CUDA Toolkit 5.5-54 (on Ubuntu 14.10). © Copyright International Business Machines Corporation 2015. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . vii Trademarks . viii IBM Redbooks promotions . ix Preface . xi Authors. xi Now you can become a published author, too! . xii Comments welcome. xii Stay connected to IBM Redbooks . xiii Chapter 1. Introduction. 1 1.1 Why implement an IBM HPC solution on IBM POWER8 . 2 1.2 IBM Power System S824L . 3 1.2.1 The IBM POWER8 processor chip . 4 1.2.2 Memory subsystem. 9 1.3 High-performance Interconnect. 11 1.4 Storage . 11 1.5 Software stack. 12 Chapter 2. Planning for your high-performance computing environment. 15 2.1 Reference architecture . 16 2.1.1 Hardware . 17 2.1.2 Firmware . 17 2.1.3 Software . 17 2.1.4 Network. 21 2.1.5 Storage . 21 2.2 Power, packaging, and cooling . 22 Chapter 3. Software deployment and configuration . 25 3.1 Hardware, firmware, and the software stack. 26 3.2 OPAL firmware and the ASM interface . 27 3.2.1 Power off. 27 3.2.2 Set firmware type . 27 3.2.3 Reset the FSP. 28 3.3 Intelligent Platform Management Interface (IPMI). 28 3.4 xCAT overview . 30 3.4.1 xCAT cluster: Nodes and networks. 30 3.4.2 xCAT database: Tables and objects . 31 3.4.3 xCAT node booting . 32 3.4.4 xCAT node discovery . 33 3.4.5 xCAT FSP discovery. 33 3.4.6 xCAT operating system installation types: Disks and state. 34 3.4.7 xCAT network adapters: Primary and secondary or additional. 34 3.4.8 xCAT Software Kits. 34 3.4.9 xCAT version . 35 3.4.10 xCAT cluster scenario: Networks, IP addresses, and hostnames . 35 3.5 xCAT management node . 36 3.5.1 Obtain super-user privileges . 36 © Copyright IBM Corp. 2015. All rights reserved. iii 3.5.2 Install xCAT packages . 36 3.5.3 Configure the xCAT networks . 39 3.5.4 Configure the xCAT management node host name . 41 3.5.5 Configure the xCAT DNS server. 42 3.5.6 Configure the xCAT DHCP server . 44 3.5.7 Configure the xCAT IPMI password . 46 3.5.8 Configure the xCAT console server . 47 3.6 xCAT node discovery . 47 3.6.1 Define the xCAT nodes. 47 3.6.2 Generate the boot image for node discovery . 50 3.6.3 Configure bootloader for node discovery and booting . 52 3.6.4 Configure the FSP network for node discovery. 54 3.6.5 Configure the xCAT FSP nodes and compute nodes for node discovery. 56 3.6.6 Perform the node discovery process. 58 3.7 xCAT compute nodes: Operating system . 63 3.7.1 Set node attributes for operating system network installation. 63 3.7.2 Set the root password for nodes . 64 3.7.3 Create the operating system image object definitions . 65 3.7.4 Download the netboot files . 65 3.7.5 Create a package repository for installer modules that are compatible with the netboot files . 66 3.7.6 Set the operating system image of the nodes. 68 3.7.7 Start the operating system provisioning . 69 3.8 xCAT compute nodes: Software stack . 72 3.8.1 Configure nodes for access to the Internet . 72 3.8.2 Check for errors in package updates . 78 3.8.3 Mellanox InfiniBand. 79 3.8.4 XL C/C++ compiler . 83 3.8.5 XL Fortran compiler . 88 3.8.6 Parallel Environment Runtime Edition (PE RTE) . 89 3.8.7 Parallel Environment Developer Edition (PE DE) . ..