Using Platform LSF™ HPC

Total Page:16

File Type:pdf, Size:1020Kb

Using Platform LSF™ HPC Using Platform LSF™ HPC Version 7 Update 6 Release date: August 2009 Last modified: August 20, 2009 Support: [email protected] Comments to: [email protected] Copyright © 1994-2009, Platform Computing Inc. We’d like to hear from You can help us make this document better by telling us what you think of the content, you organization, and usefulness of the information. If you find an error, or just want to make a suggestion for improving this document, please address your comments to [email protected]. Your comments should pertain only to Platform documentation. For product support, contact [email protected]. Although the information in this document has been carefully reviewed, Platform Computing Inc. (“Platform”) does not warrant it to be free of errors or omissions. Platform reserves the right to make corrections, updates, revisions or changes to the information in this document. UNLESS OTHERWISE EXPRESSLY STATED BY PLATFORM, THE PROGRAM DESCRIBED IN THIS DOCUMENT IS PROVIDED “AS IS” AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. IN NO EVENT WILL PLATFORM COMPUTING BE LIABLE TO ANYONE FOR SPECIAL, COLLATERAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING WITHOUT LIMITATION ANY LOST PROFITS, DATA, OR SAVINGS, ARISING OUT OF THE USE OF OR INABILITY TO USE THIS PROGRAM. Document redistribution and translation This document is protected by copyright and you may not redistribute or translate it into another language, in part or in whole. Internal redistribution You may only redistribute this document internally within your organization (for example, on an intranet) provided that you continue to check the Platform Web site for updates and update your version of the documentation. You may not make it available to your organization over the Internet. Trademarks LSF is a registered trademark of Platform Computing Inc. in the United States and in other jurisdictions. PLATFORM COMPUTING, PLATFORM SYMPHONY, PLATFORM JOBSCHEDULER, PLATFORM ENTERPRISE GRID ORCHESTRATOR, PLATFORM EGO, PLATFORM VM ORCHESTRATOR, PLATFORM VMO, ACCELERATING INTELLIGENCE, and the PLATFORM and PLATFORM LSF logos are trademarks of Platform Computing Inc. in the United States and in other jurisdictions. UNIX is a registered trademark of The Open Group in the United States and in other jurisdictions. Linux is the registered trademark of Linus Torvalds in the U.S. and other countries. Microsoft is either a registered trademark or a trademark of Microsoft Corporation in the United States and/or other countries. Windows is a registered trademark of Microsoft Corporation in the United States and other countries. Macrovision, Globetrotter, and FLEXlm are registered trademarks or trademarks of Macrovision Corporation in the United States of America and/or other countries. Topspin is a registered trademark of Topspin Communications, Inc. Intel, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other products or services mentioned in this document are identified by the trademarks or service marks of their respective owners. Third Party License Agreements http://www.platform.com/legal-notices/third-party-license-agreements Contents 1 About Platform LSF HPC . 7 What Is Platform LSF HPC? . 8 LSF HPC Components . 11 2 Running Parallel Jobs . 13 blaunch Distributed Application Framework . 14 OpenMP Jobs . 23 PVM Jobs . 24 SGI Vendor MPI Support . 25 HP Vendor MPI Support . 28 LSF HPC Generic Parallel Job Launcher Framework . 30 How the Generic PJL Framework Works . 31 Integration Method 1 . 37 Integration Method 2 . 39 Tuning PAM Scalability and Fault Tolerance . 41 Running Jobs with Task Geometry . 42 Enforcing Resource Usage Limits for Parallel Tasks . 45 Example Integration: LAM/MPI . 47 Tips for Writing PJL Wrapper Scripts . 55 Other Integration Options . 57 3 Using Platform LSF HPC with HP-UX Processor Sets . 59 About HP-UX Psets . 60 Configuring LSF HPC with HP-UX Psets . 63 Using LSF HPC with HP-UX Psets . 66 4 Using Platform LSF HPC with IBM POE . 71 Running IBM POE Jobs . 72 Migrating IBM Load Leveler Job Scripts to Use LSF Options . 79 Controlling Allocation and User Authentication for IBM POE Jobs . 86 Submitting IBM POE Jobs over InfiniBand . 89 5 Using Platform LSF HPC for Linux/QsNet . 91 About Platform LSF HPC for Linux/QsNet . 92 Using Platform LSF HPC 3 Configuring Platform LSF HPC for Linux/QsNet . 95 Operating Platform LSF HPC for Linux/QsNet . 100 Submitting and Monitoring Jobs . 103 6 Using Platform LSF HPC with SGI Cpusets . 111 About SGI cpusets . 112 Configuring LSF HPC with SGI Cpusets . 115 Using LSF HPC with SGI Cpusets . 122 Using SGI Comprehensive System Accounting facility (CSA) . 132 Using SGI User Limits Database (ULDB—IRIX only) . 134 SGI Job Container and Process Aggregate Support . 136 7 Using Platform LSF HPC with LAM/MPI . 139 About Platform LSF HPC and LAM/MPI . 140 Configuring LSF HPC to work with LAM/MPI . 142 Submitting LAM/MPI Jobs . 143 8 Using Platform LSF HPC with MPICH-GM . 145 About Platform LSF HPC and MPICH-GM . 146 Configuring LSF HPC to Work with MPICH-GM . 148 Submitting MPICH-GM Jobs . 150 Using AFS with MPICH-GM . 151 9 Using Platform LSF HPC with MPICH-P4 . 153 About Platform LSF HPC and MPICH-P4 . 154 Configuring LSF HPC to Work with MPICH-P4 . 156 Submitting MPICH-P4 Jobs . 157 10 Using Platform LSF HPC with MPICH2 . 159 About Platform LSF HPC and MPICH2 . 160 Configuring LSF HPC to Work with MPICH2 . 162 Building Parallel Jobs . 164 Submitting MPICH2 Jobs . 165 11 Using Platform LSF HPC with MVAPICH . 167 About Platform LSF HPC and MVAPICH . 168 Configuring LSF HPC to Work with MVAPICH . 170 Submitting MVAPICH Jobs . 171 12 Using Platform LSF HPC with Intel® MPI . 173 About Platform LSF HPC and the Intel® MPI Library . 174 Configuring LSF HPC to Work with Intel MPI . 176 Working with the Multi-purpose Daemon (MPD) . 177 4 Using Platform LSF HPC Submitting Intel MPI Jobs . 178 13 Using Platform LSF HPC with Open MPI . 181 About Platform LSF HPC and the Open MPI Library . 182 Configuring LSF HPC to Work with Open MPI . 184 Submitting Open MPI Jobs . 185 14 Using Platform LSF HPC Parallel Application Integrations . 187 Using LSF HPC with ANSYS . 188 Using LSF HPC with NCBI BLAST . 191 Using LSF HPC with FLUENT . 192 Using LSF HPC with Gaussian . 196 Using LSF HPC with Lion Bioscience SRS . 197 Using LSF HPC with LSTC LS-Dyna . 198 Using LSF HPC with MSC Nastran . 204 15 Using Platform LSF HPC with the Etnus TotalView® Debugger . 207 How LSF HPC Works with TotalView . 208 Running Jobs for TotalView Debugging . 210 Controlling and Monitoring Jobs Being Debugged in TotalView . 213 16 Using Platform LSF HPC with SLURM . 215 About Platform LSF HPC and SLURM . 216 Installing a New Platform LSF HPC Cluster with SLURM . 221 Configuring Platform LSF HPC with SLURM . 224 Operating Platform LSF HPC with SLURM . 229 Submitting and Monitoring Jobs with SLURM . 236 SLURM Command Reference . 245 SLURM File Reference . 247 17 pam Command Reference . 253 SYNOPSIS . 254 DESCRIPTION . 254 OPTIONS . 255 EXIT STATUS . 257 SEE ALSO . 257 Index . 259 Using Platform LSF HPC 5 6 Using Platform LSF HPC C HAPTER 1 About Platform LSF HPC Contents ◆ “What Is Platform LSF HPC?” on page 8 ◆ “LSF HPC Components” on page 11 Using Platform LSF HPC 7 What Is Platform LSF HPC? Platform LSF™ HPC (“LSF HPC”) is the distributed workload management solution for maximizing the performance of High Performance Computing (HPC) clusters. Platform LSF HPC is fully integrated with Platform LSF, the industry standard workload management software product, to provide load sharing in a distributed system and batch scheduling for compute-intensive jobs. Platform LSF HPC provides support for: ◆ Dynamic resource discovery and allocation (resource reservation) for parallel batch job execution ◆ Full job-level control of the distributed processes to ensure no processes will become un-managed. This effectively reduces the possibility of one parallel job causing severe disruption to an organization's computer service ◆ The standard MPI interface ◆ Full integration with Platform LSF, providing heterogeneous resource-based batch job scheduling including job-level resource usage enforcement Advanced HPC scheduling policies Platform LSF HPC enhances the job management capability of your cluster through advanced scheduling policies such as: ◆ Policy-based job preemption ◆ Advance reservation ◆ Memory and processor reservation ◆ Memory and processor backfill ◆ Cluster-wide resource allocation limits ◆ User and project-based fairshare scheduling ◆ Topology-aware scheduling LSF daemons Run on every node to collect resource information such as processor load, memory availability, interconnect states, and other host-specific as well as cluster-wide resources. These agents coordinate.
Recommended publications
  • Platform LSF Foundations Chapter 1
    Platform LSF Version 9 Release 1.3 Foundations SC27-5304-03 Platform LSF Version 9 Release 1.3 Foundations SC27-5304-03 Note Before using this information and the product it supports, read the information in “Notices” on page 21. First edition This edition applies to version 9, release 1 of IBM Platform LSF (product number 5725G82) and to all subsequent releases and modifications until otherwise indicated in new editions. Significant changes or additions to the text and illustrations are indicated by a vertical line (|) to the left of the change. If you find an error in any Platform Computing documentation, or you have a suggestion for improving it, please let us know. In the IBM Knowledge Center, add your comments and feedback to any topic. You can also send your suggestions, comments and questions to the following email address: [email protected] Be sure include the publication title and order number, and, if applicable, the specific location of the information about which you have comments (for example, a page number or a browser URL). When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you. © Copyright IBM Corporation 1992, 2014. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Chapter 1. IBM Platform LSF: An Job submission .............12 Overview ..............1 Job scheduling and dispatch .........13 Introduction to IBM Platform LSF .......1 Host selection .............15 LSF cluster components...........2 Job execution environment .........15 Chapter 2.
    [Show full text]
  • IBM Platform Computing Cloud Service Ready to Use Platform LSF & Symphony Clusters in the Softlayer Cloud
    Platform Computing IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud February 25, 2014 1 © 2014 IBM Corporation Platform Computing Agenda v Mapping clients needs to cloud technologies v Addressing your pain points v Introducing IBM Platform Computing Cloud Service v Product features and benefits v Use cases v Performance benchmarks 2 © 2014 IBM Corporation Platform Computing HPC cloud characteristics and economics are different than general-purpose computing • High-end hardware and special purpose devices (e.g. GPUs) are typically used to supply the needed processing, memory, network, and storage capabilities • The performance requirements of technical computing and service-oriented workloads means that performance may be impacted in a virtualized cloud environment, especially when latency or I/O is a constraint • HPC cluster/grid utilization is usually in the 70-90% range, removing a major potential advantage of a public cloud service provider for stable workload volumes HPC Workloads Recommended for Private Cloud HPC Workloads with Best Potential for Virtualized Public & Hybrid Cloud Primary HPC Workloads 3 © 2014 IBM Corporation Platform Computing IBM’s HPC cloud strategy provides a flexible approach to address a variety of client needs Private Hybrid Public Clouds Clouds Clouds Evolve existing infrastructure to Enable integrated HPC Cloud to enhance approach to improve Access additional responsiveness, HPC cost and HPC capacity with flexibility, and capability variable cost model
    [Show full text]
  • Platform LSF Quick Reference IBM Platform LSF 9.1.2 Quick Reference
    Platform LSF Version 9 Release 1.2 Quick Reference GC27-5309-02 Platform LSF Version 9 Release 1.2 Quick Reference GC27-5309-02 Note Before using this information and the product it supports, read the information in “Notices” on page 9. First edition This edition applies to version 9, release 1 of IBM Platform LSF (product number 5725G82) and to all subsequent releases and modifications until otherwise indicated in new editions. Significant changes or additions to the text and illustrations are indicated by a vertical line (|) to the left of the change. If you find an error in any Platform Computing documentation, or you have a suggestion for improving it, please let us know. Send your suggestions, comments and questions to the following email address: [email protected] Be sure include the publication title and order number, and, if applicable, the specific location of the information about which you have comments (for example, a page number or a browser URL). When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you. © Copyright IBM Corporation 1992, 2013. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices ...............9 Privacy policy considerations ........11 Trademarks ..............11 © Copyright IBM Corp. 1992, 2013 iii iv Platform LSF Quick Reference IBM Platform LSF 9.1.2 Quick Reference Sample UNIX installation directories Daemon error log files Daemon error log files are stored in the directory defined by LSF_LOGDIR in lsf.conf.
    [Show full text]
  • Administering IBM Platform LSF Chapter 1
    Platform LSF Version 9 Release 1.3 Administering Platform LSF SC27-5302-03 Platform LSF Version 9 Release 1.3 Administering Platform LSF SC27-5302-03 Note Before using this information and the product it supports, read the information in “Notices” on page 827. First edition This edition applies to version 9, release 1 of IBM Platform LSF (product number 5725G82) and to all subsequent releases and modifications until otherwise indicated in new editions. Significant changes or additions to the text and illustrations are indicated by a vertical line (|) to the left of the change. If you find an error in any Platform Computing documentation, or you have a suggestion for improving it, please let us know. In the IBM Knowledge Center, add your comments and feedback to any topic. You can also send your suggestions, comments and questions to the following email address: [email protected] Be sure include the publication title and order number, and, if applicable, the specific location of the information about which you have comments (for example, a page number or a browser URL). When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you. © Copyright IBM Corporation 1992, 2014. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Chapter 1. Managing Your Cluster . 1 Job Directories and Data..........441 Working with Your Cluster .........1 Resource
    [Show full text]
  • Experimental Study of Remote Job Submission and Execution on LRM Through Grid Computing Mechanisms
    Experimental Study of Remote Job Submission and Execution on LRM through Grid Computing Mechanisms Harshadkumar B. Prajapati Vipul A. Shah Information Technology Department Instrumentation & Control Engineering Dharmsinh Desai University Department Nadiad, INDIA Dharmsinh Desai University e-mail: [email protected], Nadiad, INDIA [email protected] e-mail: [email protected], [email protected] Abstract —Remote job submission and execution is The fundamental unit of work-done in Grid fundamental requirement of distributed computing done computing is successful execution of a job. A job in using Cluster computing. However, Cluster computing Grid is generally a batch-job, which does not interact limits usage within a single organization. Grid computing with user while it is running. Higher-level Grid environment can allow use of resources for remote job services and applications use job submission and execution that are available in other organizations. This execution facilities that are supported by a Grid paper discusses concepts of batch-job execution using LRM and using Grid. The paper discusses two ways of middleware. Therefore, it is very important to preparing test Grid computing environment that we use understand how a job is submitted, executed, and for experimental testing of concepts. This paper presents monitored in Grid computing environment. experimental testing of remote job submission and Researchers and scientists who are working in higher- execution mechanisms through LRM specific way and level Grid services and applications do not pay much Grid computing ways. Moreover, the paper also discusses attention to the underlying Grid infrastructure in their various problems faced while working with Grid published work.
    [Show full text]
  • Turbo-Charging Open Source Hadoop for Faster, More Meaningful Insights
    Turbo-Charging Open Source Hadoop for Faster, more Meaningful Insights Gord Sissons Senior Manager, Technical Marketing IBM Platform Computing [email protected] Agenda • Some Context – IBM Platform Computing • Low-latency scheduling meets open-source • Breakthrough performance • Multi-tenancy (for real!) • Cluster-sprawl - The elephant in the room • Side step the looming challenges IBM Platform Computing De facto standard for commercial high- • Acquired by IBM in 2012 performance computing Powers financial • 20 year history in high-performance computing analytics grids for 60% of top investment banks • 2000+ global customers Over 5 million CPUs • 23 of 30 largest enterprises under management • High-performance, mission-critical, extreme scale Breakthrough performance in Big • Comprehensive capability Data analytics Technical Computing - HPC Platform LSF Scalable, comprehensive workload Family management for demanding heterogeneous environments Simplified, integrated HPC management Platform HPC software bundled with systems Analytics Infrastructure Software High-throughput, low-latency compute Platform and data intensive analytics Symphony Family • An SOA infrastructure for analytics • Extreme performance and scale • Complex Computations (i.e., risk) • Big Data Analytics via MapReduce Our worldview – shaped by time critical analytics • Financial firms compete on the their ability to maximize use of capital • Monte-Carlo simulation is a staple technique for simulating market outcomes • Underlying instruments are increasingly complex •
    [Show full text]
  • Platform LSF Command Reference Chapter 99
    Platform LSF Version 9 Release 1.3 Command Reference SC27-5305-03 Platform LSF Version 9 Release 1.3 Command Reference SC27-5305-03 Note Before using this information and the product it supports, read the information in “Notices” on page 521. First edition This edition applies to version 9, release 1 of IBM Platform LSF (product numbers 5725G82 and 5725L25) and to all subsequent releases and modifications until otherwise indicated in new editions. Significant changes or additions to the text and illustrations are indicated by a vertical line (|) to the left of the change. If you find an error in any Platform Computing documentation, or you have a suggestion for improving it, please let us know. In the IBM Knowledge Center, add your comments and feedback to any topic. You can also send your suggestions, comments and questions to the following email address: [email protected] Be sure include the publication title and order number, and, if applicable, the specific location of the information about which you have comments (for example, a page number or a browser URL). When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you. © Copyright IBM Corporation 1992, 2014. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Chapter 1. bacct ...........1 Chapter 23. blcollect ........165 Chapter 2. badmin ..........15 Chapter 24. blcstat .........167 Chapter 3. bapp ...........31 Chapter 25. blhosts .........169 Chapter 4.
    [Show full text]
  • Cabot Partners Cabot Group, Inc
    How IBM Platform LSF Architecture Accelerates Technical Computing Sponsored by IBM Srini Chari, Ph.D., MBA September, 2012 mailto:[email protected] Executive Summary Advances in High Performance Computing (HPC) have resulted in dramatic improvements in application processing performance across a wide range of disciplines ranging from engineering, manufacturing, finance - risk analysis and revenue management to geological, life and earth sciences. This mainstreaming of technical computing has driven solution providers towards innovative solutions that are faster, scalable, reliable, and secure. But these mission critical technical computing solutions are further challenged with reducing cost and managing complexity. Further, data explosion has morphed application workloads in traditional technical computing www.cabotpartners.com environments from compute intensive to both compute and data intensive. Also, there continues to be , an unrelenting appetite and need to increasingly solve problems that evolve, grow large and are complex, further challenging the scale of technical computing processing environments. These newer domains cover a broad range of emerging applications including fraud detection, anti-terrorist analysis, social and biological network analysis, semantic analysis, financial and economic modeling, drug discovery and epidemiology, weather and climate modeling, oil exploration, and power grid management1. Although most technical computing environments are quite sophisticated, several IT organizations are still challenged to fully take advantage of available processing capacity to adequately address newer business needs. For these organizations, effective resource management and job submission that meets stringent service level agreement (SLA) requirements across multiple departments because sharing IT infrastructure is an extremely complex process. They require higher levels of shared infrastructure utilization and better application processing throughput, while keeping costs lower.
    [Show full text]
  • Installing Platform LSF on UNIX and Linux Chapter 1
    Platform LSF Version 9 Release 1.2 Installing on UNIX and Linux SC27-5314-02 Platform LSF Version 9 Release 1.2 Installing on UNIX and Linux SC27-5314-02 Note Before using this information and the product it supports, read the information in “Notices” on page 39. First edition This edition applies to version 9, release 1 of IBM Platform LSF (product number 5725G82) and to all subsequent releases and modifications until otherwise indicated in new editions. Significant changes or additions to the text and illustrations are indicated by a vertical line (|) to the left of the change. If you find an error in any Platform Computing documentation, or you have a suggestion for improving it, please let us know. Send your suggestions, comments and questions to the following email address: [email protected] Be sure include the publication title and order number, and, if applicable, the specific location of the information about which you have comments (for example, a page number or a browser URL). When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you. © Copyright IBM Corporation 1992, 2013. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Chapter 1. Example installation directory Chapter 8. install.config .......19 structure ..............1 About install.config............19 Parameters ..............19 Chapter 2. Plan your installation ....3 EGO in the LSF cluster ...........4 Chapter 9. slave.config ........31 EGO_DAEMON_CONTROL.........32 Chapter 3.
    [Show full text]
  • Platform LSF™ Foundations
    Platform LSF™ Foundations Platform LSF Version 7.0 Update 5 Release date: July 2009 Last modified: July 17, 2009 Copyright © 1994-2009 Platform Computing Inc. Although the information in this document has been carefully reviewed, Platform Computing Corporation (“Platform”) does not warrant it to be free of errors or omissions. Platform reserves the right to make corrections, updates, revisions or changes to the information in this document. UNLESS OTHERWISE EXPRESSLY STATED BY PLATFORM, THE PROGRAM DESCRIBED IN THIS DOCUMENT IS PROVIDED “AS IS” AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. IN NO EVENT WILL PLATFORM COMPUTING BE LIABLE TO ANYONE FOR SPECIAL, COLLATERAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES, INCLUDING WITHOUT LIMITATION ANY LOST PROFITS, DATA, OR SAVINGS, ARISING OUT OF THE USE OF OR INABILITY TO USE THIS PROGRAM. We'd like to hear You can help us make this document better by telling us what you think of the content, organization, and usefulness of the information. from you If you find an error, or just want to make a suggestion for improving this document, please address your comments to [email protected]. Your comments should pertain only to Platform documentation. For product support, contact [email protected]. Document This document is protected by copyright and you may not redistribute or translate it into another language, in part or in whole. redistribution and translation Internal You may only redistribute this document internally within your organization (for example, on an intranet) provided that you continue redistribution to check the Platform Web site for updates and update your version of the documentation.
    [Show full text]
  • Running Jobs with IBM Platform LSF
    Running Jobs with IBM Platform LSF IBM Platform LSF Version 8.3 May 2012 Copyright Note: Before using this information and the product it supports, read the information in Notices on page 49. This edition applies to version 8, release 3, modification 0 of IBM Platform LSF (product number 5725G82) and to all subsequent releases and modifications until otherwise indicated in new editions. © Copyright Platform Computing Inc., an IBM Company 1992, 2012. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents 1 About IBM Platform LSF ............................................................................................................. 5 Clusters, jobs, and queues ............................................................................................. 6 Hosts ............................................................................................................................... 9 LSF daemons ................................................................................................................ 11 Batch jobs and tasks ..................................................................................................... 14 Host types and host models .......................................................................................... 16 Users and administrators .............................................................................................. 17 Resources ....................................................................................................................
    [Show full text]
  • HP HPC Linux Value Pack
    QuickSpecs HP HPC Linux Value Pack Overview HP HPC Linux Value Pack V4.1 (HPC-LVP) is a specially-priced software bundle that is specifically designed for the development, use, and management of high performance applications running on HP servers and workstations. HPC-LVP includes Platform LSF, software, which delivers a robust set of workload management capabilities for distributed computing environments. HPC-LVP also includes a web-based interface, workload monitoring and reporting, a simplified application integration process, and the HP MPI parallelization library. Why use HP HPC Linux Value Pack? Platform LSF Workload Management Powerful, comprehensive, policy driven workload management Topology and resource aware, including GPU scheduling Rich job and system information for ease of use and management Comprehensive workload accounting and reporting MPI integration for quickly launching MPI tasks, precisely tracking MPI job resource usage, and reliable MPI job management. High performance, throughput and utilization Highly available and secure Worldwide HP support HP- MPI Reduces development effort through the use of the same MPI library for multiple architectures on both Linux and Windows Optimizes performance without compromising portability or flexibility. Support for NVIDIA GPUDirect™ with CUDA drivers. Supports a wide range of Operating Systems, interconnects and workload managers, including support for interconnect protocols such as RoCE, RDMA, and OFED. Provides the quality and robustness that come from over a decade of production
    [Show full text]