Mellanox Scalableshmem Support the Openshmem Parallel Programming Language Over Infiniband

SOFTWARE PRODUCT BRIEF Mellanox ScalableSHMEM Support the OpenSHMEM Parallel Programming Language over InfiniBand The SHMEM programming library is a one-side communications library that supports a unique set of parallel programming features including point-to-point and collective routines, synchronizations, atomic operations, and a shared memory paradigm used between the processes of a parallel programming application. SHMEM Details API defined by the OpenSHMEM.org consortium. There are two types of models for parallel The library works with the OpenFabrics RDMA programming. The first is the shared memory for Linux stack (OFED), and also has the ability model, in which all processes interact through a to utilize Mellanox Messaging libraries (MXM) globally addressable memory space. The other as well as Mellanox Fabric Collective Accelera- HIGHLIGHTS is a distributed memory model, in which each tions (FCA), providing an unprecedented level of FEATURES processor has its own memory, and interaction scalability for SHMEM programs running over – Use of symmetric variables and one- with another processors memory is done though InfiniBand. sided communication (put/get) message communication. The PGAS model, The use of Mellanox FCA provides for collective – RDMA for performance optimizations or Partitioned Global Address Space, uses a optimizations by taking advantage of the high for one-sided communications combination of these two methods, in which each performance features within the InfiniBand fabric, – Provides shared memory data transfer process has access to its own private memory, including topology aware coalescing, hardware operations (put/get), collective and also to shared variables that make up the operations, and atomic memory multicast and separate quality of service levels for global memory space. operations collective operations. The figure below shows the – Hybrid model allows use of message SHMEM, which stands for SHared MEMory, uses type of scalability that can be gained when the passing and shared memory (MPI and the PGAS model to allow processes to globally collective operations take advantage of FCA. SHMEM) share variables by allowing each process to see BENEFITS the same variable name, but each process keeps – Provides a programming library for its own copy of the variable. Modification to Figure 1. shared memory communication model another process address space is then accom- extending use of InfiniBand to SHMEM plished using put/get (or write/read) semantics. applications – Seamless integration with MPI libraries The ability of put/get operations, or one-sided and job schedulers communication, is one of the major differences – Improve collective scalability through between SHMEM and MPI (Message Passing integrations with Mellanox Fabric Interface) which only uses two-sided, send/ Collective Accelerator (FCA). receive semantics. – Provides maximum performance and scalability SHMEM solution Mellanox ScalableSHMEM Mellanox ScalableSHMEM 2.0 is based on the Figure 2. ©2012 Mellanox Technologies. All rights reserved. Mellanox ScalableSHMEM- Support the OpenSHMEM Parallel Programming Language over InfiniBand page 2 In addition, the Mellanox MXM acceleration library allows for a high level of performance and scalability for the underlying put/get messages that SHMEM uses for its node-node communications. MXM is integrated with the MLNX_OFED software stack, and Scal- ableSHMEM will automatically take advantage of the performance improvements offered by it when MXM presence is detected. To further understand how these various pieces fit into the software stack refer to Figure 3. Mellanox Advantage Mellanox Technologies is a leading supplier of end-to-end servers and storage connec- tivity solutions to optimize high performance Figure 3. computing performance and efficiency. Mellanox InfiniBand adapters, switches, and software are powering the largest supercom- puters in the world. For the best in server and storage performance and scalability with the lowest TCO, Mellanox interconnect products are the solution. 350 Oakmead Parkway, Suite 100, Sunnyvale, CA 94085 Tel: 408-970-3400 • Fax: 408-970-3403 www.mellanox.com © Copyright 2012. Mellanox Technologies. All rights reserved. 3780PB Rev 1.0 Mellanox, BridgeX, ConnectX, CORE-Direct, InfiniBridge, InfiniHost, InfiniScale, PhyX, SwitchX,irtual V Protocol Interconnect and Voltaire are registered trademarks of Mellanox Technologies, Ltd. FabricIT, MLNX-OS and Unbreakable-Link are trademarks of Mellanox Technologies, Ltd. All other trademarks are property of their respective owners. .

Mellanox Scalableshmem Support the Openshmem Parallel Programming Language Over Infiniband

Introduction to Parallel Programming

An Open Standard for SHMEM Implementations Introduction

Enabling Efficient Use of UPC and Openshmem PGAS Models on GPU Clusters

Introduction to Parallel Computing

Bridging Parallel and Reconfigurable Computing with Multilevel PGAS and SHMEM+

Distributed Shared Memory – a Survey and Implementation Using Openshmem

NUG Monthly Meeting

A Space Exploration Framework for GPU Application and Hardware Codesign

Parallel Computing, Models and Their Performances

Benchmarking Parallel Performance on Many-Core Processors

SIMD Programmingprogramming

Experiences at Scale with PGAS Versions of a Hydrodynamics Application