SIMD Processing on IBM Z14, Z13 and Z13s

with current compilers. String and character array processing, traditional loop and array SIMD Processing on IBM z14, z13 and z13s Background Companies want to quickly and efficiently processing with integer and floating point data process huge amounts of information for can all be accelerated using SIMD. Complex analytics, mobile applications, data serving and mathematical processing, algorithmic-intensive more. To cost-effectively achieve these programs, image processing applications, and efficiencies new innovative technologies are other applications can be efficiently processed needed. Vector processing and SIMD optimize with SIMD. SIMD also enables processing of a performance by processing data in parallel on new class of applications on IBM Z using ® IBM Z . analytics, mathematical optimization without requiring an offload to specialized processors. What is SIMD? Standard SIMD math and linear algebra libraries SIMD stands for Single Instruction Multiple Data. enable easier porting from X86 platforms. Unlike instructions which performs a single operation on a single data point, SIMD instructions Compilers can perform the same operation on multiple data SIMD helps you achieve efficiencies using IBM points at once. z/OS® XL C/C++ V2.1.1, IBM Enterprise COBOL 5.2 and IBM Enterprise PL/I for z/OS V4.5, for SIMD is an innovation that was delivered with applications with character or integer IBM z13® (z13) and IBM z13s™ (z13s). With SIMD manipulation. IBM z/OS XL C/C++ V2.1.1 entire arrays of data can be processed by a compilers offer new capabilities, such as single instruction enabling more efficient architecture sections, inline assembly, and high processing of data. In addition, the z13 and z13s optimizations which when combined with SIMD, superscalar processors feature 32 double-width provide a performance boost. The IBM z/OS XL (128 bit width) vector registers and 139 C/C++ V2.1.1 compiler supports new data types instructions to accelerate processing. Additional and built-in functions to exploit SIMD instructions, in support of decimal operations, instructions. Enterprise COBOL 5.2 uses SIMD to were introduced with the new IBM z14™ (z14) improve the processing of INSPECT...TALLYING and are designed to give a performance boost or INSPECT... REPLACING statements and for traditional workloads using COBOL by accelerates string operations. tailoring the instructions to the specific need of financial applications. The z14 also enhanced Similarly, Enterprise PL/I for z/OS V4.5, also the throughput of binary SIMD floating-point exploits SIMD, with expanded use of the used in new applications like analytics. Decimal Floating Point Facility for Packed Decimal calculations. Candidates for SIMD Existing programs are excellent candidates for SIMD and can make use of SIMD by recompiling ZSL03398-USEN-03 Analytics applications such as Apache Spark for Candidate Applications for SIMD z/OS benefits from SIMD improvements in String and character array processing decimal operations when processing highly Loop processing. iterative analysis on large volumes of data. Binary floating point processing such as mathematical optimization & modeling Public-Key Cryptographic Standards (PKCS) #11 Telematics applications is a cryptographic token interface standard. Repetitive security calculations Select PKCS#11 clear key operations can take Graphics processing, video processing advantage of SIMD for performance gains. Java character and string processing XML System Services Operating Systems Other Java 8 SR1 applications z/OS V2.3, and z/OS V2.2 or z/OS V2.1 with PTFs, can support SIMD instructions. VM V6.4 Using SIMD is Easy and VM 6.3 with PFT UM34752, enables guess With new vector processing in IBM Z, many support for SIMD, allowing Linux® to use SIMD. existing programs need only be recompiled in- KVM executing on IBM Z also supports SIMD. order to take advantage of SIMD. Programs or workloads using IBM Java will exploit SIMD Java automatically with the newest Java 8 version. Java® is a language of choice for next Summary generation cloud and analytics workloads. IBM Software for analytics, business intelligence and z14, z13s, z13, and IBM Java 8 drives Java database processing as well as traditional performance to new levels with SIMD, applications can easily get a boost in processing simultaneous multithreading (SMT) and with SIMD on z14, z13s and z13, contributing to improved cryptographic processing. the value of IBM Z for processing your most data intensive workloads. Java applications, exploiting SMT-enabled zIIP specialty engines, benefit from many For More Information performance improvements due to Java 8 code . SIMD Business Analytics Acceleration on z optimizations and use of SIMD by Java string Systems - Point of View, REDP-5145-00 operations. http://www.redbooks.ibm.com/abstracts/re dp5145.html?Open Java uses SIMD to provide dramatic . The SIMD accelerator for business analytics acceleration of matrix operations, string on the IBM z13 processing and more. The Java JIT compiler http://ieeexplore.ieee.org/xpl/articleDetails.j offers native operations which work on Java sp?reload=true&arnumber=7175128 byte arrays, eliminating the need for interim Java objects. Text accelerations include code page conversion and array compare operations as part of java.lang.String class operations. © Copyright IBM Corporation 2017 IBM, IBM logo, IBM Z, z13, z13s, z14, z/OS and z Systems are trademarks or registered Auto-vectorization is a Just-In-Time (JIT) trademarks of International Business Machines Corporation in the United States, other countries, or both. Linux is a registered trademark of Linus Torvalds in the United States, compiler optimization in IBM Java 8 that other countries, or both. Java and all Java based trademarks and logos are trademarks accelerates simple scalar loops by leveraging or registered trademarks of Oracle and/or its affiliates. Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United vector operations on z13. Matrix multiplication States, other countries, or both. Other company, product and service names may be trademarks or service marks of others. References in this publication to IBM products operations were up to 60% faster when using and services do not imply that IBM intends to make them available in all countries in 1 which IBM operates. All statements regarding IBM’s future direction and intent are the new Java 8 JIT in IBM lab measurements . subject to change or withdrawal without notice, and represent goals and objectives only. 1 http://mainframeinsights.com/java-performance-ibm-z-systems-ibm-z13-ibm-java-sdk-8/ ZSL03398-USEN-03 .

SIMD Processing on IBM Z14, Z13 and Z13s

2.5 Classification of Parallel Computers

IBM Z Systems Introduction May 2017

SIMD Extensions

The Von Neumann Computer Model 5/30/17, 10:03 PM

Parallel Programming

Lecture Notes

Real-Time Performance During CUDA™ a Demonstration and Analysis of Redhawk™ CUDA RT Optimizations

An Introduction to Gpus, CUDA and Opencl

Unit: 4 Processes and Threads in Distributed Systems

Cuda C Programming Guide

Gpu Concurrency

CPU MF Counters for Efficiency