Intel® Omni-Path Performance Tuning

Intel® Omni-Path Performance Tuning User Guide April 2016 Order No.: H93143, Rev.: 3.0 Legal Disclaimer Legal Disclaimer You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548- 4725 or by visiting: http://www.intel.com/design/literature.htm Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at http://www.intel.com/ or from the OEM or retailer. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others. Copyright © 2016, Intel Corporation. All rights reserved. Intel® Omni-Path Performance Tuning User Guide April 2016 2 Order No.: H93143, Rev.: 3.0 Contents Contents 1 Introduction ................................................................................................... 5 2 BIOS Settings ................................................................................................. 6 2.1 Intel® Xeon® Processor E5 v3 Family ................................................................. 6 3 Linux Settings ................................................................................................. 8 3.1 irqbalance ...................................................................................................... 8 3.2 CPU Frequency Scaling Drivers ......................................................................... 8 3.2.1 Using the Intel P-State Driver ................................................................ 8 3.2.2 Using the ACPI CPUfreq Driver and cpupower Governor ............................ 9 3.3 Avoid acpi_pad Consuming CPU Resources ....................................................... 10 4 MPI Performance .......................................................................................... 11 4.1 Intel® MPI Library Settings ............................................................................. 11 4.2 Intel® MPI Benchmarks (IMB) or OSU Micro Benchmarks (OMB) ......................... 12 4.3 Tuning for MPI Performance on nodes with Intel® Xeon Phi™ CPUs..................... 12 5 System Settings for Verbs Performance ....................................................... 14 5.1 HFI1 Driver Module Parameters ...................................................................... 14 5.2 Parallel File System Concurrency Improvement ................................................ 14 5.3 Lustre Parameter Tuning for Intel® Omni-Path .................................................. 15 6 Verbs Benchmarks ........................................................................................ 16 6.1 Perftest ....................................................................................................... 16 6.2 RDMA Performance ....................................................................................... 16 7 IPoFabric Performance ................................................................................. 17 7.1 IPoFabric Connected Mode Configuration ......................................................... 17 7.2 IPoFabric Datagram (UD) Mode Configuration ................................................... 17 7.3 qperf ........................................................................................................... 19 7.4 iperf ............................................................................................................ 19 Appendix A Driver Module Parameters ............................................................................ 21 A.1 Listing the Driver Parameters ....................................................................... 21 A.2 Current Values of Module Parameters ........................................................... 23 A.3 Setting HFI1 Driver Parameters .................................................................... 24 Intel® Omni-Path Performance Tuning April 2016 User Guide Order No.: H93143, Rev.: 3.0 3 Revision History Revision History Date Revision Description April 2016 3.0 Sec. 3.2 & 3.3 combined to form Sec 3.2.2. Sec. 3.2.1 is new, describing use of the Intel P-State driver. SPEC MPI Tunings generalized to Intel MPI tunings in Sec. 4.1. MPI tuning for Intel® Xeon Phi™ added as Sec. 4.3. New Sec. 5.2 to activate feature for Parallel File System performance, and Sec. 5.3 for Lustre parameter tuning. New Section 7.2 to improve IPoFabric UD performance. Updated driver parameter listings in Sec. A.1 and A.2. Clarified example in A.3. February 2016 2.0 Clarified language in Sec. 3.2 & 5.1. November 2015 1.0 Document has been updated for revision 1.0 September 2015 0.7 Initial release of document. § Intel® Omni-Path Performance Tuning User Guide April 2016 4 Order No.: H93143, Rev.: 3.0 Introduction 1 Introduction The Intel® Omni-Path Architecture (OPA) is designed for excellent out-of-the-box performance. But in some situations, a better way to run benchmarks and applications has evolved compared to the earlier-generation of Intel® True Scale fabric products. This document describes BIOS settings and parameters that have been shown to improve performance, or make performance more consistent, on Intel® Omni-Path Architecture. If you are interested in benchmarking the performance of your system, these tips may help you obtain better performance. The traditional term for sending IP traffic over the InfiniBand* fabric is IP over IB or IPoIB. The Intel® Omni-Path fabric does not implement InfiniBand*. Instead, the Intel implementation is known as IP over Fabric or IPoFabric. From the software point of view, IPoFabric behaves the same way as IPoIB, and, in fact, uses an ib_ipoib driver and sends IP traffic over the ib0 and/or ib1 ports. Intel® Omni-Path Performance Tuning April 2016 User Guide Order No.: H93143, Rev.: 3.0 5 BIOS Settings 2 BIOS Settings Setting the system BIOS is an important step in configuring a cluster to provide the best mix of application performance and power efficiency. In the following material, we specify settings that should maximize the Intel® Omni-Path Fabric and application performance. Optimally, settings similar to these should be used during a cluster bring-up and validation phase in order to show that the fabric is performing as expected. For the long term, you may want to set the BIOS to provide more power savings, even though that will reduce overall application and fabric performance to some extent. ® ® 2.1 Intel Xeon Processor E5 v3 Family The following are the performance-relevant BIOS settings on a server with Intel® Xeon® Processor E5 v3 Family CPUs (Haswell), recommended for all-around performance with an Intel® Omni-Path fabric: BIOS Setting Value CPU Power and Performance Policy Balanced Performance Workload Configuration Balanced Uncore Frequency Scaling Enabled Performance P-limit Enabled Enhanced Intel SpeedStep(R) Tech Enabled Intel Configurable TDP Disabled Intel(R) Turbo Boost Technology Enabled Intel® VT for Directed I/O (VT-D) Disabled Energy Efficient Turbo Enabled CPU C-State Enabled C1E Autopromote Enabled Processor C3 Disabled Processor C6 Enabled Intel(R) Hyper-Threading Tech No recommendation (Test with your apps. to see if a benefit occurs) IOU Non-posted Prefetch Disabled (where available) (1) Cluster-on-Die Disabled Early Snoop Disable NUMA Optimized Enable (2) MaxPayloadSize Auto or 256B (3) Intel® Omni-Path Performance Tuning User Guide April 2016 6 Order No.: H93143, Rev.: 3.0 BIOS Settings BIOS Setting Value MaxReadReq 512B or 4096B (3) Snoop Holdoff Count 9 (1) Available in the Grantley Refresh BIOS version R016. (2) Also known as Memory.SocketInterleave=NUMA in some BIOSes (3) PCIe Max Payload Size and Max Read Req settings are sometimes not in BIOSes. In that case, the OPA driver (hfi1) parameter pcie_caps=0x51 setting (which implies setting MaxPayLoad to 256B and MaxReadReq to 4096B) can be made as described in Appendix A Intel® Omni-Path Performance Tuning April 2016 User Guide Order No.: H93143, Rev.: 3.0 7 Linux Settings 3 Linux Settings The following settings are recommended to enable consistent performance measurements on the Linux distributions supported with Intel® Omni-Path Fabric Host Software. 3.1 irqbalance The purpose of irqbalance is to distribute hardware interrupts across processors on a multiprocessor system in order to increase performance. Setting --hintpolicy to exact is needed to work with the Receive and SDMA interrupt algorithms in the HFI1 driver. The following is how to implement that setting: Add a line to the /etc/sysconfig/irqbalance file, such as the line below: # IRQBALANCE_ARGS # append any args here to the irqbalance daemon, as documented in the man

Load more