Intel® Products Highlights and Best Practices

Edmund Preiss Business Development Manager Entdecken Sie weitere interessante Artikel und News zum Thema auf all-electronics.de!

Hier klicken & informieren! Agenda

• Key enhancements and highlights since ISTEP’11

• Industry segments using ® Software Development Products

• Customer Demo and Best Practices

Copyright© 2012, Intel Corporation. All rights reserved. 2 *Other brands and names are the property of their respective owners. Key enhancements & highlights since ISTEP’11

3 All in One -- Intel® Cluster Studio XE 2012

Analysis & Correctness Tools

Shared & Distributed Memory Application Development Intel Cluster Studio XE supports:

-Shared Memory Processing MPI Libraries & Tools -Distributed Memory Processing & Libraries Programming Models -Hybrid Processing

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Intel® VTune™ Amplifier XE New VTune Amplifier XE features very well received by Software Developers

Key reasons : • More intuitive – Improved GUI points to application inefficiencies • Preconfigured & customizable analysis profiles • Timeline View highlights concurrency issues • New Event/PC counter ratio analysis concept easy to grasp

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Intel® VTune™ Amplifier XE The Old Way versus The New Way

The Old Way: To see if there is an issue with branch misprediction, multiply event value (86,400,000) by 14 cycles, then divide by CPU_CLK_UNHALTED.THREAD (5,214,000,000). Then compare the resulting value to a threshold. If it is too high, investigate.

The New Way: Look at the Branch Mispredict metric, and see if any cells are pink. If so, investigate.

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Development Across Devices

• Common tool set for Intel ® Atom™ processor based platform designs • Early availability of support and optimization • Cross-development solutions for complex setups • Full Awareness of all SoC devices Intel Atom Processor CE4200

TCP/IP

USB JTAG

Macraigor * “usb2demon *“

USB JTAG

Intel® XDP3 JTAG probe Intel Atom Processor E600

A single flexible tool solution for embedded development and validation

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Intel ® Embedded Software Development Tool Suite 2.3 for Intel ® Atom™ Processor

Identify Performance optimization optimize your opportunities Thread Specific code for IA Run Control & Kernel debug; Thread Grouping On-Chip trace & SMP run control

Intel ® Intel ® Intel ® Intel ® Intel ® ++ Integrated VTune™ Application JTAG Performance Amplifier XE Debugger Debugger Primitives

Linux * Based Targets JTAG I/F

Broad Processor Intel ® Atom™ Processor coverage CE4xxx, Z6xx, E6xx, N4xx, D5xx series

Target OS: *; CE Linux *, Yocto *, Essential tools to develop and optimize MeeGo * embedded systems

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Intel Compilers support the Future IA CPUs

45nm 32nm 22nm 14nm

Intel Compilers’ v12.1 Architectural Support

Fortran and ICC 12.1 support already Ivy Bridge and Haswell Potential future options, subject to change without notice.

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Latest Compilers produce Industry Leading Performance

• Intel v12.1 compilers improve performance compared with: – Competitive compilers – Previous version Intel compilers

Copyright© 2012, Intel Corporation. All rights reserved. 10 *Other brands and names are the property of their respective owners. Flow Graph - Intel® Threading Building Block 4.0

Graph object "Using Intel TBB's new flow graph feature, we accomplished what was Graph node previously not possible, parallelize a very sizable task graph with thousands of interrelationships - all in about a week .“

Robert Link, GCAM Project Scientist, Edge Pacific Northwest National Laboratory

Easy to implement Flow Graph describes Task dependenices for parallelism

Copyright© 2012, Intel Corporation. All rights reserved. SSG Software Enabling Summit*Other brands and names are the propertyINTEL of their CONFIDENTIAL respective owners. – Internal Use Only Intel ® ™ Plus is now available in open-source and for GCC 4.7!

• A branch for GCC-4.7 called "cilkplus" with Intel ® Cilk™ Plus extensions available

• More infos at – http://gcc.gnu.org/svn/gcc/branches/cilkplus

• Open Specification at www.cilkplus.org

• Complete runtime source to support the Linux* available

• Tested on both the x86_32 and x86_64 architectures

Copyright© 2012, Intel Corporation. All rights reserved. 12 *Other brands and names are the property of their respective owners. Comprehensive software development environment for OpenCL* applications Intel® OpenCL SDK 1.5 available (free download at ‘http://software.intel.com/en- us/articles/vcsource-tools-opencl-sdk/’)

• Intel OpenCL SDK Implements the OpenCL standard optimized for Intel® processors ( Windows* & Linux* operating systems)

• Intel OpenCL SDK 1.5 - Fully conformant with OpenCL 1.1 specification for the CPU supporting AVX

* OpenCL provides a uniform programming environment to write portable code using a mix of multi-core CPUs and other parallel processors

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Intel® Debugger (IDB) New in 12.1: IDB + Inspector XE (memory analysis)

• IDB can be used with Inspector XE as debugger during memory analysis.

IDB supports all 3 types of memory analysis Start debugger when first error was found or with every application start

Developer can use most current program information

14

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Intel® Parallel Advisor XE ( in Beta Program) Discover, Analyze, Check, Add Parallelism Design Phase Analyzes the executing serial program as developers work through the methodology Survey Target Focuses effort where it will pay off and targets coarse-grain parallelism

Annotate Sources Inserts annotations to sources to describe parallel experiments

Check Suitability Evaluates performance impact and implication of the parallel experiments

Check Correctness Finds and resolves conflicts before parallelism is applied. Reduces debugging problems by defining safe and solid paths to parallelism

Models Intel® Threading Building Blocks, Intel® Cilk™ Plus, Microsoft*PPL, Windows* Threads, OpenMP*

Intel Parallel Advisor XE becomes integral part of XE Supports now Linux and Windows, and C#

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. What Customers Say… About current Intel® Parallel Advisor

“The Intel Parallel Advisor “Intel Parallel Advisor is an design approach was excellent tool for use in instrumental in conjunction with a class on introducing parallelism parallel programming, enabling into our code . The Survey students to quickly comprehend feature helped improve the theory of parallelism and our code by finding areas in implementation .” our serial code that took a lot of CPU time, and where our code would benefit from parallelism.”

Korbinian Molitorisz Dr. William Orttung Dipl.-Inform, Karlsruhe Emeritus Professor Institute of Technology” of Chemistry

Kirk Haller, Director of “Leveraging Intel Parallel Advisor, we were able to test our ideas, and Research at Dassault determine where to spend our development effort . With the help of Intel Systèmes SolidWorks Parallel Advisor, we achieve the performance scaling we were hoping for, Corporation and saved a lot of effort and frustration ." ”

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. What Customers Say… About Intel® Parallel Advisor

“As a “The Intel Parallel Advisor “Intel Parallel Advisor is an C++ developer for many design approach was excellent tool for use in years, and with no previous instrumental in conjunction with a class on experience with parallel introducing parallelism parallel programming, enabling programming, Intel Parallel into our code . The Survey students to quickly comprehend Advisor proved to be a major feature helped improve the theory of parallelism and boon in making it easier our code by finding areas in implementation .” and efficient to implement our serial code that took a parallelism through the lot of CPU time, and where planning and production our code would benefit from phase.” parallelism.”

Korbinian Molitorisz Brian Reynolds Dr. William Orttung Brian Reynolds Dipl.-Inform, Karlsruhe Emeritus Professor Institute of Technology” Research of Chemistry

Kirk Haller, Director of “Leveraging Intel Parallel Advisor, we were able to test our ideas, and Research at Dassault determine where to spend our development effort . With the help of Intel Systèmes SolidWorks Parallel Advisor, we achieve the performance scaling we were hoping for, Corporation and saved a lot of effort and frustration." ”

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Develop Code for Intel® Xeon® Processors with Intel Developer Products today - Easily Extends to Intel® MIC Architecture

“By just utilizing standard programming on both Intel® Xeon processor and Intel® MIC architecture based platforms, the performance met multi-threading scalability expectations and we observed near-theoretical linear performance scaling with the number of threads.” – Hongsuk Yi, Heterogeneous Computing Team Leader, KISTI Supercomputing Center

Copyright© 2012, Intel Corporation. All rights reserved. 18 *Other brands and names are the property of their respective owners. Intel® MPI Application LaunchManager - Hydra Process Manager

• New Intel Hydra Process Manager comes with MPI library

• Hydra Manager speeds up Application launch over existing MPD (Intel MPI) significantly • Tests with ‘empty codes‘ shows performance gains in the range of factor 10

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Pushing the Scalability Envelope Intel® MPI Library 4.0

Intel® MPI Library 4.0 scales beyond 90k processes

Copyright© 2012, Intel Corporation. All rights reserved. 20 *Other brands and names are the property of their respective owners. What customers say about the Intel® MPI Library

"We're delighted by the efforts of "As we scale our astrophysics codes the Intel cluster tools team in and our clusters to handle larger helping us scale our applications to and larger simulations, the Intel® 10s of cores with Intel® MPI MPI Library continues to deliver Library 4.0. and raising the on scaling, performance, and performance bar in providing us stability ...”. with the most scalable Dr. Ue-Li Pen, Associate Director at Canadian Institute for commercial MPI library for Intel Theoretical Astrophysics, University of Toronto architecture based processors.”

Dr. Daniel Gruner, Chief Technical Officer - Software, University of Toronto

Copyright© 2012, Intel Corporation. All rights reserved. 21 New *Other brands and names are the property of their respective owners. Pro Industry Overview –

Industries that are using Intel® Developer Products

22 HPC and Intel Tools usage  Sciencific Research Government  Physics  Chemistry  Biology  Astronomy / Astrophysics  Life Sciences  Universities / Academia Universities  Oil exploration / Geophysics  Seismic simulation  Reservoir modeling  Weather Forecast / Climate research  Manufacturing  CFD / FEA / Simulation Industry  Cars, Planes, Aerospace, Devices,…  Finance  Investment banks / Trading  Risk management  Intense numerical Calculations & Simulations on Clusters - “Number crunching”  Lots of legacy code in most verticals requiring Intel Fortran  HPC a stronghold for Linux

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Main Intel Tools mapped to HPC Segments HPC Segments Mainly used Intel Tools  HPC / Data Centres / Academic Institutes •Intel Fortran & Intel C/C++ Compilers •Intel MKL  Research and Development organisations •Intel MPI, Intel Trace Analyser and Collector •Intel VTune Amplifier XE making inroads  Energy • Intel Fortran & Intel C/C++ Compilers • Intel libraries MKL & IPP  Oil and Gas • Intel MPI • Intel® VTune Amplifier XE • XE  Finance • Significant usage of Intel C/C++ Compiler • Intel MKL • Intel® VTune Amplifier XE • Intel Inspector XE • Manufacturing / Automotive •Intel C/C++ & Fortran Compiler • Intel MKL • Intel Performance Primitives • Intel® VTune Amplifier XE • Intel Inspector XE • Intel MPI growing

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Shared Memory Processing Segments  Primary target = “Native code”  Developers using C, C++  Windows / Linux / MacOS  Performance oriented

 Digital imaging & Signal processing  Image processing (photo, medical imaging, GPS, etc)  Content Creation (music,TV & radio)  Cinema industry (2D & 3D, post-production)  Embedded systems  Healthcare, Telecoms  PC-Games  Enterprise applications  Databases, Business Intelligence, etc

25

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Shared Memory Processing Segments

Segment Mainly used Intel Tools  Digital imaging & Signal processing • Intel C/C++ Compilers  Image processing • Intel® VTune Amplifier XE (photo, GPS, etc) • Intel Performance Primitives  Content Creation (Coding /Decoding of Images and Video, Cryptography) (music,TV & radio) • MKL  Cinema industry • Intel Inspector XE (2D & 3D, post-production) • Intel Threading Building Blocks  Embedded systems • Intel C/C++ Compilers  Healthcare (medical imaging) • Intel® VTune Amplifier XE Telecoms • Intel Inspector XE • Intel Performance Primitives  Enterprise applications • Intel C/C++ Compilers and Fortran (legacy)  Databases, Business Intelligence, etc… • Intel® VTune Amplifier XE • MKL • Intel Performance Primitives • PC Games • Intel® VTune Amplifier XE •Threading Building Blocks

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Best Practices

27 SAP *

• Application : SAP HANA Database A Performance optimised in-Memory Computing Application

“Intel® Performance Counter In the course of optimization for Intel architecture, the SAP HANA Monitor has been a very valuable tool for us to unlock the full potential development team benefited dramatically from the use of Intel® of state-of-the-art multi-core Software Development Products …. processors, especially for non- uniform memory access debugging.

Intel® VTune™ helps take advantage As other tools do not allow of hardware counters to identify micro -architectural events, such for such quick and easy monitoring of memory accesses between as cache misses, and tie them back to specific code locations. different sockets, the Intel This capability has played an essential role in optimizing the code Performan Counter Monitor was a great help for SAP when develping for the out-of-order engine and cache hierarchy. new parallel algorithms that control the memory location on different Intel® Parallel Amplifier simplifies the effort to find multicore sockets for better performance. performance bottlenecks, providing for efficient, robust Without the tool, we were always in optimization for scaling for the parallelism provided by multi- doubt whether the applied memory access pattern really works. processor, multi-core systems with Intel HT Technology…. Additionally, the tool has allowed us to identify unexpected problems related to cache pressure very Source : quickly.” http://a676.g.akamaitech.net/f/676/773/1d/images.delivery.net/ cm50content/intel/software/e-alert3/mc_sap_wp.pdf – Oliver Rebholz, SAP development manager

* Other names, logos and brands may be claimed as the property of others.

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Nik Software Inc. *

• Nik Software, Inc. develops software for the digital- imaging and photography markets. • Nik Software technology & plug-in products for digital photography and imaging professionals: – U Point® technology, Dfine®, Viveza®, HDR Efex Pro™,Silver Efex Pro™, Color Efex Pro™, Nik Sharpener Pro™ software. • Nik Software uses Intel Software development Products Welcome Martin Boehme Nik Software Inc.

* Other names, logos and brands may be claimed as the property of others.

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Tools that Developers Count on Keep Getting Better “I use the in Intel® “Tried the latest Intel Cilk Plus and liked “The Intel tools provided an excellent Fortran Composer XE, and I very much the lower overhead from using Intel return on investment . Intel® Parallel like the new Object Browser. It makes it Cilk Plus spawning compared to Inspector allowed the code to be much easier to navigate in modules that OpenMP* task. The concepts behind validated as ‘data race-free’ on our have many routines. The compiler is Cilk Plus – simplification of adding validation suite, and Intel® Parallel stable and offers outstanding parallelism – is really great.” Amplifier allows us to focus efforts on performance . We are a small company David Carver, Texas Advanced Computing Center the hotspots.” but big proponents of Intel Fortran!” University of Texas at Austin Andrew Cunningham Technical Staff Member, ESI Warner Weiss, Manager, Sugars International LLC Group

"Intel® Parallel Studio XE is a great “The new interface is a joy to use . software development tool for Intel® VTune Amplifier XE gives us performance-oriented Windows*-based precise, down-to-the-metal C++ software developers. I achieved an performance data that’s invaluable for astonishing boost in performance by pinpointing hotspots and evaluating using Intel® Cilk Plus and Array Notation the effect of optimizations” features in my code. If you need Daniel Schwarz, Performance Engineer Nik performance, try Intel Parallel Studio Software reference customer case study XE."

Jorge Martinis, Research & Development Engineer, BR&E Inc.

Copyright© 2012, Intel Corporation. All rights reserved. 30 *Other brands and names are the property of their respective owners. 4/25/2 Summary

• Many new useful features introduced during the last year

• Intel Studio XE Bundles well received

• Intel Software Development Products widely used by many customers in a diverse mix of vertical market segments

• Intel is working intensively on upcoming enhancements

Copyright© 2012, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Thank You

32 Legal Disclaimer & Optimization Notice

INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.

Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, reference www.intel.com/software/products .

Copyright © , Intel Corporation. All rights reserved. Intel, the Intel logo, Xeon, Core, VTune, and Cilk are trademarks of Intel Corporation in the U.S. and other countries. *Other names and brands may be claimed as the property of others.

Optimization Notice Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804

Copyright© 2012, Intel Corporation. All rights reserved. 33 *Other brands and names are the property of their respective owners. 34