S O L U T I O N BRIEF

Intel Cluster Ready and the Processor 5500 Series: A Dynamic Duo for Broadening HPC Cluster Adoption Sponsored by: Intel Corporation

Steve Conway Richard Walsh Jie Wu April 2009

EXECUTIVE SUMMARY

Intel's new Intel Xeon processor 5500 series, formerly known as Nehalem, has the turbo-charged specs, especially for bandwidth, that should allow it to drive sustained performance advances on a wide range of high-performance computing (HPC) applications. The challenge is harnessing this newfound power in the context of HPC clusters, which distribute processing power across a set of independent computers that are linked together by a mix of software and networking technologies from different vendors.

Taming HPC cluster complexity is especially daunting for newer HPC users and those considering first-time migration to HPC from desktop computers. These users typically lack access to in-house HPC experts, and they buy systems priced below $250,000 (IDC's HPC workgroup and departmental segments). They are an important source of HPC market growth, and vendors that want to broaden HPC cluster adoption and usage need to address the "ease-of-everything" requirements of this important group of users.

Intel had this goal in mind when it launched the Intel Cluster Ready (ICR) program in 2007 with two other founding vendors. The number of ICR vendor members has grown to more than 150 in the relatively short time since then, testifying to the growing appeal of reference architectures as a means of alleviating cluster complexity. ICR is not the only cluster reference architecture in the HPC market, but it is the best-known one and has built the largest contingent of OEM and software partners.

Not surprisingly, Intel and its ICR partners began preparing for the Intel Xeon processor 5500 series' arrival some time ago. Thanks to that programmatic foresight, HPC users — and desktop users aspiring to HPC — will be able to exploit the Intel Xeon processor 5500 series' advanced capabilities on ICR-compliant (i.e., preintegrated, pretested) cluster platforms from a variety of OEMs as soon as these Intel Xeon 5500 processor-based systems reach the marketplace. Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.872.8200 F.508.935.4015 www.idc.com F.508.935.4015 P.508.872.8200 01701 GlobalUSA Framingham, Street Headquarters:MA 5 Speen

In sum, the combination of ICR and the Intel Xeon processor 5500 series has the potential to become a powerful catalyst for expanding worldwide HPC adoption and use.

SITUATION OVERVIEW

Barriers to Increased Cluster Adoption and Productivity

The Rise of Clusters to HPC Market Dominance

A cluster is a set of independent computers linked together by software and networking technologies from multiple vendors. Clusters originated as do-it-yourself HPC systems. In the late 1990s users began employing inexpensive hardware to cobble together scientific computing systems based on the "Beowulf cluster" concept pioneered by Thomas Sterling and Donald Becker at NASA. From their Beowulf origins, clusters have evolved and matured substantially.

Clusters have made HPC servers one of the fastest-growing IT markets in recent years, almost single-handedly propelling HPC server revenue past $10 billion in 2007. Primarily due to superior price/performance based on the use of standard technologies, clusters have become the dominant species of HPC servers, capturing 65% of market revenue in 2007 and increasing their share in 2008. IDC predicts that as the worldwide economic downturn causes worldwide HPC server revenue to dip modestly in 2009 before resuming growth in 2010, the aggressive price/performance of clusters will make them even more appealing.

An Escalating Problem: Cluster Management and Operational Complexity

But clusters have some downsides, too. The system management issues that plagued the early years of clusters remain in force today because clusters are moving targets. Even as vendors have made progress in taming clusters, cluster complexity has raced out ahead of them. The number of processor elements that need to be managed has grown rapidly as processor counts in the average cluster have ballooned and as multicore processors have doubled and quadrupled the elements in each processor. Add to this an expanding variety of applications, libraries, operating system versions and patches, and management tools, and it is not hard to see the source of the difficulty in managing cluster systems. The deployment of clusters in new environments, especially grids, has added to the management challenges. Because HPC clusters are moving targets, users report the same major pain points today that they reported five years ago.

Bandwidth Limitations Constrain Application Performance

Another notable weakness of HPC clusters has been increasingly poor per-core bandwidth. As standard processors have gone from one to two cores, and then from two to four, they have multiplied their peak performance (flops) without corresponding increases in bandwidth. This dramatic worsening of the bytes/flops ratio has impaired cluster efficiency and productivity by making it more and more difficult to move data into and out of each core fast enough to keep the cores busy, especially for bandwidth-intensive commercial ISV applications. Most HPC ISV applications were originally designed to operate on a single core with relatively strong access to main memory. And although many of these codes have since been modified to scale to a handful of cores, poor per-core bandwidth often seriously constrains their performance.

2 #217715 ©2009 IDC A recent IDC survey revealed that about one in eight HPC sites (12%) had at least some codes whose per-core performance on real-world applications was slower on their newest HPC system than on the prior one — and 50% of the sites said they expected to see this kind of retrograde performance on some codes in the next 12 months.

New and Less Experienced Users Require "Ease of Everything" When the Cray-1 supercomputer debuted in 1976, it came with no operating system. Not a problem: The early government customers employed legions of computer scientists who were perfectly capable of writing their own operating system. As the HPC market expanded to new groups of customers lacking this in-house expertise, especially within industry, HPC systems had to become easier to deploy and operate. But HPC system management was never painless, and when clusters arrived to spur unprecedented HPC market growth, management complexity took a decided turn for the worse.

Despite this increasing complexity, clusters inaugurated a period of rapid HPC market growth starting in 2001. Recognizing that an important source of this growth was at the entry level, IDC began tracking a new "HPC workgroup" segment for systems (primarily clusters) priced below $100,000. This price band and the HPC departmental segment just above it (systems priced from $100,000 to $249,000) together captured about $5.6 billion in 2008, and IDC predicts their combined revenue will reach $7.6 billion in 2013. Users in these segments are typically found in SMBs or in smaller units of tier 1 firms. They lack access to HPC experts and require systems that provide "ease of everything": purchasing, deployment, and management. In 2007 and 2008, OEMs began aiming more products at these growing segments. Many of these new products are ICR compliant.

Waiting behind those who have already migrated from desktop computing to HPC workgroup systems is a large contingent of others who are poised to make this move if barriers are lowered enough. In an IDC study of desktop technical computing users, 57% reported having important problems they could not solve with their desktop computers. A nearly equal percentage (55%) were open to using HPC under the right circumstances.

ICR AND THE INTEL XEON PROCESSOR 5500 SERIES: A CATALYST FOR BROADENING HPC ADOPTION AND USE

Overview of ICR

The Intel Cluster Ready program tackles cluster complexity by offering a reference architecture for Intel-based systems that OEMs can use to certify their configurations and that ISVs can use to test and register their applications. The chief goal of this voluntary compliance program is to ensure fundamental hardware/software integration so that end users can get their work done even in cases where no HPC experts are available to help. ICR is designed to allow overworked IT departments to respond more readily to end users' HPC requirements, enabling SMBs to migrate to clusters from desktop computers and helping large firms to drive HPC further down in the organization. In a nutshell, the ICR program's preintegration aims to enable a much broader group of end users to exploit HPC.

©2009 IDC #217715 3

Intel Xeon Processor 5500 Series Technical Innovations and Benefits Supporting HPC

The arrival of Intel Xeon 5500 processors will be welcome news for the worldwide HPC community, from entry-level to high-end users. The Intel Xeon processor 5500 series is designed to provide the per-core bandwidth that has been needed to advance performance on key scientific and engineering applications. In addition, being able to access the capabilities of Intel Xeon 5500 processors in preintegrated, pretested, ICR-compliant clusters from a variety of OEMs should facilitate first-time HPC adoption, along with increased usage among newer, less experienced HPC users.

Intel Xeon Processor 5500 Series Innovations

Among Intel Xeon processor 5500 series innovations, those most important to HPC clusters include the following:

The Intel QuickPath Interconnect (QPI) replaces the front-side bus and promises to greatly speed up point-to-point communication.

The integrated memory controller provides direct (rather than mediated) access to up to three memory channels of DDR3 SDRAM.

Intel Xeon 5500 processors are designed to reduce energy use by about 30%.

Intel Xeon Processor 5500 Series Benefits for HPC

Better Application Performance

The Intel Xeon processor 5500 series' biggest benefits for HPC will be bandwidth, bandwidth, and bandwidth — because there has been such a dearth of it in recent years. Assuming The Intel Xeon processor 5500 series performs as its specifications indicate, this major leap forward in processor and per-core bandwidth will substantially improve the bytes/flops ratio and advance sustained performance on a wide range of real-world applications in industry, government, and academia. Bandwidth-hungry CAE, CFD, and other economically important industrial applications should especially benefit from the availability of the Intel Xeon processor 5500 series.

Avoiding SMP Expense

To date, entry-level and midrange HPC users who needed exceptional bandwidth at the processor level have had to opt for systems with costlier custom architectures, especially symmetric multiprocessing (SMP) systems. Demand for SMP systems has declined substantially during the past decade because of the systems' high cost. The availability of the Intel Xeon processor 5500 series will give HPC users a way to access stronger bandwidth in price-competitive, standards-based clusters. In HPC systems with high-performance, systemwide custom interconnects, such as Myrinet, Quadrics, and Cray SeaStar, The Intel Xeon processor 5500 series is designed to support high bisection bandwidth.

4 #217715 ©2009 IDC

Improvements in Reducing Cluster Complexity

Advantages of Integrated Intel Xeon Processor 5500 Series-ICR Solutions from OEMs

Given the inherent complexity of clusters, HPC users who want to access the Intel Xeon processor 5500 series' advanced capabilities should strongly consider clusters based on reference architectures — and the ICR reference architecture from Intel has the largest following among OEM cluster vendors.

With the ICR program, Intel set out to create a win-win scenario for the major constituencies in the HPC cluster market:

Hardware vendors and ISVs stand to win by being able to ensure both buyers and users that their products will work well together straight out of the box.

System administrators stand to win by being able to meet corporate demands to push HPC competitive advantages deeper into their organizations while satisfying end users' demands for reliable HPC cycles, all without increasing IT staff.

End users stand to win by being able to get their work done faster, with less downtime, on certified cluster platforms.

Intel has positioned itself to win by expanding the total addressable market (TAM) and reducing time to market for its microprocessors, chip sets, and platforms.

OEMs Offering/Planning Intel Xeon Processor 5500 Series-ICR Solutions

Among the OEMs committed to supporting or expected by Intel to support ICR-compliant, Intel Xeon 5500 processor-based products are the following:

AMAX

Appro

ClusterVision

Colfax

Cray

Dell

ION

Penguin

Rackable

©2009 IDC #217715 5 SGI

SiMec

Streamline

Supermicro

transtec

Viglen

CHALLENGES AND OPPORTUNITIES

Challenges

Benchmark Intel Xeon 5500 processors on a range of HPC applications. IDC understands that the Intel Xeon processor 5500 series has shown impressive early results on a number of HPC codes. HPC users will want to see how the Intel Xeon processor 5500 series performs on their own applications, however. To realize the Intel Xeon processor 5500 series' potential in the HPC market, Intel, in collaboration with its OEM and ISV partners, will need to produce benchmark results on a broad range of HPC applications. This will likely take some time.

Keep pace with increasing cluster complexity. ICR and other reference architectures will need to evolve to reflect important new cluster trends and sources of complexity. All reference architectures reflect technology choices, and to continue making smart choices, Intel will need to stay on top of HPC users' changing requirements.

Compete successfully against major OEMs' own reference architectures. In a shining example of "coopetition," some major OEMs sell ICR-compliant HPC clusters while also offering clusters based on their own reference architectures. Intel has done an admirable job of engaging in this delicate game of coopetition and will need to continue to do so. Therefore, ICR will need to remain at least as appealing to HPC users as the OEMs' offerings, if not more so.

Educate the market about the value of ICR. Although ICR is the best-known HPC reference architecture, not all existing HPC users grasp its value yet, and a sizable contingent of desktop users needs to understand how ICR can facilitate its migration to HPC clusters by helping to address "ease-of-everything" requirements.

Opportunities

Educate the market about the value of ICR and the Intel Xeon processor 5500 series-ICR combination. The challenge of ICR market education also presents a major opportunity for Intel to boost revenue for the Intel Xeon processor 5500 series and other products by using ICR to expand the adoption and use of HPC clusters, particularly (though not exclusively) among newer HPC users and those aspiring to move up to HPC from desktop systems. As noted

6 #217715 ©2009 IDC earlier, the IDC HPC workgroup and departmental segments together captured about $5.6 billion in 2008, and IDC predicts their combined revenue will reach $7.6 billion in 2013. Because the Intel Xeon processor 5500 series' advanced capabilities, especially bandwidth, should make it easier to scale performance on industry applications, the Intel Xeon processor 5500 series-ICR combination has the potential to accelerate growth in these two segments beyond the IDC- forecast level.

Leverage ICR momentum in the HPC market. The ICR program's rapid growth, from three founding vendors in 2007 to more than 150 today, constitutes a strong selling point. Intel should build its portfolio of case studies and other marketing/sales tools to leverage this momentum with HPC OEMs, ISVs, and end users. Documenting OEM successes with ICR can help persuade additional OEMs to join the ICR program. Documenting end-user successes with ICR can help persuade existing OEM members to maintain or expand their commitments to ICR.

CONCLUSION

The combination of the Intel Xeon processor 5500 series and the Intel Cluster Ready program has the potential to become a powerful catalyst for expanding the adoption and use of HPC clusters, a market in which Intel is already the leading processor vendor. ICR has established strong momentum in this market since its 2007 introduction. It is the best-known reference architecture for HPC clusters and the one with the largest following among OEMs serving this market.

The Intel Xeon 5500 processor microarchitecture promises to advance real-world performance on a wide range of HPC codes, especially bandwidth-intensive industry applications. ICR-compliant (preintegrated and pretested) OEM cluster solutions will enable end users to exploit the Intel Xeon processor 5500 series' advanced capabilities with greater assurance. The Intel Xeon processor 5500 series has already demonstrated impressive early results on some HPC codes, and Intel, in collaboration with its OEM and ISV partners, plans to benchmark the Intel Xeon processor 5500 series on a wide range of HPC applications.

Assuming that the Intel Xeon processor 5500 series lives up to performance expectations, ICR-compliant, Intel Xeon 5500 processor-based clusters from a variety of OEMs will have strong potential for exploiting IDC-projected HPC market growth, especially in the workgroup and departmental segments. With sufficient market education from Intel and its partners, this combination could even accelerate growth in these two segments beyond the IDC-forecast level.

Copyright Notice

External Publication of IDC Information and Data — Any IDC information that is to be used in advertising, press releases, or promotional materials requires prior written approval from the appropriate IDC Vice President or Country Manager. A draft of the proposed document should accompany any such request. IDC reserves the right to deny approval of external usage for any reason.

Copyright 2009 IDC. Reproduction without written permission is completely forbidden.

©2009 IDC #217715 7