Fujitsu and the HPC Pyramid
Total Page:16
File Type:pdf, Size:1020Kb
Fujitsu and the HPC Pyramid Wolfgang Gentzsch Executive HPC Strategist (external) Fujitsu Global HPC Competence Center June 20th, 2012 1 Copyright 2012 FUJITSU "Fujitsu's objective is to contribute to a prosperous future through the benefits of supercomputers. To accomplish this, we need the collaboration of researchers & scientists worldwide.” Masahiko Yamada President of Fujitsu's Technical Computing Solutions Unit HPC matters … 2 Copyright 2012 FUJITSU … it matters for Science and Industry 2011 worldwide HPC revenue share by segment Source: IDC, April 2012 Economics / Financials Chemical Engineering Other Weather Mechanical Design DDC & 2 Distribution 3 Government / 4.2 18.9 Lab Geo-Science, 6.0 Engineering 6.3 University / EDA 7.0 % 17.6 Academic 9.6 Defense 12.0 11.9 Bio-Science CAE 3 Copyright 2012 FUJITSU The HPC Pyramid Matching Science and Industry needs to the HPC Pyramid: Supercomputer Divisional Departmental Work Group Copyright 2012 FUJITSU The HPC Pyramid Matching Science and Industry needs to the HPC Pyramid: HPC is becoming mainstream Supercomputer Divisional Departmental Work Group & Workstations Copyright 2012 FUJITSU Fujitsu’s approach to the HPC market Fujitsu covers the HPC customer needs with tailored HPC platforms FX 10 Petascale Supercomputer PRIMERGY HPC solutions Supercomputer NEW Divisional Departmental Work Group Power Workstation Copyright 2012 FUJITSU High Performance Computing with Fujitsu HPC roots K-Computer Development Solution Positioning and Use 7 Copyright 2012 FUJITSU Fujitsu HPC Servers - past, present, future Fujitsu has been developing the most advanced Exascale supercomputers in the world since 1977! Trans-Exascale PRIMERGY in HPC for 10 years! FX10 No.1 in Top500 (June and Nov., 2011) K computer World’s Fastest Vector Processor (1999) FX1 VPP5000 Most Efficient Performance in Top500 (Nov. 2008) Next x86 SPARC generation NWT* Enterprise Developed with NAL VPP300/700 No.1 in Top500 PRIMEQUEST PRIMERGY (Nov. 1993) CX400 Gordon Bell Prize PRIMEPOWER (1994, 95, 96) ⒸJAXA HPC2500 VPP500 World’s Most Scalable VP Series PRIMERGY Supercomputer BX900 AP3000 (2003) Cluster node HX600 F230-75APU Cluster node AP1000 PRIMERGY RX200 Cluster node Japan’s Largest Japan’s First Cluster in Top500 Vector (Array) (July 2004) Supercomputer(1977) *NWT: Numerical Wind Tunnel 8 Copyright 2012 FUJITSU "K computer" Achieves Goal of 10 Petaflops and #1 on the TOP500 Lists June and November 2011 #2 on the TOP500 List in June 2012 LINPACK benchmark performance of 10.51 petaflops Received several big science application awards Highest Performance Efficiency with 93% 9 Copyright 2012 FUJITSU K computer - developed by RIKEN & Fujitsu System Overview Large-scale system – combining 88,128 processors Fujitsu brings together all the advanced technologies Development continues with system software tuning for completion in June 2012. System Theoretical calculation speed : 11.28 Pflops LINPACK : 10.51 Pflops, 93% eff. Processors: 88,128 Total memory: 1.4 petabyte CPU SPARC64TM VIIIfx (8 cores, 128 gigaflops) The K computer in its computer room (as of October 2011) Interconnect 6-dim. mesh/torus topology (Tofu) Tofu cables: 200,000 10 Copyright 2012 FUJITSU Fujitsu PRIMEHPC FX10 Next level of supercomputing introduced Nov. ‘11 Node Theoretical 236.5 GFLOPS computational performance Processor SPARC64™ IXfx (1.848GHz, 16 cores) Memory capacity 32GB, 64GB Memory bandwidth 85GB/s Inter-node transfer 5GB/s × 2 rate (bidirectional) / link System No. racks 4 ~ 1,024 Nodes 384 ~ 98,304 SPARC64™ VIII / IX fx RAS coverage ranges Theoretical 90.80 TFLOPS ~ Hardware-based error computational 23.24 PFLOPS detection + possible self- performance recovery range Total memory 12 ~ 6,291 TB Hardware-based error Interconnect “Tofu” Interconnect detection range Cooling Method Direct water cooling + air cooling Range in which errors (Optional: Exhaust do not affect actual cooling unit) operation 11 Copyright 2012 FUJITSU Fujitsu Real Applications Performance March 2011 cppmd molecular dynamics • 181 million atom simulation • Running on 221,184 cores (3.54 PFlops) • Sustained performance of 1.32 PFLOPS • Efficiency of 37% 2011 Gordon Bell Award, Peak Performance RIKEN, University of Tsukuba, University of Tokyo, Fujitsu RSDFT program : first-principles calculations of electron states of a silicon nanowire Hasegawa, Iwata, Tsuji, Takahashi, Oshiyama, Minami, Boku, Shoji, Uno, Kurokawa, Inoue, Miyoshi, Yokokawa 107,292-atom Si nanowire calculation Running on 442,368 cores (7.08PFlops) Sustained performance of 3.08 PFLOPS Efficiency of 43.6 % Various application programs are in optimization and evaluation stage Courtesy: M.Okuda 12 12 Copyright 2012 FUJITSU High Performance Computing with Fujitsu PRIMERGY x86 based HPC 13 Copyright 2012 FUJITSU Modular HPC growth potential towards …. Scalability, density & energy costs capacity RX200 NEW BX400 CX400 Flexibility to address all kinds of customer requirements NEW: skinless server PRIMERGY CX400 BX900 • Massive scale-out ultra dense server • HPC GPU coprocessor support Latest generation Intel® Xeon® Processor E5 series Scalability, Highest memory performance plus high reliability density & Low latency/high bandwidth Infiniband infrastructure availability Industry leading blade server density Industry leading I/O bandwidth capability 14 Copyright 2012 FUJITSU Most energy efficient server in the world Fujitsu PRIMERGY achieves world record in energy efficiency and holds several best in class ratings World record in SPECpower_ssj2008 by breaking the prestigious milestone of 6,000 overall ssj_ops/watt http://ts.fujitsu.com/ps2/press/read/news_details.aspx?id=6092 Reduce energy consumption and current carbon footprint Up to 73% more performance per Watt compared to the previous generation means • Up to 33% less energy for the same current performance level enables to better meet stringent environmental mandates for data centers • Up to 66% more workloads on current power budget without stressing current data center cooling 15 Copyright 2012 FUJITSU PRIMERGY CX400 S1: Scale-out Smart Multi-Node System for HPC and Cloud Computing with compact granularity PRIMERGY CX400 S1 doubles server density over traditional rack servers New condensed „4in2U“ form factor More performance per U for large scale-out computing 4 servers, each with . two latest Intel Xeon E5-26xx CPUs . up to 512 GB Memory . up to 3 x PCIe Gen3 I/O slots „4in2U“ 36 TB hot-plug storage drives CX400 S1 - front shared redundant power supplies and fans CX400 S1 - back 16 Copyright 2012 FUJITSU PRIMERGY CX2yy Server nodes at a glance Double Performance per U /w condensed dual socket server nodes CX250 S1 : 1U Server Node hot-plug with 2 CPUs standard node for HPC and Cloud Computing 2 x latest Intel® Xeon® processor E5-2600 family up to 512 GB RAM CX250 S1 1 xPCIe expansion slot + 1 x mezzanine card half-wide 1U server node 4 x per CX400 CX270 S1: 2U Server Node hot-plug with 2 CPUs + GPGPU option HPC optimized node /w GPGPU acceleration 2x Intel® Xeon® processor E5-2600 family up to 512 GB RAM 2 xPCIe expansion slots 1 x GPGPU option: NVIDIA Tesla 20 series: M2075 / M2090 CX270 S1 half-wide 2U server node /w GPGPU option Up to 64 Intel Xeon processor cores 2 x per CX400 2U height up to 2.048 GB Memory up to 36 TB local storage 17 Copyright 2012 FUJITSU Building Blocks of PRIMERGY HPC Ecosystem PRIMERGY Server Cluster Operation ISV and Research Partnerships PCM Edition PreDiCT Initiative Eternus Storage Open Petascale Libraries Network Consulting and Integration Services Sizing, Proof of Integration into design concept customer environment Certified system Complete assembly, Ready to and production pre-installation and operate environment quality assurance delivery Ready to Go 18 Copyright 2012 FUJITSU HPC Wales – A Grid of HPC Excellence Initiative’s motivation and background Positioning of Wales at the forefront of supercomputing Promotion of research, technology and skills Improvement of economic development Creation of 400+ quality jobs, 10+ new business Implementation and rollout Distributed HPC clusters among 15 academic sites in Wales Sophisticated tier model with central hubs, tier 1 and 2 sites, portal for transparent, easy use of resources Final rollout of PRIMERGY x86 clusters in 2012 Performance & Technology Solution Design Fujitsu Value Proposition PRIMERGY CX250 and BX922 User-focused solution to access Best-in-class technology combination distributed HPC systems from Latest Intel processor technology desktop browser Mix of Linux and Windows OS ~200 TFlops aggregated peak Multiple components integrated into a Complete tuned software stack performance consistent environment with a single Full service engagement Infiniband, 10 / 1 Gb Ethernet, FCS sign-on Completely integrated design Eternus DX online SAN (home FS) Comprehensive engagement model at Data accessible across the entire all levels (research, development, Parallel File System (up to 10 GB/s) infrastructure, automated movement business) driven by workflow DDN Lustre Professional delivery management Backup & Archiving Collaborative sharing of information and governance Symantec, Quantum and resources End-to-end program management 19 Australia’s National Computational Infrastructure Tokyo, June 15, 2012 — Fujitsu today announces its selection to deliver and install a new supercomputer, as well as to establish a Collaboration with the Australian National University's (ANU) National Computational Infrastructure