33Rd TOP500 List
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Survey of Computer Architecture
Architecture-aware Algorithms and Software for Peta and Exascale Computing Jack Dongarra University of Tennessee Oak Ridge National Laboratory University of Manchester 11/17/2010 1 H. Meuer, H. Simon, E. Strohmaier, & JD - Listing of the 500 most powerful Computers in the World - Yardstick: Rmax from LINPACK MPP Ax=b, dense problem TPP performance Rate - Updated twice a year Size SC‘xy in the States in November Meeting in Germany in June - All data available from www.top2 500.org 36rd List: The TOP10 Rmax % of Power Flops/ Rank Site Computer Country Cores [Pflops] Peak [MW] Watt Nat. SuperComputer NUDT YH Cluster, X5670 1 China 186,368 2.57 55 4.04 636 Center in Tianjin 2.93Ghz 6C, NVIDIA GPU DOE / OS Jaguar / Cray 2 USA 224,162 1.76 75 7.0 251 Oak Ridge Nat Lab Cray XT5 sixCore 2.6 GHz Nebulea / Dawning / TC3600 Nat. Supercomputer 3 Blade, Intel X5650, Nvidia China 120,640 1.27 43 2.58 493 Center in Shenzhen C2050 GPU Tusbame 2.0 HP ProLiant GSIC Center, Tokyo 4 SL390s G7 Xeon 6C X5670, Japan 73,278 1.19 52 1.40 850 Institute of Technology Nvidia GPU Hopper, Cray XE6 12-core 5 DOE/SC/LBNL/NERSC USA 153,408 1.054 82 2.91 362 2.1 GHz Commissariat a Tera-100 Bull bullx super- 6 l'Energie Atomique France 138,368 1.050 84 4.59 229 node S6010/S6030 (CEA) DOE / NNSA Roadrunner / IBM 7 USA 122,400 1.04 76 2.35 446 Los Alamos Nat Lab BladeCenter QS22/LS21 NSF / NICS / kyaken/ Cray 8 USA 98,928 .831 81 3.09 269 U of Tennessee Cray XT5 sixCore 2.6 GHz Forschungszentrum Jugene / IBM 9 Germany 294,912 .825 82 2.26 365 Juelich (FZJ) Blue Gene/P Solution DOE/ NNSA / 10 Cray XE6 8-core 2.4 GHz USA 107,152 .817 79 2.95 277 Los Alamos Nat Lab 36rd List: The TOP10 Rmax % of Power Flops/ Rank Site Computer Country Cores [Pflops] Peak [MW] Watt Nat. -
JSC News No. 166, July 2008
JUMP Succession two more years. This should provide users No. 166 • July 2008 with sufficient available computing power On 1 July 2008, the new IBM Power6 ma- on the general purpose part and time to mi- chine p6 575 – known as JUMP, just like grate their applications to the new cluster. its predecessor – took over the production (Contact: Klaus Wolkersdorfer, ext. 6579) workload from the previous Power4 clus- ter as scheduled. First benchmark tests show that users can expect their applica- Gauss Alliance Founded tions to run about three times faster on the During the ISC 2008, an agreement was new machine. This factor could even be im- signed to found the Gauss Alliance, which proved by making the most out of the follow- will unite supercomputer forces in Ger- ing Power6 features: many. The Gauss Centre for Supercom- • Simultaneous multi-threading (SMT puting (GCS) and eleven regional and topi- mode): 64 threads (instead of 32) can cal high-performance computer centres are be used on one node, where 2 threads participating in the alliance, thus creating a now share one physical processor with computer association that is unique world- its dedicated memory caches and float- wide. The signatories are: Gauss Cen- ing point units. tre for Supercomputing (GCS), Center for • Medium size virtual memory pages Computing and Communication of RWTH (64K): applications can set 64K pages Aachen University, Norddeutscher Ver- as a default during link time and thus bund für Hoch- und Höchstleistungsrech- can benefit from improved hardware ef- nen (HLRN) consisting of Zuse Institute ficiencies by accessing these pages. -
Performance Evaluation of the Intel Sandy Bridge Based NASA
Performance Evaluation of the Intel Sandy Bridge Based NASA Pleiades Using Scientific and Engineering Applications Subhash Saini, Johnny Chang, Haoqiang Jin NASA Advanced Supercomputing Division NASA Ames Research Center Moffett Field, California 94035-1000, USA {subhash.saini, johnny.chang, haoqiang.jin}@nasa.gov Abstract — We present a performance evaluation of Pleiades based based on the Many Integrated Core (code-named Knight’s on the Intel Xeon E5-2670 processor, a fourth-generation eight-core Corner) architecture and Yellowstone [1, 5, 6]. New and Sandy Bridge architecture, and compare it with the previous third extended features of Sandy Bridge architecture are: generation Nehalem architecture. Several architectural features have been incorporated in Sandy Bridge: (a) four memory channels as a) A ring to connect on-chip L3 cache with cores, system opposed to three in Nehalem; (b) memory speed increased from 1333 agent, memory controller, and QPI agent and I/O MHz to 1600 MHz; (c) ring to connect on-chip L3 cache with cores, controller to increase the scalability. L3 cache per core system agent, memory controller, and QPI agent and I/O controller to has been increased from 2 MB to 2.5 MB. increase the scalability; (d) new AVX unit with wider vector registers of 256 bit; (e) integration of PCI-Express 3.0 controllers into the I/O b) New micro-ops (L0) cache that caches instructions as subsystem on chip; (f) new Turbo Boost version 2.0 where base they are decoded. The cache is direct mapped and can frequency of processor increased from 2.6 to 3.2 GHz; and (g) QPI link store 1.5 K micro-ops. -
President Lands at Moffett Field During Silicon Valley Visit by Huong Nguyen and Jessica Culler President Barack Obama’S Arrival Onboard Air Force One on Sunday, Sept
Fall 2011 - A Quarterly Publication Thousands turn out to tour airborne observatory at Ames The Stratospheric Observatory for In- frared Astronomy (SOFIA) visited NASA Ames and provided a rare opportunity to tour the airborne observatory in October. News media and Ames employees were invited to tour on Friday, Oct. 14 and the public on Saturday, Oct. 15. On Friday, an estimated 2,500 people, including representatives from CNET, Fox News, KQED (PBS), New Scientist, Space. com, Mountain View Patch and the San Mateo Daily Journal attended the event. On Saturday, an estimated 5,500 people visited Ames to tour SOFIA. See page 6 for a feature about the SOFIA visit. NASA photo by Doiminic Hart President lands at Moffett Field during Silicon Valley visit BY HUONG NGUYEN AND JESSICA CULLER President Barack Obama’s arrival onboard Air Force One on Sunday, Sept. 25, 2011, at Moffett Federal Airfield marked his first land- ing at NASA Ames. Center Director Pete Worden met President Barack Obama upon his arrival. "I had the honor to meet President Obama when he arrived at Moffett Federal Airfield," said Worden, who along with San Jose Mayor Chuck Reed and Mountain View Mayor Jac Siegel, greeted the president. "It was fitting that the president came to Silicon Valley to talk about his job creation plan, given how critical Silicon Valley is to the future of the U.S. economy," Worden said. NASA Ames is an integral com- ponent of the world-renowned inno- vation economy and actively partici- pates in the valley’s technological Photo credit: Official White House Photo by Pete Souza and scientific evolution. -
Presentación De Powerpoint
Towards Exaflop Supercomputers Prof. Mateo Valero Director of BSC, Barcelona National U. of Defense Technology (NUDT) Tianhe-1A • Hybrid architecture: • Main node: • Two Intel Xeon X5670 2.93 GHz 6-core Westmere, 12 MB cache • One Nvidia Tesla M2050 448-ALU (16 SIMD units) 1150 MHz Fermi GPU: • 32 GB memory per node • 2048 Galaxy "FT-1000" 1 GHz 8-core processors • Number of nodes and cores: • 7168 main nodes * (2 sockets * 6 CPU cores + 14 SIMD units) = 186368 cores (not including 16384 Galaxy cores) • Peak performance (DP): • 7168 nodes * (11.72 GFLOPS per core * 6 CPU cores * 2 sockets + 36.8 GFLOPS per SIMD unit * 14 SIMD units per GPU) = 4701.61 TFLOPS • Linpack performance: 2.507 PF 53% efficiency • Power consumption 4.04 MWatt Source http://blog.zorinaq.com/?e=36 Cartagena, Colombia, May 18-20 Top10 Rank Site Computer Procs Rmax Rpeak 1 Tianjin, China XeonX5670+NVIDIA 186368 2566000 4701000 2 Oak Ridge Nat. Lab. Cray XT5,6 cores 224162 1759000 2331000 3 Shenzhen, China XeonX5670+NVIDIA 120640 1271000 2984300 4 GSIC Center, Tokyo XeonX5670+NVIDIA 73278 1192000 2287630 5 DOE/SC/LBNL/NERSC Cray XE6 12 cores 153408 1054000 1288630 Commissariat a l'Energie Bull bullx super-node 6 138368 1050000 1254550 Atomique (CEA) S6010/S6030 QS22/LS21 Cluster, 7 DOE/NNSA/LANL PowerXCell 8i / Opteron 122400 1042000 1375780 Infiniband National Institute for 8 Computational Cray XT5-HE 6 cores 98928 831700 1028850 Sciences/University of Tennessee 9 Forschungszentrum Juelich (FZJ) Blue Gene/P Solution 294912 825500 825500 10 DOE/NNSA/LANL/SNL Cray XE6 8-core 107152 816600 1028660 Cartagena, Colombia, May 18-20 Looking at the Gordon Bell Prize • 1 GFlop/s; 1988; Cray Y-MP; 8 Processors • Static finite element analysis • 1 TFlop/s; 1998; Cray T3E; 1024 Processors • Modeling of metallic magnet atoms, using a variation of the locally self-consistent multiple scattering method. -
The Outer Limits
Welcome to the outer limits Budapest, March 19, 2013 M.Sc. Ji ří Hlavá č HPC consultant + sales manager for CEE [email protected] ©2012 Silicon Graphics International Corp. / Presented Only Under Non-Disclosure Agreement 1 Jiri Hlavac (by Czech Ji ří Hlavá č) 51 years old, 4 children … • MSc. Computers (1986) • Development of PC OSs for Tesla Czech (1986-1989) • Own SW company (1986-1991) • Owner SGI Distributor @ Czechoslovakia (1991-1995) • Employee @ SGI Czech office (1995-now) • Technical Director, Academic Sales, Enterprise Sales • HPC Consultant (2001-2011) • Sales Manager for Central + East Europe (2005-now) ©2012 Silicon Graphics International Corp. / Presented Only Under Non-Disclosure Agreement 2 SGI = Experts @ HPC Structural Mechanics Structural Mechanics Computational Fluid Electro-Magnetics Implicit Explicit Dynamics Computational Chemistry Computational Chemistry Computational Biology Seismic Processing Quantum Mechanics Molecular Dynamics Reservoir Simulation Rendering / Ray Tracing Climate / Weather Data Analytics Ocean Simulation ©2012 Silicon Graphics International Corp. / Presented Only Under Non-Disclosure Agreement 3 SGI = Focus on every detail (here Power Consumption) ©2012 Silicon Graphics International Corp. / Presented Only Under Non-Disclosure4 Agreement 4 SGI = Frontier @ Research SGI = Winner of last HPCwire Readers' Choice Award (Nov 2012) for "Top Supercomputing Achievement" for SGI contribution to the NASA Ames Pleiades supercomputer. SGI = Winner of last HPCwire Editor's Choice Award (Nov 2012) for "Best use of HPC in 'edge HPC‘ application" for Wikipedia historical mapping and exploration on UV 2000. SGI stock is growing 5 ©2012 Silicon Graphics International Corp. / Presented Only Under Non-Disclosure5 Agreement 5 Advanced Energy Exploration and Production Total: World's Largest Commercial HPC System (2.3 PF) SGI ICE ©2012 Silicon Graphics International Corp. -
Document.Pdf
THE EVOLUTION OF THE HPC FACILITY AT JSC 2019-06-04 I D. KRAUSE (WITH VARIOUS CONTRIBUTIONS) RESEARCH AND DEVELOPMENT @ FZJ on 2.2 Square Kilometres FORSCHUNGSZENTRUM JÜLICH: AT A GLANCE Facts and Figures 1956 Shareholders 11 609.3 5,914 867 90 % Federal Republic FOUNDATION INSTITUTES million euros EMPLOYEES VISITING of Germany REVENUE SCIENTISTS on 12 December 10 % North Rhine- 2 project 2,165 scientists total Westphalia management 536 doctoral from 65 countries organizations researchers (40 % external 323 trainees and funding) students on placement STRATEGIC PRIORITIES CLIMATE RESEARCH QUANTUM COMPUTING LLEC ENERGY STORAGE - SUPER COMPUTING CORE MATERIALS RESEARCH HBP FACILITIES INFORMATION ALZHEIMER’S RESEARCH SOIL BIOECONOMY RESEARCH NEUROMORPHIC COMPUTING - BIO TECHNOLOGY PLANT RESEARCH LARGE-SCALE INSTRUMENTS on campus JÜLICH SUPERCOMPUTING CENTRE JÜLICH SUPERCOMPUTING CENTRE • Supercomputer operation for: • Center – FZJ • Region – RWTH Aachen University • Germany – Gauss Centre for Supercomputing John von Neumann Institute for Computing • Europe – PRACE, EU projects • Application support • Unique support & research environment at JSC • Peer review support and coordination • R-&-D work • Methods and algorithms, computational science, performance analysis and tools • Scientific Big Data Analytics • Computer architectures, Co-Design Exascale Laboratories: EIC, ECL, NVIDIA • Education and Training IBM Power 4+ JUMP, 9 TFlop/s IBM Blue Gene/L IBM Power 6 JUBL, 45 TFlop/s JUMP, 9 TFlop/s JUROPA IBM Blue Gene/P 200 TFlop/s JUGENE, 1 PFlop/s -
NASA Announces a New Approach to Earth Science Data Analysis 20 April 2010
NASA Announces A New Approach To Earth Science Data Analysis 20 April 2010 (PhysOrg.com) -- The way we analyze planet Earth than ten hours. will never be the same, thanks to a new initiative at NASA that integrates supercomputers with global NEX uses a new approach for collaboration among satellite observations and sophisticated models of scientists and science teams working to model the the Earth system in an online collaborative Earth system and analyze large Earth observation environment. As part of its celebration of Earth datasets. Using on-line collaboration technologies, Week, NASA unveiled the NASA Earth Exchange NEX will bring together geographically dispersed (NEX) at a “Green Earth” public forum held at the multi-disciplinary groups of scientists focused on NASA Exploration Center, Moffett Field, Calif. global change research. Scientists will be able to build custom project environments containing the By making NEX available, NASA expects to better datasets and software components needed to solve enable scientists to collaboratively conduct complex Earth science problems. These project research and address the impacts of changes in environments, built using virtualization technology, climate and land use patterns on ecosystems. NEX will be highly portable and reusable and will will link NASA’s supercomputing resources with automatically capture the entire analysis process, massive Earth system data sets, and provide a including the data and processing steps required to collection of tools for analysis and visualization. replicate the results in an open and transparent way. For example, results from the processing of "Currently, it can require months for scientists to the global Landsat data would be available to gather and analyze global-scale data sets, due to scientists with the additional expertise required to computing limitations, data storage requirements analyze rates of urbanization, deforestation, or and network bandwidth constraints”, said biodiversity impacts. -
Performance Evaluation of the Intel Sandy Bridge Based NASA
NAS Technical Report: NAS-2015-05 Performance Evaluation of the Intel Sandy Bridge Based NASA Pleiades Using Scientific and Engineering Applications Subhash Saini, Johnny Chang, Haoqiang Jin NASA Advanced Supercomputing Division NASA Ames Research Center Moffett Field, California 94035-1000, USA {subhash.saini, johnny.chang, haoqiang.jin}@nasa.gov Abstract — We present a performance evaluation of Pleiades based based on the Many Integrated Core (code-named Knight’s on the Intel Xeon E5-2670 processor, a fourth-generation eight-core Corner) architecture and Yellowstone [1, 5, 6]. New and Sandy Bridge architecture, and compare it with the previous third extended features of Sandy Bridge architecture are: generation Nehalem architecture. Several architectural features have been incorporated in Sandy Bridge: (a) four memory channels as a) A ring to connect on-chip L3 cache with cores, system opposed to three in Nehalem; (b) memory speed increased from 1333 agent, memory controller, and QPI agent and I/O MHz to 1600 MHz; (c) ring to connect on-chip L3 cache with cores, controller to increase the scalability. L3 cache per core system agent, memory controller, and QPI agent and I/O controller to has been increased from 2 MB to 2.5 MB. increase the scalability; (d) new AVX unit with wider vector registers of 256 bit; (e) integration of PCI-Express 3.0 controllers into the I/O b) New micro-ops (L0) cache that caches instructions as subsystem on chip; (f) new Turbo Boost version 2.0 where base they are decoded. The cache is direct mapped and can frequency of processor increased from 2.6 to 3.2 GHz; and (g) QPI link store 1.5 K micro-ops. -
Project Status Report High End Computing Capability Strategic Capabilities Assets Program December 10, 2014
Project Status Report High End Computing Capability Strategic Capabilities Assets Program December 10, 2014 Dr. Rupak Biswas – Project Manager NASA Ames Research Center, Moffett Field, CA [email protected] (650) 604-4411 www.nasa.gov Pleiades Achieves 3.38 Petaflops, Ranks 7th in U.S. and 11th Worldwide on TOP500 List • On the November 2014 TOP500 list of the world’s Mission Impact: Enhancing NASA’s most powerful most power supercomputers, Pleiades ranked 7th in supercomputer and running subsequent diagnostic the U.S. and 11th worldwide, with an impressive tests enables HECC to validate the entire system, improving stability for all jobs. efficiency of 84.6%. The new list was released on November 17 at the SC14 conference in New Orleans (see slide 11). • The HECC Systems team worked with SGI engineers to complete system-wide testing on Pleiades, including a LINPACK benchmark run, prior to the release of new Intel Haswell processors for general production on October 27. • During the LINPACK run, Pleiades delivered a sustained performance of 3.38 petaflops using 73% of its compute nodes, representing 89% of the theoretical peak of the overall system. The benchmark was run on 1,024 Haswell nodes, 5,365 Pleiades’ increased LINPACK benchmark performance since 2008, when Ivy Bridge nodes, and 1,792 Sandy Bridge nodes. the original system was installed at the NASA Advanced Supercomputing (NAS) facility. The initial configuration achieved a sustained performance • In addition to measuring performance, the of 0.487 petaflops and 80% efficiency in benchmark testing. With the latest LINPACK run helped the Systems team detect upgrade, Pleiades achieved 84.6% efficiency on the LINPACK benchmark. -
Technological Forecasting of Supercomputer Development: the March to Exascale Computing
Portland State University PDXScholar Engineering and Technology Management Faculty Publications and Presentations Engineering and Technology Management 10-2014 Technological Forecasting of Supercomputer Development: The March to Exascale Computing Dong-Joon Lim Portland State University Timothy R. Anderson Portland State University, [email protected] Tom Shott Portland State University, [email protected] Follow this and additional works at: https://pdxscholar.library.pdx.edu/etm_fac Part of the Engineering Commons Let us know how access to this document benefits ou.y Citation Details Lim, Dong-Joon; Anderson, Timothy R.; and Shott, Tom, "Technological Forecasting of Supercomputer Development: The March to Exascale Computing" (2014). Engineering and Technology Management Faculty Publications and Presentations. 46. https://pdxscholar.library.pdx.edu/etm_fac/46 This Post-Print is brought to you for free and open access. It has been accepted for inclusion in Engineering and Technology Management Faculty Publications and Presentations by an authorized administrator of PDXScholar. Please contact us if we can make this document more accessible: [email protected]. Technological forecasting of supercomputer development: The march to Exascale computing Dong-Joon Lim *, Timothy R. Anderson, Tom Shott Dept. of Engineering and Technology Management, Portland State University, USA Abstract- Advances in supercomputers have come at a steady pace over the past 20 years. The next milestone is to build an Exascale computer however this requires not only speed improvement but also significant enhancements for energy efficiency and massive parallelism. This paper examines technological progress of supercomputer development to identify the innovative potential of three leading technology paths toward Exascale development: hybrid system, multicore system and manycore system. -
Pleiades • the NASA Workload • System Build Strategy • Operational Strategy • Tools and Analysis Software • Issues Do We See • Whats Lustre Does • What We Want
NASA SuperComputing A Ground Based Instrument for Exploration and Discovery LUG 2015 Bob Ciotti Chief Architect/Supercomputing Systems Lead LUG 2015 - Denver LUG - Denver Apr 2015 Discussion • What is Pleiades • The NASA Workload • System Build Strategy • Operational Strategy • Tools and Analysis Software • Issues Do We See • Whats Lustre Does • What We Want LUG - Denver Apr 2015 Supercomputing Support for NASA Missions Jet aircraft wake vortices Jet engine • Agency wide resource noise • Production Supercomputing emissions • Focus on availability • Machines mostly run large ensembles • Some very large calculations (50k) Orbiting, Spinning – Typically o500 jobs running Detailed CLV Black Holes Geometry • Example applications • ARMD – LaRC: Jet wake vortex simulations, to increase airport capacity and safety – GRC: Understanding jet noise simulations, to decrease airport noise • ESMD – ARC: Launch pad flame trench simulations for Ares vehicle safety analysis – MSFC: Correlating wind tunnel tests and simulations of Solar surface Ares I-X test vehicle – ARC/LaRC: High-fidelity CLV flight simulation with convection detailed protuberances • SMD – Michigan State: Ultra-high-resolution solar surface convection simulation – GSFC: Gravity waves from the merger of orbiting, spinning black holes • SOMD – JSC/ARC: Ultra-high-resolution Shuttle ascent analysis • NESC – KSC/ARC: Initial analysis of SRB burn risk in Vehicle 2-SRB Burn Assembly Building in VAB Shuttle Ascent Configuration 3 LUG - Denver Apr 2015 LUG - Denver Apr 2015 ECCO – Ocean Modeling