Appro Xtreme-X Computers with Mellanox QDR Infiniband

Total Page:16

File Type:pdf, Size:1020Kb

Appro Xtreme-X Computers with Mellanox QDR Infiniband Intel Cluster Ready Appro Xtreme-X Computers with Mellanox QDR Infiniband AA PP PP RR OO II NN TT EE RR NN AA TT II OO NN AA LL II NN CC Steve Lyness Vice President, HPC Solutions Engineering [email protected] Company Overview :: Corporate Snapshot • Computer Systems manufacturer founded in 1991 • From OEM to branded products and solutions in 2001 – Headquarters in Milpitas, CA – Regional Office in Houston, TX – Global Presence via International Resellers and Partners – Manufacturing and Hardware R&D in Asia – 100+ employees worldwide • Leading developer of high performance servers, clusters and supercomputers. – Balanced Architecture – Delivering a Competitive Edge – Shipped over thousands of clusters and servers since 2001 – Solid Growth from 69.3M in 2006 to 86.5M in 2007 • Target Markets: - Electronic Design Automation - Financial Services - Government / Defense - Manufacturing - Oil & Gas *Source: IDC WW Quarterly PC Tracker SLIDE | 2 SLIDE A P P R O H P C P R E S E N T A T I O N Company Overview :: Why Appro High Performance Computing Expertise – Designing Best in Class Supercomputers Leadership in Price/Performance Energy Efficient Solution Best System Scalability and Manageability High Availability Features SLIDE | 3 SLIDE A P P R O H P C P R E S E N T A T I O N Company Overview :: Our Customers SLIDE | 4 SLIDE A P P R O H P C P R E S E N T A T I O N Company Overview :: Solid Growth +61% Compound Annual Revenue Growth $86.5M $69.3M $33.2M 2005 2006 2007 SLIDE | 5 SLIDE A P P R O H P C P R E S E N T A T I O N HPC Experience :: Recent Major Installations September 2006 June 2007 NOAA Cluster LLNL Minos Cluster • 1,424 processor cores • 6,912 processor cores • 2.8TB System Memory • 13.8TB System Memory • 15 TFlops • 33 TFlops RecentRecent InstallationsInstallations November 2006 February 2008 LLNL Atlas Cluster DE Shaw Research Cluster • 9,216 processor cores • 4,608 processor cores • 18.4TB System Memory • 9.2TB System Memory • 44 TFlops • 49 TFlops SLIDE | 6 SLIDE A P P R O H P C P R E S E N T A T I O N HPC Experience :: Current Major Projects April - June 2008 July 2008 TLCC Clusters Renault F1 CFD Cluster • Total 426TFlops project • Total 38TFlops • LLNL, LANL, SNL • Dual-rail IB • 8 Clusters CurrentCurrent ProjectsProjects June 2008 U of Tsukuba Cluster • Total 95TFlops • Quad-rail IB SLIDE | 7 SLIDE A P P R O H P C P R E S E N T A T I O N HPC/Supercomputing Trends :: What are the pain points? • Technology is driven by customer’s needs • Needs are driven by desire to reduce pain points – Clusters are still hard to use – as clusters get bigger, the problem grows exponentially – Power, cooling and floor space are major issues – Weak interconnect performance – RAS is a growing issue/problem SLIDE | 8 SLIDE A P P R O H P C P R E S E N T A T I O N Intel Cluster Ready and Intel Multi-Core Processors AA PP PP RR OO II NN TT EE RR NN AA TT II OO NN AA LL II NN CC Intel® Cluster Ready Intel® Cluster Ready Platform Solutions Provider Solution Deployment End-Users Deliver SKU that is a Certified Solution Intel® Cluster Ready Platform Certification Appro Xtreme-X1 ISV-Applications Supercomputer Intel® Cluster Ready Application Registration Complete Intel® Cluster Ready Solution Intel® Cluster Ready Hardware Certification Component Certification Mellanox Appro HW and SW ConnectX Certified Solution from a specific Platform Integrator SLIDE | 10 SLIDE A P P R O H P C P R E S E N T A T I O N Developer Considerations Intel® Software Tools for Building HPC Applications Multi-Threading Æ Å Message Passing Performance | Compatibility | Support | Productivity | Cross-Platform http://www.intel.com/software | 11 SLIDE A P P R O H P C P R E S E N T A T I O N Benefits of Choosing an Intel® Cluster Ready Certified System •In production faster • Simplified selection of compatible systems and applications • Faster and higher confidence initial deployment •Less downtime • Ability to detect and correct problems soon after they occur • Better support from ISVs and your system vendor – Lower initial and ongoing costs – Higher ongoing productivity Demand an Intel® Cluster Ready certified system for your next cluster SLIDE | 12 SLIDE A P P R O H P C P R E S E N T A T I O N HPC/Supercomputing Trends New 45nm Processors Quad-Core Intel® Xeon® processor 5400 series Technology Rich Compelling IT Benefits Enhanced Greater Intel Core Performance at Microarchitecture Given Frequency and Higher Larger Caches Frequencies New SSE4 Instructions Greater Energy Efficiency 45 nm High-k Process Technology 2nd Generation Intel Quad-Core also available Dual-Core Intel® Xeon® processor 5200 series | 13 SLIDE A P P R O H P C P R E S E N T A T I O N Nehalem Based System Architecture Up to 25.6 Gb/sec bandwidth per link Benefits Nehalem Nehalem • More application performance • Improved energy efficiency QPI I/O • End to end HW assist Hub – Improved Virtualization Technology PCI DMI Functional •Stable IT image Express* system – Software compatible Gen 1, 2 ICH demonstrated Sept 2007 IDF – Live migration compatible with today’s dual and quad-core Intel® Core™ Key Technologies Extending Today’s Leadership New 45nm Intel® Microarchitecture New Intel® QuickPath interconnect Production Q4’08 Integrated Memory Controller Volume Ramp 1H’09 Next Generation Memory (DDR3) PCI Express Gen 2 SLIDE | 14 SLIDE All future products, dates, and figures are preliminary and are subject to change without any notice. A P P R O H P C P R E S E N T A T I O N Appro Xtreme-X Supercomputers AA PP PP RR OO II NN TT EE RR NN AA TT II OO NN AA LL II NN CC Xtreme-X Supercomputer :: Reliability by Design • Redundancy where Required – Disk Drives • Diskless architecture – Fan Trays • N+1 Redundancy at the blade – Power Supplies • Hot Swappable N+1 Power supplies – Network components • Redundant InfiniBand and Ethernet networks – Servers • Management Nodes RAID6 Disk storage arrays • Management nodes - redundant pair (active/standby) • Sub-Management Nodes - redundant pair (active/active) • Compute nodes are hot swappable in sub-racks and are easily removed without removing any cabling from other nodes SLIDE | 16 SLIDE A P P R O H P C P R E S E N T A T I O N Xtreme-X Supercomputer :: The Thermal Problem • A 100C increase in ambient temperature produces a 50% reduction in the long-term reliability of electronic equipment • A 50C reduction in temperature can triple the life expectancy of electronic equipment. • MTBF and Availability depend on cooling. Source: BCC, Inc. Report CB-185R SLIDE | 17 SLIDE A P P R O H P C P R E S E N T A T I O N Xtreme-X Supercomputer :: Current Datacenter Cooling Methodology Top View Hot Aisle / Cold Aisle Hot Isle Cold Isle • As Much as 40% or the cold air flow by passes the equipment • Since the air is not directly pushed into the equipment, the cold air from the chillers are mixed with the hot air reducing the efficiency to cool the datacenter equipment. SLIDE | 18 SLIDE A P P R O H P C P R E S E N T A T I O N Xtreme-X Supercomputer :: Directed Airflow Cooling Methodology Top View • Up to 30% Improvement in Density with Lower Power per Equipment Rack • Delivers Cold Air directly to the equipment for optimum rack cooling efficiency. • Delivers comfortable temperature to the room for return to Chillers • Back-to-Back Rack configuration saves floor space in the datacenter and eliminates hot isles • FRU and maintenance is done in the front side of the blade rack cabinet | 19 SLIDE A P P R O H P C P R E S E N T A T I O N Xtreme-X Supercomputer :: Power/Cooling Efficiency & Density Xtreme-X Supercomputer Back-To-Back Configuration Rack 1 Rack 2 Sub-RackRack Sub-Rack Sub-Rack Sub-Rack Sub-Rack Sub-Rack Sub-Rack Sub-Rack SIDE VIEW SIDE VIEW FLOOR UNDERCOLD FLOOR AIR COLDFROM AIR FROM CHILLERS 30% IMPROVEMENT IN SPACE BY ELIMINATING HOT ISLES AND HAVING BACK-TO-BACK RACK CONFIGURATION SLIDE | 20 SLIDE A P P R O H P C P R E S E N T A T I O N Xtreme-X Supercomputer :: Based on Scalable Units • A new way to approach building Supercomputing Linux clusters • Multiple clusters of various sizes can be built and deployed into production very rapidly. • The SU concept is very flexible by accommodating clusters from very small node counts to thousands of nodes. • This new approach yields many benefits to dramatically lower total cost of ownership. Awarded Supercomputing Online 2007 Product of the Year SLIDE | 21 SLIDE A P P R O H P C P R E S E N T A T I O N Appro Cluster Engine AA PP PP RR OO II NN TT EE RR NN AA TT II OO NN AA LL II NN CC Appro Cluster Engine :: ACE: Scalable SW Architecture • Issues with today’s cluster environments – Change Operating systems EASILY and QUICKLY – Run jobs on the cluster efficiently – Monitor nodes for health – Simple cluster management (Ex. IP Address allocation) Cluster Management Software turns a Collection of Servers into a, Functional, Usable, Reliable, Available, computing system SLIDE | 23 SLIDE A P P R O H P C P R E S E N T A T I O N Appro Cluster Engine • Total Management Package 9 Network 9 Servers 9 Host 9 Scheduling 9 RAS • Two Tier Management Architecture 9 Scalable to large numbers of Servers 9 Minimal Overhead 9 Complete Remote Lights-Out Control • Redundancy provides both High Availability and High Performance 9 Redundant Management Servers 9 Redundant Networks 9 Active-Active Operation Supported SLIDE | 24 SLIDE A P P R O H P C P R E S E N T A T I O N Appro Cluster Engine :: ACE: Scalable SW Architecture Global File System Storage Server Mgmt : InfiniBand Node for Computing : 10GbE Storage Operation Server N GbE 2x GbE : GbE Operation
Recommended publications
  • GPU Developments 2018
    GPU Developments 2018 2018 GPU Developments 2018 © Copyright Jon Peddie Research 2019. All rights reserved. Reproduction in whole or in part is prohibited without written permission from Jon Peddie Research. This report is the property of Jon Peddie Research (JPR) and made available to a restricted number of clients only upon these terms and conditions. Agreement not to copy or disclose. This report and all future reports or other materials provided by JPR pursuant to this subscription (collectively, “Reports”) are protected by: (i) federal copyright, pursuant to the Copyright Act of 1976; and (ii) the nondisclosure provisions set forth immediately following. License, exclusive use, and agreement not to disclose. Reports are the trade secret property exclusively of JPR and are made available to a restricted number of clients, for their exclusive use and only upon the following terms and conditions. JPR grants site-wide license to read and utilize the information in the Reports, exclusively to the initial subscriber to the Reports, its subsidiaries, divisions, and employees (collectively, “Subscriber”). The Reports shall, at all times, be treated by Subscriber as proprietary and confidential documents, for internal use only. Subscriber agrees that it will not reproduce for or share any of the material in the Reports (“Material”) with any entity or individual other than Subscriber (“Shared Third Party”) (collectively, “Share” or “Sharing”), without the advance written permission of JPR. Subscriber shall be liable for any breach of this agreement and shall be subject to cancellation of its subscription to Reports. Without limiting this liability, Subscriber shall be liable for any damages suffered by JPR as a result of any Sharing of any Material, without advance written permission of JPR.
    [Show full text]
  • CTL RFP Proposal
    State of Maine Department of Education in coordination with the National Association of State Procurement Officials PROPOSAL COVER PAGE RFP # 201210412 MULTI-STATE LEARNING TECHNOLOGY INITIATIVE Bidder’s Organization Name: CTL Chief Executive - Name/Title: Erik Stromquist / COO Tel: 800.642.3087 x 212 Fax: 503.641.5586 E-mail: [email protected] Headquarters Street Address: 3460 NW Industrial St. Headquarters City/State/Zip: Portland, OR 97210 (provide information requested below if different from above) Lead Point of Contact for Proposal - Name/Title: Michael Mahanay / GM, Sales & Marketing Tel: 800.642.3087 x 205 Fax: 503.641.5586 E-mail: [email protected] Street Address: 3460 NW Industrial St. City/State/Zip: Portland, OR 97219 Proposed Cost: $294/yr. The proposed cost listed above is for reference purposes only, not evaluation purposes. In the event that the cost noted above does not match the Bidder’s detailed cost proposal documents, then the information on the cost proposal documents will take precedence. This proposal and the pricing structure contained herein will remain firm for a period of 180 days from the date and time of the bid opening. No personnel on the multi-state Sourcing Team or any other involved state agency participated, either directly or indirectly, in any activities relating to the preparation of the Bidder’s proposal. No attempt has been made or will be made by the Bidder to induce any other person or firm to submit or not to submit a proposal. The undersigned is authorized to enter into contractual obligations on behalf of the above-named organization.
    [Show full text]
  • Mass-Producing Your Certified Cluster Solutions
    Mass-Producing Your Certified Cluster Solutions White Paper The Intel® Cluster Ready program is designed to help you make the most of your engineering Intel® Cluster Ready resources. The program enables you to sell several different types of clusters from each solution High-Performance you design, by varying the hardware while maintaining the same software stack. You can gain a Computing broader range of cluster products to sell without engineering each one “from scratch.” As long as the software stack remains the same and works on each one of the hardware configurations, you can be confident that your clusters will interoperate with registered Intel Cluster Ready applications. Your customers can purchase your clusters with that same confidence, knowing they’ll be able to get their applications up and running quickly on your systems. Intel® Cluster Ready: Mass-Producing Your Certified Cluster Solutions Table of Contents Overview of Production-Related Activities � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 3 Creating and certifying recipes � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 3 Maintaining recipes � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 3 Mass-producing recipes � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � �
    [Show full text]
  • NVM Express and the PCI Express* SSD Revolution SSDS003
    NVM Express and the PCI Express* SSD Revolution Danny Cobb, CTO Flash Memory Business Unit, EMC Amber Huffman, Sr. Principal Engineer, Intel SSDS003 Agenda • NVM Express (NVMe) Overview • New NVMe Features in Enterprise & Client • Driver Ecosystem for NVMe • NVMe Interoperability and Plugfest Plans • EMC’s Perspective: NVMe Use Cases and Proof Points The PDF for this Session presentation is available from our Technical Session Catalog at the end of the day at: intel.com/go/idfsessions URL is on top of Session Agenda Pages in Pocket Guide 2 Agenda • NVM Express (NVMe) Overview • New NVMe Features in Enterprise & Client • Driver Ecosystem for NVMe • NVMe Interoperability and Plugfest Plans • EMC’s Perspective: NVMe Use Cases and Proof Points 3 NVM Express (NVMe) Overview • NVM Express is a scalable host controller interface designed for Enterprise and client systems that use PCI Express* SSDs • NVMe was developed by industry consortium of 80+ members and is directed by a 13-company Promoter Group • NVMe 1.0 was published March 1, 2011 • Product introductions later this year, first in Enterprise 4 Technical Basics • The focus of the effort is efficiency, scalability and performance – All parameters for 4KB command in single 64B DMA fetch – Supports deep queues (64K commands per Q, up to 64K queues) – Supports MSI-X and interrupt steering – Streamlined command set optimized for NVM (6 I/O commands) – Enterprise: Support for end-to-end data protection (i.e., DIF/DIX) – NVM technology agnostic 5 NVMe = NVM Express NVMe Command Execution 7 1
    [Show full text]
  • System Design for Telecommunication Gateways
    P1: OTE/OTE/SPH P2: OTE FM BLBK307-Bachmutsky August 30, 2010 15:13 Printer Name: Yet to Come SYSTEM DESIGN FOR TELECOMMUNICATION GATEWAYS Alexander Bachmutsky Nokia Siemens Networks, USA A John Wiley and Sons, Ltd., Publication P1: OTE/OTE/SPH P2: OTE FM BLBK307-Bachmutsky August 30, 2010 15:13 Printer Name: Yet to Come P1: OTE/OTE/SPH P2: OTE FM BLBK307-Bachmutsky August 30, 2010 15:13 Printer Name: Yet to Come SYSTEM DESIGN FOR TELECOMMUNICATION GATEWAYS P1: OTE/OTE/SPH P2: OTE FM BLBK307-Bachmutsky August 30, 2010 15:13 Printer Name: Yet to Come P1: OTE/OTE/SPH P2: OTE FM BLBK307-Bachmutsky August 30, 2010 15:13 Printer Name: Yet to Come SYSTEM DESIGN FOR TELECOMMUNICATION GATEWAYS Alexander Bachmutsky Nokia Siemens Networks, USA A John Wiley and Sons, Ltd., Publication P1: OTE/OTE/SPH P2: OTE FM BLBK307-Bachmutsky August 30, 2010 15:13 Printer Name: Yet to Come This edition first published 2011 C 2011 John Wiley & Sons, Ltd Registered office John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book please see our website at www.wiley.com. The right of the author to be identified as the author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.
    [Show full text]
  • Accelerate Your AI Journey with Intel
    Intel® AI Workshop 2021 Accelerate Your AI Journey with Intel Laurent Duhem – HPC/AI Solutions Architect ([email protected]) Shailen Sobhee - AI Software Technical Consultant ([email protected]) Notices and Disclaimers ▪ Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. ▪ No product or component can be absolutely secure. ▪ Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. For more complete information about performance and benchmark results, visit http://www.intel.com/benchmarks . ▪ Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/benchmarks . ▪ Intel® Advanced Vector Extensions (Intel® AVX) provides higher throughput to certain processor operations. Due to varying processor power characteristics, utilizing AVX instructions may cause a) some parts to operate at less than the rated frequency and b) some parts with Intel® Turbo Boost Technology 2.0 to not achieve any or maximum turbo frequencies. Performance varies depending on hardware, software, and system configuration and you can learn more at http://www.intel.com/go/turbo.
    [Show full text]
  • Extracting and Mapping Industry 4.0 Technologies Using Wikipedia
    Computers in Industry 100 (2018) 244–257 Contents lists available at ScienceDirect Computers in Industry journal homepage: www.elsevier.com/locate/compind Extracting and mapping industry 4.0 technologies using wikipedia T ⁎ Filippo Chiarelloa, , Leonello Trivellib, Andrea Bonaccorsia, Gualtiero Fantonic a Department of Energy, Systems, Territory and Construction Engineering, University of Pisa, Largo Lucio Lazzarino, 2, 56126 Pisa, Italy b Department of Economics and Management, University of Pisa, Via Cosimo Ridolfi, 10, 56124 Pisa, Italy c Department of Mechanical, Nuclear and Production Engineering, University of Pisa, Largo Lucio Lazzarino, 2, 56126 Pisa, Italy ARTICLE INFO ABSTRACT Keywords: The explosion of the interest in the industry 4.0 generated a hype on both academia and business: the former is Industry 4.0 attracted for the opportunities given by the emergence of such a new field, the latter is pulled by incentives and Digital industry national investment plans. The Industry 4.0 technological field is not new but it is highly heterogeneous (actually Industrial IoT it is the aggregation point of more than 30 different fields of the technology). For this reason, many stakeholders Big data feel uncomfortable since they do not master the whole set of technologies, they manifested a lack of knowledge Digital currency and problems of communication with other domains. Programming languages Computing Actually such problem is twofold, on one side a common vocabulary that helps domain experts to have a Embedded systems mutual understanding is missing Riel et al. [1], on the other side, an overall standardization effort would be IoT beneficial to integrate existing terminologies in a reference architecture for the Industry 4.0 paradigm Smit et al.
    [Show full text]
  • Fast Setup and Integration of ABAQUS on HPC Linux Cluster and the Study of Its Scalability
    Fast Setup and Integration of ABAQUS on HPC Linux Cluster and the Study of Its Scalability Betty Huang, Jeff Williams, Richard Xu Baker Hughes Incorporated Abstract: High-performance computing (HPC), the massive powerhouse of IT, is now the fastest-growing sector in the industry, especially for the oil and gas industry, which has outpaced other US industries in integrating HPC into its critical business functions. HPC can offer greater capacity and flexibility to allow more advanced data analysis, which usually cannot be handled by individual workstations. In April 2008, Baker Oil Tools installed a Linux Cluster to boost its finite element analysis and computational fluid dynamics application performance. Platform Open Cluster Stack (OCS) has been implemented as cluster and system management software and load-sharing facility for high-performance computing (LSF-HPC) as job scheduler. OCS is a pre-integrated, modular and hybrid software stack that contains open-source software and proprietary products. It is a simple and easy way to rapidly assemble and manage small- to large-scale Linux-based HPC clusters. 1. Introduction of HPC A cluster is a group of linked computers working together closely so they form a single computer. Depending on the function, clusters can be divided into several types: high-availability (HA) cluster, load- balancing cluster, grid computing, and Beowulf cluster (computing). In this paper, we will focus on the Beowulf-type high-performance computing (HPC) cluster. Driven by more advanced simulation project needs and the capability of modern computers, the simulation world relies more and more on HPC. HPC cluster has become the dominating resource in this area.
    [Show full text]
  • GPU Developments 2017T
    GPU Developments 2017 2018 GPU Developments 2017t © Copyright Jon Peddie Research 2018. All rights reserved. Reproduction in whole or in part is prohibited without written permission from Jon Peddie Research. This report is the property of Jon Peddie Research (JPR) and made available to a restricted number of clients only upon these terms and conditions. Agreement not to copy or disclose. This report and all future reports or other materials provided by JPR pursuant to this subscription (collectively, “Reports”) are protected by: (i) federal copyright, pursuant to the Copyright Act of 1976; and (ii) the nondisclosure provisions set forth immediately following. License, exclusive use, and agreement not to disclose. Reports are the trade secret property exclusively of JPR and are made available to a restricted number of clients, for their exclusive use and only upon the following terms and conditions. JPR grants site-wide license to read and utilize the information in the Reports, exclusively to the initial subscriber to the Reports, its subsidiaries, divisions, and employees (collectively, “Subscriber”). The Reports shall, at all times, be treated by Subscriber as proprietary and confidential documents, for internal use only. Subscriber agrees that it will not reproduce for or share any of the material in the Reports (“Material”) with any entity or individual other than Subscriber (“Shared Third Party”) (collectively, “Share” or “Sharing”), without the advance written permission of JPR. Subscriber shall be liable for any breach of this agreement and shall be subject to cancellation of its subscription to Reports. Without limiting this liability, Subscriber shall be liable for any damages suffered by JPR as a result of any Sharing of any Material, without advance written permission of JPR.
    [Show full text]
  • High Performance Computing
    C R C P R E S S . T A Y L O R & F R A N C I S High Performance Computing A Chapter Sampler www.crcpress.com Contents 1. Overview of Parallel Computing From: Elements of Parallel Computing, by Eric Aubanel 2. Introduction to GPU Parallelism and CUDA From: GPU Parallel Program Development Using CUDA, by Tolga Soyata 3. Optimization Techniques and Best Practices for Parallel Codes From: Parallel Programming for Modern High Performance Computing Systems, by Pawel Czarnul 4. Determining an Exaflop Strategy From: Programming for Hybrid Multi/Manycore MMP Systems, by John Levesque, Aaron Vose 5. Contemporary High Performance Computing From: Contemporary High Performance Computing: From Petascale toward Exascale, by Jeffrey S. Vetter 6. Introduction to Computational Modeling From: Introduction to Modeling and Simulation with MATLAB® and Python, by Steven I. Gordon, Brian Guilfoos 20% Discount Available We're offering you 20% discount on all our entire range of CRC Press titles. Enter the code HPC10 at the checkout. Please note: This discount code cannot be combined with any other discount or offer and is only valid on print titles purchased directly from www.crcpress.com. www.crcpress.com Copyright Taylor & Francis Group. Do Not Distribute. CHAPTER 1 Overview of Parallel Computing 1.1 INTRODUCTION In the first 60 years of the electronic computer, beginning in 1940, computing performance per dollar increased on average by 55% per year [52]. This staggering 100 billion-fold increase hit a wall in the middle of the first decade of this century. The so-called power wall arose when processors couldn't work any faster because they couldn't dissipate the heat they pro- duced.
    [Show full text]
  • Spotlight on Ansys 12.0
    TM EXCELLENCE IN ENGINEERING SIMULATION ADVANTAGEVOLUME III ISSUE 1 2009 SPOTLIGHT ON ANSYS 12.0 FRAMEWORK MULTIPHYSICS HIGH-PERFORMANCE PAGE 6 PAGE 11 COMPUTING PAGE 25 EDITORIAL Engineering Simulation: t pa Needed Now © iStockphoto.com/Atra More Than Ever In a tough economy, forward-thinking companies are investing in leading-edge simulation technology to drive top-line revenue growth and bottom-line savings. Time and cost benefits of engineering simulation are this simulation method has the potential to reduce crack well documented. Predicting product performance and growth analysis time by over 90 percent compared determining optimal solutions early in the design phase help with manual methods. The productivity gain will enable to avoid late-stage problems and to eliminate trial-and-error engineers to analyze more designs annually, thus keeping testing cycles that drive up costs and bog down schedules. up with increased demand for turbochargers around the Simulation enables engineers to perform what-if studies world and strengthening the company’s leadership and to compare alternatives, processes that otherwise position in this competitive industry sector. would be impractical. Indeed, bottom-line savings are The prediction method is based on improved fracture one key benefit that prompts most companies to implement mechanics capabilities for calculating J integrals, one of simulation, and are most readily quantified in return-on- the many enhancements in ANSYS 12.0. Previewed in the investment calculations. Spotlight section of this issue, the release is a milestone A second, and potentially greater, benefit is boosting for the software supplier and a huge step forward for the top-line revenue growth.
    [Show full text]
  • Increasing Productivity with PRIMERGY X86 HPC from Fujitsu
    Increasing Productivity with PRIMERGY x86 HPC from Fujitsu Wolfgang Gentzsch HPC Consultant Fujitsu Technology Solutions (FTS) Copyright 2012 FUJITSU From Innovation to HPC Solutions for our Customers Wolfgang Gentzsch HPC Consultant Fujitsu Technology Solutions (FTS) Copyright 2012 FUJITSU Fujitsu & preDiCT: In Silico Drug Testing Real time simulation of cardiac electrical activation on a realistic whole human heart mesh Half of the drugs that enter clinical testing fail at that stage Often from side-effects in the heart The European project preDiCT was set up to eliminate such unsuitable drugs early in the R&D pipeline to eliminate wasted time and expenditure Copyright 2012 FUJITSU Innovation reduces time to solution January 2009 Now • Application is now 80000x faster • One month to calculate 1 s of activity has become just a few seconds • Simulator now a valuable tool for the pharmaceutical industry • Eventually replace in vivo testing with in silico testing Copyright 2012 FUJITSU Open Innovation: Disaster Mitigation Earthquake/Seabed Data Tohoku, Tokyo Universities Ocean Models Imperial College London, ANU Open Numerical Libraries www.openpetascale.org Multicore Parallel Computer 4 Copyright 2012 FUJITSU Fujitsu‘s Approach for the HPC Market PRIMEHPC FX10 Supercomputer Divisional Departmental x86 HPC Cluster Workgroup Work Group Copyright 2012 FUJITSU Modular growth potential towards … RX200 CX400 Workgroup / departmental …capacity Scalability Density Energy Costs Co-processor support BX400 Flexibility to address all kinds of customer
    [Show full text]