Wrangling Rogues: a Case Study on Managing Experimental Post-Moore Architectures Will Powell Jeffrey S

Total Page:16

File Type:pdf, Size:1020Kb

Wrangling Rogues: a Case Study on Managing Experimental Post-Moore Architectures Will Powell Jeffrey S Wrangling Rogues: A Case Study on Managing Experimental Post-Moore Architectures Will Powell Jeffrey S. Young Jason Riedy Thomas M. Conte [email protected] [email protected] [email protected] [email protected] School of Computational Science and Engineering School of Computer Science Georgia Institute of Technology Georgia Institute of Technology Atlanta, Georgia Atlanta, Georgia Abstract are not always foreseen in traditional data center environments. The Rogues Gallery is a new experimental testbed that is focused Our Rogues Gallery of novel technologies has produced research on tackling rogue architectures for the post-Moore era of computing. results and is being used in classes both at Georgia Tech as well as While some of these devices have roots in the embedded and high- at other institutions. performance computing spaces, managing current and emerging The key lessons from this testbed (so far) are the following: technologies provides a challenge for system administration that • Invest in rogues, but realize some technology may be are not always foreseen in traditional data center environments. short-lived. We cannot invest too much time in configuring We present an overview of the motivations and design of the and integrating a novel architecture when tools and software initial Rogues Gallery testbed and cover some of the unique chal- stacks may be in an unstable state of development. Rogues lenges that we have seen and foresee with upcoming hardware may be short-lived; they may not achieve their goals. Find- prototypes for future post-Moore research. Specifically, we cover ing their limits is an important aspect of the Rogues Gallery. networking, identity management, scheduling of resources, and Early users of these platforms are technically sophisticated tools and sensor access aspects of the Rogues Gallery along with and relatively friendly, so not all components of the support- techniques we have developed to manage these new platforms. We ing infrastructure need be user friendly. argue that current tools like the Slurm scheduler can support new • Physical hardware resources not dedicated to rogues rogues without major infrastructure changes. should be kept to a minimum. As we explain in Section 4, most functionality should be satisfied by VMs and containers 1 Challenges for Post-Moore Testbeds rather than dedicating a physical server to manage each As we look to the end of easy and cost-effective transistor scal- rogue piece of hardware. ing, we enter the post-Moore era[33] and reach a turning point in • Collaboration and commiseration is key. Rogues need a computer system design and usage. Accelerators like GPUs have community to succeed. If they cannot establish a community created a pronounced shift in the high-performance computing and (users, vendors interested in providing updates, manageabil- machine learning application spaces, but there is a wide variety ity from an IT perspective), they will disappear. We promote of possible architectural choices for the post-Moore era, including community with our testbed through a documentation wiki, memory-centric, neuromorphic, quantum, and reversible comput- mailing lists, and other communication channels for users ing. These revolutionary research fields combined with alternative as well as close contact with vendors, where appropriate. materials-based approaches to silicon-based hardware have given • Licensing and appropriate identity management are us a bewildering array of options and rogue devices for the post- tough but necessary challenges. Managing access to pro- Moore era. However, there currently is limited guidance on how to totypes and licensed software are tricky for academic institu- evaluate this novel hardware for tomorrow’s application needs. tions, but we must work with collaborators like start-ups to arXiv:1808.06334v4 [cs.AR] 2 Aug 2019 Creating one-off testbeds for each new post-Moore technology provide a secure environment with appropriate separation of increases the cost of evaluating these new technologies. We present privileges to protect IP as necessary. Many software-defined an in-progress case study of a cohesive user environment that en- network (SDN) environments assume endpoint implementa- ables researchers to perform experiments across different novel tions that may not be possible on first-generation hardware. architectures that may be in different stages of acceptance by the Documentation restrictions present a particular challenge wider computing community. The Rogues Gallery is a new exper- while trying to build a community. imental testbed focusing on opening access to rogue architectures Here we describe the initial design of the testbed as well as the that may play a larger role in the Post-Moore era of computing. high-level system management strategy that we use to maintain While some of these devices have roots in the embedded and high- and tie together novel architectures. We use existing tools as much performance computing spaces, managing current and emerging as possible; we need not re-invent the systems integration wheel technologies provides a challenge for system administration that to support early novel architectures, which may be few in number and specialized. ,, 2019. ACM ISBN 978-x-xxxx-xxxx-x/YY/MM...$15.00 https://doi.org/10.1145/nnnnnnn.nnnnnnn Figure 1: Initial rogues: FPGAs, the Emu Chick, and the field programmable analog array (FPAA) 2 The Rogues Gallery Benchmarks and Data Sets for The Rogues Gallery was initiated by Georgia Tech’s Center for Irregular and ML applications Tools and Resources Research into Novel Computing Hierarchies (CRNCH) in late 2017. Portability APIs – Kokkos, GraphBLAS, Neuromorphic APIs The Gallery’s focus is to acquire new and unique hardware (the Rogues Gallery Training materials and Tutorials rogues) from vendors, research labs, and start-ups and make this Hosted Hardware hardware widely available to students, faculty, and industry collab- Near-memory Computation Neuromorphic orators within a managed data center environment. By exposing and Data Rearrangement Accelerators students and researchers to this set of unique hardware, we hope to Emu foster cross-cutting discussions about hardware designs that will Chick FPAA (GT) Others.. Non-volatile drive future performance improvements in computing long after Memory Programmable Interconnection the Moore’s Law era of cheap transistors ends. Networks High-bandwidth Memory The primary goal of the Rogues Gallery is to make first-generation Future Devices Traditional Computation start-up or research lab prototypes available to a wide audience and Prototype Accelerators RQL Devices of researchers and developers as soon as it is moderately stable Quantum Metastrider for testing, application development, and benchmarking. Examples RISC-V FPGA of such cutting-edge hardware include Emu Technology’s Chick, FPGA-based memory-centric platforms, field programmable analog Figure 2: High-level overview of the CRNCH Rogues Gallery devices (FPAAs), and others. This mirrors how companies like Intel and IBM are investigating novel hardware, Loihi and TrueNorth respectively, but the Rogues Gallery adds student and collaborator access. to play with ideas. Georgia Tech has a history of providing such The Rogues Gallery provides a coherent, managed environment early access through efforts such as the Sony-Toshiba-IBM Center for exposing new hardware and supporting software and tools to of Competence for the Cell Broadband Engine Processor[2], the students, researchers, and collaborators. This environment allows NSF Keeneland GPU system[34], and work with early high core users to perform critical architecture, systems, and HPC research, count Intel-based platforms[24]. and enables them to train with new technologies so they can be more competitive in today’s rapidly changing job market. As part 3 Initial Rogues of this goal, CRNCH and Georgia Tech are committed to partner- Our initial Rogues Gallery shown in Figure 1 consists of vari- ing with vendors to define a flexible access management model ous field programmable gate arrays (FPGAs), an Emu Chick, anda that allows for a functional and distinctive research experience for field-programmable analog array (FPAA). The gallery also currently both Georgia Tech and external users, while protecting sensitive includes power monitoring infrastructure for embedded-style sys- intellectual property and technologies. tems (NVIDIA Tegra), but we will not go into depth on that more Not all rogues become long-term products. Some fade away typical infrastructure. within a few years (or are acquired by companies that fail to pro- FPGAs for Memory System Exploration Reconfigurable devices ductize the technology). The overall infrastructure of a testbed like FPGAs are not novel in themselves, but the gallery includes focused on rogues must minimize up-front investment to limit the FPGAs for prototyping near-memory and in-network computing. cost of “just trying out” new technology. As these early-access and Currently the Rogues Gallery includes prototype platforms change, the infrastructure must also adapt. • two Nallatech 385 PCIe cards with mid-grade Intel Arria 10 And even if the technology is fantastic, rogues that do not de- FPGAs and 8 GiB of memory; velop communities do not last. In our opinion, initial communities • a Nallatech 520N PCIe card with a Intel Stratix 10, 32 GiB of grow by easing use and access. Some systems, like the Emu Chick, memory, and multiple 100 GBps network ports; and require
Recommended publications
  • Jack Dongarra: Supercomputing Expert and Mathematical Software Specialist
    Biographies Jack Dongarra: Supercomputing Expert and Mathematical Software Specialist Thomas Haigh University of Wisconsin Editor: Thomas Haigh Jack J. Dongarra was born in Chicago in 1950 to a he applied for an internship at nearby Argonne National family of Sicilian immigrants. He remembers himself as Laboratory as part of a program that rewarded a small an undistinguished student during his time at a local group of undergraduates with college credit. Dongarra Catholic elementary school, burdened by undiagnosed credits his success here, against strong competition from dyslexia.Onlylater,inhighschool,didhebeginto students attending far more prestigious schools, to Leff’s connect material from science classes with his love of friendship with Jim Pool who was then associate director taking machines apart and tinkering with them. Inspired of the Applied Mathematics Division at Argonne.2 by his science teacher, he attended Chicago State University and majored in mathematics, thinking that EISPACK this would combine well with education courses to equip Dongarra was supervised by Brian Smith, a young him for a high school teaching career. The first person in researcher whose primary concern at the time was the his family to go to college, he lived at home and worked lab’s EISPACK project. Although budget cuts forced Pool to in a pizza restaurant to cover the cost of his education.1 make substantial layoffs during his time as acting In 1972, during his senior year, a series of chance director of the Applied Mathematics Division in 1970– events reshaped Dongarra’s planned career. On the 1971, he had made a special effort to find funds to suggestion of Harvey Leff, one of his physics professors, protect the project and hire Smith.
    [Show full text]
  • A Peer-To-Peer Framework for Message Passing Parallel Programs
    A Peer-to-Peer Framework for Message Passing Parallel Programs Stéphane GENAUD a;1, and Choopan RATTANAPOKA b;2 a AlGorille Team - LORIA b King Mongkut’s University of Technology Abstract. This chapter describes the P2P-MPI project, a software framework aimed at the development of message-passing programs in large scale distributed net- works of computers. Our goal is to provide a light-weight, self-contained software package that requires minimum effort to use and maintain. P2P-MPI relies on three features to reach this goal: i) its installation and use does not require administra- tor privileges, ii) available resources are discovered and selected for a computation without intervention from the user, iii) program executions can be fault-tolerant on user demand, in a completely transparent fashion (no checkpoint server to config- ure). P2P-MPI is typically suited for organizations having spare individual comput- ers linked by a high speed network, possibly running heterogeneous operating sys- tems, and having Java applications. From a technical point of view, the framework has three layers: an infrastructure management layer at the bottom, a middleware layer containing the services, and the communication layer implementing an MPJ (Message Passing for Java) communication library at the top. We explain the de- sign and the implementation of these layers, and we discuss the allocation strategy based on network locality to the submitter. Allocation experiments of more than five hundreds peers are presented to validate the implementation. We also present how fault-management issues have been tackled. First, the monitoring of the infras- tructure itself is implemented through the use of failure detectors.
    [Show full text]
  • Simulating Low Precision Floating-Point Arithmetic
    SIMULATING LOW PRECISION FLOATING-POINT ARITHMETIC Higham, Nicholas J. and Pranesh, Srikara 2019 MIMS EPrint: 2019.4 Manchester Institute for Mathematical Sciences School of Mathematics The University of Manchester Reports available from: http://eprints.maths.manchester.ac.uk/ And by contacting: The MIMS Secretary School of Mathematics The University of Manchester Manchester, M13 9PL, UK ISSN 1749-9097 SIMULATING LOW PRECISION FLOATING-POINT ARITHMETIC∗ NICHOLAS J. HIGHAMy AND SRIKARA PRANESH∗ Abstract. The half precision (fp16) floating-point format, defined in the 2008 revision of the IEEE standard for floating-point arithmetic, and a more recently proposed half precision format bfloat16, are increasingly available in GPUs and other accelerators. While the support for low precision arithmetic is mainly motivated by machine learning applications, general purpose numerical algorithms can benefit from it, too, gaining in speed, energy usage, and reduced communication costs. Since the appropriate hardware is not always available, and one may wish to experiment with new arithmetics not yet implemented in hardware, software simulations of low precision arithmetic are needed. We discuss how to simulate low precision arithmetic using arithmetic of higher precision. We examine the correctness of such simulations and explain via rounding error analysis why a natural method of simulation can provide results that are more accurate than actual computations at low precision. We provide a MATLAB function chop that can be used to efficiently simulate fp16 and bfloat16 arithmetics, with or without the representation of subnormal numbers and with the options of round to nearest, directed rounding, stochastic rounding, and random bit flips in the significand. We demonstrate the advantages of this approach over defining a new MATLAB class and overloading operators.
    [Show full text]
  • Design of a Modern Distributed and Accelerated Linear Algebra Library
    SLATE: Design of a Modern Distributed and Accelerated Linear Algebra Library Mark Gates Jakub Kurzak Ali Charara [email protected] [email protected] [email protected] University of Tennessee University of Tennessee University of Tennessee Knoxville, TN Knoxville, TN Knoxville, TN Asim YarKhan Jack Dongarra [email protected] [email protected] University of Tennessee University of Tennessee Knoxville, TN Knoxville, TN Oak Ridge National Laboratory University of Manchester ABSTRACT USA. ACM, New York, NY, USA, 13 pages. https://doi.org/10.1145/3295500. The SLATE (Software for Linear Algebra Targeting Exascale) library 3356223 is being developed to provide fundamental dense linear algebra capabilities for current and upcoming distributed high-performance systems, both accelerated CPU–GPU based and CPU based. SLATE will provide coverage of existing ScaLAPACK functionality, includ- ing the parallel BLAS; linear systems using LU and Cholesky; least 1 INTRODUCTION squares problems using QR; and eigenvalue and singular value The objective of SLATE is to provide fundamental dense linear problems. In this respect, it will serve as a replacement for Sca- algebra capabilities for the high-performance computing (HPC) LAPACK, which after two decades of operation, cannot adequately community at large. The ultimate objective of SLATE is to replace be retrofitted for modern accelerated architectures. SLATE uses ScaLAPACK [7], which has become the industry standard for dense modern techniques such as communication-avoiding algorithms, linear algebra operations in distributed memory environments. Af- lookahead panels to overlap communication and computation, and ter two decades of operation, ScaLAPACK is past the end of its life task-based scheduling, along with a modern C++ framework.
    [Show full text]
  • R00456--FM Getting up to Speed
    GETTING UP TO SPEED THE FUTURE OF SUPERCOMPUTING Susan L. Graham, Marc Snir, and Cynthia A. Patterson, Editors Committee on the Future of Supercomputing Computer Science and Telecommunications Board Division on Engineering and Physical Sciences THE NATIONAL ACADEMIES PRESS Washington, D.C. www.nap.edu THE NATIONAL ACADEMIES PRESS 500 Fifth Street, N.W. Washington, DC 20001 NOTICE: The project that is the subject of this report was approved by the Gov- erning Board of the National Research Council, whose members are drawn from the councils of the National Academy of Sciences, the National Academy of Engi- neering, and the Institute of Medicine. The members of the committee responsible for the report were chosen for their special competences and with regard for ap- propriate balance. Support for this project was provided by the Department of Energy under Spon- sor Award No. DE-AT01-03NA00106. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the authors and do not necessarily reflect the views of the organizations that provided support for the project. International Standard Book Number 0-309-09502-6 (Book) International Standard Book Number 0-309-54679-6 (PDF) Library of Congress Catalog Card Number 2004118086 Cover designed by Jennifer Bishop. Cover images (clockwise from top right, front to back) 1. Exploding star. Scientific Discovery through Advanced Computing (SciDAC) Center for Supernova Research, U.S. Department of Energy, Office of Science. 2. Hurricane Frances, September 5, 2004, taken by GOES-12 satellite, 1 km visible imagery. U.S. National Oceanographic and Atmospheric Administration. 3. Large-eddy simulation of a Rayleigh-Taylor instability run on the Lawrence Livermore National Laboratory MCR Linux cluster in July 2003.
    [Show full text]
  • Mathematics People
    NEWS Mathematics People or up to ten years post-PhD, are eligible. Awardees receive Braverman Receives US$1 million distributed over five years. NSF Waterman Award —From an NSF announcement Mark Braverman of Princeton University has been selected as a Prizes of the Association cowinner of the 2019 Alan T. Wa- terman Award of the National Sci- for Women in Mathematics ence Foundation (NSF) for his work in complexity theory, algorithms, The Association for Women in Mathematics (AWM) has and the limits of what is possible awarded a number of prizes in 2019. computationally. According to the Catherine Sulem of the Univer- prize citation, his work “focuses on sity of Toronto has been named the Mark Braverman complexity, including looking at Sonia Kovalevsky Lecturer for 2019 by algorithms for optimization, which, the Association for Women in Math- when applied, might mean planning a route—how to get ematics (AWM) and the Society for from point A to point B in the most efficient way possible. Industrial and Applied Mathematics “Algorithms are everywhere. Most people know that (SIAM). The citation states: “Sulem every time someone uses a computer, algorithms are at is a prominent applied mathemati- work. But they also occur in nature. Braverman examines cian working in the area of nonlin- randomness in the motion of objects, down to the erratic Catherine Sulem ear analysis and partial differential movement of particles in a fluid. equations. She has specialized on “His work is also tied to algorithms required for learning, the topic of singularity development in solutions of the which serve as building blocks to artificial intelligence, and nonlinear Schrödinger equation (NLS), on the problem of has even had implications for the foundations of quantum free surface water waves, and on Hamiltonian partial differ- computing.
    [Show full text]
  • Co 2019 Vivekanandan Balasubramanian ALL RIGHTS
    c 2019 Vivekanandan Balasubramanian ALL RIGHTS RESERVED A PROGRAMMING MODEL AND EXECUTION SYSTEM FOR ADAPTIVE ENSEMBLE APPLICATIONS ON HIGH PERFORMANCE COMPUTING SYSTEMS by VIVEKANANDAN BALASUBRAMANIAN A dissertation submitted to the School of Graduate Studies Rutgers, The State University of New Jersey In partial fulfillment of the requirements For the degree of Doctor of Philosophy Graduate Program in Electrical and Computer Engineering Written under the direction of Shantenu Jha and Matteo Turilli and approved by New Brunswick, New Jersey October, 2019 ABSTRACT OF THE DISSERTATION A programming model and execution system for adaptive ensemble applications on high performance computing systems by Vivekanandan Balasubramanian Dissertation Directors: Shantenu Jha and Matteo Turilli Traditionally, advances in high-performance scientific computing have focused on the scale, performance, and optimization of an application with a large, single task, and less on applications comprised of multiple tasks. However, many scientific problems are expressed as applications that require the collective outcome of one or more ensembles of computational tasks in order to provide insight into the problem being studied. Depending on the scientific problem, a task of an ensemble can be any type of a program: from a molecular simulation, to a data analysis or a machine learning model. With different communication and coordination patterns, both within and across ensembles, the number and type of applications that can be formulated as ensembles is vast and spans many scientific domains, including biophysics, climate science, polar science and earth science. The performance of ensemble applications can be improved further by using partial results to adapt the application at runtime. Partial results of ongoing executions can be analyzed with multiple methods to adapt the application to focus on relevant portions of the problem space or reduce the time to execution of the application.
    [Show full text]
  • Michel Foucault Ronald C Kessler Graham Colditz Sigmund Freud
    ANK RESEARCHER ORGANIZATION H INDEX CITATIONS 1 Michel Foucault Collège de France 296 1026230 2 Ronald C Kessler Harvard University 289 392494 3 Graham Colditz Washington University in St Louis 288 316548 4 Sigmund Freud University of Vienna 284 552109 Brigham and Women's Hospital 5 284 332728 JoAnn E Manson Harvard Medical School 6 Shizuo Akira Osaka University 276 362588 Centre de Sociologie Européenne; 7 274 771039 Pierre Bourdieu Collège de France Massachusetts Institute of Technology 8 273 308874 Robert Langer MIT 9 Eric Lander Broad Institute Harvard MIT 272 454569 10 Bert Vogelstein Johns Hopkins University 270 410260 Brigham and Women's Hospital 11 267 363862 Eugene Braunwald Harvard Medical School Ecole Polytechnique Fédérale de 12 264 364838 Michael Graetzel Lausanne 13 Frank B Hu Harvard University 256 307111 14 Yi Hwa Liu Yale University 255 332019 15 M A Caligiuri City of Hope National Medical Center 253 345173 16 Gordon Guyatt McMaster University 252 284725 17 Salim Yusuf McMaster University 250 357419 18 Michael Karin University of California San Diego 250 273000 Yale University; Howard Hughes 19 244 221895 Richard A Flavell Medical Institute 20 T W Robbins University of Cambridge 239 180615 21 Zhong Lin Wang Georgia Institute of Technology 238 234085 22 Martín Heidegger Universität Freiburg 234 335652 23 Paul M Ridker Harvard Medical School 234 318801 24 Daniel Levy National Institutes of Health NIH 232 286694 25 Guido Kroemer INSERM 231 240372 26 Steven A Rosenberg National Institutes of Health NIH 231 224154 Max Planck
    [Show full text]
  • CV January 2001 Geoffrey Charles
    Geoffrey Charles Fox Phone Email Cell: 8122194643 [email protected] Informatics: 8128567977 [email protected] Lab: 8128560927 [email protected] Home: 8123239196 Web Site: http://www.infomall.org Addresses ..................................................................................................... 2 Date of Birth ................................................................................................. 2 Education ...................................................................................................... 2 Professional Experience .................................................................................. 2 Awards and Honors ........................................................................................ 3 Board Membership ......................................................................................... 3 Industry Experience ....................................................................................... 3 Current Journal Editor..................................................................................... 3 Previous Journal Responsibility ........................................................................ 4 PhD Theses supervised ................................................................................... 4 Research Activities ......................................................................................... 7 Conferences Organized ................................................................................... 8 Publications: Introduction ..............................................................................
    [Show full text]
  • NHBS Backlist Bargains 2011 Main Web Page NHBS Home Page
    www.nhbs.com [email protected] T: +44 (0)1803 865913 The Migration Biochemistry How To Identify How and Why Collins Field Guide: The Iberian Lynx: Ecology of Birds £51.99 £30.50 Trees in Southen Species Multiply: Birds of the Extinction or Africa The Radiation of Palearctic - Non- Recovery? £83.00 £48.99 £14.99 £8.99 Darwin's Finches Passerines £6.99 £4.50 £27.95 £16.50 £24.99 £12.99 For Love of Insects Field Guide to the The Status and Atlas of Seeds and The Biology of A Primer of £18.95 £11.50 Birds of The Distribution of Fruits of Central African Savannahs Ecological Statistics Gambia and Dragonflies of the and East-European £34.95 £20.50 £32.99 £19.50 Senegal Mediterranean Flora £24.99 £14.99 Region £359.00 £209.99 £9.99 £6.50 Browse the subject pages Dear Customers, Mammals Birds Welcome to the 2011 Backlist Bargains Catalogue! Every year we offer you the Reptiles & Amphibians chance to update your library collections, top up on textbooks or explore new interests, at greatly reduced prices. This year we have nearly 5000 books at up Fishes to 50% off. Invertebrates Palaeontology You'll find books from across our range of scientific and environmental subjects, Marine & Freshwater Biology from heavyweight science and monographs to field guides and natural history General Natural History writing. Regional & Travel Botany & Plant Science Please enjoy browsing the catalogue. The NHBS Backlist Bargains sale ends March Animal & General Biology 31st 2011. Take advantage of these great discounts - Order Now! Evolutionary Biology Ecology Happy reading and buying, Habitats & Ecosystems Conservation & Biodiversity Nigel Massen Managing Director Environmental Science Physical Sciences Sustainable Development Using the Backlist Bargains Catalogue Data Analysis Reference Now that NHBS Catalogues are online there are many ways to access the title information that you need.
    [Show full text]
  • Studying the Regulatory Landscape of Flowering Plants
    Studying the Regulatory Landscape of Flowering Plants Jan Van de Velde Promoter: Prof. Dr. Klaas Vandepoele Co-Promoter: Prof. Dr. Jan Fostier Ghent University Faculty of Sciences Department of Plant Biotechnology and Bioinformatics VIB Department of Plant Systems Biology Comparative and Integrative Genomics Research funded by a PhD grant of the Institute for the Promotion of Innovation through Science and Technology in Flanders (IWT Vlaanderen). Dissertation submitted in fulfilment of the requirements for the degree of Doctor in Sciences:Bioinformatics. Academic year: 2016-2017 Examination Commitee Prof. Dr. Geert De Jaeger (chair) Faculty of Sciences, Department of Plant Biotechnology and Bioinformatics, Ghent University Prof. Dr. Klaas Vandepoele (promoter) Faculty of Sciences, Department of Plant Biotechnology and Bioinformatics, Ghent University Prof. Dr. Jan Fostier (co-promoter) Faculty of Engineering and Architecture, Department of Information Technology (INTEC), Ghent University - iMinds Prof. Dr. Kerstin Kaufmann Institute for Biochemistry and Biology, Potsdam University Prof. Dr. Pieter de Bleser Inflammation Research Center, Flanders Institute of Biotechnology (VIB) and Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium Dr. Vanessa Vermeirssen Faculty of Sciences, Department of Plant Biotechnology and Bioinformatics, Ghent University Dr. Stefanie De Bodt Crop Science Division, Bayer CropScience SA-NV, Functional Biology Dr. Inge De Clercq Department of Animal, Plant and Soil Science, ARC Centre of Excellence in Plant Energy Biology, La Trobe University and Faculty of Sciences, Department of Plant Biotechnology and Bioinformatics, Ghent University iii Thank You! Throughout this PhD I have received a lot of support, therefore there are a number of people I would like to thank. First of all, I would like to thank Klaas Vandepoele, for his support and guidance.
    [Show full text]
  • Jack Dongarra University of Tennessee and Oak Ridge National Laboratory
    Seminario EEUU/Venezuela de Computación de Alto Rendimiento 2000 US/Venezuela Workshop on High Performance Computing 2000 - WoHPC 2000 4 al 6 de Abril del 2000 Puerto La Cruz Venezuela Jack Dongarra University of Tennessee and Oak Ridge National Laboratory http://www.cs.utk.edu/~dongarra/ 1 ♦ Look at trends in HPC Ø Top500 statistics ♦ NetSolve Ø Example of grid middleware ♦ Performance on today’s architecture Ø ATLAS effort ♦ Tools for performance evaluation Ø Performance API (PAPI) 2 - Started in 6/93 by JD,Hans W. Meuer and Erich Strohmaier - Basis for analyzing the HCP market - Quantify observations - Detection of trends (market, architecture, technology) 3 - Listing of the 500 most powerful Computers in the World - Yardstick: Rmax from LINPACK MPP Ax=b, dense problem TPP performance - Updated twice a year Rate Size SC‘xy in November Meeting in Mannheim in June - All data available from www.top500.org 4 ♦ Original classes of ♦ A way for machines tracking trends Ø Sequential Ø SMPs Øin performance Ø MPPs Ø in market Ø SIMDs Øin classes of ♦ Two new classes HPC systems Ø Beowulf-class ØArchitecture systems Ø ØTechnology Clustering of SMPs and DSMs ØRequires additional terminology 5 Ø “Constellation” ClusterCluster ofof ClustersClusters ♦ An ensemble of N nodes each comprising p computing elements ♦ The p elements are tightly bound shared memory (e.g. smp, dsm) ♦ The N nodes are loosely coupled, i.e.: distributed memory 4TF Blue Pacific SST 3 x 480 4-way SMP nodes ♦ p is greater than N 3.9 TF peak performance 2.6 TB memory ♦ Distinction
    [Show full text]