Pubtex Output 2003.06.03:1113

Total Page:16

File Type:pdf, Size:1020Kb

Pubtex Output 2003.06.03:1113 Cray Assembly Language (CAL) for Cray X1™ Systems Reference Manual S–2314–50 © 2001, 2003 Cray Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless permitted by contract or by written permission of Cray Inc. U.S. GOVERNMENT RESTRICTED RIGHTS NOTICE The Computer Software is delivered as "Commercial Computer Software" as defined in DFARS 48 CFR 252.227-7014. All Computer Software and Computer Software Documentation acquired by or for the U.S. Government is provided with Restricted Rights. Use, duplication or disclosure by the U.S. Government is subject to the restrictions described in FAR 48 CFR 52.227-14 or DFARS 48 CFR 252.227-7014, as applicable. Technical Data acquired by or for the U.S. Government, if any, is provided with Limited Rights. Use, duplication or disclosure by the U.S. Government is subject to the restrictions described in FAR 48 CFR 52.227-14 or DFARS 48 CFR 252.227-7013, as applicable. Autotasking, CF77, Cray, Cray Ada, Cray Channels, Cray Chips, CraySoft, Cray Y-MP, Cray-1, CRInform, CRI/TurboKiva, HSX, LibSci, MPP Apprentice, SSD, SuperCluster, UNICOS, UNICOS/mk, and X-MP EA are federally registered trademarks and Because no workstation is an island, CCI, CCMT, CF90, CFT, CFT2, CFT77, ConCurrent Maintenance Tools, COS, Cray Animation Theater, Cray APP, Cray C90, Cray C90D, Cray CF90, Cray C++ Compiling System, CrayDoc, Cray EL, Cray Fortran Compiler, Cray J90, Cray J90se, Cray J916, Cray J932, CrayLink, Cray MTA, Cray MTA-2, Cray MTX, Cray NQS, Cray/REELlibrarian, Cray S-MP, Cray SSD-T90, Cray SV1, Cray SV1ex, Cray SV2, Cray SX-5, Cray SX-6, Cray T90, Cray T94, Cray T916, Cray T932, Cray T3D, Cray T3D MC, Cray T3D MCA, Cray T3D SC, Cray T3E, CrayTutor, Cray X1, Cray X-MP, Cray XMS, Cray-2, CSIM, CVT, Delivering the power . ., DGauss, Docview, EMDS, GigaRing, HEXAR, IOS, ND Series Network Disk Array, Network Queuing Environment, Network Queuing Tools, OLNET, RQS, SEGLDR, SMARTE, SUPERLINK, System Maintenance and Remote Testing Environment, Trusted UNICOS, UNICOS MAX, and UNICOS/mp are trademarks of Cray Inc. AIX and IBM are trademarks and RISC System/6000 is a product of International Business Machines Corporation. AXP and DEC are trademarks of Digital Equipment Corporation. DynaText, DynaVerse, DynaWeb, and EBT are trademarks of Electronic Book Technologies, Inc. HP and HP-UX are trademarks and HP 9000 is a product of Hewlett-Packard Company. IRIX, Mindshare, SGI, and Silicon Graphics are trademarks of Silicon Graphics, Inc. Netscape and Netscape Navigator are trademarks of Netscape Communications Corporation. NFS, Solaris, SPARC, Sun, and SunOS are trademarks of Sun Microsystems, Inc. Open Software Foundation, OSF, and OSF/1 are trademarks of Open Software Foundation, Inc. UNIX is a registered trademark in the United States and other countries, licensed exclusively through X/Open Company Limited. The UNICOS, UNICOS/mk, and UNICOS/mp operating systems are derived from UNIX System V. These operating systems are also based in part on the Fourth Berkeley Software Distribution (BSD) under license from The Regents of the University of California. Record of Revision Version Description 1.0 August 19, 2002 Draft version to support the Programming Environment 4.1 release. 1.1 June 2003 Supports the Cray Assember 1.1 and Programming Environment 5.0 releases. S–2314–50 i Contents Page Preface xv Accessing Cray Documentation . xv Error Message Explanations . xvi Typographical Conventions . xvi Ordering Documentation . xvii Reader Comments . xviii Introduction [1] 1 Capabilities . 1 Related Publications . 2 Cray X1 Instruction Set Overview [2] 3 Data Types . 3 Integer . 3 Floating-point . 4 Addresses . 4 Registers . 5 Address and Scalar Registers . 6 Vector Registers . 7 Vector Length Register . 8 Mask Registers . 8 Vector Carry Register . 8 Bit Matrix Multiply Register . 9 Control Registers . 9 Program Counter . 9 Performance Counters . 9 CAL Source Statement Format . 10 S–2314–50 iii Cray Assembly Language (CAL) for Cray X1™ Systems Reference Manual Page Scalar Instructions . 11 Immediate Loads . 11 Scalar Memory References . 12 Branches and Jumps . 13 Scalar Integer Functions . 14 Scalar Floating-point Functions . 17 Bit Matrix Multiplication . 18 Byte and Halfword Access . 19 Scalar Atomic Memory Operations . 20 Other Scalar Instructions . 21 Vector Instructions . 22 Elemental Vector Operations . 22 Vector Memory References . 22 Elemental Vector Functions . 23 Mask Operations . 25 Other Vector Instructions . 27 Memory Ordering . 28 Summary of Rules . 30 Special Syntax Forms . 31 CAL Pseudo Instruction Overview [3] 33 Pseudo Instruction Format . 34 Program Control . 34 Loader Linkage . 34 Mode Control . 35 Section Control . 35 Message Control . 35 Listing Control . 36 Symbol Definition . 36 Data Definition . 36 Conditional Assembly . 37 iv S–2314–50 Contents Page Micro Definition . 37 Defined Sequences . 39 File Control . 40 CAL Program Organization [4] 41 Program Modules . 41 Global Definitions and Local Definitions . 41 Program Segments . 42 Program . 42 Sections . 44 Examples . 44 Example 1: Global and Local Definitions . 44 Example 2: Sections and Qualifiers . 45 CAL Assembler Invocation [5] 47 Assembler Command Line . 47 Environment Variables . 52 ASDEF Environment Variable . 53 LPP Environment Variable . 53 TMPDIR Environment Variable . 53 TARGET Shell Variable . 53 Assembler Execution . 53 Reading Source Files . 54 Using Binary Definition Files . 54 CPU Compatibility Checking . 55 Multiple References to a Definition . 55 Included Files . 57 Source Statement Listing File . 57 Source Statement Listing . 58 Cross-reference Listing . 60 Diagnostic Messages . 62 S–2314–50 v Cray Assembly Language (CAL) for Cray X1™ Systems Reference Manual Page Diagnostic Message Listing File . 63 Object File . 63 Creating a Binary Definition File . 63 Symbols . 64 Macros . 64 Opdefs . 65 Opsyns . 65 Micros . 65 Linking . 65 CAL Source Statements [6] 67 Source Statement Format . 68 Label Field . 68 Result Field . 68 Operand Field . 68 Comment Field . 69 Old Format . 69 Case Sensitivity . 70 Symbols . 70 Symbol Qualification . 71 Unqualified Symbol . 71 Qualified Symbols . 72 Symbol Definition . 73 Symbol Attributes . 74 Type Attribute . 74 Relative Attributes . 74 Redefinable Attributes . 76 Symbol Reference . 76 Tags............................ 76 Constants . 78 Floating Constant . 78 vi S–2314–50 Contents Page Integer Constant . 80 Character Constants . 82 Data Items . 82 Floating Data Item . 83 Integer Data Item . 84 Character Data Item . 84 Literals . 85 Micros . 88 Location Elements . ..
Recommended publications
  • Cray Product Installation and Configuration
    Cray Product Installation and Configuration Scott E. Grabow, Silicon Graphics, 655F Lone Oak Drive, Eagan, MN 55121, [email protected] ABSTRACT: The Common Installation Tool (CIT) has had a large impact upon the installation process for Cray software. The migration to CIT for installation has yielded a large number of questions about the methodology for installing new software via CIT. This document discusses the installation process with CIT and how CIT’s introduction has brought about changes to our installation process and documentation. 1 Introduction • Provide a complete replacement approach rather than an incremental software approach or a mods based release The Common Installation Tool (CIT) has made a dramatic change to the processes and manuals for installation of UNICOS • Provide a better way to ensure that relocatable and source and asynchronous products. This document is divided into four files match on site after an install major sections dealing with the changes and the impacts of the • Reduce the amount of time to do an install installation process. • Support single source upgrade option for all systems 2 Changes made between UNICOS 9.0 and Some customers have expressed that the UNICOS 9.0 instal- UNICOS 10.0/UNICOS/mk lation process is rather hard to understand initially due to the incremental software approach, the number of packages needed With the release of UNICOS 9.2 and UNICOS/mk the pack- to be loaded, and when these packages need to be installed. Part aging and installation of Cray software has undergone major of the problem was the UNICOS Installation Guide[1] did not changes when compared to the UNICOS 9.0 packaging and present a clear path as what should be performed during a installation process.
    [Show full text]
  • Vector Vs. Scalar Processors: a Performance Comparison Using a Set of Computational Science Benchmarks
    Vector vs. Scalar Processors: A Performance Comparison Using a Set of Computational Science Benchmarks Mike Ashworth, Ian J. Bush and Martyn F. Guest, Computational Science & Engineering Department, CCLRC Daresbury Laboratory ABSTRACT: Despite a significant decline in their popularity in the last decade vector processors are still with us, and manufacturers such as Cray and NEC are bringing new products to market. We have carried out a performance comparison of three full-scale applications, the first, SBLI, a Direct Numerical Simulation code from Computational Fluid Dynamics, the second, DL_POLY, a molecular dynamics code and the third, POLCOMS, a coastal-ocean model. Comparing the performance of the Cray X1 vector system with two massively parallel (MPP) micro-processor-based systems we find three rather different results. The SBLI PCHAN benchmark performs excellently on the Cray X1 with no code modification, showing 100% vectorisation and significantly outperforming the MPP systems. The performance of DL_POLY was initially poor, but we were able to make significant improvements through a few simple optimisations. The POLCOMS code has been substantially restructured for cache-based MPP systems and now does not vectorise at all well on the Cray X1 leading to poor performance. We conclude that both vector and MPP systems can deliver high performance levels but that, depending on the algorithm, careful software design may be necessary if the same code is to achieve high performance on different architectures. KEYWORDS: vector processor, scalar processor, benchmarking, parallel computing, CFD, molecular dynamics, coastal ocean modelling All of the key computational science groups in the 1. Introduction UK made use of vector supercomputers during their halcyon days of the 1970s, 1980s and into the early 1990s Vector computers entered the scene at a very early [1]-[3].
    [Show full text]
  • UNICOS® Installation Guide for CRAY J90lm Series SG-5271 9.0.2
    UNICOS® Installation Guide for CRAY J90lM Series SG-5271 9.0.2 / ' Cray Research, Inc. Copyright © 1996 Cray Research, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless permitted by contract or by written permission of Cray Research, Inc. Portions of this product may still be in development. The existence of those portions still in development is not a commitment of actual release or support by Cray Research, Inc. Cray Research, Inc. assumes no liability for any damages resulting from attempts to use any functionality or documentation not officially released and supported. If it is released, the final form and the time of official release and start of support is at the discretion of Cray Research, Inc. Autotasking, CF77, CRAY, Cray Ada, CRAYY-MP, CRAY-1, HSX, SSD, UniChem, UNICOS, and X-MP EA are federally registered trademarks and CCI, CF90, CFr, CFr2, CFT77, COS, Cray Animation Theater, CRAY C90, CRAY C90D, Cray C++ Compiling System, CrayDoc, CRAY EL, CRAY J90, Cray NQS, CraylREELlibrarian, CraySoft, CRAY T90, CRAY T3D, CrayTutor, CRAY X-MP, CRAY XMS, CRAY-2, CRInform, CRIlThrboKiva, CSIM, CVT, Delivering the power ..., DGauss, Docview, EMDS, HEXAR, lOS, LibSci, MPP Apprentice, ND Series Network Disk Array, Network Queuing Environment, Network Queuing '!boIs, OLNET, RQS, SEGLDR, SMARTE, SUPERCLUSTER, SUPERLINK, Trusted UNICOS, and UNICOS MAX are trademarks of Cray Research, Inc. Anaconda is a trademark of Archive Technology, Inc. EMASS and ER90 are trademarks of EMASS, Inc. EXABYTE is a trademark of EXABYTE Corporation. GL and OpenGL are trademarks of Silicon Graphics, Inc.
    [Show full text]
  • Through the Years… When Did It All Begin?
    & Through the years… When did it all begin? 1974? 1978? 1963? 2 CDC 6600 – 1974 NERSC started service with the first Supercomputer… ● A well-used system - Serial Number 1 ● On its last legs… ● Designed and built in Chippewa Falls ● Launch Date: 1963 ● Load / Store Architecture ● First RISC Computer! ● First CRT Monitor ● Freon Cooled ● State-of-the-Art Remote Access at NERSC ● Via 4 acoustic modems, manually answered capable of 10 characters /sec 3 50th Anniversary of the IBM / Cray Rivalry… Last week, CDC had a press conference during which they officially announced their 6600 system. I understand that in the laboratory developing this system there are only 32 people, “including the janitor”… Contrasting this modest effort with our vast development activities, I fail to understand why we have lost our industry leadership position by letting someone else offer the world’s most powerful computer… T.J. Watson, August 28, 1963 4 2/6/14 Cray Higher-Ed Roundtable, July 22, 2013 CDC 7600 – 1975 ● Delivered in September ● 36 Mflop Peak ● ~10 Mflop Sustained ● 10X sustained performance vs. the CDC 6600 ● Fast memory + slower core memory ● Freon cooled (again) Major Innovations § 65KW Memory § 36.4 MHz clock § Pipelined functional units 5 Cray-1 – 1978 NERSC transitions users ● Serial 6 to vector architectures ● An fairly easy transition for application writers ● LTSS was converted to run on the Cray-1 and became known as CTSS (Cray Time Sharing System) ● Freon Cooled (again) ● 2nd Cray 1 added in 1981 Major Innovations § Vector Processing § Dependency
    [Show full text]
  • NQE Release Overview RO–5237 3.3
    NQE Release Overview RO–5237 3.3 Document Number 007–3795–001 Copyright © 1998 Silicon Graphics, Inc. and Cray Research, Inc. All Rights Reserved. This manual or parts thereof may not be reproduced in any form unless permitted by contract or by written permission of Silicon Graphics, Inc. or Cray Research, Inc. RESTRICTED RIGHTS LEGEND Use, duplication, or disclosure of the technical data contained in this document by the Government is subject to restrictions as set forth in subdivision (c) (1) (ii) of the Rights in Technical Data and Computer Software clause at DFARS 52.227-7013 and/or in similar or successor clauses in the FAR, or in the DOD or NASA FAR Supplement. Unpublished rights reserved under the Copyright Laws of the United States. Contractor/manufacturer is Silicon Graphics, Inc., 2011 N. Shoreline Blvd., Mountain View, CA 94043-1389. Autotasking, CF77, CRAY, Cray Ada, CraySoft, CRAY Y-MP, CRAY-1, CRInform, CRI/TurboKiva, HSX, LibSci, MPP Apprentice, SSD, SUPERCLUSTER, UNICOS, and X-MP EA are federally registered trademarks and Because no workstation is an island, CCI, CCMT, CF90, CFT, CFT2, CFT77, ConCurrent Maintenance Tools, COS, Cray Animation Theater, CRAY APP, CRAY C90, CRAY C90D, Cray C++ Compiling System, CrayDoc, CRAY EL, CRAY J90, CRAY J90se, CrayLink, Cray NQS, Cray/REELlibrarian, CRAY S-MP, CRAY SSD-T90, CRAY T90, CRAY T3D, CRAY T3E, CrayTutor, CRAY X-MP, CRAY XMS, CRAY-2, CSIM, CVT, Delivering the power . ., DGauss, Docview, EMDS, GigaRing, HEXAR, IOS, ND Series Network Disk Array, Network Queuing Environment, Network Queuing Tools, OLNET, RQS, SEGLDR, SMARTE, SUPERLINK, System Maintenance and Remote Testing Environment, Trusted UNICOS, UNICOS MAX, and UNICOS/mk are trademarks of Cray Research, Inc.
    [Show full text]
  • Unix and Linux System Administration and Shell Programming
    Unix and Linux System Administration and Shell Programming Unix and Linux System Administration and Shell Programming version 56 of August 12, 2014 Copyright © 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2009, 2010, 2011, 2012, 2013, 2014 Milo This book includes material from the http://www.osdata.com/ website and the text book on computer programming. Distributed on the honor system. Print and read free for personal, non-profit, and/or educational purposes. If you like the book, you are encouraged to send a donation (U.S dollars) to Milo, PO Box 5237, Balboa Island, California, USA 92662. This is a work in progress. For the most up to date version, visit the website http://www.osdata.com/ and http://www.osdata.com/programming/shell/unixbook.pdf — Please add links from your website or Facebook page. Professors and Teachers: Feel free to take a copy of this PDF and make it available to your class (possibly through your academic website). This way everyone in your class will have the same copy (with the same page numbers) despite my continual updates. Please try to avoid posting it to the public internet (to avoid old copies confusing things) and take it down when the class ends. You can post the same or a newer version for each succeeding class. Please remove old copies after the class ends to prevent confusing the search engines. You can contact me with a specific version number and class end date and I will put it on my website. version 56 page 1 Unix and Linux System Administration and Shell Programming Unix and Linux Administration and Shell Programming chapter 0 This book looks at Unix (and Linux) shell programming and system administration.
    [Show full text]
  • Cray Research Software Report
    Cray Research Software Report Irene M. Qualters, Cray Research, Inc., 655F Lone Oak Drive, Eagan, Minnesota 55121 ABSTRACT: This paper describes the Cray Research Software Division status as of Spring 1995 and gives directions for future hardware and software architectures. 1 Introduction single CPU speedups that can be anticipated based on code per- formance on CRAY C90 systems: This report covers recent Supercomputer experiences with Cray Research products and lays out architectural directions for CRAY C90 Speed Speedup on CRAY T90s the future. It summarizes early customer experience with the latest CRAY T90 and CRAY J90 systems, outlines directions Under 100 MFLOPS 1.4x over the next five years, and gives specific plans for ‘95 deliv- 200 to 400 MFLOPS 1.6x eries. Over 600 MFLOPS 1.75x The price/performance of CRAY T90 systems shows sub- 2 Customer Status stantial improvements. For example, LINPACK CRAY T90 price/performance is 3.7 times better than on CRAY C90 sys- Cray Research enjoyed record volumes in 1994, expanding tems. its installed base by 20% to more than 600 systems. To accom- plish this, we shipped 40% more systems than in 1993 (our pre- CRAY T94 single CPU ratios to vious record year). This trend will continue, with similar CRAY C90 speeds: percentage increases in 1995, as we expand into new applica- tion areas, including finance, multimedia, and “real time.” • LINPACK 1000 x 1000 -> 1.75x In the face of this volume, software reliability metrics show • NAS Parallel Benchmarks (Class A) -> 1.48 to 1.67x consistent improvements. While total incoming problem re- • Perfect Benchmarks -> 1.3 to 1.7x ports showed a modest decrease in 1994, our focus on MTTI (mean time to interrupt) for our largest systems yielded a dou- bling in reliability by year end.
    [Show full text]
  • The Gemini Network
    The Gemini Network Rev 1.1 Cray Inc. © 2010 Cray Inc. All Rights Reserved. Unpublished Proprietary Information. This unpublished work is protected by trade secret, copyright and other laws. Except as permitted by contract or express written permission of Cray Inc., no part of this work or its content may be used, reproduced or disclosed in any form. Technical Data acquired by or for the U.S. Government, if any, is provided with Limited Rights. Use, duplication or disclosure by the U.S. Government is subject to the restrictions described in FAR 48 CFR 52.227-14 or DFARS 48 CFR 252.227-7013, as applicable. Autotasking, Cray, Cray Channels, Cray Y-MP, UNICOS and UNICOS/mk are federally registered trademarks and Active Manager, CCI, CCMT, CF77, CF90, CFT, CFT2, CFT77, ConCurrent Maintenance Tools, COS, Cray Ada, Cray Animation Theater, Cray APP, Cray Apprentice2, Cray C90, Cray C90D, Cray C++ Compiling System, Cray CF90, Cray EL, Cray Fortran Compiler, Cray J90, Cray J90se, Cray J916, Cray J932, Cray MTA, Cray MTA-2, Cray MTX, Cray NQS, Cray Research, Cray SeaStar, Cray SeaStar2, Cray SeaStar2+, Cray SHMEM, Cray S-MP, Cray SSD-T90, Cray SuperCluster, Cray SV1, Cray SV1ex, Cray SX-5, Cray SX-6, Cray T90, Cray T916, Cray T932, Cray T3D, Cray T3D MC, Cray T3D MCA, Cray T3D SC, Cray T3E, Cray Threadstorm, Cray UNICOS, Cray X1, Cray X1E, Cray X2, Cray XD1, Cray X-MP, Cray XMS, Cray XMT, Cray XR1, Cray XT, Cray XT3, Cray XT4, Cray XT5, Cray XT5h, Cray Y-MP EL, Cray-1, Cray-2, Cray-3, CrayDoc, CrayLink, Cray-MP, CrayPacs, CrayPat, CrayPort, Cray/REELlibrarian, CraySoft, CrayTutor, CRInform, CRI/TurboKiva, CSIM, CVT, Delivering the power…, Dgauss, Docview, EMDS, GigaRing, HEXAR, HSX, IOS, ISP/Superlink, LibSci, MPP Apprentice, ND Series Network Disk Array, Network Queuing Environment, Network Queuing Tools, OLNET, RapidArray, RQS, SEGLDR, SMARTE, SSD, SUPERLINK, System Maintenance and Remote Testing Environment, Trusted UNICOS, TurboKiva, UNICOS MAX, UNICOS/lc, and UNICOS/mp are trademarks of Cray Inc.
    [Show full text]
  • Forschungszentrum Jülich Gmbh High Speed Supercomputer Communications in Broadband Networks FZJ-ZAM-IB-9903
    Forschungszentrum Jülich GmbH Zentralinstitut für Angewandte Mathematik D-52425 Jülich, Tel. (02461) 61-6402 Interner Bericht High Speed Supercomputer Communications in Broadband Networks Helmut Grund*, Ferdinand Hommes*, Ralph Niederberger, Eva Pless* FZJ-ZAM-IB-9903 April 1999 (letzte Änderung: 20.04.99) (*) GMD – German National Research Center for Information Technology, Schloss Birlinghoven Preprint: To be published in: Proceedings of the TERENA-NORDUnet Networking Conference 1999 7-10 June 1999, Lund, Sweden II High Speed Supercomputer Communications in Broadband Networks Helmut Grund2, Ferdinand Hommes2 , Ralph Niederberger1, Eva Pless2 1 FZJ - Research Centre Jülich D - 52425 Jülich http://www.fz-juelich.de [email protected] 2 GMD – German National Research Center for Information Technology Schloss Birlinghoven, D-53754 Sankt Augustin http://www.gmd.de [email protected] , [email protected] , [email protected] Abstract High speed communication across wide area networks is currently of great interest in computer com- munications industry. It is needed wherever a huge amount of data will be created and made available to remote computer systems. Areas of investigation in computer science are the coupling of super- computers (metacomputing), high-resolution visualisation of data and distributed access to huge amounts of data. Within the Gigabit Testbed West (GTB-West) a high speed communication link has been installed between the research center Forschungszentrum Jülich GmbH (FZJ) and the National Research Center for Information (GMD) at Sankt Augustin. Amongst the research fields Metacomputing, Visualization and distributed usage of huge amount of data the installation, management and performance evaluation of the data communication links are main points of interest to the GTBW project.
    [Show full text]
  • CRAY T90 Series IEEE Floating Point Migration Issues and Solutions
    CRAY T90 Series IEEE Floating Point Migration Issues and Solutions Philip G. Garnatz, Cray Research, Inc., Eagan, Minnesota, U.S.A. ABSTRACT: Migration to the new CRAY T90 series with IEEE floating-point arith- metic presents a new challenge to applications programmers. More precision in the mantissa and less range in the exponent will likely raise some numerical differences issues. A step-by-step process will be presented to show how to isolate these numerical differences. Data files will need to be transferred and converted from a Cray format PVP system to and from a CRAY T90 series system with IEEE. New options to the assign command allow for transparent reading and writing of files from the other types of system. Introduction This paper is intended for the user services personnel and 114814 help desk staff who will be asked questions by programmers and exp mantissa users who will be moving code to the CRAY T90 series IEEE from another Cray PVP architecture. The issues and solutions Exponent sign presented here are intended to be a guide to the most frequent problems that programmers may encounter. Mantissa sign What is the numerical model? Figure2: Cray format floating-point frames and other computational entities such as workstations and New CRAY T90 series IEEE programmers might notice graphic displays. Other benefits are as follows: different answers when running on the IEEE machine than on a system with traditional Cray format floating-point arithmetic. • Greater precision. An IEEE floating-point number provides The IEEE format has the following characteristics: approximately 16 decimal digits of precision; this is about one and a half digits more precise than Cray format numbers.
    [Show full text]
  • Shared-Memory Vector Systems Compared
    Shared-Memory Vector Systems Compared Robert Bell, CSIRO and Guy Robinson, Arctic Region Supercomputing Center ABSTRACT: The NEC SX-5 and the Cray SV1 are the only shared-memory vector computers currently being marketed. This compares with at least five models a few years ago (J90, T90, SX-4, Fujitsu and Hitachi), with IBM, Digital, Convex, CDC and others having fallen by the wayside in the early 1990s. In this presentation, some comparisons will be made between the architecture of the survivors, and some performance comparisons will be given on benchmark and applications codes, and in areas not usually presented in comparisons, e.g. file systems, network performance, gzip speeds, compilation speeds, scalability and tools and libraries. KEYWORDS: SX-5, SV1, shared-memory, vector systems averaged 95% on the larger node, and 92% on the smaller 1. Introduction node. HPCCC The SXes are supported by data storage systems – The Bureau of Meteorology and CSIRO in Australia SAM-FS for the Bureau, and DMF on a Cray J90se for established the High Performance Computing and CSIRO. Communications Centre (HPCCC) in 1997, to provide ARSC larger computational facilities than either party could The mission of the Arctic Region Supercomputing acquire separately. The main initial system was an NEC Centre is to support high performance computational SX-4, which has now been replaced by two NEC SX-5s. research in science and engineering with an emphasis on The current system is a dual-node SX-5/24M with 224 high latitudes and the Arctic. ARSC provides high Gbyte of memory, and this will become an SX-5/32M performance computational, visualisation and data storage with 224 Gbyte in July 2001.
    [Show full text]
  • Appendix G Vector Processors in More Depth
    G.1 Introduction G-2 G.2 Vector Performance in More Depth G-2 G.3 Vector Memory Systems in More Depth G-9 G.4 Enhancing Vector Performance G-11 G.5 Effectiveness of Compiler Vectorization G-14 G.6 Putting It All Together: Performance of Vector Processors G-15 G.7 A Modern Vector Supercomputer: The Cray X1 G-21 G.8 Concluding Remarks G-25 G.9 Historical Perspective and References G-26 Exercises G-29 G Vector Processors in More Depth Revised by Krste Asanovic Massachusetts Institute of Technology I’m certainly not inventing vector processors. There are three kinds that I know of existing today. They are represented by the Illiac-IV, the (CDC) Star processor, and the TI (ASC) processor. Those three were all pioneering processors.…One of the problems of being a pioneer is you always make mistakes and I never, never want to be a pioneer. It’s always best to come second when you can look at the mistakes the pioneers made. Seymour Cray Public lecture at Lawrence Livermore Laboratorieson on the introduction of the Cray-1 (1976) G-2 ■ Appendix G Vector Processors in More Depth G.1 Introduction Chapter 4 introduces vector architectures and places Multimedia SIMD extensions and GPUs in proper context to vector architectures. In this appendix, we go into more detail on vector architectures, including more accurate performance models and descriptions of previous vector architectures. Figure G.1 shows the characteristics of some typical vector processors, including the size and count of the registers, the number and types of functional units, the number of load-store units, and the number of lanes.
    [Show full text]