A Parallel Processing Architecture for CAD of Integrated Circuits

Total Page:16

File Type:pdf, Size:1020Kb

A Parallel Processing Architecture for CAD of Integrated Circuits s.\21õ A Parallel Processing Architecture for CAD of Integrated Circuits Bruce A. Tonkin, B.E.(Hons.) Departrnent of Electrical ancl Electronic Engineering The University of Aclelaicle Aclelaicle, South Australia. September 1990 -¿.f û. v,=) a- * å.". Å \t 1,'l Contents Abstract x Declaration xi Acknorr-ledgements xii 1 Introduction 1 1.1 Computer Aided Design of Integrated Circuits 1 I.2 Synthesis Tooìs 6 I.2.1 Algorithms 6 n I.2J..I Heuristic Algorithms I 1.2.1.2 Simulated Annealing 8 1.2.1.3 Iinosledge-based Systems 10 1.2.2 Examples 10 1.2.2.1 Cellgenerators 10 7.2.2.2 Compactors 11 1.2.2.3 Routers 72 7.2.2.4 Placement tools 12 I.2.2.5 Silicon comPilers t2 1.3 Static Analysis Tools 13 1.3.1 Algoritìims 13 1.3.1.1 Layor.rt analysis algorithms . 13 1.3.1.2 Netu'ork analysis algorithms t4 1.3.1.3 Test generation algorithms . 15 1.3.1.4 Flielarchical analYsis t7 1.3.2 Examples t7 7.3.2.1 Cilcuit extractors 17 1.3.2.2 Netu'ork comParators. - 18 1.3.2.3 Design rule checkers 18 7.3.2.4 Electrical rule checkers 19 1.3.2.5 Tiningverifiers 19 1.3.2.6 Test generation tools 20 r.4 D¡-namic Analr-sis Tools 20 1.-l.1 Algorithrns 2I 1.4.1.i Direct nethods 2l I.4.I.2 Relaration methods 24 1.4.1.3 Er¡ent driven simulators 25 1.4.1.4 Device moclels 26 1.1.2 Examples 26 1.4.2.I Circuit simulators 27 I.4.2.2 Timing simulators 27 L4.2.3 Logic simulatols 28 L4.2.4 Fault simulation 29 1.4.2.5 Functional simula'tors 30 I.4.2.6 Behavioural simulators 31 1.4.2-7 i\{ixed ler-el simulators 31 1.4.2.8 Nlixed mode simulat'ors 31 1.5 Summary 32 2 Background óÐ 2.f Real Tirne Suìrsysten-r (RT'S) 36 2.1 .1 Introcluction 36 2.1.2 Real Tirne veÌsns General Purpose ComputeÌ Systems 38 It 2.I.2.L Real tirne computer systems 3E 2.I.2.2 Gener-al purpose compttter srl'stems 38 2.r.3 Real Tirne System Overview 4I 2.7.4 Conclusions 44 2.2 Parallel Circuit Extraction 48 2.2.1 Intr^oduction 48 2.2.2 Overview of Circuit Extlaction 50 2.2.3 Cìoalie 51 2.2.4 Strategy fol Parallel Circuit Extraction 55 2.2.5 Parallel Goalie 58 2.2.5.I General purpose multicomputer' 58 2.2.5.2 Serial disk accesses 61 2.2.5.3 Parallel disk accesses 61 2.2.5.4 Cilcuit paltitioning . ' . 64 2.2.5.5 Layout analYsis 65 2.2.5.6 l\'Ierging region numbers bC) 2.'2.6 Results 70 2.2.7 Conclusions 77 2.2.8 Acknowledgements 8r 3 Approaches to Accelerating Computer-Aided Design 82 3.1 Introcluction . 82 3.2 Strategies for High Performance 84 3.2.1 SISD . 85 3.2.1.1 RISC versus CISC öD 3.2.2 SIMD 86 3.2.2.7 Pt-ocessor alra}S 87 3.2.2.2 \/ector pÌocessors 8S 3.2.2.3 1\{ulti-oPeration . 89 3.2.3 l\{IlvID 91 iii :1.2.4 Snmmarr- o.) 3.3 C'haractelistics of À'IIN{D architectures ot 3.3.1 Circuit TechnologY 93 3.3.2 Nlultiprocessor 95 3.3.3 Nlulticomputer' 96 3.3.4 Grain Size 97 .1.3.5 T)'p" of Plocessing \ode 99 3.3.6 ]nterconnection \etrtork 100 3.4 ParalleÌ plocessing fol CAD 103 3.1.1 S1'nthesis Tools - Simulated Annealing 103 3..1.1.1 Summaly 111 3.+.2 Static Analysis Tools r12 3.1.2.1 Design rule checking IT2 3.1.2.2 Cilcuitertlaction 115 3.+.2.3 Testgeneration 115 3.4.2.4 Summary L).7 3.1.3 Dvnamic Analvsis Tools 119 3.1.3.1 Cilcuit simulation 1r9 3.1.3.2 Tiningsimulation r23 3.4.3.3 Logicsimulation 121 3.+.3.4 Faultsimulation 132 3.4.3.5 Sunmarr- 133 3.5 C'onclusions 134 4 Computer architecture for \rLSI CAD r37 4.1 Introduction . r37 1.2 \\:olkstations , 140 4.3 C'omputer \ode Architectule . t45 +.3.1 \,'LSI N4icroprocessor Technology . 145 1.3.2 Floating Point SuPPolt . 148 1V i.:J.3 \:irtual \lemor'¡' \f anagement ' 148 4.3.4 Local t\4e:-r-ror¡, ' 149 4.4 hlerconnection\etrvork . ' ' 150 -1-l.l Nectar' "'151 4.+.2 Netrvolk Topolog¡- " " 156 i.+.3 Scalable Cohelent Interconnect (SCI) ' ' I57 i.J.4 High Speecl Chanlel (HSC) standard ' ' 161 l.+.5 Fiber Dis:ributed Data Lìterface (FDDI) . ' ' 162 -1,1.6 InterconrrectionNetrvorkRequirements ' ' ' ' ' 163 4.5 Seconclar¡'storace ' ' ' ' 166 4.51 DiskArr¿,,-s ' " '167 !.i.2 Briclge Fì.e S1''ster:r " " 772 i.õ.3 Secondar-,-stolage Requirements " ' 173 4.6 Operatings)'st'en " '177 :.6.1 Arnoeba "'178 +.6.2 \lach ""181 :.6.3 OperatinlSl'stenRequirements- ' ' ' 183 4.7 PalallelPrograruning ' ' ' 187 :. t.I \lessage ..'er.sus SLared Variable Programming ParadigmslSS :-.t.2 Sirar.ecl r-¿.riables ia a Loosely Coupled Environment . 190 :.7.3 Lincla "'i90 :.1.4 Distlibutecl Share,l \-irtual ìvlemoly ' ' ' 192 :.7.5 Shaled Data Obiet-t \{odel ' ' ' ' 195 +.7.6 Computer-aidedPlogrammittg. " "198 1.7.6.1 cAPtrR .....199 1. r.6.2 H,r'pertool " " 200 :.t.7 Proglanr:ning Retiuirements ' " 201 5 Conclttsions 206 A Real Time Subsystem Architecture 2ll 4.1 Comnercial Alternatives . .211 4.2 Specification for the Real Time Subsystem . .214 4.3 \rchitecture of the Real Time Subsystem . ...218 -\.3.1 VMEbus BackPlane . .2r9 -\.3.2 Central Processing Unit . .220 -\.3.3 On-board Software . .223 -\.3.4 A/D Conver-ter Module . .223 -\.3.5 D/A Converter Module ' . .225 -\.3.6 Timer/Parallel Interface Module . .227 -\.3.7 Data Link . .227 -\.3.8 Softu'ale Suppolt ...231 -\.3.9 Surlma.-y ...231 .'232 -\.3. 1 0 Àcknotvledgements . Bibliography 233 VI List of Figures 1.1 Design Hierarchy 2 7.2 Design Representation Hieralchy J 1.3 Icleal Design Cycle b 1..4 Simulated Annealing Algorithm I 1.5 Direct Nlethod Algorithm 23 2.7 System Overvierv 42 2.2 Real Time Subs¡'stem (RTS) Architecture 43 2.3 Example of a region melge 54 2.4 P altitioning strategies bb 2.5 HPC Architectule 59 2.6 Programmer's tr'Iodel of the HPC 60 2.7 Programming N'Iodel used in Parallel Goalie 63 2.8 Example of mer-ging acloss slice boundaries br 2.9 Example of mer-ging across slice boundaries (ctd.) 68 2.70 Performance results for the detection and numbering of tran- sistor regions 70 2.1r Partitioning for circuit 3 rvith 12 processors T3 2.r2 Load variation for circuit 3 rvith 12 processors t.t 2.13 Number of scanline stops lbr circuit 3 rvith 12 processors 74 2.1.4 Partitioning for cilcuit 2 rvith 12 processors 75 2.15 Loacl valiation for circuit 2 l'ith 12 processors 75 2.16 Nurnbel of scanline stops lbl cilcuit 2 rvith 12 processols 76 vll 4.1. Systenl Ar-chi tecture . 138 4.2 Leoparcl multiprocessor wolkstation [Ashenden et al., 1989] . r42 4.3 Intel iE60 microplocessor layout fl{ohn & Margulis, 19Eg] . r47 4.4 Single HUB Cluster of Nectar netrvork [Arnould et al., i989] . 155 4.5 SCI applications . r59 4.6 SCI Tlansaction phases [IEBE P1596, 1990] . 160 4.7 HSC Interface signals [Chlamtac & Franta, 1990] . t62 4.8 Interconnection system . 164 4.9 Disk arlay . 168 4.10 Intelleavecl parity sectors [I(atz et al., 19E9a] . 170 4.11 Secondary storage architecture . 176 4.1 \,IC500 Vlinicomputer Architecture [lvlasscomp, ] 2r2 4.2 System Ovelview 215 4.3 Real Time Subsystern (RTS) Architecture 219 4.4 Data Link structur-e 229 vlu List of Tables 2.1 Charactelistics of the cir-cuits in Figure 2.10 7l 2.2 Times fol the slorvest nocle out of 12 nodes 72 IX Abstract Designers use computer-aided-design (CAD) software to check that an integrated circuit rvill function correctly and meet performance specifications. The software must produce results cluickly enough for a designer to interactively evaluate many design iterations before fabricating the circuit. For large circuits, present rvork- stations are not fast enough to run the softrvare at the rate needed by designers. This has lead to research into parallel processing techniclues to reduce the running time of C-\D softrvare. The goal of this thesis is to describe a general purpose parallel computing systern that shoulcl be capable of supporting the full range of design tools, and offel a substantial speed up to the design verification step in the design cycle. The thesis begins by discussing the computational characteristics of modern CAI) software. It then discusses initial ideas for th.e hardware and so{tware attributes of a multiple plocessor computer system suitable for running CAD software. These ideas derive frorn the author's expeLiences rvith the design and construction of a high speed processing system, rvhich can accluire and generate analog signals in real time. and the implementation of a parallel circuit extraction algorithm on a multiconpLrter. The thesis goes on to revierv different architectural approaches to designing faster computers. before recommending a \,Iultiple Instruction stream / Multiple Data strea,m (\II\ID) architecture because it offers the flexibility required by CAD algo- rithms. It then analyses recent research into parallel CAD algorithms for running on \,IilVID architectures.
Recommended publications
  • Overview of the SPEC Benchmarks
    9 Overview of the SPEC Benchmarks Kaivalya M. Dixit IBM Corporation “The reputation of current benchmarketing claims regarding system performance is on par with the promises made by politicians during elections.” Standard Performance Evaluation Corporation (SPEC) was founded in October, 1988, by Apollo, Hewlett-Packard,MIPS Computer Systems and SUN Microsystems in cooperation with E. E. Times. SPEC is a nonprofit consortium of 22 major computer vendors whose common goals are “to provide the industry with a realistic yardstick to measure the performance of advanced computer systems” and to educate consumers about the performance of vendors’ products. SPEC creates, maintains, distributes, and endorses a standardized set of application-oriented programs to be used as benchmarks. 489 490 CHAPTER 9 Overview of the SPEC Benchmarks 9.1 Historical Perspective Traditional benchmarks have failed to characterize the system performance of modern computer systems. Some of those benchmarks measure component-level performance, and some of the measurements are routinely published as system performance. Historically, vendors have characterized the performances of their systems in a variety of confusing metrics. In part, the confusion is due to a lack of credible performance information, agreement, and leadership among competing vendors. Many vendors characterize system performance in millions of instructions per second (MIPS) and millions of floating-point operations per second (MFLOPS). All instructions, however, are not equal. Since CISC machine instructions usually accomplish a lot more than those of RISC machines, comparing the instructions of a CISC machine and a RISC machine is similar to comparing Latin and Greek. 9.1.1 Simple CPU Benchmarks Truth in benchmarking is an oxymoron because vendors use benchmarks for marketing purposes.
    [Show full text]
  • Computer Architectures an Overview
    Computer Architectures An Overview PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Sat, 25 Feb 2012 22:35:32 UTC Contents Articles Microarchitecture 1 x86 7 PowerPC 23 IBM POWER 33 MIPS architecture 39 SPARC 57 ARM architecture 65 DEC Alpha 80 AlphaStation 92 AlphaServer 95 Very long instruction word 103 Instruction-level parallelism 107 Explicitly parallel instruction computing 108 References Article Sources and Contributors 111 Image Sources, Licenses and Contributors 113 Article Licenses License 114 Microarchitecture 1 Microarchitecture In computer engineering, microarchitecture (sometimes abbreviated to µarch or uarch), also called computer organization, is the way a given instruction set architecture (ISA) is implemented on a processor. A given ISA may be implemented with different microarchitectures.[1] Implementations might vary due to different goals of a given design or due to shifts in technology.[2] Computer architecture is the combination of microarchitecture and instruction set design. Relation to instruction set architecture The ISA is roughly the same as the programming model of a processor as seen by an assembly language programmer or compiler writer. The ISA includes the execution model, processor registers, address and data formats among other things. The Intel Core microarchitecture microarchitecture includes the constituent parts of the processor and how these interconnect and interoperate to implement the ISA. The microarchitecture of a machine is usually represented as (more or less detailed) diagrams that describe the interconnections of the various microarchitectural elements of the machine, which may be everything from single gates and registers, to complete arithmetic logic units (ALU)s and even larger elements.
    [Show full text]
  • R00456--FM Getting up to Speed
    GETTING UP TO SPEED THE FUTURE OF SUPERCOMPUTING Susan L. Graham, Marc Snir, and Cynthia A. Patterson, Editors Committee on the Future of Supercomputing Computer Science and Telecommunications Board Division on Engineering and Physical Sciences THE NATIONAL ACADEMIES PRESS Washington, D.C. www.nap.edu THE NATIONAL ACADEMIES PRESS 500 Fifth Street, N.W. Washington, DC 20001 NOTICE: The project that is the subject of this report was approved by the Gov- erning Board of the National Research Council, whose members are drawn from the councils of the National Academy of Sciences, the National Academy of Engi- neering, and the Institute of Medicine. The members of the committee responsible for the report were chosen for their special competences and with regard for ap- propriate balance. Support for this project was provided by the Department of Energy under Spon- sor Award No. DE-AT01-03NA00106. Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the authors and do not necessarily reflect the views of the organizations that provided support for the project. International Standard Book Number 0-309-09502-6 (Book) International Standard Book Number 0-309-54679-6 (PDF) Library of Congress Catalog Card Number 2004118086 Cover designed by Jennifer Bishop. Cover images (clockwise from top right, front to back) 1. Exploding star. Scientific Discovery through Advanced Computing (SciDAC) Center for Supernova Research, U.S. Department of Energy, Office of Science. 2. Hurricane Frances, September 5, 2004, taken by GOES-12 satellite, 1 km visible imagery. U.S. National Oceanographic and Atmospheric Administration. 3. Large-eddy simulation of a Rayleigh-Taylor instability run on the Lawrence Livermore National Laboratory MCR Linux cluster in July 2003.
    [Show full text]
  • Solaris X Window System Reference Manual
    Solaris X Window System Reference Manual Sun Microsystems, Inc. 2550 Garcia Avenue Mountain View, CA 94043 U.S.A. 1995 Sun Microsystems, Inc., 2550 Garcia Avenue, Mountain View, California 94043-1100 USA. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Portions of this product may be derived from the UNIX system, licensed from UNIX System Laboratories, Inc., a wholly owned subsidiary of Novell, Inc., and from the Berkeley 4.3 BSD system, licensed from the University of California. Third- party software, including font technology in this product, is protected by copyright and licensed from Sun's Suppliers. RESTRICTED RIGHTS LEGEND: Use, duplication, or disclosure by the government is subject to restrictions as set forth in subparagraph (c)(1)(ii) of the Rights in Technical Data and Computer Software clause at DFARS 252.227-7013 and FAR 52.227-19. The product described in this manual may be protected by one or more U.S. patents, foreign patents, or pending applications. TRADEMARKS Sun, Sun Microsystems, the Sun logo, SunSoft, the SunSoft logo, Solaris, SunOS, OpenWindows, DeskSet, ONC, ONC+, and NFS are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries. UNIX is a registered trademark in the United States and other countries, exclusively licensed through X/Open Company, Ltd. OPEN LOOK is a registered trademark of Novell, Inc.
    [Show full text]
  • Icase Interim Report 6
    NASA Contractor Report 181764 ICASE INTERIM REPORT 6 A BIBLIOGRAPHY ON PARALLEL AND VECTOR NUMERICAL ALGORITHMS James M. Ortega, Robert G. Voigt, and Charles H. Romine - NASA Contract No. NAS1-18107, NASI-18605 December 1988 (EASA-CR- 181764) A BXELIOGEIFHY 08 PARALLEL N89-163Sb A10 VLCICG IUBBBICAL ALGORIILt5 Iinal Report OASA) 90 CSCL 0913 Unclar G3/61 0169713 INSTITUTE FOR COMPUTER APPLICATIONS IN SCIENCE AND ENGINEERING NASA Langley Research Center, Hampton, Virginia 23665 Operated by the Universities Space Research Association National Aeronautics and Space Ad ministration Langley Research Center Hampton, Virginia 23665-5225 ICASE INTERIM REPORTS ICASE has introduced a new report series to be called ICASE lntcrim Reports. The series will complement the more familiar blue ICASE reports that have becn distributed for many years. The blue reports are intended as preprints of research that has been submitted for publication in either rcferccd journals or conference proceedings. In general, the green Interim Report will not be submit- ted for publication, at least not in its printed form. It will be used for research that has reached a certain level of maturity but needs additional refinement, for technical reviews or position statements, for bibliographies, and for computer software. The Interim Reports will receive the same distribution as the ICASE Reports. They will be available upon request in the future, and they may be referenced in other publications. Robert G. Voigt Director i A BIBLIOGRAPHY ON PARALLEL AND VECTOR NUMERICAL ALGORITHMS JAMES M. ORTEGA', ROBERT G. VOIGTt AND CHARLES H. ROlllINEt Since parallel and vector computation is expanding rapidly, we hope that the refer- ences we have collected over the years will be of some value to researchers entering the field.
    [Show full text]
  • Openwindows Reference Manual
    OpenWindows Reference Manual Sun Microsystems, Inc. 2550 Garcia Avenue Mountain View, CA 94043 U.S.A. 1994 Sun Microsystems, Inc. All rights reserved. 2550 Garcia Avenue, Mountain View, California 94043-1100 U.S.A. This product and related documentation are protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or related documentation may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Portions of this product may be derived from the UNIX and Berkeley 4.3 BSD systems, licensed from UNIX Systems Laboratories, Inc., a wholly owned subsidiary of Novell, Inc., and the University of California, respectively. Third-party software, including font technology, in this product is protected by copyright and licensed from Sun's Suppliers. RESTRICTED RIGHTS LEGEND: Use, duplication, or disclosure by the government is subject to restrictions as set forth in subparagraph (c)(1)(ii) of the Rights in Technical Data and Computer Software clause at DFARS 252.227-7013 and FAR 52.227-19. This product or the products described herein may be protected by one or more U.S., foreign patents, or pending applications. TRADEMARKS Sun, Sun Microsystems, the Sun Logo, SunSoft, Sun Microsystems Computer Corporation and Solaris, are trademarks or registered trademarks of Sun Microsystems, Inc. in the U.S. and certain other countries. UNIX is a registered trademark of Novell, Inc., in the United States and other countries; X/Open Company, Ltd., is the exclusive licensor of such trademark. OPEN LOOK is a registered trademark of Novell, Inc.
    [Show full text]
  • Japanese Semiconductor Industry Service
    Japanese Semiconductor Industry Service Volume II Technology & Government Dataquest nn a company of The Dun & Bradstreet Corporation 1290 Ridder Park Drive San Jose, California 95131-2398 (408) 437-8000 Telex: 171973 Fax: (408) 437-0292 Sales/Service offices: UNITED KINGDOM GERMANY Dataquest UK Limited Dataquest GmbH 13th Floor, Centrepoint Rosenkavalierplatz 17 103 New Oxford Street D-8000 Munich 81 London WCIA IDD West Germany England (089)91 10 64 01-379-6257 Telex: 5218070 Telex: 266195 Fax: (089)91 21 89 Fax: 01-240-3653 FRANCE JAPAN Dataquest SARL Dataquest Japan, Ltd. 100, avenue Charles de Gaulle Taiyo Ginza Building/2nd Floor 92200 Neuilly-sur-Seine 7-14-16 Ginza, Chuo-ku France Tokyo 104 Japan (01)4738.13.12 (03)546-3191 Telex: 611982 Telex: 32768 Fax: (01)4738.11.23 Fax: (03)546-3198 The content of this report represents our interpretation and analysis of information generally available to the public or released by responsible individuals in the subject com­ panies, but is not guaranteed as to accuracy or completeness. It does not contain material provided to us in confidence by our clients. This information is not furnished in connection with a sale or offer to sell securities, or in connection with the solicitation of an offer to buy securities. This firm and its par­ ent and/or their officers, stockholders, or members of their families may, from time to time, have a long or short position in the securities mentioned and may sell or buy such securities. Printed in the United States of America. All rights reserved.
    [Show full text]
  • The Spare ™ Architecture Manual Version 7
    The SPARe™ Architecture Manual Version 7 SOLBOURNE COMPUTER, Inc. 2190 Miller Drive Longmont, Colorado 80501 303 772·3400 For Solboume Support Call: 1·800-447·2861 The X Window System is a trademark of MIT. Sun Mlcrosystems and Sun Workstation are registered trademarks of Sun Microsystems, Inc. Sun-3, Sun-4, SPARe, and DVMA are trademarks of Sun Microsystems, Inc. Part Number: 101482-00 September 1988 Reprinted by permission. Copyright 1988 by Solbourne Computer, Inc. All rights reserved. No part of this publication may be reproduced, stored in any media or in any type of retrieval system, transmitted in any form (e.g., electronic, mechanical, photocopying, recording) or translated into any language or computer language without the prior written permission of Solbourne Computer, Inc., 2190 Miller Drive, Longmont, Colorado 80501. There is no right to reverse engineer, decompile, or disassemble the information contained herein or in the accompanying software. Solbourne Computer, Inc. reserves the right to revise this publication and to make changes from time to time without obligation to notify any person of such reviSions or changes. TABLE OF CONTENTS SECTION 1: INTRODUCTION •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• 1-1 1.1 Introduction 1-1 1.2 Architecture and Implementation ••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• 1-1 1.3 Features •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• 1-1 1.4 Using This Manual •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••
    [Show full text]
  • Towards the Teraflop in CFD
    NASA-CR-197947 Research Institute for Advanced Computer Science NASA Ames Research Center Towards the Teraflop in CFD Robert Schreiber and Horst D. Simon (NASA-CR-197947) TOWAROS THE N95-235_4 TERAFLOP CFO (Research Inst. for Advanced Computer Science) 35 p Unclas G3/62 0043880 RIACS Technical Report TR 92.12 May 1, 1992 To appear in Parallel CFD - Implementations and Results Using Parallel Computers, Horst D. Simon editor, MIT Press, 1992. Towards the Teraflop in CFD Robert Schreiber and Horst D. Simon The Research Institute of Advanced Computer Science is operated by Universities Space Research Association, The American City Building, Suite 311, Columbia, MD 244, (301)730-2656 Work reported herein was supported in part by the NAS Systems Division of NASA via Cooperative Agree- ment NCC 2-387 between NASA and the University Space Research Association (USRA). Work was per- formed at the Research Institute for Advanced Computer Science (RIACS), NASA Ames Research Center, Moffett Field, CA 94035. Towards the Teraflop in CFD Robert Schreiber 1 and Horst D. Simon 2 Mail Stop T045-1 NASA Ames Research Center Moffett Field, CA 94035 May 1, 1992 Abstract We are surveying current projects in the area of parallel super- computers. The machines considered here will become commercially available in the 1990 - 1992 time frame. All are suitable for exploring the critical issues in applying parallel processors to large scale scientific computations, in particular CFD calculations. This chapter presents an overview of the surveyed machines, and a detailed analysis of the various architectural and technology approaches taken. Particular em- phasis is placed on the feasibility of a Teraflops capability following the paths proposed by various developers.
    [Show full text]
  • Icase Interim Report 3
    NASA Contractor Report 178335 ICASE INTERIM REPORT 3 A BIBLIOGRAPHY ON PARALLEL AND VECTOR NUMERICAL ALGORIlTMS J. M. Ortega andR. G. Voigt NASA Contract No. NAS1-18107 July 1987 (kASA<€i-178335) A BIELICGfiAEBP Gbl PABBLLEL ~a7-2t570 ABD VECTOR NUMERICAL AIGCIRJlhLS Final Report (&ASA) 125 p Avail: RTIS EIC AC6/WP A01 CSCL 09B Unclas ~3161 ooa76zi INSTITUTE FOR COMPUTER APPLICATIONS IN SCIENCE AND ENGINEERING NASA Langley Research Center, Hampton, Virginia 23665 Operated by the Universities Space Research Association National Aeronautics and Space Administration ~ROS-tCtlC6llta Hampton,Virginla 23665 ICASE INTERIM REPORTS ICASE has introduced a new report series to be called ICASE Interim Rcports. The series will complement the more familiar blue ICASE reports that have bccn distributed for many years. The blue reports are intended as preprints of research that has been submitted for publication in either refereed journals or conference proceedings. In general, the green Interim Report will not be submit- ted for publication, at least not in its printed form. It will be used for research that has reached a certain level of maturity but needs additional refincment, for technical reviews or position statements, for bibliographies, and for computer software. The Interim Reports will receive the same distribution as the ICASE Reports. They will be available upon request in the future, and they may bc referenced in other publications. Robcrt G. Voigt Di rcc t or i A Bibliography on Parallel and Vector Numerical Algorithms Since parallel and vector computation is expanding rapidly, we hope that the references we have collected over the years will be of some value to researchers entering the field.
    [Show full text]
  • The First Vector Computers
    Lawrence Berkeley National Laboratory Recent Work Title 20 Years Supercomputer Market Analysis Permalink https://escholarship.org/uc/item/4gh5g1cw Author Strohmaier, Erich Publication Date 2005-05-28 eScholarship.org Powered by the California Digital Library University of California 20 Years Supercomputer Market Analysis Erich Strohmaier, Future Technology Group, Lawrence Berkeley National Laboratory, 50A1182, Berkeley, CA 94720; e-mail: [email protected] May 2005 Abstract Since the very beginning of the International Supercomputer Conference (ISC) series, one important focus has been the analysis of the supercomputer market. For 20 years, statistics about this marketplace have been published at ISC. Initially these were based on simple market surveys and since 1993, they are based on the TOP500 project, which has become the accepted standard for such data. We take the occasion of the 20th anniversary of ISC to combine and extend several previously published articles based on these data. We analyze our methodologies for collecting our statistics and illustrate the major developments, trends and changes in the High Performance Computing (HPC) marketplace and the supercomputer market since the introduction of the Cray 1 system. The introduction of vector computers started the area of modern ‘Supercomputing’. The initial success of vector computers in the seventies and early eighties was driven by raw performance. In the second half of the eighties, the availability of standard development environments and of application software packages became more important. Next to performance, these criteria determined the success of MP vector systems especially at industrial customers. Massive Parallel Systems (MPP) became successful in the early nineties due to their better price/performance ratios, which was enabled by the attack of the ‘killer-micros’.
    [Show full text]
  • The SPARC Architecture Manual Version 8
    The SPARC Architecture Manual Version 8 Revision SAV080SI9308 SPARC International Inc. 535 Middlefield Road, Suite 210 Menlo Park, CA 94025 415-321-8692 SPARC International, Inc. SPARC is a registered trademark of SPARC International, Inc. The SPARC logo is a registered trademark of SPARC International, Inc. UNIX and OPEN LOOK are registered trademarks of UNIX System Labora- tories, Inc. Copyright 1991,1992 SPARC International, Inc. − Printed in U.S.A. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior permission of the copyright owners. Restricted rights legend: use, duplication, or disclosure by the U.S. government is subject to restrictions set forth in subparagraph (c)(1)(ii) of the Rights in Technical Data and Computer Software clause at DFARS 52.227-7013 and in similar clauses in the FAR and NASA FAR Supplement. The SPARC Architecture Manual Version 8 Revision SAV080SI9308 1 Introduction This document specifies Version 8 of the Scalable Processor ARChitecture, or SPARC. 1.1. SPARC Attributes SPARC is a CPU instruction set architecture (ISA), derived from a reduced instruction set computer (RISC) lineage. As an architecture, SPARC allows for a spectrum of chip and system implementations at a variety of price/performance points for a range of applications, including scientific/engineering, programming, real-time, and commercial. Design Goals SPARC was designed as a target for optimizing compilers and easily pipelined hardware implementations. SPARC implementations provide exceptionally high execution rates and short time-to-market development schedules.
    [Show full text]