TABLE DRIVEN ADAPTIVE, EFFECTIVELY HETEROGENEOUS MULTI-CORE ARCHITECTURE by SURPRIYA TIKE Bachelor of Science in Electronics

Total Page:16

File Type:pdf, Size:1020Kb

TABLE DRIVEN ADAPTIVE, EFFECTIVELY HETEROGENEOUS MULTI-CORE ARCHITECTURE by SURPRIYA TIKE Bachelor of Science in Electronics TABLE DRIVEN ADAPTIVE, EFFECTIVELY HETEROGENEOUS MULTI-CORE ARCHITECTURE By SURPRIYA TIKE Bachelor of Science in Electronics and Telecommunication Pune University Pune, India 2007 Submitted to the Faculty of the Graduate College of the Oklahoma State University in partial fulfillment of the requirements for the Degree of MASTER OF SCIENCE December, 2011 TABLE DRIVEN ADAPTIVE, EFFECTIVELY HETEROGENEOUS MULTI-CORE ARCHITECTURE Thesis Approved: Dr. James E. Stine Jr. Thesis Adviser Dr. Chris Hutchens Dr. Louis G. Johnson Dr. Sheryl A. Tucker Dean of the Graduate College ii TABLE OF CONTENTS Chapter Page 1. INTRODUCTION .............................................................................................................. 1 2. BACKGROUND ................................................................................................................ 1 2.1. Processor based platforms ........................................................................................... 1 2.1.1. OpenSPARC ..................................................................................................... 1 2.1.2. SimpleScalar .................................................................................................... 2 2.1.3. SPIM ................................................................................................................ 2 2.1.4. Simics ............................................................................................................... 2 2.2. Other Related Work .................................................................................................... 3 2.3. Proposed Solution ....................................................................................................... 4 3. IMPLEMENTATION ......................................................................................................... 6 3.1. Architecture Top Level description - .......................................................................... 6 3.2. Performance Counters ............................................................................................... 10 3.3. Single-threaded core ................................................................................................. 10 3.3.1. Datapath ........................................................................................................ 11 3.3.2. Hazard Unit ................................................................................................... 21 3.3.3. Coprocessor0 ................................................................................................. 21 3.4. L2 Data and Instruction Cache .................................................................................. 21 3.5. Floating Point Units .................................................................................................. 22 3.6. Arbiter Logic ............................................................................................................. 25 3.7. Asynchronous FIFO .................................................................................................. 27 3.8. Data Coherency ......................................................................................................... 29 3.8.1. Snooping ........................................................................................................ 29 3.8.2. Directory-based ............................................................................................. 30 3.8.3. Snarfing .......................................................................................................... 30 4. TOOL FLOW AND DESIGN METHODOLOGY .......................................................... 34 iii 4.1. ModelSim .................................................................................................................. 34 4.1.1. Unit testing of module .................................................................................... 37 4.1.2. Benchmark Testing ......................................................................................... 37 4.1.3. Challenge with Benchmark Testing ............................................................... 37 4.1.4. Solution .......................................................................................................... 38 4.1.5. Script and Setup ............................................................................................. 38 4.2. Design Compiler ....................................................................................................... 38 4.2.1. Design Compiler flow .................................................................................... 38 4.2.2. Design Compiler file formats ......................................................................... 39 4.2.3. Script and Setup ............................................................................................. 41 4.3. Formal Verification ................................................................................................... 41 4.3.1. Need for Formal Verification ......................................................................... 42 4.3.2. Script and Setup ............................................................................................. 43 4.4. IC Compiler .............................................................................................................. 43 4.4.1. IC Compiler flow ............................................................................................ 44 4.4.2. Script and Setup ............................................................................................. 56 5. CONCLUSION ................................................................................................................. 57 REFERENCES ........................................................................................................................ 59 APPENDICES ......................................................................................................................... 62 iv LIST OF TABLES Table Page Table 1 - Implemented Instruction Set ........................................................................................... 12 Table 2 - Arbiter Bus - [Level1_Icache - Arbiter - Level2_Icache] .............................................. 25 Table 3 - Arbiter Bus - [Level1_Dcache - Arbiter - Level2_Dcache] ........................................... 25 Table 4 - Arbiter Bus - [Core_CX - Arbiter - Floating Point Unit] ............................................... 25 v LIST OF FIGURES Figure Page Figure 1 - Top Level Architecture ................................................................................................... 7 Figure 2 - 5 Stage pipelined datapath............................................................................................... 9 Figure 3 - Register File Set ............................................................................................................ 13 Figure 4 - Integer Radix16 Divider (fast) ...................................................................................... 14 Figure 5 - Integer Radix4 Divider (slow)....................................................................................... 15 Figure 6 - Level1 Dcache architecture flexibility .......................................................................... 16 Figure 7 - Memory Subsystem adapted into different architectures .............................................. 17 Figure 8 - Memory moving from lower to higher set associativity ............................................... 18 Figure 9 - Memory moving from higher to lower set associativity – encounters invalid entries .. 19 Figure 10 - multi_match penalty evaluation .................................................................................. 20 Figure 11 - Pipelined Floating Point Unit ...................................................................................... 23 Figure 12 - Arbiter Logic ............................................................................................................... 26 Figure 13 - Asynchronous FIFO with Gray code read/write pointers ........................................... 27 Figure 14 - Snooping based cache coherency ................................................................................ 28 Figure 15 - Directory Structure ...................................................................................................... 31 Figure 16 - Tool Flow Methodology.............................................................................................. 35 Figure 17 - ModelSim Simulation of RTL..................................................................................... 36 Figure 18 - Design Vision showing the synthesized memory stage from datapath ....................... 40 Figure 19 - Formality – Successful Verification with post-synthesis netlist for Core1 ................. 42 Figure 20 - Initialize floorplan for complete design ...................................................................... 45 vi Figure 21 – Creating plan-groups for hierarchical sub-modules ................................................... 45 Figure 22 – Placement of plan-groups along with hard macros..................................................... 46 Figure 23 - Committing final plan-groups ..................................................................................... 46 Figure 24 - Plan-group after placement ......................................................................................... 47 Figure
Recommended publications
  • Day 2, 1640: Leveraging Opensparc
    Leveraging OpenSPARC ESA Round Table 2006 on Next Generation Microprocessors for Space Applications G.Furano, L.Messina – TEC-EDD OpenSPARC T1 • The T1 is a new-from-the-ground-up SPARC microprocessor implementation that conforms to the UltraSPARC architecture 2005 specification and executes the full SPARC V9 instruction set. Sun has produced two previous multicore processors: UltraSPARC IV and UltraSPARC IV+, but UltraSPARC T1 is its first microprocessor that is both multicore and multithreaded. • The processor is available with 4, 6 or 8 CPU cores, each core able to handle four threads. Thus the processor is capable of processing up to 32 threads concurrently. • Designed to lower the energy consumption of server computers, the 8-cores CPU uses typically 72 W of power at 1.2 GHz. G.Furano, L.Messina – TEC-EDD 72W … 1.2 GHz … 90nm … • Is a cutting edge design, targeted for high-end servers. • NOT FOR SPACE USE • But, let’s see which are the potential spin-in … G.Furano, L.Messina – TEC-EDD Why OPEN ? On March 21, 2006, Sun made the UltraSPARC T1 processor design available under the GNU General Public License. The published information includes: • Verilog source code of the UltraSPARC T1 design, including verification suite and simulation models • ISA specification (UltraSPARC Architecture 2005) • The Solaris 10 OS simulation images • Diagnostics tests for OpenSPARC T1 • Scripts, open source and Sun internal tools needed to simulate the design and to do synthesis of the design • Scripts and documentation to help with FPGA implementation
    [Show full text]
  • Sparc Enterprise T5440 Server Architecture
    SPARC ENTERPRISE T5440 SERVER ARCHITECTURE Unleashing UltraSPARC T2 Plus Processors with Innovative Multi-core Multi-thread Technology White Paper July 2009 TABLE OF CONTENTS THE ULTRASPARC T2 PLUS PROCESSOR 0 THE WORLD'S FIRST MASSIVELY THREADED SYSTEM ON A CHIP (SOC) 0 TAKING CHIP MULTITHREADED DESIGN TO THE NEXT LEVEL 1 ULTRASPARC T2 PLUS PROCESSOR ARCHITECTURE 3 SERVER ARCHITECTURE 8 SYSTEM-LEVEL ARCHITECTURE 8 CHASSIS DESIGN INNOVATIONS 13 ENTERPRISE-CLASS MANAGEMENT AND SOFTWARE 19 SYSTEM MANAGEMENT TECHNOLOGY 19 SCALABILITY AND SUPPORT FOR INNOVATIVE MULTITHREADING TECHNOLOGY21 CONCLUSION 28 0 The UltraSPARC T2 Plus Processors Chapter 1 The UltraSPARC T2 Plus Processors The UltraSPARC T2 and UltraSPARC T2 Plus processors are the industry’s first system on a chip (SoC), supplying the most cores and threads of any general-purpose processor available, and integrating all key system functions. The World's First Massively Threaded System on a Chip (SoC) The UltraSPARC T2 Plus processor eliminates the need for expensive custom hardware and software development by integrating computing, security, and I/O on to a single chip. Binary compatible with earlier UltraSPARC processors, no other processor delivers so much performance in so little space and with such small power requirements letting organizations rapidly scale the delivery of new network services with maximum efficiency and predictability. The UltraSPARC T2 Plus processor is shown in Figure 1. Figure 1. The UltraSPARC T2 Plus processor with CoolThreads technology 1 The UltraSPARC
    [Show full text]
  • SPIM MIPS Simulator
    SPIM MIPS Simulator SPIM A MIPS32 Simulator James Larus [email protected] Microsoft Research Formerly: Professor, Computer Sciences Department, University of Wisconsin-Madison spim is a self-contained simulator that will run MIPS32 assembly language programs. It reads and executes assembly language programs written for this processor. spim also provides a simple debugger and minimal set of operating system services. spim does not execute binary (compiled) programs. spim implements almost the entire MIPS32 assembler-extended instruction set. (It omits most floating point comparisons and rounding modes and the memory system page tables.) The MIPS architecture has several variants that differ in various ways (e.g., the MIPS64 architecture supports 64-bit integers and addresses), which means that spim will not run programs compiled for all types of MIPS processors. MIPS compilers also generate a number of assembler directives that spim cannot process. These directives usually can be safely deleted. Earlier versions of spim (before 7.0) implemented the MIPS-I instruction set used on the MIPS R2000/R3000 computers. This architecture is obsolete (though, has never been surpassed for its simplicity and elegance). spim now supports the more modern MIPS32 architecture, which is the MIPS-I instruction set augmented with a large number of occasionally useful instructions. MIPS code from earlier versions of SPIM should run without changes, except code that handles exceptions and interrupts. This part of the architecture changed over time (and was poorly implemented in earlier versions of spim). Code of this sort need to be updated. Examples of the new code are in exceptions.s and Tests/tt.io.s.
    [Show full text]
  • Opensparc – an Open Platform for Hardware Reliability Experimentation
    OpenSPARC – An Open Platform for Hardware Reliability Experimentation Ishwar Parulkar and Alan Wood Sun Microsystems, Inc. James C. Hoe and Babak Falsafi Carnegie Mellon University Sarita V. Adve and Josep Torrellas University of Illinois at Urbana- Champaign Subhasish Mitra Stanford University IEEE SELSE 4 - March 26, 2008 www.OpenSPARC.net Outline 1.Chip Multi-threading (CMT) 2.OpenSPARC T2 and T1 processors 3.Reliability in OpenSPARC processors 4.What is available in OpenSPARC 5.Current university research using OpenSPARC 6.Future research directions IEEE SELSE 4 – March 26, 2008 2 www.OpenSPARC.net World's First 64-bit Open Source Microprocessor OpenSPARC.net Governed by GPLv2 Complete processor architecture & implementation Register Transfer Level (RTL) Hypervisor API Verification suite and architectural models Simulation model for operating system bringup on s/w IEEE SELSE 4 – March 26, 2008 3 www.OpenSPARC.net Chip Multithreading (CMT) Instruction- Low Low Low Medium Low High level Parallelism Thread-level Parallelism High High High High High Instruction/Data Large Large Medium Large Large Working Set Data Sharing Low Medium High Medium High Medium IEEE SELSE 4 – March 26, 2008 4 www.OpenSPARC.net Memory Bottleneck Relative Performance 10000 CPU Frequency DRAM Speeds 1000 2 Years 100 Every Gap 2x -- CPU 6 10 -- 2x Every DRAM Years 1 1980 1985 1990 1995 2000 2005 Source: Sun World Wide Analyst Conference Feb. 25, 2003 IEEE SELSE 4 – March 26, 2008 5 www.OpenSPARC.net Single Threading HURRY Up to 85% Cycles Waiting for Memory
    [Show full text]
  • MIPS Assembly Language Programming Using Qtspim
    MIPS Assembly Language Programming using QtSpim Ed Jorgensen, Ph.D. Version 1.1.50 July 2019 Cover image: MIPS R3000 Custom Chip http://commons.wikimedia.org/wiki/File:RCP-NUS_01.jpg Spim is copyrighted by James Larus and distributed under a BSD license. Copyright (c) 1990-2011, James R. Larus. All rights reserved. Copyright © 2013, 2014, 2015, 2016, 2017 by Ed Jorgensen You are free: To Share — to copy, distribute and transmit the work To Remix — to adapt the work Under the following conditions: Attribution — you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). Noncommercial — you may not use this work for commercial purposes. Share Alike — if you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one. Table of Contents 1.0 Introduction...........................................................................................................1 1.1 Additional References.........................................................................................1 2.0 MIPS Architecture Overview..............................................................................3 2.1 Architecture Overview........................................................................................3 2.2 Data Types/Sizes.................................................................................................4 2.3 Memory...............................................................................................................4
    [Show full text]
  • Computer Architectures an Overview
    Computer Architectures An Overview PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Sat, 25 Feb 2012 22:35:32 UTC Contents Articles Microarchitecture 1 x86 7 PowerPC 23 IBM POWER 33 MIPS architecture 39 SPARC 57 ARM architecture 65 DEC Alpha 80 AlphaStation 92 AlphaServer 95 Very long instruction word 103 Instruction-level parallelism 107 Explicitly parallel instruction computing 108 References Article Sources and Contributors 111 Image Sources, Licenses and Contributors 113 Article Licenses License 114 Microarchitecture 1 Microarchitecture In computer engineering, microarchitecture (sometimes abbreviated to µarch or uarch), also called computer organization, is the way a given instruction set architecture (ISA) is implemented on a processor. A given ISA may be implemented with different microarchitectures.[1] Implementations might vary due to different goals of a given design or due to shifts in technology.[2] Computer architecture is the combination of microarchitecture and instruction set design. Relation to instruction set architecture The ISA is roughly the same as the programming model of a processor as seen by an assembly language programmer or compiler writer. The ISA includes the execution model, processor registers, address and data formats among other things. The Intel Core microarchitecture microarchitecture includes the constituent parts of the processor and how these interconnect and interoperate to implement the ISA. The microarchitecture of a machine is usually represented as (more or less detailed) diagrams that describe the interconnections of the various microarchitectural elements of the machine, which may be everything from single gates and registers, to complete arithmetic logic units (ALU)s and even larger elements.
    [Show full text]
  • Marrying Prediction and Segmentation to Identify Sales Leads
    Marrying Prediction and Segmentation to Identify Sales Leads Jim Porzak, The Generations Network Alex Kriney, Sun Microsystems February 19, 2009 1 Business Challenge ● In 2005 Sun Microsystems began the process of open-sourcing and making its software stack freely available ● Many millions of downloads per month* ● Heterogeneous registration practices ● Multichannel (web, email, phone, in- person) and multitouch marketing strategy ● 15%+ response rates; low contact-to-lead ratio ● Challenge: Identify the sales leads * Solaris, MySQL, GlassFish, NetBeans, OpenOffice, OpenSolaris, xVM Virtualbox, JavaFX, Java, etc. Sun Confidential: Internal Only 2 The Project ● Focus on Solaris 10 (x86 version) ● Data sources: download registration, email subscriptions, other demographics databases, product x purchases ● What are the characteristics of someone with a propensity to purchase? ● How can we become more efficient at identifying potential leads? ● Apply learnings to marketing strategy for all products and track results Sun Confidential: Internal Only 3 Predictive Models Employed ● Building a purchase model using random forests ● Creating prospect persona segmentation using cluster analysis Sun Confidential: Internal Only 4 Random Forest Purchase Model Sun Confidential: Internal Only 5 Random Forests • Developed by Leo Breiman of Cal Berkeley, one of the four developers of CART, and Adele Cutler, now at Utah State University. • Accuracy comparable with modern machine learning methods. (SVMs, neural nets, Adaboost) • Built in cross-validation using “Out of Bag” data. (Prediction error estimate is a by product) • Large number candidate predictors are automatically selected. (Resistant to over training) • Continuous and/or categorical predicting & response variables. (Easy to set up.) • Can be run in unsupervised for cluster discovery. (Useful for market segmentation, etc.) • Free Prediction and Scoring engines run on PC’s, Unix/Linux & Mac’s.
    [Show full text]
  • Industry Perspective on Chip Multi-Threading: Bridging the Gap with Academia Using Opensparc Dwayne Lee Opensparc Community Manager
    Industry perspective on Chip Multi-Threading: Bridging the gap with academia using OpenSPARC Dwayne Lee OpenSPARC Community Manager Shrenik Mehta Senior Director, Frontend Technologies & OpenSPARC Program Microelectronics Sun Microsystems, Inc. www.opensparc.net Agenda 1.Chip Multi-Threading (CMT) Era 2.Microarchitecture of OpenSPARC T1 3.OpenSPARC T1 Program 4.OpenSPARC in Academia www.opensparc.net Workshop on Computer Architecture Education, June 9 2007, San Diego 2 Making the Right Waves Chip Multi-threading (CoolThreadsTM) Symmetrical Multi-processing (SMP) e c n Reduced Instruction Set a m Computing (RISC) r o f r e P / e c i r P d e v o r p m I 1980 1990 2000 2010 www.opensparc.net Workshop on Computer Architecture Education, June 9 2007, San Diego 3 The Processor Growth Single Chip Multiprocessors Multiple cores Integer Unit Data Center >350M xstors <20K gates 1982 1990 1998 2005 RISC SMP RAS SWAP 32 bit 64 bit CMT VIS 1.0 Hypervisor Source: Sun Network San Francisco, NC03Q3, Sep. 17, 2003 The Big Bang Is Happening— Four Converging Trends Network Computing Is Moore’s Law Thread Rich A fraction of the die can Web services, JavaTM already build a good applications, database processor core; how am I transactions, ERP . going to use a billion transistors? Worsening Growing Complexity Memory Latency of Processor Design It’s approaching 1000s Forcing a rethinking of of CPU cycles! Friend or foe? processor architecture – modularity, less is more, time-to-market www.opensparc.net Workshop on Computer Architecture Education, June 9 2007,
    [Show full text]
  • Pipenightdreams Osgcal-Doc Mumudvb Mpg123-Alsa Tbb
    pipenightdreams osgcal-doc mumudvb mpg123-alsa tbb-examples libgammu4-dbg gcc-4.1-doc snort-rules-default davical cutmp3 libevolution5.0-cil aspell-am python-gobject-doc openoffice.org-l10n-mn libc6-xen xserver-xorg trophy-data t38modem pioneers-console libnb-platform10-java libgtkglext1-ruby libboost-wave1.39-dev drgenius bfbtester libchromexvmcpro1 isdnutils-xtools ubuntuone-client openoffice.org2-math openoffice.org-l10n-lt lsb-cxx-ia32 kdeartwork-emoticons-kde4 wmpuzzle trafshow python-plplot lx-gdb link-monitor-applet libscm-dev liblog-agent-logger-perl libccrtp-doc libclass-throwable-perl kde-i18n-csb jack-jconv hamradio-menus coinor-libvol-doc msx-emulator bitbake nabi language-pack-gnome-zh libpaperg popularity-contest xracer-tools xfont-nexus opendrim-lmp-baseserver libvorbisfile-ruby liblinebreak-doc libgfcui-2.0-0c2a-dbg libblacs-mpi-dev dict-freedict-spa-eng blender-ogrexml aspell-da x11-apps openoffice.org-l10n-lv openoffice.org-l10n-nl pnmtopng libodbcinstq1 libhsqldb-java-doc libmono-addins-gui0.2-cil sg3-utils linux-backports-modules-alsa-2.6.31-19-generic yorick-yeti-gsl python-pymssql plasma-widget-cpuload mcpp gpsim-lcd cl-csv libhtml-clean-perl asterisk-dbg apt-dater-dbg libgnome-mag1-dev language-pack-gnome-yo python-crypto svn-autoreleasedeb sugar-terminal-activity mii-diag maria-doc libplexus-component-api-java-doc libhugs-hgl-bundled libchipcard-libgwenhywfar47-plugins libghc6-random-dev freefem3d ezmlm cakephp-scripts aspell-ar ara-byte not+sparc openoffice.org-l10n-nn linux-backports-modules-karmic-generic-pae
    [Show full text]
  • QEMU As a Platform for PLC Virtualization an Analysis from a Cyber Security Perspective
    QEMU as a platform for PLC virtualization An analysis from a cyber security perspective HANNES HOLM, MATS PERSSON FOI Swedish Defence Research Agency Phone: +46 8 555 030 00 www.foi.se FOI-R--4576--SE SE-164 90 Stockholm Fax: +46 8 555 031 00 ISSN 1650-1942 April 2018 Hannes Holm, Mats Persson QEMU as a platform for PLC virtualization An analysis from a cyber security perspective Bild/Cover: Hannes Holm FOI-R--4576--SE Titel QEMU as a platform for PLC virtualization Title Virtualisering av PLC:er med QEMU Rapportnr/Report no FOI-R--4576--SE Månad/Month April Utgivningsår/Year 2018 Antal sidor/Pages 36 ISSN 1650-1942 Kund/Customer MSB Forskningsområde 4. Informationssäkerhet och kommunikation FoT-område Projektnr/Project no E72086 Godkänd av/Approved by Christian Jönsson Ansvarig avdelning Ledningssytem Detta verk är skyddat enligt lagen (1960:729) om upphovsrätt till litterära och konstnärliga verk, vilket bl.a. innebär att citering är tillåten i enlighet med vad som anges i 22 § i nämnd lag. För att använda verket på ett sätt som inte medges direkt av svensk lag krävs särskild överenskommelse. This work is protected by the Swedish Act on Copyright in Literary and Artistic Works (1960:729). Citation is permitted in accordance with article 22 in said act. Any form of use that goes beyond what is permitted by Swedish copyright law, requires the written permission of FOI. FOI-R--4576--SE Sammanfattning IT-säkerhetsutvärderingar är ofta svåra att genomföra inom operativa industriella informations- och styrsystem (ICS) då de medför risk för avbrott, vilket kan få mycket stor konsekvens om tjänsten som ett system realiserar är samhällskritisk.
    [Show full text]
  • Philip Machanick
    MIPS2C programming from the machine up Philip Machanick MIPS2C: PROGRAMMING FROM THE MACHINE UP First edition, 2015 Minor corrections: March 2017, April 2019, October 2020 Copyright © Philip Machanick 2014, 2015, 2016, 2017, 2018, 2019, 2020 Published by Philip Machanick in the RAMpage Research imprint under an Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) licence: http://creativecommons.org/licenses/by-nc/4.0/ The quick summary: free to use however you like but not for commercial purposes. SPIM documentation: Appendix E is copyright to the author as indicated on the first page and using this material does not imply endorsement by James Larus of this book. Picture credits: all illustrations are either by the author or from public domain sources, as acknowledged in the text. Author: Machanick, Philip, 1957- Title: Mips2C: programming from the machine up / Philip Machanick Edition: 1st ed. Publisher: Grahamstown, South Africa : RAMpage Research, 2015. ISBN: 978-0-8681048-7-4 (pbk.) LoC classification : QA76 Last typeset 27 October 2020 Preface HY THIS BOOK? Some years ago I took part in a panel discussion titled “Programming Early Considered Harmful” at the SIGCSE 2001 W conference [Hitchner et al. 2001]. Once of those present was Yale Patt, whom I had met briefly on a sabbatical at University of Michigan, where he was at the time a professor working in computer architecture. His role on the panel was to proselytise his book, Introduction to Computing Systems: From bits & gates to C & beyond [Patt and Patel 2013], which introduced programming from the low level up. I found the idea intriguing particularly as I also was concerned with the problem that students tend to stick with the first thing they learn.
    [Show full text]
  • Kutztown CSC Multiprocessors, Page 1 Using The
    Using the Three Multiprocessor Servers from the 2009 Sun Microsystems Grant Dale E. Parson, http://faculty.kutztown.edu/parson, Summer 2010 Version 2, January 2012 I. Introduction Acknowledgements January 2012: This originally unplanned extension of the summer 2010 report, completed during winter break 2011-2012, has been made possible by a Parallelism in the Classroom Microgrant from Intel Corporation. The updates to the benchmarks reported in this document represent one component of the work completed thanks to the Intel grant. Other curricular components relating to the CSC 580 Special Topics course in Multiprocessor Programming from spring 2011, and the CSC 480 Special Topics course in Multiprocessor Programming from spring 2012, have benefited from the Intel grant as well. They are reported in separate documents, since they were not parts of the summer 2010 study. In addition, thanks go to the NVIDIA Corporation for the donation of a C2070 Tesla GPU (graphical processing unit) card to our department. I will be using that card housed within an Intel-based multicore PC purchased by Kutztown University to perform algorithm research reported outside the scope of this report. All code and documentation reported herein will be contributed to Intel’s Academic Communities' Educational Exchange under the Creative Commons 3 open-source copyright. The major addition in this January 2012 revision of this report is the inclusion of measurements from an Intel-based multiprocessor PC in the benchmark reports. Subsections tagged with January 2012 contain most of the added information. Start of Original Report This document is an outcome of a study that I conducted over the 2009-2010 academic year and the summer of 2010, with the summer work funded by a PASSHE Faculty Professional Development Grant, in order to determine ways to integrate the three multiprocessor servers that we received from Sun Microsystems at the end of summer 2009 into the computer science curriculum at Kutztown University.
    [Show full text]