The Design and Utility of the ML-RSIM System Simulator, Lambert

Total Page:16

File Type:pdf, Size:1020Kb

The Design and Utility of the ML-RSIM System Simulator, Lambert Journal of Systems Architecture 52 (2006) 283–297 www.elsevier.com/locate/sysarc The design and utility of the ML-RSIM system simulator Lambert Schaelicke a,*, Mike Parker b a Intel Corporation, 3400 Harmony Road, Fort Collins, CO 80528, USA b Cray, Inc., PO Box 6000, Chippewa Falls, WI 54729, USA Received 24 July 2004; accepted 21 July 2005 Available online 25 October 2005 Abstract Execution-driven simulation has become the primary method for evaluating architectural techniques as it facilitates rapid design space exploration without the cost of building prototype hardware. To date, most simulation systems have either focused on the cycle-accurate modeling of user-level code while ignoring operating system and I/O effects, or have modeled complete systems while abstracting away many cycle-accurate timing details. The ML-RSIM simulation system presented here combines detailed hardware models with the ability to simulate user-level as well as operating system activ- ity, making it particularly suitable for exploring the interaction of applications with the operating system and I/O activity. This paper provides an overview of the design of the simulation infrastructure and discusses its strengths and weaknesses in terms of accuracy, flexibility, and performance. A validation study using LMBench microbenchmarks shows a good cor- relation for most of the architectural characteristics, while operating system effects show a larger variability. By quantifying the accuracy of the simulation tool in various areas, the validation effort not only helps gauge the validity of simulation results but also allows users to assess the suitability of the tool for a particular purpose. Ó 2005 Elsevier B.V. All rights reserved. Keywords: Simulator validation; Execution-driven simulator; Simulator accuracy 1. Introduction not provided by real hardware. However, the design and implementation of a suitable and robust Simulation has become one of the predominant simulator is in itself a major undertaking that tools for computer architecture research. It allows requires a combination of architectural expertise researchers to rapidly explore novel techniques and software development skills. For this reason, and architectures without incurring the engineering a number of readily available and proven simula- effort and expense of implementing prototype hard- tion systems such as SimpleScalar [2], SimOS ware. In addition, it affords a flexibility normally [16], SimICS [12] and RSIM [15] have found wide- spread use. Understanding the performance, accu- racy, and limitations of simulators is critical when * Corresponding author. selecting a tool to answer a particular research E-mail addresses: [email protected] (L. Schaelicke), [email protected] (M. Parker). question. URLs: http://www.cs.utah.edu/~lambert (L. Schaelicke), This paper describes the design and validation of http://www.cs.utah.edu/~map (M. Parker). ML-RSIM, an execution-driven system simulator 1383-7621/$ - see front matter Ó 2005 Elsevier B.V. All rights reserved. doi:10.1016/j.sysarc.2005.07.001 284 L. Schaelicke, M. Parker / Journal of Systems Architecture 52 (2006) 283–297 that combines detailed hardware models of modern The following section briefly describes the design workstation and server-class machines with a Unix- of ML-RSIM, focusing on the I/O subsystem and like operating system [18]. The ML-RSIM environ- operating system infrastructure. Section 3 presents ment, based on RSIM [15], extends the system the validation methodology used to gauge the accu- features normally represented by simulators to racy of ML-RSIM. Section 4 presents the validation include I/O devices and an operating system with- results and discusses how important design deci- out sacrificing fidelity of the processor model and sions affect the simulatorÕs utility and accuracy. Sec- with only a small impact on simulation speed. A tion 5 contrasts this work with other simulation detailed characterization of the accuracy of the infrastructures and validation efforts. Finally, Sec- hardware models as well as the simulator operating tion 6 summarizes and outlines ongoing and future system highlights key strengths and weaknesses of work. ML-RSIM and helps determine the toolÕs suitability for specific research tasks. 2. ML-RSIM design Simulator design is a trade-off between develop- ment effort, simulation speed, modeling scope, and 2.1. Machine model accuracy. As such, most simulators are targeted towards specific application domains and research The ML-RSIM processor model is based on agendas. For instance, architectural simulators like RSIM v1.0 [15] with extensions to simulate operat- SimpleScalar [2] and RSIM [15] focus on the ing system code and memory-mapped access to I/O cycle-accurate modeling of user-level instructions devices. Other hardware models include a two-level by providing highly detailed models of the processor cache hierarchy, a coherent memory bus, a memory and memory hierarchy. These tools are invaluable controller, and several I/O devices. Combined, these for experiments involving compute-intensive appli- components model a contemporary workstation or cations, but they ignore many system-level effects server-class machine, as shown in Fig. 1. such as virtual memory, system calls, interrupt han- The ML-RSIM CPU model implements the dling, and other I/O related effects. System-level SPARC V8 instruction set [21] on a MIPS R12000- simulators like SimOS [16] and SimICS [12],on like dynamically scheduled superscalar processor the other hand, provide complete system models core [14]. It also includes an instruction cache model, and thus account for application, operating system, TLB models, exception handling capabilities, and and I/O effects. However, to achieve acceptable sim- support for privileged instructions and registers. ulation speed, hardware models are usually simpli- Both user-level and privileged instructions are simu- fied and not cycle-accurate. Such system-level lated, so simulation sessions account for both appli- simulators are able to model the interaction of hard- cation and kernel effects. ware, system software, and applications. However, The cache hierarchy is modeled as a conventional due to the abstract hardware models, only coarse- two-level structure with support for snooping cache grain effects can be observed. coherency. An uncached buffer handles load and In contrast, ML-RSIM combines detailed hard- store instructions to I/O addresses, and supports ware models of a microprocessor, memory hierar- store combining to reduce system bus utilization. chy, and I/O subsystem with a simulated operating The system bus implements snooping cache coher- system. As such, it supports detailed explorations ency through a MESI protocol. The memory con- of the interaction of applications, operating sys- troller supports SDRAM or RAMBUS memory in tems, and I/O devices. Hardware models include a a variety of configurations and accurately models dynamically scheduled processor with a two-level queueing delays, bank contention, and DRAM pre- cache hierarchy, coherent system bus, and a multi- charge and refresh overhead. bank memory controller. These components are The I/O subsystem is based on the PCI bus and augmented with an I/O subsystem consisting of a consists of a PCI bridge with autoconfiguration sup- PCI bridge, SCSI adapter, SCSI bus, and hard disk. port, a real-time clock, and a number of SCSI host The fully-simulated ‘‘Lamix’’ operating system is adapters with hard disks. The real-time clock is based on NetBSD source code and includes a sys- modeled after a MOSTEK 48T02 clock chip and tem call layer, multiple file systems with buffer implements common date and time functionality cache, device drivers, and multitasking capabilities as well as two independent periodic interrupt [18]. sources. The SCSI adapter is a variation of an L. Schaelicke, M. Parker / Journal of Systems Architecture 52 (2006) 283–297 285 CPU L1 I-Cache L1 D-Cache L2 Cache System Bus Disk Memory Controller PCI Config PCI Bridge DRAM . DRAM Realtime SCSI Clock Adapter SCSI Bus Fig. 1. ML-RSIM system model. Adaptec AIC 7770-based SCSI host adapter and ances the need to accurately model a realistic system supports multiple outstanding requests, disconnect with the cost of implementing the minute and often and reconnect transactions, and request queueing obscure details of a particular commercial system. [1]. Each adapter controls one SCSI bus that accounts for arbitration delays, idle times, and data 2.2. Lamix kernel transfer. Each SCSI bus connects an adapter to a configurable number of SCSI disks. These disks The Lamix kernel is a Unix-compatible multi- model seek and transfer times as well as an on- tasking operating system specifically designed to board DRAM buffer. Seek times are calculated run on the ML-RSIM simulator. It consists of an using one of four algorithms: (a) a fitted curve based independently developed multitasking core and sig- on single-track, average, and full-stroke seek times, nificant portions of NetBSD source code that imple- (b) a three-point linear curve based on the same ment file system and device driver functionality. The parameters, (c) a constant seek time, or (d) instanta- system ABI is Solaris-compatible; thus applications neous seeks with zero latency [8,11]. The sector- compiled for Lamix can, in most cases, execute on based disk cache is used for data staging, prefetch- native Solaris hosts without modifications. Lamix ing and optional write buffering. The disk model only requires that applications are statically
Recommended publications
  • SGI® Octane III®
    Making ® ® Supercomputing SGI Octane III Personal™ KEY FEATURES Supercomputing Gets Personal Octane III takes high-performance computing out Office Ready of the data center and puts it at the deskside. It A pedestal, one foot by two combines the immense power and performance broad HPC application support and arrives ready foot form factor and capabilities of a high-performance cluster with the for immediate integration for a smooth out-of-the- quiet operation portability and usability of a workstation to enable box experience. a new era of personal innovation in strategic science, research, development and visualization. Octane III allows a wide variety of single and High Performance dual-socket node choices and a wide selection of Up to 120 high- In contrast with standard dual-processor performance, storage, integrated networking, and performance cores and workstations with only eight cores and moderate graphics and compute GPU options. The system nearly 2TB of memory memory capacity, the superior design of is available as an up to ten node deskside cluster Broad HPC Application Octane III permits up to 120 high-performance configuration or dual-node graphics workstation Support cores and nearly 2TB of memory. Octane III configurations. Accelerated time-to-results significantly accelerates time-to-results for over with support for over 50 50 HPC applications and supports the latest Supported operating systems include: ® ® ® HPC applications Intel processors to capitalize on greater levels SUSE Linux Enterprise Server and Red Hat of performance, flexibility and scalability. Pre- Enterprise Linux. All configurations are available configured with system software, cluster set up is with pre-loaded SGI® Performance Suite system Ease of Use a breeze.
    [Show full text]
  • OCTANE Technical Report
    OCTANE Technical Report Silicon Graphics, Inc. ANY DUPLICATION, MODIFICATION, DISTRIBUTION, PUBLIC PERFORMANCE, OR PUBLIC DISPLAY OF THIS DOCUMENT WITHOUT THE EXPRESS WRITTEN CONSENT OF SILICON GRAPHICS, INC. IS STRICTLY PROHIBITED. THE RECEIPT OR POSSESSION OF THIS DOCUMENT DOES NOT CONVEY ANY RIGHTS TO REPRODUCE, DISCLOSE OR DISTRIBUTE ITS CONTENTS, OR TO MANUFACTURE, USE, OR SELL ANYTHING THAT IT MAY DESCRIBE, IN WHOLE OR IN PART. Cop VED Table of Contents Section 1 Introduction .....................................................................................................1-1 1.1 Manufacturing.............................................................................................................. 1-3 1.1.1 Industrial Design .....................................................................................1-3 1.1.2 CAD/CAM Solid Modeling ....................................................................1-4 1.1.3 Analysis...................................................................................................1-5 1.1.4 Digital Prototyping..................................................................................1-6 1.2 Entertainment............................................................................................................... 1-8 1.2.1 3D Animation..........................................................................................1-8 1.2.2 Film/Video/Audio ...................................................................................1-9 1.2.3 Publishing..............................................................................................1-10
    [Show full text]
  • Interactive Rendering with Coherent Ray Tracing
    EUROGRAPHICS 2001 / A. Chalmers and T.-M. Rhyne Volume 20 (2001), Number 3 (Guest Editors) Interactive Rendering with Coherent Ray Tracing Ingo Wald, Philipp Slusallek, Carsten Benthin, and Markus Wagner Computer Graphics Group, Saarland University Abstract For almost two decades researchers have argued that ray tracing will eventually become faster than the rasteri- zation technique that completely dominates todays graphics hardware. However, this has not happened yet. Ray tracing is still exclusively being used for off-line rendering of photorealistic images and it is commonly believed that ray tracing is simply too costly to ever challenge rasterization-based algorithms for interactive use. However, there is hardly any scientific analysis that supports either point of view. In particular there is no evidence of where the crossover point might be, at which ray tracing would eventually become faster, or if such a point does exist at all. This paper provides several contributions to this discussion: We first present a highly optimized implementation of a ray tracer that improves performance by more than an order of magnitude compared to currently available ray tracers. The new algorithm makes better use of computational resources such as caches and SIMD instructions and better exploits image and object space coherence. Secondly, we show that this software implementation can challenge and even outperform high-end graphics hardware in interactive rendering performance for complex environments. We also provide an brief overview of the benefits of ray tracing over rasterization algorithms and point out the potential of interactive ray tracing both in hardware and software. 1. Introduction Ray tracing is famous for its ability to generate high-quality images but is also well-known for long rendering times due to its high computational cost.
    [Show full text]
  • Computing @SERC Resources,Services and Policies
    Computing @SERC Resources,Services and Policies R.Krishna Murthy SERC - An Introduction • A state-of-the-art Computing facility • Caters to the computing needs of education and research at the institute • Comprehensive range of systems to cater to a wide spectrum of computing requirements. • Excellent infrastructure supports uninterrupted computing - anywhere, all times. SERC - Facilities • Computing - – Powerful hardware with adequate resources – Excellent Systems and Application Software,tools and libraries • Printing, Plotting and Scanning services • Help-Desk - User Consultancy and Support • Library - Books, Manuals, Software, Distribution of Systems • SERC has 5 floors - Basement,Ground,First,Second and Third • Basement - Power and Airconditioning • Ground - Compute & File servers, Supercomputing Cluster • First floor - Common facilities for Course and Research - Windows,NT,Linux,Mac and other workstations Distribution of Systems - contd. • Second Floor – Access Stations for Research students • Third Floor – Access Stations for Course students • Both the floors have similar facilities Computing Systems Systems at SERC • ACCESS STATIONS *SUN ULTRA 20 Workstations – dual core Opteron 4GHz cpu, 1GB memory * IBM INTELLISTATION EPRO – Intel P4 2.4GHz cpu, 512 MB memory Both are Linux based systems OLDER Access stations * COMPAQ XP 10000 * SUN ULTRA 60 * HP C200 * SGI O2 * IBM POWER PC 43p Contd... FILE SERVERS 5TB SAN storage IBM RS/6000 43P 260 : 32 * 18GB Swappable SSA Disks. Contd.... • HIGH PERFORMANCE SERVERS * SHARED MEMORY MULTI PROCESSOR • IBM P-series 690 Regatta (32proc.,256 GB) • SGI ALTIX 3700 (32proc.,256GB) • SGI Altix 350 ( 16 proc.,16GB – 64GB) Contd... * IBM SP3. NH2 - 16 Processors WH2 - 4 Processors * Six COMPAQ ALPHA SERVER ES40 4 CPU’s per server with 667 MHz.
    [Show full text]
  • OCTANE® Workstation Owner's Guide
    OCTANE® Workstation Owner’s Guide Document Number 007-3435-003 CONTRIBUTORS Written by Charmaine Moyer Production by Linda Rae Sande Illustrated by Kwong Liew Engineering contributions by Jim Bergman, Brian Bolich, Bob Cook, Mark Glusker, John Hahn, Steve Manzi, Ted Marsh, Donna McMaster, Jim Pagura, Michael Poimboeuf, Brad Reger, Jose Reinoso, Bob Sanders, Chris Wheaton, Michael Wright, and many others on the OCTANE engineering and business team. St. Peter’s Basilica image courtesy of ENEL SpA and InfoByte SpA. Disk Thrower image courtesy of Xavier Berenguer, Animatica. © 1997 - 1999, Silicon Graphics, Inc.— All Rights Reserved The contents of this document may not be copied or duplicated in any form, in whole or in part, without the prior written permission of Silicon Graphics, Inc. LIMITED AND RESTRICTED RIGHTS LEGEND Use, duplication, or disclosure by the Government is subject to restrictionsas set forth in the Rights in Data clause at FAR 52.227-14 and/or in similar orsuccessor clauses in the FAR, or in the DOD, DOE or NASA FAR Supplements.Unpublished rights reserved under the Copyright Laws of the United States.Contractor/manufacturer is Silicon Graphics, Inc., 2011 N. Shoreline Blvd., Mountain View, CA 94043-1389. Silicon Graphics, IRIS, IRIX, and OCTANE are registered trademarks and the Silicon Graphics logo, IRIX Interactive Desktop, Power Fortran Accelerator, IRIS InSight, and Stereoview are trademarks of Silicon Graphics, Inc. ADAT is a registered trademark of Alesis Corporation. Centronics is a registered trademark of Centronics Data Computer Corporation. Envi-ro-tech is a trademark of TECHSPRAY. Macintosh is a registered trademark of Apple Computer, Inc.
    [Show full text]
  • AVS on UNIX WORKSTATIONS INSTALLATION/ RELEASE NOTES
    _________ ____ AVS on UNIX WORKSTATIONS INSTALLATION/ RELEASE NOTES ____________ Release 5.5 Final (50.86 / 50.88) November, 1999 Advanced Visual Systems Inc.________ Part Number: 330-0120-02 Rev L NOTICE This document, and the software and other products described or referenced in it, are con®dential and proprietary products of Advanced Visual Systems Inc. or its licensors. They are provided under, and are subject to, the terms and conditions of a written license agreement between Advanced Visual Systems and its customer, and may not be transferred, disclosed or otherwise provided to third parties, unless oth- erwise permitted by that agreement. NO REPRESENTATION OR OTHER AFFIRMATION OF FACT CONTAINED IN THIS DOCUMENT, INCLUDING WITHOUT LIMITATION STATEMENTS REGARDING CAPACITY, PERFORMANCE, OR SUI- TABILITY FOR USE OF SOFTWARE DESCRIBED HEREIN, SHALL BE DEEMED TO BE A WARRANTY BY ADVANCED VISUAL SYSTEMS FOR ANY PURPOSE OR GIVE RISE TO ANY LIABILITY OF ADVANCED VISUAL SYSTEMS WHATSOEVER. ADVANCED VISUAL SYSTEMS MAKES NO WAR- RANTY OF ANY KIND IN OR WITH REGARD TO THIS DOCUMENT, INCLUDING BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PUR- POSE. ADVANCED VISUAL SYSTEMS SHALL NOT BE RESPONSIBLE FOR ANY ERRORS THAT MAY APPEAR IN THIS DOCUMENT AND SHALL NOT BE LIABLE FOR ANY DAMAGES, INCLUDING WITHOUT LIMI- TATION INCIDENTAL, INDIRECT, SPECIAL OR CONSEQUENTIAL DAMAGES, ARISING OUT OF OR RELATED TO THIS DOCUMENT OR THE INFORMATION CONTAINED IN IT, EVEN IF ADVANCED VISUAL SYSTEMS HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. The speci®cations and other information contained in this document for some purposes may not be com- plete, current or correct, and are subject to change without notice.
    [Show full text]
  • Running.1997.2.Pdf
    SEMI-ANNUAL REPORT NASA CONTRACT NAS5-31368 FOR MODIS TEAM MEMBER STEVEN W. RUNNING ASSOC. TEAM MEMBER RAMAKRISHNA R. NEMANI SOFTWARE ENGINEER JOSEPH GLASSY July 15, 1997 PRE-LAUNCH TASKS PROPOSED IN OUR CONTRACT OF DECEMBER 1991 We propose, during the pre-EOS phase to: (1) develop, with other MODIS Team Members, a means of discriminating different major biome types with NDVI and other AVHRR-based data. (2) develop a simple ecosystem process model for each of these biomes, BIOME-BGC (3) relate the seasonal trend of weekly composite NDVI to vegetation phenology and temperature limits to develop a satellite defined growing season for vegetation; and (4) define physiologically based energy to mass conversion factors for carbon and water for each biome. Our final core at-launch product will be simplified, completely satellite driven biome specific models for net primary production. We will build these biome specific satellite driven algorithms using a family of simple ecosystem process models as calibration models, collectively called BIOME-BGC, and establish coordination with an existing network of ecological study sites in order to test and validate these products. Field datasets will then be available for both BIOME-BGC development and testing, use for algorithm developments of other MODIS Team Members, and ultimately be our first test point for MODIS land vegetation products upon launch. We will use field sites from the National Science Foundation Long-Term Ecological Research network, and develop Glacier National Park as a major site for intensive validation. OBJECTIVES: We have defined the following near-term objectives for our MODIS contract based on the long term objectives stated above: -Organization of an EOS ground monitoring network with collaborating U.S.
    [Show full text]
  • Translating Research Into Business
    THE STATE OF SÃO PAULO RESEARCH FOUNDATION Translating Research into Business Ten years promoting technological innovation THE STATE OF SÃO PAULO RESEARCH FOUNDATION Carlos Vogt President Marcos Macari Vice-president BOARD OF TRUSTEES Adilson Avansi de Abreu Carlos Vogt Celso Lafer Hermann Wever Horácio Lafer Piva Hugo Aguirre Armelin José Arana Varela Marcos Macari Nilson Dias Vieira Júnior Vahan Agopyan Yoshiaki Nakano EXECUTIVE BOARD Ricardo Renzo Brentani Chief Executive Carlos Henrique de Brito Cruz Scientific Director Joaquim José de Camargo Engler Administrative Director Translating Research into Business Ten years promoting technological innovation Projects supported by FAPESP in the Partnership for Technological Innovation and Technological Innovation in Small Businesses Programs 2005 Catalogação-na-publicação elaborada pelo Centro de Documentação e Informação da FAPESP The State of São Paulo Research Foundation. Translating research into business : ten years promoting technological innovation : projects supported by FAPESP in the Partnership for Technological Innovation and Technological Innovation in Small Businesses programs / The State of São Paulo Research Foundation – São Paulo : FAPESP, 2005. 256 p. : il. ; 28 cm. Tradução de: A pesquisa traduzida em negócios : dez anos de incentivo à inovação tecnológica : projetos apoiados pela FAPESP nos programas Parceria para Inovação Tecnológica e Inovação Tecnológica em Pequenas Empresas. I. Título II. Título: Ten years promoting technological innovation. III. Título: Projects supported by FAPESP in the Partnership for Technological Innovation and Technological Innovation in Small Businesses programs. 1.FAPESP 2. Pesquisa e desenvolvimento – São Paulo 3. Ciência 4. Tecnologia 5. Inovação tecnológica 6. Inovação Tecnológica em Pequenas Empresas 7. PIPE 8. Parceria para Inovação Tecnológica 9. PITE 04/05 CDD 507.208161 Depósito Legal na Biblioteca Nacional, conforme Lei n.º 10.994, de 14 de dezembro de 2004.
    [Show full text]
  • SGI™ Origin 3000 Series Technical Configuration Owner's Guide
    SGI™ Origin 3000 Series Technical Configuration Owner’s Guide Document Number 007-4311-002 CONTRIBUTORS Written by Dick Brownell Illustrated by Dan Young Production by Diane Ciardelli Engineering contributions by SN1 Electrical, Mechanical, Manufacturing, Marketing, and Site Planning teams COPYRIGHT © 2000 Silicon Graphics, Inc. All rights reserved; provided portions may be copyright in third parties, as indicated elsewhere herein. No permission is granted to copy, distribute, or create derivative works from the contents of this electronic documentation. The contents of this document may not be copied or duplicated in any form, in whole or in part, without the prior written permission of Silicon Graphics, Inc. LIMITED RIGHTS LEGEND The electronic (software) version of this document was developed at private expense; if acquired under an agreement with the USA government or any contractor thereto, it is acquired as “commercial computer software” subject to the provisions of its applicable license agreement, as specified in (a) 48 CFR 12.212 of the FAR; or if acquired for Department of Defense units, (b) 48 CFR 227-7202 of the DoD FAR Supplement; or sections succeeding thereto. Contractor/manufacturer is Silicon Graphics, Inc., 1600 Amphitheatre Pkwy. 2E, Mountain View, CA 94043-1351. TRADEMARKS AND ATTRIBUTIONS Silicon Graphics is a registered trademark and SGI, the SGI logo, Origin, Onyx3, and IRIS InSight are trademarks of Silicon Graphics, Inc. PostScript is a trademark of Adobe Systems, Inc. MIPS is a registered trademark of MIPS Technologies, Inc. Windows and Windows NT are registered trademarks of Microsoft Corporation. DVCPRO is a trademark of Panasonic, Inc. Linux is a registered trademark of Linus Torvalds.
    [Show full text]
  • 5.3.1 Chyby Na SGI O2
    VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ BRNO UNIVERSITY OF TECHNOLOGY FAKULTA INFORMAČNÍCH TECHNOLOGIÍ ÚSTAV INFORMAČNÍCH SYSTÉMŮ FACULTY OF INFORMATION TECHNOLOGY DEPARTMENT OF INFORMATION SYSTEMS TESTOVÁNÍ PAMĚTI NA ARCHITEKTUŘE SGI/MIPS BAKALÁŘSKÁ PRÁCE BACHELOR‘S THESIS AUTOR PRÁCE Karol Rydlo AUTHOR BRNO 2009 VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ BRNO UNIVERSITY OF TECHNOLOGY FAKULTA INFORMAČNÍCH TECHNOLOGIÍ ÚSTAV POČÍTAČOVÝCH SYSTÉMŮ FACULTY OF INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER SYSTEMS TESTOVÁNÍ PAMĚTI NA ARCHITEKTUŘE SGI/MIPS MEMORY TESTING ON SGI/MIPS ARCHITECTIRE BAKALÁŘSKÁ PRÁCE BACHELOR‘S THESIS AUTOR PRÁCE Karol Rydlo AUTHOR VEDOUCÍ PRÁCE Ing. Tomáš Kašpárek SUPERVISOR BRNO 2009 Abstrakt Moje bakalářská práce se zabývá zprovozněním a vytvořením vlastních testů paměti na grafických stanicích SGI O2, což sebou přináší seznámení se s architekturou procesorů MIPS a pokouší se najít ideální prostředí pro provádění těchto testů. S tím úzce souvisí hledání vhodného způsobu spouštění a překladu aplikací pro stanice SGI O2, kde se zabývá také využitím křížových kompilátorů. Abstract Work is engaged in making solution for creating own memory tests on graphical station SGI O2. This thesis produces work on MIPS processor architecture and it try to find the ideal environments for testing memory and with it is nearly related looking for chances of start and compile application for SGI O2. Part of my thesis is also target using cross-compilers, for effective and useful work with program for other architecture. Klíčová slova Testování paměti, RAM,
    [Show full text]
  • An Investigation Into the Cost Factors of Special Effects in Computer Graphics
    An Investigation into the Cost Factors Of Special Effects in Computer Graphics Submitted in partial fulfilment of degree Bachelor of Science (Honours) By Adrian Charteris Rhodes University Computer Science Honours Adrian Charteris Computer Science Honours 2000 November 2000 Abstract Here within we explore one of the current benchmarks that exist in the market place today, namely SPEC, we follow it up with a case study of their main benchmark products. We also explore and discuss the philosophies behind general benchmarking including some of the pitfalls or challenges that face benchmark developers and outline various benchmark categories identified throughout our research. After presenting the related work in this field, we move on to our own benchmark experiment. In our experiment we hope to gain a richer understanding or appreciation of the benchmarking process through the development of our own practical implementation. We identify a number of fields within performance measurement that we feel need further exploration. These fields include the measuring of cost factors incurred when introducing graphical special effects. We record the performance of local graphics accelerators when these special effects are introduced. This includes the related strengths and weaknesses of each accelerator. The tests introduce the following special effects z- buffering, blending, fog and light. We apply these effects to the rendering of three basic primitives, namely the point, the line and the polygon. We measure the increases in performance time with the introduction of these special effects in order to determine the performance of various graphics accelerators. We rank both the special effects and the graphics accelerators. We also measure the difference between 2D and 3D rendering.
    [Show full text]
  • Applications of Pixel Textures in Visualization and Realistic Image Synthesis
    Applications of Pixel Textures in Visualization and Realistic Image Synthesis Wolfgang Heidrich, Rudiger¨ Westermann, Hans-Peter Seidel, Thomas Ertl Computer Graphics Group University of Erlangen fheidrich,wester,seidel,[email protected] Abstract which can be used for volume rendering, and the imaging subset, a set of extensions useful not only for image-processing, have been With fast 3D graphics becoming more and more available even on added in this version of the specification. Bump mapping and pro- low end platforms, the focus in developing new graphics hardware cedural shaders are only two examples for features that are likely to is beginning to shift towards higher quality rendering and addi- be implemented at some point in the future. tional functionality instead of simply higher performance imple- On this search for improved quality it is important to identify a mentations of the traditional graphics pipeline. On this search for powerful set of orthogonal building blocks to be implemented in improved quality it is important to identify a powerful set of or- hardware, which can then be flexibly combined to form new algo- thogonal features to be implemented in hardware, which can then rithms. We think that the pixel texture extension by Silicon Graph- be flexibly combined to form new algorithms. ics [9, 12] is a building block that can be useful for many applica- Pixel textures are an OpenGL extension by Silicon Graphics that tions, especially when combined with the imaging subset. fits into this category. In this paper, we demonstrate the benefits of In this paper, we use pixel textures to implement four different this extension by presenting several different algorithms exploiting algorithms for applications from visualization and realistic image its functionality to achieve high quality, high performance solutions synthesis: fast line integral convolution (Section 3), shadow map- for a variety of different applications from scientific visualization ping (Section 4), realistic fog models (Section 5), and finally en- and realistic image synthesis.
    [Show full text]