(12) United States Patent (10) Patent No.: US 9,612,835 B2 Palanca Et Al

Total Page:16

File Type:pdf, Size:1020Kb

(12) United States Patent (10) Patent No.: US 9,612,835 B2 Palanca Et Al USOO961.2835B2 (12) United States Patent (10) Patent No.: US 9,612,835 B2 Palanca et al. (45) Date of Patent: *Apr. 4, 2017 (54) MFENCE AND LFENCE (58) Field of Classification Search MICRO-ARCHITECTURAL None IMPLEMENTATION METHOD AND SYSTEM See application file for complete search history. (75) Inventors: Salvador Palanca, Folsom, CA (US); (56) References Cited Stephen A. Fischer, Gold River, CA (US); Subramaniam Maiyuran, Gold U.S. PATENT DOCUMENTS River, CA (US); Shekoufeh Qawami, El Dorado Hills, CA (US) 5,265,233 A 11/1993 Frailong 5,636,374. A 6/1997 Rodgers et al. (73) Assignee: Intel Corporation, Santa Clara, CA (Continued) (US) FOREIGN PATENT DOCUMENTS (*) Notice: Subject to any disclaimer, the term of this patent is extended or adjusted under 35 WO WO-9708608 3, 1997 U.S.C. 154(b) by 0 days. This patent is Subject to a terminal dis OTHER PUBLICATIONS claimer. Alpha 21164 Microprocessor Hardware Reference Manual (Dec. (21) Appl. No.: 13/619,919 1998, pp. 1-435).* (Continued) (22) Filed: Sep. 14, 2012 Primary Examiner — George Giroux (65) Prior Publication Data (74) Attorney, Agent, or Firm — Vecchia Patent Agent, US 2013 FOO672OO A1 Mar. 14, 2013 LLC Related U.S. Application Data (57) ABSTRACT (63) Continuation of application No. 13/440,096, filed on A system and method for fencing memory accesses. Apr. 5, 2012, now Pat. No. 9,383,998, which is a Memory loads can be fenced, or all memory access can be (Continued) fenced. The system receives a fencing instruction that sepa (51) Int. Cl. rates memory access instructions into older accesses and G06F 5/00 (2006.01) newer accesses. A buffer within the memory ordering unit is allocated to the instruction. The access instructions newer G06F 9/30 (2006.01) than the fencing instruction are stalled. The older access (Continued) instructions are gradually retired. When all older memory (52) U.S. Cl. accesses are retired, the fencing instruction is dispatched CPC ...... G06F 9/30145 (2013.01); G06F 9/30043 from the buffer. (2013.01); G06F 9/30047 (2013.01); (Continued) 24 Claims, 5 Drawing Sheets FUFETCHES ID DECOCES INTO ETRY IS STORE BUFFERS FENCE STORE DATA ALOCATED INT ACATED IN ACR0 STORE ADDRESSMFENCE RESERVATION EORY INSTRUCTION MICRO-OPERATIONS STATION ORDERING UNI 302 38 34 304 306 310 308 A-RETREVENT AL PREWOS SAL DISPATCHES DISPACHTOL INSERUCTIONS IN PROGRA FENCE OFALNSRUCTIONS CACHE CONTROLER ORDER RETIRED FROM THE READY TO FOONG EORY SBSYSTE SPATCH FENCEN ONSOREPOR PROGRA. ORDER ESPATCH AND RETIRE PREVIOUSNSTRUCTIONS OTSANDING ANY WRE BUFFERS (READ & WRITE COENING BUFFERSN IN CACHE CONTROLLER CACHE CONTROLLERON NOTEE GEOBAY ECTION PROCESS 326 EWCA 524 DEALLOCATEFENCE FROSTORE BUFFER IN MEMORY OUTSANDING WRE COMBINING ORDERING UNIT. CACHE CONTROLLERTREATSAFENCE ASA NOP. RETRENFENCE FROL CACHE CONTROLER. BUFFERS US 9,612,835 B2 Page 2 Related U.S. Application Data Bernstein et al. (Solutions and Debugging for Data Consistency in Multiprocessors with Noncoherent Caches, Feb. 1995, pp. continuation of application No. 10/654,573, filed on 83-103).* Sep. 2, 2003, now Pat. No. 8,171,261, which is a Alpha 21264 Microprocessor Hardware Reference Manual, continuation of application No. 10/194,531, filed on COMPAQ Computer Corporation, Order No. EC-RJRZA-TE, (Jul. Jul. 12, 2002, now Pat. No. 6,651,151, which is a 1999). continuation of application No. 09/475.363, filed on Goodman, "Cache Consistency and Sequential Consistency'. Dec. 30, 1999, now Pat. No. 6,678,810. (1989), pp. 1-4. Thakkar, Shreekant, “The Internet Streaming SIMD Extensions'. (51) Int. Cl. Intel Technology Journal, Retrieved on Nov. 11, 1999, Legal G06F 9/40 (2006.01) Information (C) 1999, pp. 1-8, from http://support.intel.com/technol G06F 9/38 (2006.01) ogy/it q21999/articles/art 1a.htm. (52) U.S. Cl. Advanced Micro Devices, Inc., “AMD-3D Technology Manual”. CPC ........ G06F 9/30087 (2013.01); G06F 9/3834 (Feb. 1998), pp. i-x, 1-58. Barad, Haim, et al., “Intel's Multimedia Architecture Extension', (2013.01); G06F 9/3836 (2013.01); G06F Nineteenth Convention of Electrical and Electronics Engineers in 9/3855 (2013.01); G06F 9/3857 (2013.01) Israel. (1996), pp. 148-151. Control Data Corporation, “Control Data 6400/6500/6600 Com (56) References Cited puter Systems Reference Manual”. Publication No. 601 00000, (1967), 159 pages. U.S. PATENT DOCUMENTS Convex Computer Corporation, “C4/XA Architecture Overview”. 5,675,724. A 10, 1997 Beal et al. Convex Technical Marketing, (Feb. 1994), 279 pages. 5,694,553 A * 12/1997 Abramson et al. ............. 71.2/23 Intel Corporation, “i860 Microprocessor Family Programmer's Ref 5,694,574 A 12/1997 Abramson et al. erence Manual”. (1992), Ch. 1, 3, 8 & 12. 5,724,536 A 3, 1998 Abramson et al. Intel Corporation, “Intel 80386 Programmer's Reference Manual”. 5,751,996 A 5, 1998 Glew et al. (1986), 421 pages. 5,778,245 A 7/1998 Papworth et al. Intel Corporation, "Pentium Processor User's Manual”, vol. 3: 5,790,398 A 8, 1998 Horie Architecture and Programming Manual. (1993), Ch. 1, 3-4, 6, 8 & 5,802,575 A 9/1998 Greenley et al. 18. 5,802,757 A 9, 1998 Duval et al. Kohn, L., et al., “The Visual Instruction Set (VIS) in UltraSPARC, 5,826, 109 A 10, 1998 Abramson et al. SPARC Technology Business-Sun Microsystems, Inc.(1995), pp. 5,860,126 A 1, 1999 Mittal 462-469. 5,881,262 A 3, 1999 Abramson et al. 5,898,854 A 4, 1999 Abramson et al. Lawrence Livermore Laboratory, "S-1 Uniprocessor Architecture'. 5,903,740 A 5, 1999 Walker et al. Apr. 21, 1983, 386 pages. 6,006,325 A * 12/1999 Burger et al. ................. T12/214 Lawrence Livermore Laboratory, “vol. I: Architecture-The 1979 6,038,646 A 3/2000 Sproull Annual Report-The S-1 Project', (1979), 443 pages. 6,047,334 A 4/2000 Langendorf et al. Lawrence Livermore Laboratory, “vol. II: Hardware-The 1979 6,073,210 A 6, 2000 Palanca et al. Annual Report-The S-1 Project', (1979), 366 pages. 6,088,771 A 7/2000 Steely, Jr. et al. Motorola, Inc., “MC88110 Second Generation RISC Microproces 6,088,772 A 7/2000 Harrimon et al. sor User's Manual”, MC811OUM/AD, (1991), 619 pages. 6,148,394 A 11/2000 Tung et al. Philips Electronics, “TriMedia TM1000 Preliminary Data Book”, 6,189,089 B1 2/2001 Walker et al. (1997), 496 pages. 6,216.215 B1 4/2001 Palanca et al. Samsung Electronics, “21164 Alpha Microprocessor Data Sheet”, 6,223,258 B1 4/2001 Palanca et al. 6,233,657 B1 5/2001 Ramagopal et al. (1997), 121 pages. 6,266,767 B1 7/2001 Feiste et al. Shipnes, J., “Graphics Processing with the 88.110 RISC Micropro 6,286,095 B1 9, 2001 Morris et al. cessor", IEEE, (1992), pp. 169-174. 6,356,270 B2 3, 2002 Pentkovski et al. Sun Microsystems, Inc., “VIS Visual Instruction Set User's 6,546,462 B1 4/2003 Palanca et al. Manual”. Part #805-1394-01, (Jul 1997), pp. i-xii. pp. 1-136. 6,636,950 B1 10/2003 Mithal et al. Sun Microsystems, Inc., “Visual Instruction Set (VIS) User's 6,651,151 B2 11/2003 Palanca et al. Guide'. Version 1.1. (Mar. 1997), pp. ixii, pp. 1-127. 6,678,810 B1 1/2004 Palanca et al. Texas Instruments, “TMS320C2X User's Guide', (1993), pp. 3:2- 6,708,269 B1 3, 2004 Tiruvallur et al. 3:11; 3:28-3:34; 4:1-4:22; 4:41; 4:103; 4:119-4:120; 4:122; 4:150 6,754,751 B1 6, 2004 Wilke 4:151. 6,862,679 B2 3/2005 Hacking et al. Texas Instruments, “TMS320C80 (MVP) Master Processor User's Guide”. (1995), 595 pages. OTHER PUBLICATIONS Texas Instruments, “TMS320C80 (MVP) Parallel Processor User's Guide”. (1995), 705 pages. Weaver et al. (The SPARC Architecture Manual: Version 9, Pub lished 1994, pp. 1-369).* * cited by examiner U.S. Patent Apr. 4, 2017 Sheet 1 of 5 US 9,612,835 B2 102 MACRO-INSTRUCTIONS: E. MFENCE, FENCE, 104 MASKMOVO, PREFETCH. NSTRUCTION FETCH UNIT NSTRUCTION DECODER UNIT MICRO-OPERATIONS:LE, STORE ADDRESSMFENCE UOP & STORE DATA MFENCE UOP, LFENCE UOP, MASKED WRITE ADDRESS UOP MASKED WRITE DATA UOP, LOAD HINT UOP. 108 106 RESERVATION STATION STORE PORT 10 REORDER BUFFER LOAD PORT: FENCE UOP, MFENCEUOPS. REGISTER FLE LOAD HINTUOP, - MEMORY ORDERING CACHEABLE 112 LOAD UNIT LOAD HIT (TO BUFFER RECISTER FILE) UNCACHEABLE LOADS CACHEABLE LOADS AND STORES CACHEABLE CACHEABLE STORE LOAD MISS LOAD HTS CACHEABLE UNCACHEABLE STORE MISS STORES L1 CACHE ? RITE COMBINING WCSTORES BUFFERS (WEAKLY WRITE ORDERED) BUFFERS UNCACHEABLE REQUESTS AND CACHEABLE MISSES UNCACHEABLE CACHEABLE TO IN-ORDER Y REQUESTS/ OUT OF ORDERY REQUESTS L2 CACHE FSB QUEUE o QUEUE11a. 150 ( Bus CONTROLER CCHAELEMSSES ) 142 144 146 140 FIG.1 U.S. Patent US 9,612,835 B2 U.S. Patent Apr. 4, 2017 Sheet 3 of 5 US 9,612,835 B2 909) SEHOIHHfliji U.S. Patent Apr. 4, 2017 Sheet 4 of 5 US 9,612,835 B2 807 ZZ; U.S. Patent Apr. 4, 2017 Sheet 5 of 5 US 9,612,835 B2 FU fetches E) decodes into "fence", Entries are LFENCE and and "store data" & allocated into MFENCE in acro "store address infence" reservation instructions micro-operations station 502 MFENCE infrediately after FENCE FENCE stals dispatch of all Load and store FENCE stalls dispatch buffer entries are ready to subsequent Ticao-operations, of MFENCE in program order allocated in memory dispatch ordering unit SO A picvious loads in program order retired from the memory subsyster, and load buffer tail pointer points to FENCE? Dispatch and retire YES previous pads “At-retirement" 518 dispatch to Ll cache controller A read buffers in L.
Recommended publications
  • Implementing Elliptic Curve Cryptography (A Narrow Survey)
    Implementing Elliptic Curve Cryptography (a narrow survey) Institute of Computing – UNICAMP Campinas, Brazil April 2005 Darrel Hankerson Auburn University Implementing ECC – 1/110 Overview Objective: sample selected topics of practical interest. Talk will favor: I Software solutions on general-purpose processors rather than dedicated hardware. I Techniques with broad applicability. I Methods targeted to standardized curves. Goals: I Present proposed methods in context. I Limit coverage of technical details (but “implementing” necessarily involves platform considerations). Implementing ECC – 2/110 Focus: higher-performance processors “Higher-performance” includes processors commonly associated with workstations, but also found in surprisingly small portable devices. Sun and IBM workstations RIM pager circa 1999 SPARC or Intel x86 (Pentium) Intel x86 (custom 386) 256 MB – 8 GB 2 MB “disk”, 304 KB RAM 0.5 GHz – 3 GHz 10 MHz, single AA battery heats entire building fits in shirt pocket Implementing ECC – 3/110 Optimizing ECC Elliptic Curve Digital Signature Algorithm (ECDSA) Random number Big number and Curve generation modular arithmetic arithmetic Fq field arithmetic General categories of optimization efforts: 1. Field-level optimizations. 2. Curve-level optimizations. 3. Protocol-level optimizations. Constraints: security requirements, hardware limitations, bandwidth, interoperability, and patents. Implementing ECC – 4/110 Optimizing ECC... 1. Field-level optimizations. I Choose fields with fast multiplication and inversion. I Use special-purpose hardware (cryptographic coprocessors, DSP, floating-point, SIMD). 2. Curve-level optimizations. I Reduce the number of point additions (windowing). I Reduce the number of field inversions (projective coords). I Replace point doubles (endomorphism methods). 3. Protocol-level optimizations. I Develop efficient protocols. I Choose methods and protocols that favor your computations or hardware.
    [Show full text]
  • Computer Architectures an Overview
    Computer Architectures An Overview PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Sat, 25 Feb 2012 22:35:32 UTC Contents Articles Microarchitecture 1 x86 7 PowerPC 23 IBM POWER 33 MIPS architecture 39 SPARC 57 ARM architecture 65 DEC Alpha 80 AlphaStation 92 AlphaServer 95 Very long instruction word 103 Instruction-level parallelism 107 Explicitly parallel instruction computing 108 References Article Sources and Contributors 111 Image Sources, Licenses and Contributors 113 Article Licenses License 114 Microarchitecture 1 Microarchitecture In computer engineering, microarchitecture (sometimes abbreviated to µarch or uarch), also called computer organization, is the way a given instruction set architecture (ISA) is implemented on a processor. A given ISA may be implemented with different microarchitectures.[1] Implementations might vary due to different goals of a given design or due to shifts in technology.[2] Computer architecture is the combination of microarchitecture and instruction set design. Relation to instruction set architecture The ISA is roughly the same as the programming model of a processor as seen by an assembly language programmer or compiler writer. The ISA includes the execution model, processor registers, address and data formats among other things. The Intel Core microarchitecture microarchitecture includes the constituent parts of the processor and how these interconnect and interoperate to implement the ISA. The microarchitecture of a machine is usually represented as (more or less detailed) diagrams that describe the interconnections of the various microarchitectural elements of the machine, which may be everything from single gates and registers, to complete arithmetic logic units (ALU)s and even larger elements.
    [Show full text]
  • 2 the VIS Instruction Set Pdist Instruction
    The VISTM Instruction Set Version 1.0 June 2002 A White Paper This document provides an overview of the Visual Instruction Set. ® 4150 Network Circle Santa Clara, CA 95054 USA www.sun.com Copyright © 2002 Sun Microsystems, Inc. All Rights reserved. THE INFORMATION CONTAINED IN THIS DOCUMENT IS PROVIDED"AS IS" WITHOUT ANY EXPRESS REPRESENTATIONS OR WARRANTIES. IN ADDITION, SUN MICROSYSTEMS, INC. DISCLAIMS ALL IMPLIED REPRESENTATIONS AND WARRANTIES, INCLUDING ANY WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NON-INFRINGEMENT OF THIRD PARTY INTELLECTUAL PROPERTY RIGHTS. This document contains proprietary information of Sun Microsystems, Inc. or under license from third parties. No part of this document may be reproduced in any form or by any means or transferred to any third party without the prior written consent of Sun Microsystems, Inc. Sun, Sun Microsystems, the Sun Logo, VIS, Java, and mediaLib are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the United States and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. UNIX is a registered trademark in the United States and other countries, exclusively licensed through x/Open Company Ltd. The information contained in this document is not designed or intended for use in on-line control of aircraft, air traffic, aircraft navigation or aircraft communications; or in the design, construction, operation or maintenance of any nuclear facility. Sun disclaims any express or implied warranty of fitness for such uses.
    [Show full text]
  • SIMD-Swift: Improving Performance of Swift Fault Detection
    Master thesis SIMD-Swift: Improving Performance of Swift Fault Detection Oleksii Oleksenko 15. November 2015 Technische Universität Dresden Department of Computer Science Systems Engineering Group Supervisor: Prof. Christof Fetzer Adviser: MSc. Dmitrii Kuvaiskii Declaration Herewith I declare that this submission is my own work and that, to the best of my knowledge, it contains no material previously published or written by another person nor material which to a substantial extent has been accepted for the award of any other degree or diploma of the university or other institute of higher education, except where due acknowledgment has been made in the text. Dresden, 9. November 2015 Oleksii Oleksenko Abstract The general tendency in modern hardware is an increase in fault rates, which is caused by the decreased operation voltages and feature sizes. Previously, the issue of hardware faults was mainly approached only in high-availability enterprise servers and in safety-critical applications, such as transport or aerospace domains. These fields generally have very tight requirements, but also higher budgets. However, as fault rates are increasing, fault tolerance solutions are starting to be also required in applications that have much smaller profit margins. This brings to the front the idea of software-implemented hardware fault tolerance, that is, the ability to detect and tolerate hardware faults using software-based techniques in commodity CPUs, which allows to get resilience almost for free. Current solutions, however, are lacking in performance, even though they show quite good fault tolerance results. This thesis explores the idea of using the Single Instruction Multiple Data (SIMD) technology for executing all program’s operations on two copies of the same data.
    [Show full text]
  • Ultrasparc T1™ Supplement to the Ultrasparc Architecture 2005
    UltraSPARC T1™ Supplement to the UltraSPARC Architecture 2005 Draft D2.1, 14 May 2007 Privilege Levels: Hyperprivileged, Privileged, and Nonprivileged Distribution: Public Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. 650-960-1300 Part No.No: 8xx-xxxx-xx819-3404-05 ReleaseRevision: 1.0, Draft 2002 D2.1, 14 May 2007 ii UltraSPARC T1 Supplement • Draft D2.1, 14 May 2007 Copyright 2002-2006 Sun Microsystems, Inc., 4150 Network Circle • Santa Clara, CA 950540 USA. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. For Netscape Communicator™, the following notice applies: Copyright 1995 Netscape Communications Corporation. All rights reserved. Sun, Sun Microsystems, the Sun logo, Solaris, and VIS are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc.
    [Show full text]
  • Idisa+: a Portable Model for High Performance Simd Programming
    IDISA+: A PORTABLE MODEL FOR HIGH PERFORMANCE SIMD PROGRAMMING by Hua Huang B.Eng., Beijing University of Posts and Telecommunications, 2009 a Thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in the School of Computing Science Faculty of Applied Science c Hua Huang 2011 SIMON FRASER UNIVERSITY Fall 2011 All rights reserved. However, in accordance with the Copyright Act of Canada, this work may be reproduced without authorization under the conditions for Fair Dealing. Therefore, limited reproduction of this work for the purposes of private study, research, criticism, review and news reporting is likely to be in accordance with the law, particularly if cited appropriately. APPROVAL Name: Hua Huang Degree: Master of Science Title of Thesis: IDISA+: A Portable Model for High Performance SIMD Pro- gramming Examining Committee: Dr. Kay C. Wiese Associate Professor, Computing Science Simon Fraser University Chair Dr. Robert D. Cameron Professor, Computing Science Simon Fraser University Senior Supervisor Dr. Thomas C. Shermer Professor, Computing Science Simon Fraser University Supervisor Dr. Arrvindh Shriraman Assistant Professor, Computing Science Simon Fraser University SFU Examiner Date Approved: ii Declaration of Partial Copyright Licence The author, whose copyright is declared on the title page of this work, has granted to Simon Fraser University the right to lend this thesis, project or extended essay to users of the Simon Fraser University Library, and to make partial or single copies only for such users or in response to a request from the library of any other university, or other educational institution, on its own behalf or for one of its users.
    [Show full text]
  • HPC-Event-Return-Of-Vector-20160801
    The OrionX Constellation: Data Center High Performance Computing The Return of Vector Processing Shahin Khan Digital transformation, cloud computing and application elasticity, open source software, and new app areas like AI and IoT are driving a renaissance in system architecture. This is an area of interest and research for us at OrionX. Vector processing was an interesting topic to re-emerge recently. First during the International Supercomputing Conference (ISC), and again in various announcements in implicit and explicit ways. On the Monday of the ISC conference, a new leader on the TOP500 list was announced. The Sunway TaihuLight system uses a new processor architecture that is Single-Instruction-Multiple-Data (SIMD) with a pipeline that can do eight 64-bit floating-point calculations per cycle. 10,649,600 computing cores comprising 40,960 nodes produce 93 petaflop/s running the LINPACK benchmark. Later that day, at the “ISC 2016 Vendor Showdown”, NEC had a presentation about its project “Aurora”. This project aims to combine x86 clusters and NEC’s vector processors in the same high bandwidth system. NEC has a long history of advanced vector processors with its SX architecture. Among many achievements, it built the Earth Simulator, a vector-parallel system that was #1 on the TOP500 list from 2002 to 2004. At its debut, it had a substantial (nearly 5x) lead over the previous #1. Vector Processing and Parallelism Vector processing is a time-honored system architecture that started the supercomputing market, starting with the CDC STAR-100 system (STrings and ARrays, performing at 100 million floating point operations per second), and then led by the legendary Seymour Cray and his superb team.
    [Show full text]
  • Université Batna 2 – Mostefa Ben Boulaïd Thèse Doctorat En
    République Algérienne Démocratique et Populaire Ministère de l’Enseignement Supérieur et de la Recherche Scientifique Université Batna 2 – Mostefa Ben Boulaïd Faculté de Technologie Département de Génie Industriel Thèse Préparée au sein du laboratoire d’Automatique & Productique Présentée pour l’obtention du diplôme de : Doctorat en Sciences en Génie Industriel Option : Génie Industriel Sous le Thème : An Optimized Approach to Software Security via Malware Analysis Présentée par : OURLIS Lazhar Devant le jury composé de : M. ABDELHAMID Samir MCA Université de Batna 2 Président M. BELLALA Djamel MCA Université de Batna 2 Rapporteur Mme. BOUAME Souhila MCA Université de Batna 2 Examinateur M.DJEFFAL Abdelhamid MCA Université de Biskra Examinateur M.KAHLOUL Laid Prof Université de Biskra Examinateur M. BENMOHAMMED Mohamed Prof Université de Constantine 2 Examinateur Novembre 2020 To my parents and family Contents List of Figures. .I List of Tables. .II Program Listings. III Abbreviations. IV Publications. .V Acknowledgements. .VI Abstract. VII Chapter 1 Introduction ...................................................................................................................... 1 1.1 Motivation for the research ............................................................................................ 3 1.2 Thesis scope .................................................................................................................... 3 1.3 Thesis outline .................................................................................................................
    [Show full text]
  • 10Th Gen Intel® Core™ Processor Families Datasheet, Vol. 1
    10th Generation Intel® Core™ Processor Families Datasheet, Volume 1 of 2 Supporting 10th Generation Intel® Core™ Processor Families, Intel® Pentium® Processors, Intel® Celeron® Processors for U/Y Platforms, formerly known as Ice Lake July 2020 Revision 005 Document Number: 341077-005 Legal Lines and Disclaimers You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Intel technologies' features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at intel.com. Intel technologies may require enabled hardware, specific software, or services activation. Check with your system manufacturer or retailer. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548- 4725 or visit www.intel.com/design/literature.htm.
    [Show full text]
  • SPARC Assembly Language Reference Manual
    SPARC Assembly Language Reference Manual Part No: E36858 July 2014 Copyright © 1993, 2014, Oracle and/or its affiliates. All rights reserved. This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT END USERS. Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government. This software or hardware is developed for general use in a variety of information management applications.
    [Show full text]
  • SPARC M7™ Supplement to the Oracle SPARC Architecture 2015
    SPARC M7™ Supplement to the Oracle SPARC Architecture 2015 Draft D1.0, 30 Jun 2016 Privilege Levels: Privileged and Nonprivileged Distribution: Public Part No.No: 950-____-00 ReleaseRevision: 1.0, Draft 2002 D1.0, 30 Jun 2016 Oracle Corporation 4150 Network Circle Santa Clara, CA 95054 U.S.A. 650-960-1300 2 SPARC M7 Supplement • Draft D1.0, 30 Jun 2016 Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. UNIX is a registered trademark licensed through X/Open Company, Ltd. 4 SPARC M7 Supplement • Draft D1.0, 30 Jun 2016 Contents 1 SPARC M7 Basics. .9 1.1 Background . 9 1.2 SPARC M7 Overview . 10 1.3 SPARC M7 Components . 11 1.3.1 SPARC Physical Core . 11 1.3.1.1 Single-threaded and multi-threaded performance . 12 1.3.2 L3 Cache. 12 2Data Formats . 15 3Registers. 17 3.1 Floating-Point State Register (FSR) . .17 3.2 Ancillary State Registers (ASRs) . 17 3.2.1 Tick Register (TICK). 18 3.2.2 Program Counter (PC). 18 3.2.3 Floating-Point Registers State Register (FPRS) . 19 3.2.4 General Status Register (GSR) .
    [Show full text]
  • ASIC DESIGN of the OPENSPARC T1 PROCESSOR CORE By
    ASIC DESIGN OF THE OPENSPARC T1 PROCESSOR CORE By Mohamed Mahmoud Mohamed Farag A Thesis Submitted to the Faculty of Engineering at Cairo University in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE in ELECTRONICS AND COMMUNICATIONS ENGINEERING FACULTY OF ENGINEERING, CAIRO UNIVERSITY GIZA, EGYPT 2013 I ASIC DESIGN OF THE OPENSPARC T1 PROCESSOR CORE By Mohamed Mahmoud Mohamed Farag A Thesis Submitted to the Faculty of Engineering at Cairo University in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE in ELECTRONICS AND COMMUNICATIONS ENGINEERING Under the Supervision of Prof. Dr. Serag El-Din Habib Dr. Hossam A. H. Fahmy Professor of Electronics Associate Professor Electronics and Communications Electronics and Communications Department Department Faculty of Engineering, Cairo University Faculty of Engineering, Cairo University FACULTY OF ENGINEERING, CAIRO UNIVERSITY GIZA, EGYPT 2013 II ASIC DESIGN OF THE OPENSPARC T1 PROCESSOR CORE By Mohamed Mahmoud Mohamed Farag A Thesis Submitted to the Faculty of Engineering at Cairo University in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE in ELECTRONICS AND COMMUNICATIONS ENGINEERING Approved by the Examining Committee ____________________________ Prof. Dr. El Sayed Mostafa Saad, External Examiner ____________________________ Prof. Dr. Ibrahim Mohamed Qamar, Internal Examiner ____________________________ Prof. Dr. Serag El-Din Habib, Thesis Main Advisor ____________________________ Dr. Hossam A. H. Fahmy, Thesis Advisor FACULTY OF ENGINEERING, CAIRO UNIVERSITY GIZA, EGYPT 2013 III Engineer’s Name: Mohamed Mahmoud Mohamed Farag Date of Birth: 29/12/1985 Nationality: Egyptian Insert photo here E-mail: [email protected] Phone: 01068823040 Address: 7 Ramzy Farag Street – Al Haram Registration Date: …./…./…….
    [Show full text]