RISC-V Bitmanip ISA Extensions, Hardware, Tools, Applications Thanks to Our Task Group Members!
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Pdp11-40.Pdfprocessor handbook digital equipment corporation Copyright© 1972, by Digital Equipment Corporation DEC, PDP, UNIBUS are registered trademarks of Digital Equipment Corporation. ii TABLE OF CONTENTS CHAPTER 1 INTRODUCTION 1·1 1.1 GENERAL ............................................. 1·1 1.2 GENERAL CHARACTERISTICS . 1·2 1.2.1 The UNIBUS ..... 1·2 1.2.2 Central Processor 1·3 1.2.3 Memories ........... 1·5 1.2.4 Floating Point ... 1·5 1.2.5 Memory Management .............................. .. 1·5 1.3 PERIPHERALS/OPTIONS ......................................... 1·5 1.3.1 1/0 Devices .......... .................................. 1·6 1.3.2 Storage Devices ...................................... .. 1·6 1.3.3 Bus Options .............................................. 1·6 1.4 SOFTWARE ..... .... ........................................... ............. 1·6 1.4.1 Paper Tape Software .......................................... 1·7 1.4.2 Disk Operating System Software ........................ 1·7 1.4.3 Higher Level Languages ................................... .. 1·7 1.5 NUMBER SYSTEMS ..................................... 1-7 CHAPTER 2 SYSTEM ARCHITECTURE. 2-1 2.1 SYSTEM DEFINITION .............. 2·1 2.2 UNIBUS ......................................... 2-1 2.2.1 Bidirectional Lines ...... 2-1 2.2.2 Master-Slave Relation .. 2-2 2.2.3 Interlocked Communication 2-2 2.3 CENTRAL PROCESSOR .......... 2-2 2.3.1 General Registers ... 2-3 2.3.2 Processor Status Word ....... 2-4 2.3.3 Stack Limit Register 2-5 2.4 EXTENDED INSTRUCTION SET & FLOATING POINT .. 2-5 2.5 CORE MEMORY . .... 2-6 2.6 AUTOMATIC PRIORITY INTERRUPTS .... 2-7 2.6.1 Using the Interrupts . 2-9 2.6.2 Interrupt Procedure 2-9 2.6.3 Interrupt Servicing ............ .. 2-10 2.7 PROCESSOR TRAPS ............ 2-10 2.7.1 Power Failure ..............
- 
												  Lecture 04 Linear Structures SortAlgorithmics (6EAP) MTAT.03.238 Linear structures, sorting, searching, etc Jaak Vilo 2018 Fall Jaak Vilo 1 Big-Oh notation classes Class Informal Intuition Analogy f(n) ∈ ο ( g(n) ) f is dominated by g Strictly below < f(n) ∈ O( g(n) ) Bounded from above Upper bound ≤ f(n) ∈ Θ( g(n) ) Bounded from “equal to” = above and below f(n) ∈ Ω( g(n) ) Bounded from below Lower bound ≥ f(n) ∈ ω( g(n) ) f dominates g Strictly above > Conclusions • Algorithm complexity deals with the behavior in the long-term – worst case -- typical – average case -- quite hard – best case -- bogus, cheating • In practice, long-term sometimes not necessary – E.g. for sorting 20 elements, you dont need fancy algorithms… Linear, sequential, ordered, list … Memory, disk, tape etc – is an ordered sequentially addressed media. Physical ordered list ~ array • Memory /address/ – Garbage collection • Files (character/byte list/lines in text file,…) • Disk – Disk fragmentation Linear data structures: Arrays • Array • Hashed array tree • Bidirectional map • Heightmap • Bit array • Lookup table • Bit field • Matrix • Bitboard • Parallel array • Bitmap • Sorted array • Circular buffer • Sparse array • Control table • Sparse matrix • Image • Iliffe vector • Dynamic array • Variable-length array • Gap buffer Linear data structures: Lists • Doubly linked list • Array list • Xor linked list • Linked list • Zipper • Self-organizing list • Doubly connected edge • Skip list list • Unrolled linked list • Difference list • VList Lists: Array 0 1 size MAX_SIZE-1 3 6 7 5 2 L = int[MAX_SIZE]
- 
												  Specification of Bit Handling Routines AUTOSAR CP Release 4.3.1Specification of Bit Handling Routines AUTOSAR CP Release 4.3.1 Document Title Specification of Bit Handling Routines Document Owner AUTOSAR Document Responsibility AUTOSAR Document Identification No 399 Document Status Final Part of AUTOSAR Standard Classic Platform Part of Standard Release 4.3.1 Document Change History Date Release Changed by Change Description 2017-12-08 4.3.1 AUTOSAR Addition on mnemonic for boolean Release as “u8”. Management Editorial changes 2016-11-30 4.3.0 AUTOSAR Removal of the requirement Release SWS_Bfx_00204 Management Updation of MISRA violation com- ment format Updation of unspecified value range for BitPn, BitStartPn, BitLn and ShiftCnt Clarifications 1 of 42 Document ID 399: AUTOSAR_SWS_BFXLibrary - AUTOSAR confidential - Specification of Bit Handling Routines AUTOSAR CP Release 4.3.1 Document Change History Date Release Changed by Change Description 2015-07-31 4.2.2 AUTOSAR Updated SWS_Bfx_00017 for the Release return type of Bfx_GetBit function Management from 1 and 0 to TRUE and FALSE Updated chapter 8.1 for the defini- tion of bit addressing and updated the examples of Bfx_SetBit, Bfx_ClrBit, Bfx_GetBit, Bfx_SetBits, Bfx_CopyBit, Bfx_PutBits, Bfx_PutBit Updated SWS_Bfx_00017 for the return type of Bfx_GetBit function from 1 and 0 to TRUE and FALSE without changing the formula Updated SWS_Bfx_00011 and SWS_Bfx_00022 for the review comments provided for the exam- ples In Table 2, replaced Boolean with boolean In SWS_Bfx_00029, in example re- placed BFX_GetBits_u16u8u8_u16 with Bfx_GetBits_u16u8u8_u16 2014-10-31 4.2.1 AUTOSAR Correct usage of const in function Release declarations Management Editoral changes 2014-03-31 4.1.3 AUTOSAR Editoral changes Release Management 2013-10-31 4.1.2 AUTOSAR Improve description of how to map Release functions to C-files Management Improve the definition of error classi- fication Editorial changes 2013-03-15 4.1.1 AUTOSAR Change return value of Test Bit API Administration to boolean.
- 
												  RISC-V Bitmanip Extension Document Version 0.90RISC-V Bitmanip Extension Document Version 0.90 Editor: Clifford Wolf Symbiotic GmbH [email protected] June 10, 2019 Contributors to all versions of the spec in alphabetical order (please contact editors to suggest corrections): Jacob Bachmeyer, Allen Baum, Alex Bradbury, Steven Braeger, Rogier Brussee, Michael Clark, Ken Dockser, Paul Donahue, Dennis Ferguson, Fabian Giesen, John Hauser, Robert Henry, Bruce Hoult, Po-wei Huang, Rex McCrary, Lee Moore, Jiˇr´ıMoravec, Samuel Neves, Markus Oberhumer, Nils Pipenbrinck, Xue Saw, Tommy Thorn, Andrew Waterman, Thomas Wicki, and Clifford Wolf. This document is released under a Creative Commons Attribution 4.0 International License. Contents 1 Introduction 1 1.1 ISA Extension Proposal Design Criteria . .1 1.2 B Extension Adoption Strategy . .2 1.3 Next steps . .2 2 RISC-V Bitmanip Extension 3 2.1 Basic bit manipulation instructions . .4 2.1.1 Count Leading/Trailing Zeros (clz, ctz)....................4 2.1.2 Count Bits Set (pcnt)...............................5 2.1.3 Logic-with-negate (andn, orn, xnor).......................5 2.1.4 Pack two XLEN/2 words in one register (pack).................6 2.1.5 Min/max instructions (min, max, minu, maxu)................7 2.1.6 Single-bit instructions (sbset, sbclr, sbinv, sbext)............8 2.1.7 Shift Ones (Left/Right) (slo, sloi, sro, sroi)...............9 2.2 Bit permutation instructions . 10 2.2.1 Rotate (Left/Right) (rol, ror, rori)..................... 10 2.2.2 Generalized Reverse (grev, grevi)....................... 11 2.2.3 Generalized Shuffleshfl ( , unshfl, shfli, unshfli).............. 14 2.3 Bit Extract/Deposit (bext, bdep)............................ 22 2.4 Carry-less multiply (clmul, clmulh, clmulr)....................
- 
												  KTS-A Memory Management Control User's Guide Digital Equipment Corporation • Maynard, Massachusetts--- KTS-A memory management control user's guide E K- KTOSA-U G-001 digital equipment corporation • maynard, massachusetts 1st Edition , July 1978 Copyright © 1978 by Digital Equipment Corporation The material in this manual is for informational purposes and is subject to change without notice. Digital Equipment Corporation assumes no responsibility for any errors which may appear in this manual. Printed in U.S.A. This document was set on DIGITAL's DECset-8000 com puterized typesetting system. The following are trademarks of Digital Equipment Corporation, Maynard, Massachusetts: DIGITAL D ECsystem-10 MASSBUS DEC DECSYSTEM-20 OMNIBUS POP DIBOL OS/8 DECUS EDUSYSTEM RSTS UNI BUS VAX RSX VMS IAS CONTENTS Page CHAPTER 1 INTRODUCTION 1 . 1 SCOPE OF MANUAL ..................................................................................................................... 1-1 1.2 GENERAL DESCRI PTION ............................................................................................................. 1-1 1 .3 KT8-A SPECIFICATION S.............................................................................................................. 1-3 1.4 RELATED DOCUMENTS ............................................................................................................... 1-4 1 .5 SOFTWARE ..................................................................................................................................... 1-5 1.5.1 Diagnostic .............................................................................................................................
- 
												  Reference Guide for X86-64 CpusREFERENCE GUIDE FOR X86-64 CPUS Version 2019 TABLE OF CONTENTS Preface............................................................................................................. xi Audience Description.......................................................................................... xi Compatibility and Conformance to Standards............................................................ xi Organization....................................................................................................xii Hardware and Software Constraints...................................................................... xiii Conventions....................................................................................................xiii Terms............................................................................................................xiv Related Publications.......................................................................................... xv Chapter 1. Fortran, C, and C++ Data Types................................................................ 1 1.1. Fortran Data Types....................................................................................... 1 1.1.1. Fortran Scalars.......................................................................................1 1.1.2. FORTRAN real(2).....................................................................................3 1.1.3. FORTRAN 77 Aggregate Data Type Extensions.................................................. 3 1.1.4. Fortran 90 Aggregate Data Types (Derived
- 
												  The DEC PDP-8 VectorsThe ISP of the PDP-8 Pc is about the most trivial in the book. Chapter 5 — It has only a few data operators, namely, <—,+, (negate), -j, It on A, / 2, X 2, (optional) X, /, and normalize. operates words, The DEC PDP-8 vectors. there are microcoded integers, and boolean However, instructions, which allow compound instructions to be formed in Introduction 1 a single instruction. is the levels dis- The PDP-8 is a single-address, 12-bit-word computer of the second The computer straightforward and illustrates 1. look at it from the down." generation. It is designed for task environments with minimum cussed in Chap. We can easily "top arithmetic computing and small Mp requirements. For example, The C in PMS notation is it can be used to control laboratory devices, such as gas chromoto- 12 graphs or sampling oscilloscopes. Together with special T's, it is C('PDP-8; technology:transistors; b/w; programmed to be a laboratory instrument, such as a pulse height descendants:'PDP-8/S, 'PDP-8/I, 'PDP-8/L; antecedents: 'PDP-5; analyzer or a spectrum analyzer. These applications are typical Mp(core; #0:7; 4096 w; tc:1.5 /is/w); of the laboratory and process control requirements for which the ~ 4 it Pc(Mps(2 w); machine was designed. As another example, can serve as a instruction length:l|2 w message concentrator by controlling telephone lines to which : 1 address/instruction ; occasion- typewriters and Teletypes are attached. The computer operations on data/od:(<— , +, —\, A, —(negate), X 2, stands alone as a small-scale general-purpose computer.
- 
												  COSC 6385 Computer Architecture Instruction Set ArchitecturesCOSC 6385 Computer Architecture Instruction Set Architectures Edgar Gabriel Spring 2011 Edgar Gabriel Instruction Set Architecture (ISA) • Definition on Wikipedia: “Part of the Computer Architecture related to programming” • Defines – set of operations, – instruction format, – hardware supported data types, – named storage, – addressing modes, – sequencing • Includes a specification of the set of opcodes (machine language) - native commands implemented by a particular CPU design COSC 6385 – Computer Architecture Edgar Gabriel Instruction Set Architecture (II) • ISA to be distinguished from the micro-architecture – set of processor design techniques used to implement an ISA • Example: Intel Pentium and AMD Athlon support a nearly identical ISA, but have completely different micro-architecture COSC 6385 – Computer Architecture Edgar Gabriel ISA Specification • Relevant features for distinguishing ISA’s – Internal storage – Memory addressing – Type and size of operands – Operations – Instructions for Flow Control – Encoding of the IS COSC 6385 – Computer Architecture Edgar Gabriel Internal storage • Stack architecture: operands are implicitly on the top of the stack • Accumulator architecture: one operand is implicitly the accumulator • General purpose register architectures: operands have to be made available by explicit load operations – Dominant form in today’s systems COSC 6385 – Computer Architecture Edgar Gabriel Internal storage (II) • Example: C= A+B Stack: Accumulator: Load-Store: Push A Load A Load R1,A Push B Add B Load R2,B Add
- 
												  System V Application Binary Interface AMD64 Architecture Processor Supplement Draft Version 0.98System V Application Binary Interface AMD64 Architecture Processor Supplement Draft Version 0.98 Edited by Michael Matz1, Jan Hubickaˇ 2, Andreas Jaeger3, Mark Mitchell4 September 27, 2006 [email protected] [email protected] [email protected] [email protected] AMD64 ABI Draft 0.98 – September 27, 2006 – 9:24 Contents 1 Introduction 8 2 Software Installation 9 3 Low Level System Information 10 3.1 Machine Interface . 10 3.1.1 Processor Architecture . 10 3.1.2 Data Representation . 10 3.2 Function Calling Sequence . 13 3.2.1 Registers and the Stack Frame . 14 3.2.2 The Stack Frame . 14 3.2.3 Parameter Passing . 15 3.3 Operating System Interface . 22 3.3.1 Exception Interface . 22 3.3.2 Virtual Address Space . 22 3.3.3 Page Size . 24 3.3.4 Virtual Address Assignments . 24 3.4 Process Initialization . 25 3.4.1 Initial Stack and Register State . 25 3.4.2 Thread State . 29 3.4.3 Auxiliary Vector . 29 3.5 Coding Examples . 31 3.5.1 Architectural Constraints . 32 3.5.2 Conventions . 34 3.5.3 Position-Independent Function Prologue . 35 3.5.4 Data Objects . 36 3.5.5 Function Calls . 44 3.5.6 Branching . 46 1 AMD64 ABI Draft 0.98 – September 27, 2006 – 9:24 3.5.7 Variable Argument Lists . 49 3.6 DWARF Definition . 54 3.6.1 DWARF Release Number . 54 3.6.2 DWARF Register Number Mapping . 54 3.7 Stack Unwind Algorithm . 54 4 Object Files 58 4.1 ELF Header . 58 4.1.1 Machine Information .
- 
												  Accessing Tms320c54x Memory-Mapped Registers in C-C54xregs.HTMS320 DSP DESIGNER’S NOTEBOOK Accessing TMS320C54x Memory-Mapped Registers in C– C54XREGS.H APPLICATION BRIEF: SPRA260 Leor Brenman Digital Signal Processing Products Semiconductor Group Texas Instruments May 1994 IMPORTANT NOTICE Texas Instruments (TI) reserves the right to make changes to its products or to discontinue any semiconductor product or service without notice, and advises its customers to obtain the latest version of relevant information to verify, before placing orders, that the information being relied on is current. TI warrants performance of its semiconductor products and related software to the specifications applicable at the time of sale in accordance with TI’s standard warranty. Testing and other quality control techniques are utilized to the extent TI deems necessary to support this warranty. Specific testing of all parameters of each device is not necessarily performed, except those mandated by government requirements. Certain application using semiconductor products may involve potential risks of death, personal injury, or severe property or environmental damage (“Critical Applications”). TI SEMICONDUCTOR PRODUCTS ARE NOT DESIGNED, INTENDED, AUTHORIZED, OR WARRANTED TO BE SUITABLE FOR USE IN LIFE-SUPPORT APPLICATIONS, DEVICES OR SYSTEMS OR OTHER CRITICAL APPLICATIONS. Inclusion of TI products in such applications is understood to be fully at the risk of the customer. Use of TI products in such applications requires the written approval of an appropriate TI officer. Questions concerning potential risk applications should be directed to TI through a local SC sales office. In order to minimize risks associated with the customer’s applications, adequate design and operating safeguards should be provided by the customer to minimize inherent or procedural hazards.
- 
												  MINICOMPUTER CONCEPTS by BENEDICTO CACHO Bachelor OfMINICOMPUTER CONCEPTS By BENEDICTO CACHO II Bachelor of Science Southeastern Oklahoma State University Durant, Oklahoma 1973 Submitted to the Faculty of the Graduate College of the Oklahoma State University in partial fulfillment of the requirements for the Degre~ of MASTER OF SCIENCE July, 1976 . -<~ .i;:·~.~~·:-.~?. '; . .- ·~"' . ~ ' .. .• . ~ . .. ' . ,. , .. J:. MINICOMPUTER CONCEPTS Thesis Approved: fl. F. w. m wd/ I"'' ') 2 I"' e.: 9 tl d I a i. i PREFACE This thesis presents a study of concepts used in the design of minicomputers currently on the market. The material is drawn from research on sixteen minicomputer systems. I would like to thank my major adviser, Dr. Donald D. Fisher, for his advice, guidance, and encouragement, and other committee members, Dr. George E. Hedrick and Dr. James Van Doren, for their suggestions and assistance. Thanks are also due to my typist, Sherry Rodgers, for putting up with my illegible rough draft and the excessive number of figures, and to Dr. Bill Grimes and Dr. Doyle Bostic for prodding me on. Finally, I would like to thank members of my family for seeing me through it a 11 . iii TABLE OF CONTENTS Chapter Page I. INTRODUCTION 1 Objective ....... 1 History of Minicomputers 2 II. ELEMENTS OF MINICOMPUTER DESIGN 6 Introduction 6 The Processor . 8 Organization 8 Operations . 12 The Memory . 20 Input/Output Elements . 21 Device Controllers .. 21 I/0 Operations . 22 III. GENERAL SYSTEM DESIGNS ... 25 Considerations ..... 25 General Processor Designs . 25 Fixed Purpose Register Design 26 General Purpose Register Design 29 Multi-accumulator Design 31 Microprogramm1ng 34 Stack Structures 37 Bus Structures . 39 Typical System Options 41 IV.
- 
												  6.172 Performance Engineering of Software Systems, Lecture 3: Bit Hacks6.172 Performance !"##$* Engineering %&'&( of Software Systems "#)*+)$#)*+,*-./01* LECTURE 3 ! Bit Hacks Julian Shun 1 © 2008–2018 by the MIT 6.172 Lecturers Binary Representation Let x = ⟨xw–1xw–2…xO⟩ be a w-bit computer word. The x unsigned integer value stored in is Ob Y The prefix M designates a Z ZM Y Mį Boolean constant. For example, the 8-bit word Ob1OO1O11O represents ం the unsigned value 15O = 2 + 4 + 16 + 128. The signed integer (two’s complement) value stored in x is sign bit Y M Y Z ZM ZY Y Mį   For example, theৃ 8-ంbit wordৄ Ob1OO1O11O 1O6 2 4 16 128 represents the signed value – = + + – . 2 © 2008–2018 by the MIT 6.172 Lecturers Two’s Complement We have ObOO…O = O. What is the value of x = Ob11...1? Y M Y Z ZM ZY Mį  Y  ৃం M ৄY Mį  Y Y ৃం ৄ  Y    3 © 2008–2018 by the MIT 6.172 Lecturers Complementary Relationship Important identity Since we have x + ~x = -1, it follows that -x = ~x + 1. Example x = ObO11O11OOO ~x = Ob1OO1OO111 –x = Ob1OO1O1OOO 4 © 2008–2018 by the MIT 6.172 Lecturers Binary and Hexadecimal Decimal Hex Binary Decimal Hex Binary !! !!!! "" #!!! ## !!!# $ $ #!!# %% !!#! #! & #!#! '' !!## ## ( #!## )) !#!! #% * ##!! The prefix !1 + !#!# #' , ##!# designates a - !##! #) . ###! hex constant. / !### #+ 0 #### To translat from hex to binary, translate each hex digit to its binary equivalent, and concatenate the bits. Example: 0xDEC1DE2CODE4F00D is įįįįįįįįįįįįįįįįįįįįįįįįįįįįįįįį &'%&'%į&'(įį& 5 © 2008।।।।।।।।।।।।।।।।–2018 by the MIT 6.172 Lecturers C Bitwise Operators Operator Description & AND | OR ^ XOR (exclusive OR) ~ NOT (one’s complement) << shift left >> shift right Examples (8-bit word) A = Ob1O11OO11 B = ObO11O1OO1 A&B = ObOO1OOOO1 ~A = ObO1OO11OO A|B = Ob11111O11 A >> 3 = ObOOO1O11O A^B = Ob11O11O1O A << 2 = Ob11OO11OO 6 © 2008–2018 by the MIT 6.172 Lecturers Set the kth Bit Problem Set !th bit in a word "%to #.