MPC750 RISC Processor by Freescale

Freescale Semiconductor, Inc. MPC750/D (Motorola Order Number) 8/97 . C N , I R TO C U . D . N . O c IC M n E I S E , Advance Information L r A C o MPC750 RISC MicroprocessorES t E R c Technical SummaryF u Y B d This document gives an overviewED of the MPC750 and MPC740 microprocessor features, n V including a block diagram Ishowing the major functional components. It summarizes how o H ® the MPC750 and MPC740C processors implement the PowerPC architecture specification c R i A and describes processor-specific features not defined by the PowerPC architecture. m This document has two parts: e S • Part 1, “MPC750 Microprocessor Overview,” provides an overview of MPC750 features, including a block diagram showing the major functional components. e l • Part 2, “MPC750 Microprocessor: Implementation,” describes the PowerPC a architecture in general and provides specific details about the MPC750 as a low- c power, 32-bit implementation of the PowerPC architecture. s e Unless otherwise noted, references in this document to the MPC750 apply to the MPC740. e r The MPC750 differs from the MPC740 primarily in its extensive L2 cache support. F To locate any published errata or updates for this document, refer to the website at http://www.mot.com/powerpc/. MPC750 Technical Summary MPC750 Technical The PowerPC name is a registered trademark and the PowerPC logotype is a trademark of International Business Machines Corporation used by Motorola under license from International Business Machines Corporation. This document contains information on a new product under development by Motorola. Motorola reserves the right to change or discontinue this product without notice. © Motorola Inc. 1997. All rights reserved. For More Information On This Product, Go to: www.freescale.com Freescale Semiconductor, Inc. Part 1 MPC750 Microprocessor Overview This section describes the features and general operation of the MPC750 and provides a block diagram showing major functional units. The MPC750 is an implementation of the PowerPC microprocessor family of reduced instruction set computer (RISC) microprocessors. The MPC750 implements the 32-bit portion of the PowerPC architecture, which provides 32-bit effective addresses, integer data types of 8, 16, and 32 bits, and floating-point data types of 32 and 64 bits. The MPC750 is a superscalar processor that can complete two instructions simultaneously. It incorporates the following six execution units: • Floating-point unit (FPU) • Branch processing unit (BPU) . C • System register unit (SRU) N , I • Load/store unit (LSU) R • Two integer units (IUs): IU1 executes all integer instructions. IU2 executesTO all integer instructions C except multiply and divide instructions. U . D . N . The ability to execute several instructions in parallel and the use of simpleO instructions with rapid execution c IC times yield high efficiency and throughput for MPC750-based Msystems. Most integer instructions execute in n E I S one clock cycle. The FPU is pipelined, the tasks it performs are broken into subtasks, implemented as three E , successive stages. Typically, a floating-point instructionL can occupy only one of the three stages at a time, r A freeing the previous stage to work on the nextC floating-point instruction. Thus, three single-precision o ES t floating-point instructions can be in the FPUE execute stage at a time. Double-precision add instructions have R c a three-cycle latency; double-precision multiplyF and multiply-add instructions have a four-cycle latency. u Y B d Figure 1 shows the parallel organization of the execution units (shaded in the diagram). The instruction unit ED n fetches, dispatches, and predicts branch instructions. Note that this is a conceptual model that shows basic V I o features rather than attemptingH to show how features are implemented physically. C c R i A The MPC750 has independent on-chip, 32-Kbyte, eight-way set-associative, physically addressed caches m for instructions and data and independent instruction and data memory management units (MMUs). Each e MMU has a 128-entry, two-way set-associative translation lookaside buffer (DTLB and ITLB) that saves S recently used page address translations. Block address translation is done through the four-entry instruction and data block address translation (IBAT and DBAT) arrays, defined by the PowerPC architecture. During e l block translation, effective addresses are compared simultaneously with all four BAT entries. a c The L2 cache is implemented with an on-chip, two-way, set-associative tag memory, and with external, s synchronous SRAMs for data storage. The external SRAMs are accessed through a dedicated L2 cache port e that supports a single bank of up to 1 Mbyte of synchronous SRAMs. The L2 cache interface is not e implemented in the MPC740. r F The MPC750 has a 32-bit address bus and a 64-bit data bus. Multiple devices compete for system resources through a central external arbiter. The MPC750’s three-state cache-coherency protocol (MEI) supports the exclusive, modified, and invalid states, a compatible subset of the MESI (modified/exclusive/shared/invalid) four-state protocol, and it operates coherently in systems with four-state caches. The MPC750 supports single-beat and burst data transfers for memory accesses and memory-mapped I/O operations. The MPC750 has four software-controllable power-saving modes. Three static modes, doze, nap, and sleep, progressively reduce power dissipation. When functional units are idle, a dynamic power management mode causes those units to enter a low-power mode automatically without affecting operational performance, software execution, or external hardware. The MPC750 also provides a thermal assist unit (TAU) and a way to reduce the instruction fetch rate for limiting power dissipation. The MPC750 uses an advanced CMOS process technology and is fully compatible with TTL devices. 2 MPC750 RISC Microprocessor Technical Summary For More Information On This Product, Go to: www.freescale.com . C Freescale SemiconducNtor, Inc... , I R TO C U MPC750 RISC Microprocessor Technical Summary Technical MPC750 RISC Microprocessor D N O IC M E Instruction Unit 128-Bit S (4 Instructions) Branch Processing EFetcher L Unit A C Instruction MMU Additional Features S BTIC CTR E LR • Time Base Counter/Decrementer E Instruction Queue 64 Entry SRs • Clock Multiplier R (6 Word) (Shadow) IBAT F 32-Kbyte • JTAG/COP Interface BHT Tags Y Array I Cache • Thermal/Power ManagementB ITLB Figure 1.MPC750Microprocessor BlockDiagram • Performance MonitorD E Freescale Semiconductor,Inc. V F I H o 2 Instructions C 64-Bit r Dispatch Unit R M A (2 Instructions) o r G e o I n t f o Reservation Station o Reservation Station Reservation Station Reservation Station Reservation Station : r GPR File (2 Entry) FPR File w m w a Rename Buffers Rename Buffers t w (6) (6) i o . System Register 32-Bit Load/Store Unit 64-Bit 64-Bit Floating-Point f n Integer Unit 1 Integer Unit 2 r Unit Unit e O I + e n + x ÷ s (EA Calculation) ÷ c T + x + a h CR Store Queue FPSCR l i e s . 32-Bit 32-Bit c P o r o m d u PA EA c t 60x Bus Interface Unit , Completion Unit Data MMU 64-Bit Instruction Fetch Queue L2 Bus Interface Reorder Buffer SRs Unit (6 Entry) L1 Castout Queue (Original) DBAT L2 Castout Queue Array 32-Kbyte Tags D Cache Data Load Queue L2 Controller DTLB L2CR L2 Tags 32-Bit Address Bus 32-/64-Bit Data Bus Not in the MPC740 17-Bit L2 Address Bus 64-Bit L2 Data Bus 3 Freescale Semiconductor, Inc. 1.1 MPC750 Microprocessor Features This section lists features of the MPC750. The interrelationship of these features is shown in Figure 1. 1.1.1 Overview of the MPC750 Microprocessor Features Major features of the MPC750 are as follows: • High-performance, superscalar microprocessor — As many as four instructions can be fetched from the instruction cache per clock cycle — As many as two instructions can be dispatched per clock . C — As many as six instructions can execute per clock (including two integer instructions)N , I — Single-clock-cycle execution for most instructions R TO C • Six independent execution units and two register files U . D . N . — BPU featuring both static and dynamic branch predictionO c IC – 64-entry (16-set, four-way set-associative) branchM target instruction cache (BTIC), a cache n of branch instructions that have been encounteredE in branch/loop code sequences. If a target I S instruction is in the BTIC, it is fetched intoE the instruction queue a cycle sooner than it can , L r A be made available from the instructionC cache. Typically, if a fetch access hits the BTIC, it o provides the first two instructionsES in the target stream. t E R c – 512-entry branch historyF table (BHT) with two bits per entry for four levels of prediction— u not-taken, strongly not-taken,Y taken, strongly taken B d – Branch instructionsED that do not update the count register (CTR) or link register (LR) are n V removed fromI the instruction stream. o H C c — Two integerR units (IUs) that share thirty-two GPRs for integer operands i A – IU1 can execute any integer instruction. m e – IU2 can execute all integer instructions except multiply and divide instructions (multiply, S divide, shift, rotate, arithmetic, and logical instructions). Most instructions that execute in the IU2 take one cycle to execute. The IU2 has a single-entry reservation station. e l — Three-stage FPU a c – Fully IEEE 754-1985-compliant FPU for both single- and double-precision operations s – Supports non-IEEE mode for time-critical operations e – Hardware support for denormalized numbers e r – Single-entry reservation station F – Thirty-two 64-bit FPRs for single- or double-precision operands — Two-stage LSU – Two-entry reservation station – Single-cycle, pipelined cache access – Dedicated adder performs EA calculations – Performs alignment and precision conversion for floating-point data – Performs alignment and sign extension for integer data – Three-entry store queue – Supports both big- and little-endian modes 4 MPC750 RISC Microprocessor Technical Summary For More Information On This Product, Go to: www.freescale.com Freescale Semiconductor, Inc.

MPC750 RISC Processor by Freescale

Computer Organization

Review of Computer Architecture

V850ES/SA2, V850ES/SA3 32-Bit Single-Chip Microcontrollers

Testing and Validation of a Prototype Gpgpu Design for Fpgas Murtaza Merchant University of Massachusetts Amherst

The Microarchitecture of the Pentium 4 Processor

Implementation, Verification and Validation of an Openrisc-1200

Python Console Target Device 78K0 Microcontroller RL78 Family 78K0R Microcontroller V850 Family RX Family RH850 Family

Emcissonfrom L0 \ 001Mm: R" W \14

Value Prediction Design for High-Frequency Microprocessors 1

Computer Organization Structure of a Computer Registers Register

Section 3 Central Processing Unit

Ec603e™ Embedded RISC Microprocessor Technical Summary