EE382A Lecture 16A: Multi-Core Design Tradeoffs (Cont.)

Total Page:16

File Type:pdf, Size:1020Kb

EE382A Lecture 16A: Multi-Core Design Tradeoffs (Cont.) EE382A Lecture 16A: Multi-Core Design Tradeoffs (cont.) Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee382a EE382A – Autumn 2009 Lecture 16A - 1 John P Shen Performance Iron Law and Multi-core Scalability • Law #1 - CPU (Latency) Performance • Law #2 – MP (Throughput) Performance • Law #3 - Multi-core Performance Scalability Architecture – A Performance Scalability Model Performance • Law #4 – Algorithm and Performance Algorithm • Law #5 – Power and Performance Power EE382A – Autumn 2009 Lecture 16A - 2 John P Shen Law #1 – CPU (Latency) Performance [John DeVale & Bryan Black, 2005] Deeper pipelining Wider pipeline Increased CPI penalty Cycle Time Time Cycle Increased Increased EE382A – Autumn 2009 Lecture 16A - 3 John P Shen Latency vs. Throughput Performance • Reduce Latency of Application – Uni-processor, Single Program – Target Single-Thread Performance – Examples: SPEC, PC and Workstations • Increase Throughput of System – Multi-processors, Many Threads/Tasks – Target Multi-threaded/Multi-tasking Throughput – Example: Database Transaction Processing EE382A – Autumn 2009 Lecture 16A - 4 John P Shen Law #2 – MP (Throughput) Performance • Multi-Core Performance: • Can Improve PerfMC by: – Increasing: n (no. of CPUs or cores) – Increasing: Frequency (CPU clock frequency) – Decreasing: PL (dynamic instruction count) – Decreasing: CPI (cycles/instruction) EE382A – Autumn 2009 Lecture 16A - 5 John P Shen Law #3 – Multi-Core Performance Scalability • Multi-Core Speedup: • A Rigorous Scalability Model: EE382A – Autumn 2009 Lecture 16A - 6 John P Shen Scalability Degradation [Carole Dulong (x+y)=0.10 et al., 2005] 1p-16p 1-(x R^2 (x+y)=0.20 scaling +y) (x+y)=0.30 SEMPHY 0.993 0.999 PLSA 0.963 0.999 Rsearch 0.931 0.997 SVM-RFE 0.786 0.970 SNPs 0.685 0.967 GeneNet 0.642 0.983 EE382A – Autumn 2009 Lecture 16A - 7 John P Shen Path-Length Breakdown EE382A – Autumn 2009 Lecture 16A - 8 John P Shen CPI Breakdown EE382A – Autumn 2009 Lecture 16A - 9 John P Shen Scalability Headroom ? Reduce PL(n) Reduce CPI(n) (x+y)=0.35 EE382A – Autumn 2009 Lecture 16A - 10 John P Shen Conspiring Forces Against MC Scaling Three Forms of Scalability Impedance • Architecture – Increase of Path-Length Undermines Scalability – Increase of CPI Undermines Scalability • Algorithm – Limitation of Language and Algorithm – Tyranny of Amdahl’s Law (sequential bottleneck) • Power/Thermal – Increased Complexity and Inefficiency of Design – Super-linear Power Scaling Relative to Performance EE382A – Autumn 2009 Lecture 16A - 11 John P Shen Law #4 – Algorithm and Performance (Amdahl’s Law & Gustafson’s Law) f = sequential % Execution Time Execution Time Execution Time Execution Time f Parallelism (n) f* Parallelism (n) EE382A – Autumn 2009 Lecture 16A - 12 John P Shen Two Distinct & Correlated Dimensions of Performance Scalability Impedance Architecture Scalability: Algorithm Scalability: (Amdahl’s Law) EE382A – Autumn 2009 Lecture 16A - 13 John P Shen Combined Effect on Actual Speedup (Algorithm and Architecture Scalability) 10X SU 20X SU n > 13 n > 33 EE382A – Autumn 2009 Lecture 16A - 14 John P Shen Impact of Single-thread Performance on Multi-core Performance Scalability 10X SU 20X SU n > 17 n > 48 EE382A – Autumn 2009 Lecture 16A - 15 John P Shen Impact of Single-thread Performance on Multi-core Performance Scalability 10X SU 20X SU n > 14 n > 36 EE382A – Autumn 2009 Lecture 16A - 16 John P Shen Law #5 – Power and Performance EE382A – Autumn 2009 Lecture 16A - 17 John P Shen Power and Performance Landscape Pentium Pentium EE Pentium M Power (W) 120 100 80 100 300 500 700 1100 900 1300 1500 1700 60 1900 1 40 0.8 0.6 20 Spec2K/MHz 0.4 0 0.2 0 500 1000 1500 2000 2500 3000 0 3500 4000 Frequency (Hz) [John DeVale & Bryan Black, 2005] EE382A – Autumn 2009 Lecture 16A - 18 John P Shen Power and Throughput Performance [Ed Grochowski, 2005] 30 Pentium 4 (Psc) 25 CPU EPI Pentium 4 (Wmt) I486 7 nj 20 (1.74) powerpower = =perf perf (1.74) P5 10 nj 15 Scalar/Latency Throughput P6 17 nj Performance Performance 10 P4P-wmt 27 nj Relative Power Pentium Pro Pentium M P4P-psc 29 nj 5 Pentium (1.0) i486 power = perf ? Pentium M 9 nj 0 0 2 4 6 8 Low EPI Relative Performance EE382A – Autumn 2009 Lecture 16A - 19 John P Shen • The issue is not small vs. big cores, nor in-order vs. out-of-order cores. The key metric is EPI. • The ideal core: ultra-low EPI with best possible single-thread or single-core performance and CPU EPI SU highly efficient power/performance scaling. i486 7 nj 1 P5 10 nj 2 P6 17 nj 3.5 P4P-wmt 27 nj 6 ? P4P-psc 29 nj 6.5 P4P: 27 nj Pentium M 9 nj 5.5 ? Neo-Core 5 nj 6.5 Pentium 4 (Psc) Pentium 4 (Wmt) P5: 10 nj P-M: 9 nj? Pentium Pro Pentium M Neo: 5 nj i486 Pentium Neo-Core ? EE382A – Autumn 2009 Lecture 16A - 20 John P Shen CPU EPI SU i486 7 nj 1 P5 10 nj 2 P6 17 nj 3.5 • The MC scaling goal is not maximizing the number of cores but achieving the maximum throughput P4P-wmt 27 nj 6 within fixed power envelop (using fewest cores). P4P-psc 29 nj 6.5 • The key issue is not the power scaling of replicated Pentium M 9 nj 5.5 cores, but the un-core power scaling that may push total power scaling towards the square law again. Neo core? 5 nj 6.5 EPI= 29nj • Assume a large- scale CMP with potentially many cores. EPI= • Replication of 10nj cores results in Pentium 4 (Psc) proportional Pentium 4 (Wmt) EPI= increases in both Pentium M 9nj Pentium Pro EPI= throughput i486 Pentium 5nj performance and power (hopefully). EE382A – Autumn 2009 Lecture 16 - 21 John P Shen Holy Grail: <1 nj building–block core and near-linear un-core power scaling EPI: CPU Cores Prog. Accelerators Fixed Function Units 10nj 1nj 0.1nj 0.01nj 100W Power Envelope NP/DSP/GPU EPI IXP2800 1 nj 20x Performance Increase TMS320C6713 0.7 nj GeF7800GTX 0.6 nj EE382A – Autumn 2009 Lecture 16A - 22 John P Shen Power scaling challenges: • Need low EPI cores • Control un-core scaling ? ? Speedup scaling challenges: • Algorithm • Sequential % • Architecture • PL scaling • CPI scaling EE382A – Autumn 2009 Lecture 16A - 23 John P Shen Quo Vadis? • MC Scalability Strategies: – Algorithm – Languages & Specialized Parallelism – Architecture – CPI and Path Length Reduction – Power/Thermal – EPI Reduction & Scalable Un-core • Power/Thermal is the most critical MC scalability wall • Research Challenges: – Sequential % mitigation for compelling workloads – Ultra-low EPI core with great ST performance – Un-core fabric with near-linear power scaling • Un-core scaling is the new power goblin EE382A – Autumn 2009 Lecture 16A - 24 John P Shen .
Recommended publications
  • The Microarchitecture of the Pentium 4 Processor
    The Microarchitecture of the Pentium 4 Processor Glenn Hinton, Desktop Platforms Group, Intel Corp. Dave Sager, Desktop Platforms Group, Intel Corp. Mike Upton, Desktop Platforms Group, Intel Corp. Darrell Boggs, Desktop Platforms Group, Intel Corp. Doug Carmean, Desktop Platforms Group, Intel Corp. Alan Kyker, Desktop Platforms Group, Intel Corp. Patrice Roussel, Desktop Platforms Group, Intel Corp. Index words: Pentium® 4 processor, NetBurst™ microarchitecture, Trace Cache, double-pumped ALU, deep pipelining provides an in-depth examination of the features and ABSTRACT functions of the Intel NetBurst microarchitecture. This paper describes the Intel® NetBurst™ ® The Pentium 4 processor is designed to deliver microarchitecture of Intel’s new flagship Pentium 4 performance across applications where end users can truly processor. This microarchitecture is the basis of a new appreciate and experience its performance. For example, family of processors from Intel starting with the Pentium it allows a much better user experience in areas such as 4 processor. The Pentium 4 processor provides a Internet audio and streaming video, image processing, substantial performance gain for many key application video content creation, speech recognition, 3D areas where the end user can truly appreciate the applications and games, multi-media, and multi-tasking difference. user environments. The Pentium 4 processor enables real- In this paper we describe the main features and functions time MPEG2 video encoding and near real-time MPEG4 of the NetBurst microarchitecture. We present the front- encoding, allowing efficient video editing and video end of the machine, including its new form of instruction conferencing. It delivers world-class performance on 3D cache called the Execution Trace Cache.
    [Show full text]
  • Multiprocessing Contents
    Multiprocessing Contents 1 Multiprocessing 1 1.1 Pre-history .............................................. 1 1.2 Key topics ............................................... 1 1.2.1 Processor symmetry ...................................... 1 1.2.2 Instruction and data streams ................................. 1 1.2.3 Processor coupling ...................................... 2 1.2.4 Multiprocessor Communication Architecture ......................... 2 1.3 Flynn’s taxonomy ........................................... 2 1.3.1 SISD multiprocessing ..................................... 2 1.3.2 SIMD multiprocessing .................................... 2 1.3.3 MISD multiprocessing .................................... 3 1.3.4 MIMD multiprocessing .................................... 3 1.4 See also ................................................ 3 1.5 References ............................................... 3 2 Computer multitasking 5 2.1 Multiprogramming .......................................... 5 2.2 Cooperative multitasking ....................................... 6 2.3 Preemptive multitasking ....................................... 6 2.4 Real time ............................................... 7 2.5 Multithreading ............................................ 7 2.6 Memory protection .......................................... 7 2.7 Memory swapping .......................................... 7 2.8 Programming ............................................. 7 2.9 See also ................................................ 8 2.10 References .............................................
    [Show full text]
  • The Intel X86 Microarchitectures Map Version 2.0
    The Intel x86 Microarchitectures Map Version 2.0 P6 (1995, 0.50 to 0.35 μm) 8086 (1978, 3 µm) 80386 (1985, 1.5 to 1 µm) P5 (1993, 0.80 to 0.35 μm) NetBurst (2000 , 180 to 130 nm) Skylake (2015, 14 nm) Alternative Names: i686 Series: Alternative Names: iAPX 386, 386, i386 Alternative Names: Pentium, 80586, 586, i586 Alternative Names: Pentium 4, Pentium IV, P4 Alternative Names: SKL (Desktop and Mobile), SKX (Server) Series: Pentium Pro (used in desktops and servers) • 16-bit data bus: 8086 (iAPX Series: Series: Series: Series: • Variant: Klamath (1997, 0.35 μm) 86) • Desktop/Server: i386DX Desktop/Server: P5, P54C • Desktop: Willamette (180 nm) • Desktop: Desktop 6th Generation Core i5 (Skylake-S and Skylake-H) • Alternative Names: Pentium II, PII • 8-bit data bus: 8088 (iAPX • Desktop lower-performance: i386SX Desktop/Server higher-performance: P54CQS, P54CS • Desktop higher-performance: Northwood Pentium 4 (130 nm), Northwood B Pentium 4 HT (130 nm), • Desktop higher-performance: Desktop 6th Generation Core i7 (Skylake-S and Skylake-H), Desktop 7th Generation Core i7 X (Skylake-X), • Series: Klamath (used in desktops) 88) • Mobile: i386SL, 80376, i386EX, Mobile: P54C, P54LM Northwood C Pentium 4 HT (130 nm), Gallatin (Pentium 4 Extreme Edition 130 nm) Desktop 7th Generation Core i9 X (Skylake-X), Desktop 9th Generation Core i7 X (Skylake-X), Desktop 9th Generation Core i9 X (Skylake-X) • Variant: Deschutes (1998, 0.25 to 0.18 μm) i386CXSA, i386SXSA, i386CXSB Compatibility: Pentium OverDrive • Desktop lower-performance: Willamette-128
    [Show full text]
  • A Performance Analysis Tool for Intel SGX Enclaves
    sgx-perf: A Performance Analysis Tool for Intel SGX Enclaves Nico Weichbrodt Pierre-Louis Aublin Rüdiger Kapitza IBR, TU Braunschweig LSDS, Imperial College London IBR, TU Braunschweig Germany United Kingdom Germany [email protected] [email protected] [email protected] ABSTRACT the provider or need to refrain from offloading their workloads Novel trusted execution technologies such as Intel’s Software Guard to the cloud. With the advent of Intel’s Software Guard Exten- Extensions (SGX) are considered a cure to many security risks in sions (SGX)[14, 28], the situation is about to change as this novel clouds. This is achieved by offering trusted execution contexts, so trusted execution technology enables confidentiality and integrity called enclaves, that enable confidentiality and integrity protection protection of code and data – even from privileged software and of code and data even from privileged software and physical attacks. physical attacks. Accordingly, researchers from academia and in- To utilise this new abstraction, Intel offers a dedicated Software dustry alike recently published research works in rapid succession Development Kit (SDK). While it is already used to build numerous to secure applications in clouds [2, 5, 33], enable secure network- applications, understanding the performance implications of SGX ing [9, 11, 34, 39] and fortify local applications [22, 23, 35]. and the offered programming support is still in its infancy. This Core to all these works is the use of SGX provided enclaves, inevitably leads to time-consuming trial-and-error testing and poses which build small, isolated application compartments designed to the risk of poor performance.
    [Show full text]
  • Intel PSU Cage Replacement Process Support Guide
    Intel® PSU Cage Replacement Process Support Guide A Guide for Technically Qualified Assemblers of Intel® 2U ATX Products Document No.: PSU-01 Revision No.: 002 Disclaimer Information in this document is provided in connection with Intel® products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not designed, intended or authorized for use in any medical, life saving, or life sustaining applications or for any other application in which the failure of the Intel product could create a situation where personal injury or death may occur. Intel may make changes to specifications and product descriptions at any time, without notice. Intel server boards contain a number of high-density VLSI and power delivery components that need adequate airflow for cooling. Intel's own chassis are designed and tested to meet the intended thermal requirements of these components when the fully integrated system is used together. It is the responsibility of the system integrator that chooses not to use Intel developed server building blocks to consult vendor datasheets and operating parameters to determine the amount of airflow required for their specific application and environmental conditions. Intel Corporation can not be held responsible if components fail or the server board does not operate correctly when used outside any of their published operating or non-operating limits.
    [Show full text]
  • Intel® Itanium® Architecture Assembly Language Reference Guide
    Intel® Itanium® Architecture Assembly Language Reference Guide Copyright © 2000 - 2003 Intel Corporation. All rights reserved. Order Number: 248801-004 World Wide Web: http://developer.intel.com Intel(R) Itanium(R) Architecture Assembly Lanuage Reference Guide Page 2 Disclaimer and Legal Information Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Intel products are not intended for use in medical, life saving, or life sustaining applications. This Intel® Itanium® Architecture Assembly Language Reference Guide as well as the software described in it is furnished under license and may only be used or copied in accordance with the terms of the license. The information in this manual is furnished for informational use only, is subject to change without notice, and should not be construed as a commitment by Intel Corporation. Intel Corporation assumes no responsibility or liability for any errors or inaccuracies that may appear in this document or any software that may be provided in association with this document. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them.
    [Show full text]
  • Node-Level Coordinated Power-Performance Management Rathijit Sen David A
    Node-level Coordinated Power-Performance Management Rathijit Sen David A. Wood Department of Computer Sciences University of Wisconsin-Madison Abstract Apart from cores, cache and memory also consume sig- Pareto-optimal power-performance system configurations nificant power and need to be managed for power/energy- can improve performance for a power budget or reduce efficient performance. Intel’s Core Duo processor allows power for a performance target, both leading to energy sav- shutting down portions of its last-level cache (LLC) [35]. In- ings. We develop a new power-performance management tel’s Ivy Bridge microarchitecture provides more fine-grained framework that seeks to operate at the pareto-optimal frontier control: a subset of the ways of its set-associative LLC can for dynamically reconfigurable systems, focusing on single- be turned off [28]. IBM’s Power7 enables runtime throttling node reconfigurations in this paper. of memory traffic [44]. Felter et al. [23] demonstrated ben- Our framework uses new power-performance state descrip- efits through power-shifting using core DVFS and memory tors, Π-states, that improve upon the traditional ACPI-state throttling. Deng et al. [21] and David et al. [19] proposed descriptors by taking into account interactions among multi- using DFS/DVFS for main memory. Moreover, to avoid ple reconfigurable components, quantifying system-wide im- inconsistent/under-performing configurations, resource man- pact and predicting totally ordered pareto-optimal states. agement must be coordinated. Deng et al. [20] demonstrated We demonstrate applications of our framework for both coordinated management of core DVFS and memory DVFS. simulated systems and real hardware.
    [Show full text]
  • PRIMEQUEST Servers: Mission-Critical Servers for Linux and Windows
    WHITE PAPER PRIMEQUEST Servers: Mission-Critical Servers for Linux and Windows March 2006 PREPARED FOR Executive Summary Fujitsu A year ago, Fujitsu expanded its portfolio of robust enterprise servers with the introduction of the PRIMEQUEST family. Fujitsu had observed that a growing number of customers were attracted to Linux and Windows operating environments and to servers employing Intel processors. The Itanium 2-powered PRIMEQUEST TABLE OF CONTENTS line addresses that customer desire with high performance servers designed for Executive Summary .................................1 mission-critical Linux and Windows environments. Introduction...............................................1 With the addition of an eight-processor model 420, Fujitsu’s PRIMEQUEST family Industry Standard, now offers Itanium-based servers that span the range from midrange to high end. but with a Unique Approach....................2 The PRIMEQUEST models complement Fujitsu’s SPARC/Solaris PRIMEPOWER systems in mission-critical environments. Currently available with Madison 9M PRIMEQUEST Family Overview ..............3 chips, the midrange PRIMEQUEST 420 joins the PRIMEQUEST 440 with 16 processors and the PRIMEQUEST 480, which scales up to 32 processors. PRIMEQUEST Configurations .................5 Data Mirroring ...........................................7 Employing standard Intel processors certainly should not imply that PRIMEQUEST is merely another Itanium-based platform. Quite to the contrary, Fujitsu has Flexible Partitioning .................................8
    [Show full text]
  • Product Brief: Intel® Atom™ Processor E3900 Series, Intel® Celeron
    Product brief Intel® IoT Technology Intel Atom®, Intel® Pentium®, and Intel® Celeron® Processors Achieving New Levels of CPU Performance, Fast Graphics and Media Processing, Image Processing, and Security The latest generation of Intel Atom®, Pentium®, and Celeron® processors empowers real-time computing in digital surveillance, new in-vehicle experiences, advancements in industrial and ofce automation, new solutions for retail and medical, and more. Meeting the demands of the rapidly growing Internet of Things The number of connected machines has grown by approximately 300 percent1 in recent years and is expected to continue to explode. By 2020, 330 billion devices will create 35 trillion GB of data annually,2 and will require greatly increased processing power at the edge in order to maintain viability. Intel is supporting this rapid development and the growing complexity of IoT infrastructures with the release of the new Intel Atom® processor E3900 series, Intel® Pentium® processor N4200, and Intel® Celeron® processor N3350. They deliver the ability to handle more tasks. From manufacturing machines that can see, to intelligent video systems that can analyze data, these new processors enable amazing new possibilities. Powerful edge intelligence for IoT These new Intel Atom, Pentium, and Celeron processors ofer enhanced processing power in compact, low-power packages. They are now available with a dual- or quad- core processor running at up to 2.5 GHz and memory speeds up to LPDDR4 2400. All of this performance resides in a compact fip chip ball grid array (FCBGA), utilizing 14 nm silicon technology, making it an excellent ft for a wide range of IoT applications when space and power are at a premium.
    [Show full text]
  • The Intel X86 Microarchitectures Map Version 2.2
    The Intel x86 Microarchitectures Map Version 2.2 P6 (1995, 0.50 to 0.35 μm) 8086 (1978, 3 µm) 80386 (1985, 1.5 to 1 µm) P5 (1993, 0.80 to 0.35 μm) NetBurst (2000 , 180 to 130 nm) Skylake (2015, 14 nm) Alternative Names: i686 Series: Alternative Names: iAPX 386, 386, i386 Alternative Names: Pentium, 80586, 586, i586 Alternative Names: Pentium 4, Pentium IV, P4 Alternative Names: SKL (Desktop and Mobile), SKX (Server) Series: Pentium Pro (used in desktops and servers) • 16-bit data bus: 8086 (iAPX Series: Series: Series: Series: • Variant: Klamath (1997, 0.35 μm) 86) • Desktop/Server: i386DX Desktop/Server: P5, P54C • Desktop: Willamette (180 nm) • Desktop: Desktop 6th Generation Core i5 (Skylake-S and Skylake-H) • Alternative Names: Pentium II, PII • 8-bit data bus: 8088 (iAPX • Desktop lower-performance: i386SX Desktop/Server higher-performance: P54CQS, P54CS • Desktop higher-performance: Northwood Pentium 4 (130 nm), Northwood B Pentium 4 HT (130 nm), • Desktop higher-performance: Desktop 6th Generation Core i7 (Skylake-S and Skylake-H), Desktop 7th Generation Core i7 X (Skylake-X), • Series: Klamath (used in desktops) 88) • Mobile: i386SL, 80376, i386EX, Mobile: P54C, P54LM Northwood C Pentium 4 HT (130 nm), Gallatin (Pentium 4 Extreme Edition 130 nm) Desktop 7th Generation Core i9 X (Skylake-X), Desktop 9th Generation Core i7 X (Skylake-X), Desktop 9th Generation Core i9 X (Skylake-X) • New instructions: Deschutes (1998, 0.25 to 0.18 μm) i386CXSA, i386SXSA, i386CXSB Compatibility: Pentium OverDrive • Desktop lower-performance: Willamette-128
    [Show full text]
  • AN1209 Motorola Burstram
    Order this document by AN1209/D AN1209 The Freescale BurstRAM Prepared by: James Garris This note introduces the MCM62486 32K x 9 Synchronous maintain synchronization with the i486 or other logic driving BurstRAM. The device was designed to provide a high-perfor- the cache. mance, secondary cache for the Intel i486 microprocessor and future microprocessors with burst protocol. Four of these USE WITH THE i486 PROCESSOR devices can supply a 128K byte direct-mapped bursting cache The 62486 requires an ASIC or discrete PAL type of cache with parity support. controller to work with the i486. This cache control logic must THE MCM62486 also include 8K x 16 of cache-tag comparator RAM and any other buffers needed for system operation. The 62486 is a synchronous device with input registers and Control signals are sourced as follows: K is driven by the . address counters surrounding a standard 32K x 9 FSRAM system clock (CLK); ADSP is an output from the micropro- . core. The additional circuitry in the periphery enables the cessor; and ADV, ADSC are generated from the cache control c memory to uniquely interface with the i486. Like the i486, the logic. The data bus and lower address bus may interface n directly with the 62486 or the address bus may be buffered to I timings are referenced to the rising edge of the clock (K). Sig- nals generated by the processor and control logic must be improve its drive to the rest of the system. A simple block dia- , r stable during all transitions of clock from low to high.
    [Show full text]
  • Agilent E2492E Probe Adapter for Intel. Pentium. III and Celeron
    Installation Guide Publication Number E2492-92007 December 1999 © Copyright Agilent Technologies, Inc. 1999 All Rights Reserved Agilent E2492E Probe Adapter for the Intel® Pentium® III and Celeron™ Processors Installation at a Glance This Installation Guide explains how to install Agilent Technologies, Inc. probe adapter for the Intel® Pentium® III or Celeron™ processors in the 370-pin PPGA package. The probe adapter provides a quick and reliable connection from a processor to the Agilent E2487C Analysis Probe. The analysis probe then connects to your Agilent logic analyzer for state analysis. Installation Overview • Remove the processor. • Install the probe adapter. • Connect the processor to the probe adapter. • Connect the probe adapter to the Agilent E2487C analysis probe. Installation 1 Remove the clip and heatsink from the processor on the target system. 2 Remove the processor from the socket on the target system. 3 Install the probe adapter on the target system. Ensure that the connector is fully seated. 4 Install the processor on the socket of the probe adapter. Ensure that the connector is fully seated. CAUTION: Bending the flexible cable beyond the minimum bend radius will shorten the life of the probe adapter. See the dimension diagram the end of this document for minimum bend radius. 5 Installation 5 Use the clip to attach the heatsink to the socket on the probe adapter. 6 Connect the probe adapter to the Agilent E2487C Analysis Probe The probe adapter has five high-density connectors, labeled Cable 1 through Cable 5. The Agilent E2487C Analysis Probe also has five high-density cabled connectors, labeled P1 through P5.
    [Show full text]