Clock Design and Measurement Issues in Pentium™ Systems

Clock Design and Measurement Issues in Pentium Systems Design difficulties in producing a statistically stable 66-MHz Pentium system are reviewed. The information is pertinent to many other new, high-speed processors as well. A new, more informed approach to designing well-timed systems in this performance class is proposed. Measurements that support this approach are examined, particularly those made with the HP 8133A pulse generator. by Michael K. Williams and Andreas M.R. Pfaff Clock rates in all classes of computational systems,Pulse from fidelity, PCs sometimes referred to as signal integrity, is that to supercomputers, have been escalating exponentiallypart for of highĆspeed digital design that is concerned with manĆ years. Computational systems formerly considered simpleraging the analog effects that prevent signals from being reliĆ have come to run at speeds that were previouslyably found recognized only at their destinations. This includes ensuring in more complex and aggressive systems. Before thisthat hapĆ edges arrive at their loads with proper edge speeds and pened, systems at the simpler end of this spectrumproper (PCs shapes, and and controlling the various types of noise workstations) operated at clock rates that don't present(crosstalk, very EMI, reflections, ground bounce, etc.) that can difficult clock distribution and reception problems. cause unreliable or false triggering. The extent to which these issues are important has increased dramatically in PC Recent introductions of new processor types have given PC and workstation designs. Wider buses and faster clocks and and workstation system designers new chips and chipsets edges (higher waveform spectral content) are the primary that enable system designs that deliver much higher levels of sources of these problems. The classic discussion of all of performance.1 Most of these devices employ internal strucĆ these analog effects can be found in reference 2. tures that come directly from the world of mainframes and supercomputers: pipelining, 64Ćbit data buses, onĆboardTiming, or clock distribution and reception, is the other critiĆ floatingĆpoint units, instruction prefetching, and sophistiĆcal facet of design in these faster systems, and is possibly the cated caching schemes. Many of these processors aremost sumĆ significant and least wellĆunderstood aspect of the deĆ marized in Table I. These new device families includesign. Intel'sTiming environment design is the process of specifying Pentium processor, Digital's Alpha, the Apple/IBM/Motorolahow the clock is to be distributed and received throughout PowerPC, and others. These ICs have clock ratesthe that system range such that the state architecture is reliably synchroĆ from tens to hundreds of MHz. Some are expectednized. eventually Reliable means that synchronization is guaranteed on to exceed 1 GHz. every cycle of every copy of the design that is manufactured, despite the presence of a variety of statistical tolerancing Table I mechanisms (skew, jitter, etc.) which reduce the precision Some New Processor Types and their Clock Rates with which the clock can be delivered. These tolerancing mechanisms are described in detail on page 70. When reliĆ Manufacturer Processor Clock Rate able synchronization is ensured by sound design practices, Intel Pentium 60 and 66 MHzthe design is said to be statistically stable. Intel P54C 99 MHz The question of exactly how to ensure this statistical stability Intel 486 66 and 99 MHzis one that each design team must face as they adopt these new devices into their designs. Success at answering this Apple/IBM/Motorola PowerPC 80 MHz question brings with it higher yields, fewer design turns, and Digital Equipment Corp. Alpha u100 MHz the elimination of extremely subtle timing failures. Methods for doing this, while relatively new to designs at the workĆ MIPS R4400SC 150 MHz station and PC level, have been commonplace in the design With all of the sophisticated internal structures andof faster higher capability systems (mainframes and supercomputĆ operating speeds comes a price to be paid by theers) design for many years. A descriptive term for the approach that team. Specifically, successful system design at theseis speeds common to all of these methods is informed design. requires very careful consideration of many mechanisms, such as timing and pulse fidelity, that are unimportant at lower speeds (16 to 33 MHz). 68December 1994 HewlettĆPackard Journal Hewlett-Packard Company 1994 Pentium or Cache Controller Clock Elemental Tolerances Performance Cost Devices and Functionality Targets Targets Interconnects Select Number Working <700 ps Technology of Loads Clock Mix Tolerances Skew, Jitter, and Pulse Width Distortion Clock to any Cache SRAM Engineering Judgement Fig. 2. The difference in arrival time between either the Pentium clock or the cache controller clock and the clock arriving at any Timing Scheme and SRAM must be less than 700 ps in every system on every cycle. Final Tolerance make 66ĆMHz systems ideal for the informed design apĆ Budget proach. Furthermore, the issues and methods presented are general and extend to other processor types as well. Fig. 1. Timing environment design process. DesignĆspecific and/or unrated parametric information must be incorporated intoPentium the Characteristics and Requirements engineering decision making process from the outset. An understanding of the difficulties of distributing a clock within a Pentium design must begin with an understanding Two results are produced by an informed approachof to Pentium the timing requirements. Our discussion of this asĆ design of a timing environment. The obvious onepect is a of speciĆ the design will be in summary form, and the reader fication of a clock distribution scheme. Equally important,is referred to the Intel documentation3Ć6 for a more complete however, is a detailed knowledge of the tolerancediscussion on the of requirements. Also, reference 7 discusses both arrival time of any clock waveform emerging fromthe any requirements outĆ and the various design decisions in much put of any copy of that network, on any cycledeeper of its detail operaĆ than can be done here. tion. This knowledge, that is, the tolerance budget, is used by the timing verification software to determine ifA the variety rest of of system configurations are supported by the the system is correctly timed. Obviously the qualityPentium of this processor. The clock rate can be either 60 or 66 determination is a function of the quality of theMHz. tolerance The system can use either no secondĆlevel caching, or budget. Informed design, as it applies to timing, canit can be have 256KĆbyte or 512KĆbyte cache memories. Systems viewed as the practice of ensuring that all of thewith mechaĆ 256KĆbyte caches can operate at either clock rate, while nisms that contribute to the overall tolerancing of512KĆbyte the clock systems are limited to 60 MHz. A typical" Pentium have been accurately assessed. design is expected to operate at 66 MHz and have a 256KĆ byte secondĆlevel cache. For such systems, there are 12 Measurement is used to characterize devices and printedclock loads within the CPU complex. Depending upon how circuit board processes to see how they tolerance.the This rest of the system is designed, the total number of clock deviceĆlevel tolerance data is used to compute theloads overall will typically be in the range of 15 to 20, although in tolerance on the system clock. And this systemĆlevelsome tolerĆ server systems, this number can range an order of ance is used within the timing verifier to ensuremagnitude the creation higher. of a statistically stable system. Fig. 1 illustrates where deviceĆ level parametric data fits into the overall decisionThe making Pentium specification dictates that the arrival times of process. the clock at the processor and at the cache controller never differ by more than 200 ps. It also states that the difference In this article we examine some of the difficultiesin a arrival designer times between the processor and any cache memĆ will encounter in specifying, analyzing, and verifyingory, a timing and the cache controller and any cache memory, can scheme for a 66ĆMHz Pentium system. This falls innever the exceed lower 700 ps (Fig. 2). These tolerance specifications speed range for the new round of processors. However,must be met at 0.8, 1.5, and 2.0 volts. In any design, there ultratight timing specifications coupled with the currentlywill be other tolerance requirements that state how much available implementation technologies (clock buffers, printeddifference in arrival time is permitted between clocks at circuit boards, etc.) make 66ĆMHz Pentium systemsloads among within the CPU complex and clocks at loads external the most difficult from a timing environment designto perspecĆ it (external loads). These requirements will always be tive. We will see, for example, that the timing withindirectly the determined by the design itself. However, the overall CPU complex (processor, cache controller, and cachetolerance RAMs) budget will usually be driven by the timing within is very sensitive to clock jitter. This sensitivity, andthe others, CPU complex. Hewlett-Packard Company 1994 December 1994 HewlettĆPackard Journal69 Tolerance Mechanisms in Clock Distribution Networks As described in the accompanying article, we are attempting to guard against a number of statistical tolerancing mechanisms, such as skew and jitter, that reduce the precision with which a clock signal can be delivered. Here

Clock Design and Measurement Issues in Pentium™ Systems

Design and Evaluation of a Clock Multiplexing Circuit for the SSRL Booster Accelerator Timing System

V850 Standby Modes

Generate a Clock Signal from a Crystal Oscillator

VLSI Digital Signal Processing

Tms320f280x, Tms320c280x, Tms320f2801x Digital Signal Processors

Phase Alignment of Asynchronous External

7. Latches and Flip-Flops

Introduction (Pdf)

MUX-56 Overview

ECE380 Digital Logic Flip-Flops

Digital Timing: Clock Signals, Jitter, Hystereisis, and Eye Diagrams

"Clock Distribution in Synchronous Systems"