Characterizing Processors for Energy and Performance Management

Total Page:16

File Type:pdf, Size:1020Kb

Characterizing Processors for Energy and Performance Management 2015 16th International Workshop on Microprocessor and SOC Test and Verification Characterizing Processors for Energy and Performance Management Harshit Goyal Vishwani D. Agrawal Department of Electrical and Computer Engineering Department of Electrical and Computer Engineering, Auburn University, Auburn University, Auburn, Alabama 36849 Auburn, Alabama 36849 Email: [email protected] Email: [email protected] Abstract—A processor executes a computing job in a certain II. TERMINOLOGY number of clock cycles. The clock frequency determines the time that the job will take. Another parameter, cycle efficiency A. Technology Benchmark or cycles per joule, determines how much energy the job will A technology benchmark is used to characterize the technol- consume. The execution time measures performance and, in ogy of the target processor whose operation is to be managed. combination with energy dissipation, influences power, thermal behavior, power supply noise and battery life. We describe a The circuit and technology level details of the benchmark method for power management of a processor. An Intel processor circuit should be known such that it can be simulated to in 32nm bulk CMOS technology is used as an illustrative exam- accurately determine its delay and energy versus the operating ple. First, we characterize the technology by H-spice simulation voltage. The circuit size may be reasonable for economy of a ripple carry adder for critical path delay, dynamic energy of the simulation effort, yet it may have an understandably and static power at a wide range of supply voltages. The adder data is then scaled based on the clock frequency, supply voltage, meaningful function. The benchmark used in the present work thermal design power (TDP) and other specifications of the is a 16-bit ripple carry adder. processor. To optimize the time and energy performance, voltage and clock frequency are determined showing 28% reduction both B. Energy per Cycle in execution time and energy dissipation. A power-delay product, known as energy per cycle Keywords-performance, cycle efficiency, energy per cycle, low (EPC) [16], measures the energy dissipated in a circuit per power, thermal design power. switching operation. Since the dynamic energy per switching event is fixed, the EPC describes a fundamental tradeoff between speed and power. Energy per cycle of a circuit is I. INTRODUCTION a key parameter for energy efficiency in ultra-low power Power consumption is proportional to the frequency of applications. Because computing workload is characterized execution and the square of the operating voltage, while energy in terms of clock cycles, this measure directly relates to consumption also depends on the total execution time [17]. the energy consumption of workload [5]–[7]. The total energy per cycle (Etotal) of a single gate is composed of Energy consumption has become one of the primary concerns E E in processor design due to the recent popularity of portable dynamic energy ( dyn) and leakage or static energy ( static): devices and cost concerns for desktops and servers. Notably, E α · C · V 2 battery capacities have improved rather moderately (a factor dyn = 0→1 L dd (1) of 2 to 4 over the last 30 years), while the computational demands have drastically increased over the same time frame. With a number of performance oriented devices emerging Estatic = Pstatic · td and a huge demand of power from a fixed capacity battery, = Ioff · Vdd · td using the battery wisely has assumed high importance. In 2 − Vdd K · C · V · S energy-constrained systems, low power design is essential = L dd 10 (2) for extending battery and system lifetime. Lowering voltage supply (Vdd) quadratically decreases energy dissipation, but also causes an increase in delay [3]. Optimizing performance Etotal = Edyn + Estatic Vdd and power simultaneously requires a thorough study of the − S 2 =(α0→1 + K · 10 ) · CL · V (3) available resources and possible trade-offs [8]. Hence, it be- dd comes important to define and use new or existing metrics. In where α0→1 is low to high transition activity for the gate this work, we have used a recently defined parameter, cycle output node and Pstatic is static leakage power. Ioff is leakage efficiency of processor (η) [11], [12], to investigate the cycles current, K is a constant, CL is load capacitance of the gate that could be run using a given amount of energy. and S is sub-threshold swing. 2332-5674/16 $31.00 © 2016 IEEE 67 DOI 10.1109/MTV.2015.22 C. Cycle Efficiency Cycle efficiency is defined as performance per unit of energy. To increase this efficiency it is required that the fundamental energy of operations be reduced. Further, power is defined as the rate of energy consumption (watts ≡ Joules/second) and is directly affected by the performance. This distinction between power and energy is important because what may seem like a trade-off may just be a Fig. 1. Benchmark circuit structure: an n-bit ripple carry adder. modulation in performance resulting in changes in power consumption. The performance (inverse of time) can be called voltage, maximum temperature and maximum signal loading time efficiency just as cycle efficiency (inverse of energy conditions. per cycle) is energy efficiency. If we regard the clock cycle Processor base frequency describes the rate at which the as a unit of work that a processor performs, then it means processor’s transistors open and close. The processor base work done in a time period 1/f, where f is the frequency in frequency is the operating point where TDP is defined. cycles per second or hertz (Hz). A clock cycle also means Turbo boost and overclocking are both essentially the same certain amount of energy or energy per cycle (EPC).We thing although they may work a little differently. Turbo define cycle efficiency, η =1/EP C, its unit being cycles per boost is a feature of Intel processors created to dynamically joule [11], [12]. Thus, a clock cycle means 1/f second in overclock a CPU, meaning the more you use your CPU, the time and 1/η joule in energy. Consider a program being run faster the CPU moves up to a certain point which is determined on a processor and suppose it takes c clock cycles to execute. by the manufacturer. Overclocking is a similar concept except Then we have, that it is not dynamic and is implemented manually, either c through software or through BIOS on newer motherboards. Execution time = f (4) III. TECHNOLOGY ASSESSMENT c A. Ripple Carry Adder Benchmark Circuit Energy consumed = (5) η The ripple carry adder circuit of Figure 1 is used to learn where, η is cycle efficiency of the processor in cycles per the energy and delay characteristics of the technology of the joule. Equation 4 gives the time performance of the processor processor [9]. In this paper, we use a 16-bit ripple carry as, adder that may a basic element in many digital computational systems. The design methodology emphasizes the operation 1 f Performance in time = = (6) of the adder in 32nm bulk PTM CMOS technology node. We Execution time c assume that the processor being characterized is large and a Similarly, Equation 5 gives the energy performance as, full scale gate level or transistor level model may not be readily available. Even if such a model was available, a detailed 1 c simulation at various voltages would be impractical for high Performance in energy = = η (7) Energy consumed reason of high complexity. However, operational data about Clearly, cycle efficiency (η) characterizes the energy the processor, such as voltage, maximum clock frequency and performance in a similar way as frequency (f) characterizes power consumption, is available. Also, the technology of the the time performance. These two performance parameters are device is specified. We, therefore, characterize the technology related to each other by the power being consumed, as follows: using known and easily analyzable adder benchmark. Then, f we scale this characterization to the processor. (8) Power = η B. Predictive Technology Model D. Base Frequency, Turbo Boost, Overclocking and Thermal Two versions of Predictive Technology Models (PTM) avail- Design Power able from an Arizona State University site that are widely Thermal design power (TDP) is the average maximum used to carry out research experiments are chosen to carry power in watts the processor dissipates when operating at out the characterization: Bulk MOSFET with conventional base frequency with all cores active under a manufacturer SiON/Polysilicon gate and Secondly high-k dielectrics with defined, high complexity workload. TDP is not the maximum metal gate technology, a combination known as HKMG (High- power the CPU may consume - there may be periods of time K, Metal Gate). Our experiments were carried out using 32nm when the CPU dissipates more power than that allowed by its Bulk CMOS PTM model. thermal design. TDP is usually 20%-30% lower than the CPU C. Simulation Tools Used maximum power dissipation. A 16 bit adder is designed using a VHDL code and its Peak power is the maximum power dissipated by the pro- compilation and simulation is carried out using Questa Sim cessor under the worst case conditions - at the maximum core to ensure correct operation before it is synthesized. Leonardo 68 TABLE I H-SPICE SIMULATION OF 16 BIT RIPPLE CARRY ADDER FOR 32NM TECHNOLOGY NODE IN BULK CMOS PTM AT DIFFERENT VOLTAGES (Vdd). Vdd Power from simulation Timing from simulation Energy per cycle Pavg Pdyn Pstatic Ppeak Critical path fmax Edyn Estatic Etotal volts μW μW μW μW delay, ps GHz fJ fJ fJ 1.20
Recommended publications
  • CFD Analyses of a Notebook Computer Thermal Management
    PREPRINT. 1 Ilker Tari and Fidan Seza Yalçin, "CFD Analyses of a Notebook Computer Thermal Management System and a Proposed Passive Cooling Alternative, IEEE Transactions on Components and Packaging Technologies, Vol. 33, No. 2, pp. 443-452 (2010). CFD Analyses of a Notebook Computer Thermal Management System and a Proposed Passive Cooling Alternative Ilker Tari, and Fidan Seza Yalcin H Fin height, mm. Abstract— A notebook computer thermal management system L Heat sink vertical length, mm. is analyzed using a commercial CFD software package (ANSYS Nu Nusselt Number. Fluent). The active and passive paths that are used for heat Pr Prandtl Number. dissipation are examined for different steady state operating Ra Rayleigh Number. conditions. For each case, average and hot-spot temperatures of Re Reynolds Number. the components are compared with the maximum allowable T Temperature, °C or K. operating temperatures. It is observed that when low heat W Heat sink width, mm. dissipation components are put on the same passive path, the 2 increased heat load of the path may cause unexpected hot spot g Gravitational acceleration, m/s . 2 temperatures. Especially, Hard Disk Drive (HDD) is susceptible h Convection heat transfer coefficient, W/(m ·K). to overheating and the keyboard surface may reach k Thermal conductivity, W/(m·K). ergonomically undesirable temperatures. Based on the analysis q Heat transfer rate, W. results and observations, a new component arrangement s Fin spacing, mm. considering passive paths and using the back side of the LCD screen is proposed and a simple correlation based thermal Greek Symbols analysis of the proposed system is presented.
    [Show full text]
  • Desktop 3Rd Generation Intel® Core™ Processor Family, Desktop Intel® Pentium® Processor Family, Desktop Intel® Celeron® Processor Family, and LGA1155 Socket
    Desktop 3rd Generation Intel® Core™ Processor Family, Desktop Intel® Pentium® Processor Family, Desktop Intel® Celeron® Processor Family, and LGA1155 Socket Thermal Mechanical Specifications and Design Guidelines (TMSDG) January 2013 Document Number: 326767-005 INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. A “Mission Critical Application” is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined”.
    [Show full text]
  • Thermal Guide: Intel® Xeon® Processor E5 V4 Product Family
    Intel® Xeon® Processor E5 v4 Product Family Thermal Mechanical Specification and Design Guide June 2016 Document Number: 333812-002 IntelLegal Lines and Disclaimerstechnologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at Intel.com, or from the OEM or retailer. No computer system can be absolutely secure. Intel does not assume any liability for lost or stolen data or systems or any damages resulting from such losses. You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppal or otherwise) to any intellectual property rights is granted by this document. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. Intel® Turbo Boost Technology requires a PC with a processor with Intel Turbo Boost Technology capability. Intel Turbo Boost Technology performance varies depending on hardware, software and overall system configuration. Check with your PC manufacturer on whether your system delivers Intel Turbo Boost Technology. For more information, see http://www.intel.com/technology/turboboost Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548- 4725 or by visiting www.intel.com/design/literature.htm.
    [Show full text]
  • Summarizing CPU and GPU Design Trends with Product Data
    Summarizing CPU and GPU Design Trends with Product Data Yifan Sun, Nicolas Bohm Agostini, Shi Dong, and David Kaeli Northeastern University Email: fyifansun, agostini, shidong, [email protected] Abstract—Moore’s Law and Dennard Scaling have guided the products. Equipped with this data, we answer the following semiconductor industry for the past few decades. Recently, both questions: laws have faced validity challenges as transistor sizes approach • Are Moore’s Law and Dennard Scaling still valid? If so, the practical limits of physics. We are interested in testing the validity of these laws and reflect on the reasons responsible. In what are the factors that keep the laws valid? this work, we collect data of more than 4000 publicly-available • Do GPUs still have computing power advantages over CPU and GPU products. We find that transistor scaling remains CPUs? Is the computing capability gap between CPUs critical in keeping the laws valid. However, architectural solutions and GPUs getting larger? have become increasingly important and will play a larger role • What factors drive performance improvements in GPUs? in the future. We observe that GPUs consistently deliver higher performance than CPUs. GPU performance continues to rise II. METHODOLOGY because of increases in GPU frequency, improvements in the thermal design power (TDP), and growth in die size. But we We have collected data for all CPU and GPU products (to also see the ratio of GPU to CPU performance moving closer to our best knowledge) that have been released by Intel, AMD parity, thanks to new SIMD extensions on CPUs and increased (including the former ATI GPUs)1, and NVIDIA since January CPU core counts.
    [Show full text]
  • Large Cad Models
    Great decisions in product engineering with NX CAE. Learn more on page 11 DtopEng_banner_NXCAE.indd 1 August 2012 / deskeng.com 7/12/12 9:28:46 AM Workstation Upgrades P. 16 Monitor Technology P. 18 Review: HP Z1 All-in-One Workstation P. 20 TECHNOLOGY FOR DESIGN ENGINEERING FOCUS ON: Test & SPEED UP Measurement LARGE CAD MODELS DIRECT DIGITAL MANUFACTURING P. 32 IMPLEMENT MECHATRONIC TESTING P. 38 HOW TO CALIBRATE TEST SENSORS P. 42 CREATE BREAKTHROUGH DESIGNS BLAZING-FAST GEOMETRY PERFORMANCE FOR COMPLEX DESIGNS - THE NEW AMD FIREPRO TM WORKSTATION GRAPHICS RANGE AMD FIREPRO V4900 AMD FIREPRO V5900 AMD FIREPRO V7900 Maximum Performance The Perfect Balance Create Breakthrough Designs For Your Budget ● 2GB GDDR5 memory (64 GB/s) ● 2GB GDDR5 memory (160 GB/s) ● 1GB GDDR5 memory (64 GB/s) ● 512 Stream Processors ● 1280 Stream Processors ● 480 Stream Processors ● 2 x DisplayPort 1.2, 1 x DVI outputs ● 4 x DisplayPort 1.2 ● 2 x DisplayPort 1.2,1 x DVI outputs ● Support up to 3 displays ● Support up to 4 displays ● Support up to 3 displays ● GeometryBoost ● GeometryBoost ● AMD PowerTune ● AMD PowerTune ● 3D Stereoscopic support ● Framelock, Genlock support AMD FEATURED TECHNOLOGIES “I have been using three and ● Certified and optimized: for all major 3D applications for high four-monitor configurations for a performance and reliability while now, and I can tell you that once you try it, you won’t want ● GeometryBoost Technology: Helps ensure smooth handling of to go back to just two.” complex, highly tessellated models Jason Lewis, CG Channel ● AMD PowerTune Technology: Optimizes power usage in real-time based on current workflow demands ● AMD Eyefinity Technology: Run up to 4 monitors from a single graphics card, boosting productivity by viewing entire workflows Find out more at amd.com/firepronextgen © 2011 Advanced Micro Devices, Inc.
    [Show full text]
  • Intel® Atom Processor 200 Series Mechanical, and Thermal Design
    Intel® Atom™ Processor 200 Series Thermal and Mechanical Design Guidelines — Supporting nettop platform for 2008 June 2008 Document Number: 319979-001 INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
    [Show full text]
  • Intel® Celeron™ P4000 and U3000 Mobile Processor Series
    Intel® Celeron™ P4000 and U3000 Mobile Processor Series External Design Specification — Volume One This is volume 1 of 2. Refer to Document 416057 for Volume 2 Rev 1.5 March 2010 Intel Confidential Document Number:442690 INFORMATIONLegal Lines and Disclaimers IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published specifications.
    [Show full text]
  • Intel® Celeron® D Processor for Embedded Applications Thermal Design Guide Contents
    Intel® Celeron® D Processor for Embedded Applications Thermal Design Guide June 2004 Reference Number: 302647-001 Contents INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Intel products are not intended for use in medical, life saving, life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The Intel® Celeron® D Processor for Embedded Applications may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order. Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling1-800-548- 4725, or by visiting Intel's website at http://www.intel.com.
    [Show full text]
  • White Paper | ADVANCED POWER MANAGEMENT HELPS BRING
    White Paper | ADVANCED POWER MANAGEMENT HELPS BRING IMPROVED PERFORMANCE TO HIGHLY INTEGRATED X86 PROCESSORS TABLE OF CONTENTS THE IMPORTANCE OF POWER MANAGEMENT 3 THE X86 EXAMPLES 3 ESTABLISH A REALISTIC WORST-CASE FOR POWER 4 POWER LIMITS CAN TRANSLATE TO PERFORMANCE LIMITS 4 AMD TACKLES THE UNDERUSED TDP HEADROOM ISSUE 5 GOING ABOVE TDP 6 INTELLIGENT BOOST 7 CONFIGURABLE TDP 8 SUMMARY 9 Complex heterogeneous processors have the potential to leave a large amount of performance headroom untapped when workloads don’t utilize all cores. Advanced power management techniques for x86 processors are designed to reduce the power of underutilized cores while also allowing for dynamic allocation of the thermal budget between cores for improved performance. THE IMPORTANCE OF THE X86 EXAMPLE POWER MANAGEMENT Typical x86 processors widely used Those with experience implementing in both consumer and embedded microprocessors know the importance applications are a perfect example: of proper power management. Whether Integration of network and security for simple applications processors or engines, memory controllers, graphics high-end server processors, the ability processing units (GPUs), and video to down-clock, clock-gate, power-off, encode/decode engines has effectively or in some manner disable unused or turned them into heterogeneous underused hardware blocks is crucial in compute units that excel at a wide limiting power consumption. variety of workloads. Better power management benefits The notable thing about traditional range from energy savings within the reduction-based power management data center to improved battery life in is that a particular functional block is mobile devices. But don’t underestimate only turned off when unused, or down- the value of reducing power and clocked when higher performance is increasing efficiency.
    [Show full text]
  • Intel® Xeon® Processor E5 V3 Family Thermal Guide
    Intel ® Xeon® Processor E5-1600 / 2600 / 4600 v3 Product Families Thermal Mechanical Specification and Design Guide October 2015 Order No.: 330786-003US You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548-4725 or visit http:// www.intel.com/design/literature.htm. Basis, BlueMoon, BunnyPeople, Celeron, Centrino, Cilk, Flexpipe, Intel, the Intel logo, the Intel Anti-Theft technology logo, Intel AppUp, the Intel AppUp logo, Intel Atom, Intel CoFluent, Intel Core, Intel Inside, the Intel Inside logo, Intel Insider, Intel NetMerge, Intel NetStructure, Intel RealSense, Intel SingleDriver, Intel SpeedStep, Intel vPro, Intel Xeon Phi, Intel XScale, InTru, the InTru logo, the InTru Inside logo, InTru soundmark, Iris, Itanium, Kno, Look Inside., the Look Inside. logo, MCS, MMX, Pentium, picoArray, Picochip, picoXcell, Puma, Quark, SMARTi, smartSignaling, Sound Mark, Stay With It, the Engineering Stay With It logo, The Creators Project, The Journey Inside, Thunderbolt, the Thunderbolt logo, Transcede, Transrf, Ultrabook, VTune, Xeon, X-GOLD and XMM are trademarks of Intel Corporation in the U.S.
    [Show full text]
  • Intel® Pentium® 4 Processor with 512-KB L2 Cache on 0.13 Micron Process Thermal Design Guidelines Design Guide
    R Intel® Pentium® 4 Processor with 512-KB L2 Cache on 0.13 Micron Process Thermal Design Guidelines Design Guide Supporting the Intel® Pentium® 4 Processor with Hyper-Threading Technology1 November 2002 Document Number: 252161-001 R INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The Intel® Pentium® 4 processor may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order. 1Hyper-Threading Technology requires a computer system with an Intel® Pentium® 4 processor at 3.06 GHz or higher, a chipset and BIOS that utilize this technology, and an operating system that includes optimizations for this technology.
    [Show full text]
  • Thermal Modeling and Management of Microprocessors
    Thermal Modeling and Management of Microprocessors A Dissertation Presented to the faculty of the School of Engineering and Applied Science University of Virginia In Partial Fulfillment of the requirements for the Degree Doctor of Philosophy Computer Science by Karthik Sankaranarayanan May 2009 Approvals This dissertation is submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Computer Science Karthik Sankaranarayanan Approved: Kevin Skadron (Advisor) Jack W. Davidson (Chair) 7 Mircea R. Stan (Co-Advisor) James P. Cohoon John Lach Accepted by the School of Engineering and Applied Science: April 2009 Abstract The most recent, and arguably one of the most difficult obstacles to the exponential growth in transistor density predicted by Moore’s Law is that of removing the large amount of heat generated within the tiny area of a microprocessor. The exponential increase in power density and its direct relation to on-chip temperature have, in recent processors, led to very high cooling costs. Since temperature also has an exponential effect on lifetime reliability and leakage power, it has become a first-class design constraint in microprocessor development akin to performance. This dissertation describes work to address the temperature challenge from the perspective of the architecture of the microprocessor. It proposes both the infrastructure to model the problem and several mechanisms that form part of the solution. This research describes HotSpot, an efficient and extensible microarchitectural thermal modeling tool that is used to guide the design and evalua- tion of various thermal management techniques. It presents several Dynamic Thermal Management (DTM) schemes that distribute heat both over time and space by controlling the level of compu- tational activity.
    [Show full text]