Accurate Modeling of the Delay and Energy Overhead of Dynamic Voltage and Frequency Scaling in Modern Microprocessors

Total Page:16

File Type:pdf, Size:1020Kb

Accurate Modeling of the Delay and Energy Overhead of Dynamic Voltage and Frequency Scaling in Modern Microprocessors IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 32, NO. 5, MAY 2013 695 Accurate Modeling of the Delay and Energy Overhead of Dynamic Voltage and Frequency Scaling in Modern Microprocessors Sangyoung Park, Student Member, IEEE, Jaehyun Park, Student Member, IEEE, Donghwa Shin, Student Member, IEEE, Yanzhi Wang, Student Member, IEEE, Qing Xie, Student Member, IEEE, Massoud Pedram, Fellow, IEEE, and Naehyuck Chang, Fellow, IEEE Abstract—Dynamic voltage and frequency scaling (DVFS) has DVFS transition overhead may be negligible or significant been studied for well over a decade. Nevertheless, existing DVFS depending on how often we change the DVFS setting. Modern transition overhead models suffer from significant inaccuracies; microprocessors tend to change their DVFS setting rather for example, by incorrectly accounting for the effect of DC–DC converters, frequency synthesizers, voltage, and frequency change frequently in response to rapid changes in the application policies on energy losses incurred during mode transitions. Incor- behavior. In addition, DVFS is widely used for dynamic rect and/or inaccurate DVFS transition overhead models prevent thermal management (DTM), which requires frequent change one from determining the precise break-even time and thus forfeit of the DVFS setting (such as in a millisecond) to achieve some of the energy saving that is ideally achievable. This paper thermal stability. Incorrect DVFS transition overhead may introduces accurate DVFS transition overhead models for both energy consumption and delay. In particular, we redefine the cause failure in the thermal stability of the system. Correct DVFS transition overhead including the underclocking-related modeling of the DVFS transition overhead is not a trivial un- losses in a DVFS-enabled microprocessor, additional inductor dertaking since it requires detailed understanding of the DC– IR losses, and power losses due to discontinuous-mode DC–DC DC converter, frequency synthesizer, voltage and frequency conversion. We report the transition overheads for a desktop, transition policies, and so on. a mobile and a low-power representative processor. We also present DVFS transition overhead macromodel for use by high- Unfortunately, existing DVFS transition overhead models level DVFS schedulers. have limitations and are not applicable to modern DVFS setups. In particular, they are significantly simplified, contain Index Terms—Delay and energy overhead, dynamic voltage and frequency scaling (DVFS), macromodel. technical fallacies, or are limited to uncommon setups. In fact, among the 120 DVFS-related papers published in the last 10 I. Introduction years, only 17% of the DVFS papers have considered the tran- YNAMIC voltage and frequency scaling (DVFS) has sition overhead. The majority of DVFS studies simply ignore D proved itself as one of the most successful energy saving the transition overhead [3]–[5]. Among the 17% of DVFS techniques for a wide range of processors. DVFS is enabled by papers, 75% of papers are based on the analytical transition a programmable DC–DC converter and a programmable clock overhead models introduced in [6] and [7]. Some of the generator. These devices naturally incur overhead whenever previous work (e.g. [6], [8], and [9]) assume voltage controlled the system changes its voltage and frequency setting. Since oscillators for the clock generator, which is unusual in today’s the DVFS break-even time is strongly dependent on the DVFS microprocessors (or even in embedded microcontrollers). Sur- transition overhead [2], correct overhead estimation is crucial prisingly, more than a few prior work references have assumed in achieving the maximum DVFS benefit. that the microprocessor stops operation during the entire Manuscript received May 2, 2012; revised September 24, 2012; accepted voltage transition period, something that is neither desirable November 14, 2012. Date of current version April 17, 2013. This work was nor practical [10]. Most of all, a majority of the prior art papers supported under a grant from the Brain Korea 21 Project, and the National consider a DVFS transition overhead model that is based on Research Foundation of Korea (NRF) grant funded by the Korean Government (MEST), under Grant 20120005640. The ICT at Seoul National University incorrect assumptions. A recent work has raised this problem provided research facilities for this paper. Preliminary version of this paper and suggested the correct definition of DVFS transitions [1]. have been presented in part in the International Symposium on Lower Power Evidently, there is a strong need to construct a correct DVFS Electronics and Design (ISLPED) 2010 [1]. This paper was recommended by Associate Editor J. Cortadella. transition overhead model because even recent DVFS work is S. Park, J. Park, and N. Chang are with Seoul National University, still based on the previous models as shown in Section III. Seoul 151-744, Korea (e-mail: [email protected]; [email protected]; In this paper, we provide a formal definition of the DVFS [email protected]). D. Shin is with the Politecnico di Torino, Turin 10129, Italy (e-mail: transition overhead, analyze various components of the over- [email protected]). head, and finally construct a macromodel for DVFS transition Y. Wang, Q. Xie, and M. Pedram are with the University of Southern overhead. California, Los Angeles, CA 90089-2562 USA (e-mail: [email protected]; [email protected]; [email protected]). This paper takes into account all the major power and Digital Object Identifier 10.1109/TCAD.2012.2235126 performance loss components in the modern DVFS setups as 0278-0070/$31.00 c 2013 IEEE 696 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 32, NO. 5, MAY 2013 Fig. 3. Concept of underclocking loss and PLL lock time loss in a DVFS transition. scaling as shown in Fig. 3, which is a major source of Fig. 1. Breakdown of an upscaling DVFS transition overhead in energy and the delay overhead [92.4% of total delay overhead as delay. (Intel Core2 Duo E6850, upscaling from 1.05 to 1.3-V). (a) Energy shown in Fig. 1(b)]. overhead. (b) Delay overhead. 5) The microprocessor consumes static power even during the PLL lock time though it halts. This is another source of energy waste [3.2% of total energy overhead as shown in Fig. 1(a)]. The aforesaid observations are the key contributions of this paper, based on which we derive accurate, yet compact energy and delay overhead models for DVFS transitions. We present a relatively simple analytical model with parameters that can be easily acquired from the datasheets and/or passive component values (R, L and C values). We also provide case studies for three distinct and representative microprocessors, Intel Core2 Fig. 2. Upscaling and downscaling current flows. Duo E680, Samsung Exynos 4210, and TI MSP430. Some programmers who have no hardware knowledge may use the follows. numbers. Finally, we emphasize the importance of considering 1) Conventional DVFS transition models consider the PLL the DVFS transition overhead for a DTM example. lock time as the major delay (latency) overhead, and the energy required to charge and discharge the bulk II. Background capacitor as the energy overhead. Both assumptions are A. System Setup for DVFS incorrect (delay overhead due to PLL lock time accounts DVFS setups require a programmable voltage regulator for only 7.6% of the total delay overhead as shown in and a programmable clock generator. A microprocessor is Fig. 1). generally powered by a buck-type switching-mode DC–DC 2) Energy consumed for charging and discharging the bulk converter as shown in Fig. 2. The upper and lower MOSFETs capacitor does not fully account for the energy over- control the inductor current and the output voltage. The induc- head, especially for the discontinuous mode DC–DC tor current never changes abruptly, which results in adiabatic converters since they discharge the bulk capacitor by the charging and discharging to and from the bulk capacitor. In load current as shown in Fig. 2. The additionally stored other words, the bulk capacitor is not subject to switching loss energy to the bulk capacitor during the upscaling is that is proportional to the square of the terminal voltage. eventually used by the load during the next downscaling. DC–DC converters are subject to power loss aside from the 3) Voltage upscaling in a conventional DC–DC converter DVFS transition overhead, which includes conduction loss, requires more current to be fed through the inductor to switching loss, and controller loss [11]. Conduction loss is increase the bulk capacitor voltage as shown in Fig. 2. the IR loss in the MOSFET and inductor given by This in turn results in additional IR loss from the 2 · · − · inductor (78.2% of total energy overhead as shown in Pcdt = IO D RSW1 +(1 D) RSW2 + RL Fig. 1). 1 I 2 + · L · D · R +(1−D) · R +R +R (1) 4) During the DVFS transition, the microprocessor operates 3 2 SW1 SW2 L C at a higher supply voltage level than what is strictly where IO, D, RSW1, RSW2, RL, RC are output current, duty necessary. This results in energy waste. We call this ratio, upper switch on-resistance, lower switch on-resistance, phenomenon the underclocking-related loss, which is inductor resistance, and capacitor ESR, respectively. Switching a significant source of energy overhead during the loss is the power dissipation due to gate drive which is given mode change [energy overhead due to under clocking by accounts for 18.6% of total energy overhead as shown P = V · f · (Q + Q )(2) in Fig. 1(a)]. In addition, the underclocking causes the sw I s SW1 SW2 microprocessor to operate at a lower clock frequency where VI , fs, QSW1, and QSW2 are input voltage, switching than what is allowed during the voltage-frequency up- frequency, gate charges of the upper and lower MOSFET, PARK et al.: MODELING OF DYNAMIC VOLTAGE AND FREQUENCY SCALING IN MODERN MICROPROCESSORS 697 Fig.
Recommended publications
  • Computer Service Technician- CST Competency Requirements
    Computer Service Technician- CST Competency Requirements This Competency listing serves to identify the major knowledge, skills, and training areas which the Computer Service Technician needs in order to perform the job of servicing the hardware and the systems software for personal computers (PCs). The present CST COMPETENCIES only address operating systems for Windows current version, plus three older. Included also are general common Linux and Apple competency information, as proprietary service contracts still keep most details specific to in-house service. The Competency is written so that it can be used as a course syllabus, or the study directed towards the education of individuals, who are expected to have basic computer hardware electronics knowledge and skills. Computer Service Technicians must be knowledgeable in the following technical areas: 1.0 SAFETY PROCEDURES / HANDLING / ENVIRONMENTAL AWARENESS 1.1 Explain the need for physical safety: 1.1.1 Lifting hardware 1.1.2 Electrical shock hazard 1.1.3 Fire hazard 1.1.4 Chemical hazard 1.2 Explain the purpose for Material Safety Data Sheets (MSDS) 1.3 Summarize work area safety and efficiency 1.4 Define first aid procedures 1.5 Describe potential hazards in both in-shop and in-home environments 1.6 Describe proper recycling and disposal procedures 2.0 COMPUTER ASSEMBLY AND DISASSEMBLY 2.1 List the tools required for removal and installation of all computer system components 2.2 Describe the proper removal and installation of a CPU 2.2.1 Describe proper use of Electrostatic Discharge
    [Show full text]
  • Comptia A+ Complete Study Guide A+ Essentials (220-601) Exam Objectives
    4830bperf.fm Page 1 Thursday, March 8, 2007 10:03 AM CompTIA A+ Complete Study Guide A+ Essentials (220-601) Exam Objectives OBJECTIVE CHAPTER Domain 1.0 Personal Computer Components 1.1 Identify the fundamental principles of using personal computers 1 1.2 Install, configure, optimize and upgrade personal computer components 2 1.3 Identify tools, diagnostic procedures and troubleshooting techniques for personal computer components 2 1.4 Perform preventative maintenance on personal computer components 2 Domain 2.0 Laptops and Portable Devices 2.1 Identify the fundamental principles of using laptops and portable devices 3 2.2 Install, configure, optimize and upgrade laptops and portable devices 3 2.3 Identify tools, basic diagnostic procedures and troubleshooting techniques for laptops and portable devices 3 2.4 Perform preventative maintenance on laptops and portable devices 3 Domain 3.0 Operating Systems 3.1 Identify the fundamentals of using operating systems 4 3.2 Install, configure, optimize and upgrade operating systems 5 3.3 Identify tools, diagnostic procedures and troubleshooting techniques for operating systems 6 3.4 Perform preventative maintenance on operating systems 6 Domain 4.0 Printers and Scanners 4.1 Identify the fundamental principles of using printers and scanners 7 4.2 Identify basic concepts of installing, configuring, optimizing and upgrading printers and scanners 7 4.3 Identify tools, basic diagnostic procedures and troubleshooting techniques for printers and scanners 7 Domain 5.0 Networks 5.1 Identify the fundamental
    [Show full text]
  • Overclocking Assistant for the Intel® Desktop Board DZ87KLT-75K
    Overclocking Assistant for the Intel® Desktop Board DZ87KLT-75K June 2013 Overclocking Assistant for the Intel® Desktop Board DZ87KLT-75K WARNING Altering clock frequency and/or voltage may (i) reduce system stability and useful life of the system and processor; (ii) cause the processor and other system components to fail; (iii) cause reductions in system performance; (iv) cause additional heat or other damage; and (v) affect system data integrity. Intel has not tested and does not warranty the operation of the processor beyond its specifications. WARNING Altering PC memory frequency and/or voltage may (i) reduce system stability and useful life of the system, memory and processor; (ii) cause the processor and other system components to fail; (iii) cause reductions in system performance; (iv) cause additional heat or other damage; and (v) affect system data integrity. Intel assumes no responsibility that the memory included, if used with altered clock frequencies and/or voltages, will be fit for any particular purpose. Check with the memory manufacturer for warranty and additional details. INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Intel products are not intended for use in medical, life saving, or life sustaining applications.
    [Show full text]
  • Temperature Adaptive Computing in a Real PC
    TEAPC: Temperature Adaptive Computing in a Real PC Augustus K. Uht and Richard J. Vaccaro University of Rhode Island Microarchitecture Research Institute Department of Electrical and Computer Engineering 4 East Alumni Ave. Kingston, RI 02864, USA {[email protected], [email protected]} Abstract* Herein we present an adaptive prototype, TEAPC, based on the standard IBM/Intel PC architecture and TEAPC is an IBM/Intel-standard PC realization realizing underclocking, that is, operating the system of a CPU temperature-adaptive feedback-control greatly below its specified frequency. We have system. The control system adjusts the CPU frequency implemented the entire TEAPC control system in and/or voltage to maintain a constant set-point software, in a Windows application. It runs in a temperature. TEAPC dynamically adapts to changing normal application mode on Windows 2000, CPU computation loads, as well as any other system multitasking normally with other applications. CPU phenomenon that affects the CPU temperature. For frequency and core voltage are changed dynamically, example, with the specific TEAPC hardware, should based on the output of a feedback control system using the CPU cooling fan fail, the control system detects the CPU chip temperature as its input. The control the increase in CPU temperature and automatically loop and underclocking models should be usable for reduces the CPU frequency and voltage to keep the any PC to be built, as well as many that already exist. CPU from overheating. In such a circumstance the This paper’s contributions are as follows: system continues to operate. All of the adaptation is Demonstration of temperature control of a COTS done dynamically, at runtime, on an unmodified (Commercial Off-The-Shelf) based PC via a modern standard operating system (Windows 2000) with a feedback control system; On-demand reliability and purely software-implemented feedback-control system.
    [Show full text]
  • Comptia A+ Certification Exam Prep
    Course 445 CompTIA A+ Certification Exam Prep 445/CN/H.3/709/H.2 Acknowledgments The author would like to thank the following people for their valuable help with this course: Teresa Shinn Carl Waldron Tim Watts Esteban Delgado Dave O’Neal Shirley Auguste Alicia Richardson © Learning Tree International, Inc. All rights reserved. Not to be reproduced without prior written consent. © LEARNING TREE INTERNATIONAL, INC. All rights reserved. All trademarked product and company names are the property of their respective trademark holders. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, or translated into any language, without the prior written permission of the publisher. Copying software used in this course is prohibited without the express permission of Learning Tree International, Inc. Making unauthorized copies of such software violates federal copyright law, which includes both civil and criminal penalties. Introduction and Overview Course Objectives The main objective of this course is to Prepare you to pass the CompTIA A+ 220-901 and 220-902 exams To accomplish this goal, you will learn how to Safely disassemble and reassemble a complete PC system Install and configure common PC motherboards and adapter cards Use an organized strategy to identify and troubleshoot common PC problems Select and install the correct type of RAM memory needed to upgrade a PC system Install and configure data storage systems, including hard disks, solid state drives (SSD), CD, and DVD-ROM drives Identify fundamental components of laptops and mobile devices and describe how to troubleshoot common problems © Learning Tree International, Inc.
    [Show full text]
  • Error Handling and Energy Estimation Framework for Error Resilient Near-Threshold Computing Rengarajan Ragavan
    Error Handling and Energy Estimation Framework For Error Resilient Near-Threshold Computing Rengarajan Ragavan To cite this version: Rengarajan Ragavan. Error Handling and Energy Estimation Framework For Error Resilient Near- Threshold Computing. Embedded Systems. Rennes 1, 2017. English. tel-01636803 HAL Id: tel-01636803 https://hal.inria.fr/tel-01636803 Submitted on 17 Nov 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. No d’ordre : ANNÉE 2017 THÈSE / UNIVERSITÉ DE RENNES 1 sous le sceau de l’Université Bretagne Loire pour le grade de DOCTEUR DE L’UNIVERSITÉ DE RENNES 1 Mention : Traitement du Signal et Télécommunications École doctorale MATISSE présentée par Rengarajan RAGAVAN préparée à l’unité de recherche UMR6074 IRISA Institut de recherche en informatique et systèmes aléatoires - CAIRN École Nationale Supérieure des Sciences Appliquées et de Technologie Thèse soutenue à Lannion le 22 septembre 2017 devant le jury composé de : Edith BEIGNÉ Error Handling and Directrice de Recherche à CEA Leti / rapporteur Energy Estimation Patrick GIRARD Directrice de Recherche à CNRS, LIRMM / rappor- teur Framework For Lorena ANGHEL Error Resilient Professeur à Grenoble INP, TIMA / examinateur Daniel MÉNARD Near-Threshold Professeur à INSA Rennes, IETR / examinateur Olivier SENTIEYS Computing Professeur à l’Université de Rennes 1 / directeur de thèse Cédric KILLIAN Maître de Conférences à l’Université de Rennes 1 / co-directeur de thèse To My Parents, My Brother, and My Teachers iii Acknowledgements I sincerely express my gratitude to my thesis supervisor Prof.
    [Show full text]
  • Dynamic Voltage and Frequency Scaling with Multi-Clock Distribution Systems on SPARC Core" (2009)
    Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 5-1-2009 Dynamic voltage and frequency scaling with multi- clock distribution systems on SPARC core Michael Nasri Michael Follow this and additional works at: http://scholarworks.rit.edu/theses Recommended Citation Michael, Michael Nasri, "Dynamic voltage and frequency scaling with multi-clock distribution systems on SPARC core" (2009). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. Dynamic Voltage and Frequency Scaling with Multi-Clock Distribution Systems on SPARC Core by Michael Nasri Michael A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Engineering Supervised by Dr. Dhireesha Kudithipudi Department of Computer Engineering Kate Gleason College of Engineering Rochester Institute of Technology Rochester, NY May 2009 Approved By: Dr. Dhireesha Kudithipudi Assistant Professor, RIT Department of Computer Engineering Primary Advisor Dr. Ken Hsu Professor, RIT Department of Computer Engineering Dr. Muhammad Shaaban Associate Professor, RIT Department of Computer Engineering Thesis Release Permission Form Rochester Institute of Technology Kate Gleason College of Engineering Title: Dynamic Voltage and Frequency Scaling with Multi-Clock Distribution Systems on SPARC Core I, Michael Nasri Michael, hereby grant permission to the Wallace Memorial Library to reproduce my thesis in whole or part. Michael Nasri Michael Date Dedication To my parents for their pride and encouragement To my brother and sisters for their love and support Acknowledgements Thanks to Dr.
    [Show full text]
  • Geosciences and Climate on Marvell Thunderx2 White Paper
    WHITE PAPER Geosciences and Climate on Marvell ThunderX2 Server Processor Business Unit Marvell June 2019 ABSTRACT Earthquakes and climate change affect everyone on our planet. Increases in computational capability are letting scientists simulate these phenomena with increasingly finer resolution grids and with greater accuracy and urgency. In this white paper, we present why the Marvell® ThunderX2® Arm®-based server processor provides significant value for end users deploying applications in the field of weather modeling and geosciences. Marvell ThunderX2 – Designed for High-Performance Computing Before we dig deeper in to the ThunderX2 architecture, it is important to understand a typical way of categorizing high-performance computing (HPC) deployments. High-performance computing is broadly divided into two categoriesi. Capability computing refers to using a large- scale HPC installation to solve a single problem in the shortest possible time, for example simulating weather models on a Tier-0 HPC system. Capacity computing refers to optimizing system efficiency to solve as many mid-size or smaller problems as possible at the same time at the lowest possible cost, for example automobile manufacturers using rented (on-demand) HPC resources to simulate numerous drive models for their products. It is clear from the definitions above that an HPC customer must incorporate components of cost/CPU, # of CPUs and performance (execution time)/CPU. Thus, the cost of an HPC application can be summarized as cost α CPU hours where CPU hours = #cores x execution time The other important factor driving design decisions of an HPC system is the application suite running on a cluster. Although capability computing targets application runs with the lowest execution time, excessive application scaling may deliver diminishing returns in performance improvement while linearly increasing CPU-hours.
    [Show full text]
  • Error Handling and Energy Estimation for Error Resilient Near-Threshold Computing Rengarajan Ragavan
    Error handling and energy estimation for error resilient near-threshold computing Rengarajan Ragavan To cite this version: Rengarajan Ragavan. Error handling and energy estimation for error resilient near-threshold com- puting. Hardware Architecture [cs.AR]. Université Rennes 1, 2017. English. NNT : 2017REN1S038. tel-01654476 HAL Id: tel-01654476 https://tel.archives-ouvertes.fr/tel-01654476 Submitted on 4 Dec 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. No d’ordre : ANNÉE 2017 THÈSE / UNIVERSITÉ DE RENNES 1 sous le sceau de l’Université Bretagne Loire pour le grade de DOCTEUR DE L’UNIVERSITÉ DE RENNES 1 Mention : Traitement du Signal et Télécommunications École doctorale MATISSE présentée par Rengarajan RAGAVAN préparée à l’unité de recherche UMR6074 IRISA Institut de recherche en informatique et systèmes aléatoires - CAIRN École Nationale Supérieure des Sciences Appliquées et de Technologie Thèse soutenue à Lannion le 22 septembre 2017 devant le jury composé de : Edith BEIGNÉ Error Handling and Directrice de Recherche à CEA Leti / rapporteur
    [Show full text]
  • Power Management in Tizen and Linux Kernel (Tizen 2.3)
    Power Management in Tizen and Linux Kernel (Tizen 2.3) Minsoo Ryu Real-Time Computing and Communications Lab. Hanyang University [email protected] Real-Time Computing and Communications Lab., Hanyang University http://rtcc.hanyang.ac.kr Outline Introduction ACPI (Advanced Configuration Power Interface) DVFS (Dynamic Voltage Frequency Scaling) Power Management in Tizen Lab Assignment Real-Time Computing and Communications Lab., Hanyang University http://rtcc.hanyang.ac.kr 22 Why Power Management? Battery-operated devices . Smartphones, digital cameras, and laptops use batteries . Power savings and battery run times are paramount Increasing energy use of servers . Servers and data centers in U.S. reached 1.5% of the national electricity consumption in 2006 . The electricity consumption of servers doubled between 2000 and 2006 Real-Time Computing and Communications Lab., Hanyang University http://rtcc.hanyang.ac.kr 33 Power Consumption in Smartphones Real-Time Computing and Communications Lab., Hanyang University http://rtcc.hanyang.ac.kr 44 Power Consumption of Memory In a server system . Memory consumes 19% of system power on average . Some work notes up to 40% of total system power Real-Time Computing and Communications Lab., Hanyang University http://rtcc.hanyang.ac.kr 55 Idle Power Consumption Only 30% of servers in data centers are fully utilized while keeping the other 70% in idle state . Idle servers consume between 60% and 66% of the peak load power consumption Real-Time Computing and Communications Lab., Hanyang University http://rtcc.hanyang.ac.kr 66 Two Dimensions of Power Management Power management when the system is idle . Select the most efficient idle state Power management when the system is active .
    [Show full text]
  • Beyond CPU: Considering Memory Power Consumption of Software
    Beyond CPU: Considering Memory Power Consumption of Software Hayri Acar1, Gülfem I. Alptekin2, Jean-Patrick Gelas3 and Parisa Ghodous1 1LIRIS, University of Lyon, Lyon, France 2Galatasaray University, Istanbul, Turkey 3ENS Lyon, LIP, UMR 5668, Lyon, France Keywords: Power Consumption, Sustainable Software, Energy Efficiency, Green IT. Abstract: ICTs (Information and Communication Technologies) are responsible around 2% of worldwide greenhouse gas emissions (Gartner, 2007). And according to the Intergovernmental Panel on Climate Change (IPPC) recent reports, CO2 emissions due to ICTs are increasing widely. For this reason, many works tried to propose various tools to estimate the energy consumption due to software in order to reduce carbon footprint. However, these studies, in the majority of cases, takes into account only the CPU and neglects all others components. Whereas, the trend towards high-density packaging and raised memory involve a great increased of power consumption caused by memory and maybe memory can become the largest power consumer in servers. In this paper, we model and then estimate the power consumed by CPU and memory due to the execution of a software. Thus, we perform several experiments in order to observe the behavior of each component. 1 INTRODUCTION more possible to obtain accurate and efficient results for energy consumption. However, using these types ICTs (Information and Communication of devices is complicated because it is necessary to Technologies) are responsible around 2% of have these devices and connect them to different worldwide greenhouse gas emissions (Gartner, components. What is more with this method, it is 2007). And according to the Intergovernmental Panel impossible to measure the energy consumed by on Climate Change (IPPC) recent reports, CO2 virtual machines and applications on process.
    [Show full text]
  • TEAPC: Adaptive Computing and Underclocking in a Real PC Augustus K
    From: The Proceedings of the First IBM P=ac2 Conference, October 6-8, 2004,IBM T.J. Watson Research Center, Yorktown Heights, NY, USA. Copyright 2004, A.K. Uht and R.J. Vaccaro. TEAPC: Adaptive Computing and Underclocking in a Real PC Augustus K. Uht and Richard J. Vaccaro University of Rhode Island Microarchitecture Research Institute Department of Electrical and Computer Engineering 4 East Alumni Ave. Kingston, RI 02864, USA {[email protected], [email protected]} real-time internal temperature. Since this is the major Abstract component of variation of timing within an Intel TEAPC is an IBM/Intel-standard PC realization of microprocessor (the CPU’s core voltage is highly the TEAtime performance “maximizing” adaptive regulated and approximately constant), it mirrors the computing algorithm, giving performance beyond original TEAtime philosophy. Further, it is possible worst-case-specifications. TEAPC goes beyond the to base the real-time performance on the real-time TEAtime algorithm by adapting to the current CPU power consumption of the entire PC; the latter is load. It is also the first machine to use extensive partially addressed in this paper. underclocking for disaster tolerance, low power consumption and high reliability. This is all done We have implemented the entire control system in dynamically, at runtime, on an unmodified standard software, in a Windows application. It runs in a operating system (Windows 2000) with a purely normal application mode on Windows 2000, software-implemented feedback-control based multitasking normally with other applications. CPU algorithm. frequency and core voltage are changed dynamically, based on the output of a feedback control system 1 Introduction using the CPU chip temperature as its primary input.
    [Show full text]