Thesis Title

No d’ordre : ANNÉE 2017 THÈSE / UNIVERSITÉ DE RENNES 1 sous le sceau de l’Université Bretagne Loire pour le grade de DOCTEUR DE L’UNIVERSITÉ DE RENNES 1 Mention : Traitement du Signal et Télécommunications École doctorale MATISSE présentée par Rengarajan RAGAVAN préparée à l’unité de recherche UMR6074 IRISA Institut de recherche en informatique et systèmes aléatoires - CAIRN École Nationale Supérieure des Sciences Appliquées et de Technologie Thèse soutenue à Lannion le 22 septembre 2017 devant le jury composé de : Edith BEIGNÉ Error Handling and Directrice de Recherche à CEA Leti / rapporteur Energy Estimation Patrick GIRARD Directrice de Recherche à CNRS, LIRMM / rapporteur Framework For Lorena ANGHEL Error Resilient Professeur à Grenoble INP, TIMA / examinateur Daniel MÉNARD Near-Threshold Professeur à INSA Rennes, IETR / examinateur Olivier SENTIEYS Computing Professeur à l’Université de Rennes 1 / directeur de thèse Cédric KILLIAN Maître de Conférences à l’Université de Rennes 1 / co-directeur de thèse To My Parents, My Brother, and My Teachers iii Acknowledgements I sincerely express my gratitude to my thesis supervisor Prof. Olivier Sentieys who gave me the opportunity to work him in prestigious INRIA/IRISA labs. His guidance and confidence in me gave extra boost and cultivated more interest in me to do this research work in a better way. His feedback and comments refined the quality of my research contributions and this thesis dissertation. I am very grateful to my thesis co-supervisor Dr. CédricKillian for his encour- agement and support. He was inspirational to me throughout my Ph.D. tenure. His suggestions and critical evaluation helped me to innovatively address various bottlenecks in this thesis work. I register my special thanks to Cédricfor his efforts in translating the long abstract of this thesis to French. I would like to thank my colleague Philippe Quémerais,Research Engineer, IRISA labs for his extensive support in configuring and tutoring various CAD tools. I acknowl- edge my fellow research student Benjamin Barrois for his help in modelling approximate arithmetic operators. I'm thankful to Mme. Laurence Gallas for her French lessons that helped me a lot during my stay in Lannion. Also, I would like to thank all the faculty members, researchers, and colleagues of ENSSAT who made my stay in Lannion most cherishable. This thesis work has seen the light of the day thanks to Universitéde Rennes 1, and IRISA laboratory for funding and facilitating this thesis work. Thanks are due to Nadia, Joelle, and Angelique for their support in administrative matters and during my missions abroad. I also express my gratitude to the entire jury panel for reading my thesis dissertation and taking part in the thesis defense. Also, special thanks to the reviewers for their valuable review comments that improved the quality of this dissertation. I offer my sincere thanks to all my friends and well-wishers who were supporting me all these years and looking for my betterment. Last but not least, I express my everlasting gratitude to my parents, my brother and almighty without them I cannot be what I am today. 10th October 2017 Rengarajan Lannion, France v Abstract Advancement in CMOS IC technology has improved the way in which integrated circuits are designed and fabricated over decades. Scaling down the size of transistors from micrometers to few nanometers has given the capability to place more tinier transistors in a unit of chip area that upheld Moore's law over the years. On the other hand, feature size scaling imposed various challenges like physical, material, power-thermal, technological, and economical that hinder the trend of scaling. To improve energy efficiency, Dynamic Voltage Scaling (DVS) is employed in most of the Ultra-Low Power (ULP) designs. In near-threshold region (NTR), the supply voltage of the transistor is slightly more than the threshold voltage of the transistor. In this region, the delay and energy both are roughly linear with the supply voltage. The Ion/Ioff ratio is higher compared to the sub-threshold region, which improves the speed and minimizes the leakage. Due to the reduced supply voltage, the circuits operating in NTR can achieve a 10× reduction in energy per operation at the cost of the same magnitude of reduction in the operating frequency. In the present trend of sub-nanometer designs, inherent low-leakage technologies like FD-SOI (Fully Depleted Silicon On Insulator), enhance the benefits of voltage scaling by utilizing techniques like Back-Biasing. However reduction in Vdd augments the impact of variability and timing errors in sub-nanometer designs. The main objective of this work is to handle timing errors and to formulate a framework to estimate energy consumption of error resilient applications in the context of near-threshold computing. It is highly beneficial to design a digital system by taking advantage of NTR, DVS, and FD-SOI. In spite of the advantages, the impact of variability and timing errors outweighs the above-mentioned benefits. There are existing methods that can predict and prevent errors, or detect and correct errors. But there is a need for a unified approach to handle timing errors in the near-threshold regime. In this thesis, Dynamic Speculation based error detection and correction is explored in the context of adaptive voltage and clock overscaling. Apart from error detection and correction, some errors can also be tolerated or, in other words, circuits can be pushed beyond their limits to compute incorrectly to achieve higher energy efficiency. The proposed error detection and correction method achieves 71% overclocking with 2% additional hardware cost. This work involves extensive study of design at gate level to understand the behaviour of gates under overscaling of supply voltage, bias voltage and clock frequency (collectively called as operating triads). A bottom-up approach is taken: by studying trends of energy vs. error of basic arithmetic operators at transistor level. Based on profiling of the arithmetic operators, a tool flow is formulated to estimate energy and error metrics for different operating triads. We achieve maximum energy efficiency of 89% for arithmetic operators like 8-bit and 16-bit adders at the cost of 20% faulty bits by operating in NTR. A statistical model is developed for the arithmetic operators to represent the behaviour of the operators for different variability impacts. This model is used for approximate computing of error resilient applications that can tolerate acceptable margin of errors. This method is further explored for execution unit of a VLIW processor. The proposed framework provides a quick estimation of energy and error metrics of benchmark programs by simple compilation in using C compiler. In the proposed energy estimation framework, characterization of arithmetic operators is done at transistor level, and the energy estimation is done at functional level. This hybrid approach makes energy estimation faster and accurate for different operating triads. The proposed framework estimates energy for different benchmark programs with 98% accuracy compared to SPICE simulation. Abbreviations AVS Adaptive Voltage sscaling BER Bit Error Rate BKA Brent-Kung Adder BOX Buried OXide CCS Conditional Carry Select CDM Critical Delay Margin CGRA Coarse-Grained Reconfigurable Architecture CMF Common Mode Failure CMOS Complementary Metal-Oxide-Semiconductor DDMR Diverse Double Modular Redundancy DMR Double Modular Redundancy DSP Digital Signal Processor DVFS Dynamic Voltage Frequency sscaling DVS Dynamic Voltage sscaling EDAP Energy Delay Area Product EDP Energy Delay Product FBB Forward Body Biasing FEHM Flexible Error Handling Module FF Flip-Flop FIR Finite Impulse Response FPGA Field-Programming Gate Array FDSOI Fully Depleted Silicon on Insulator FU Functional Unit GRAAL Global Reliability Architecture Approach for Logic HFM High Frequency Mode ix HPM Hardware Herformance Monitor IC Integrated Chip ILA Integrated Logic Analyzer ILEP Instruction Level Power Estimation ILP Instruction Level Parallelism IoT Internet of Things ISA Instruction Set Architecture LFBFF Linear Feed Back Flip Flop LFM Low Frequency Mode LFSR Linear Feedbacl Shift Register LSB Least Significant Bit LUT Look-Up Table LVDS Low-Voltage Differential Signaling LVT Low VT MMCM Mixed Mode Clock Manager MSB Most Significant Bit MSFF Master Slave Flip Flop NOP No OPeration NTR Near-Threshold Region PE Processing Element POFF Point of First Failure PUM Path Under Monitoring RBB Reverse Body Biasing RCA Ripple Carry Adder RTL Resistor- Transistor Logic RVT Regular VT SET Single Event Transients SEU Single Event Upset SNR Signal to Noise Ratio SNW Single N-Well SOI Silicon on Insulator SPICE Simulation Program with Integrated Circuit Emphasis SPW Single P-Well STA Static Timing Analysis STR Sub-Threshold Region SW Single Well TFD Timing Fault Detector TMR Triple Modular Redundancy UTBB Ultra-Thin Body and Box VEX ISA VEX Instruction Set Architecture VLIW Very Large Instruction Word VOS Voltage Over-Scaling Contents Acknowledgements v Abstract vii Abbreviations ix Contents 1 List of Figures 5 List of Tables 9 1 Résuméétendu 11 1.1 Introduction ................................... 11 1.2 Contributions .................................. 13 1.3 Gestion des erreurs temporelles ........................ 14 1.3.1 Gestion des erreurs avec une fenêtrede spéculationdynamique . 15 1.3.2 Boucle de retour pour augmentation ou diminution de la fréquence 17 1.3.3 Validation du concept sur FPGA Virtex de Xilinx . 18 1.4 Ajustement agressif de l'alimentation pour les applications résistantes aux erreurs ...................................... 23 1.4.1 Modélisationd'opérateurarithmétiqueavec gestion agressive de la tension d'alimentation ........................ 24 1.4.2 Validation du concept: modélisationd'additionneurs . 25 1.5 Estimation d'énergiedans un processeur ρ − V EX . 28 1.5.1 Caractérisationd'une unitéd'exécutiond'un cluster ρ − V EX . 30 1.5.2 Méthode proposéepour l'estimation de l'énergieau niveau instruction ................................... 34 1.5.3 Validation de la méthode ......................

Thesis Title

Computer Architectures an Overview

Open Access Proceedings Journal of Physics: Conference Series

Background Optimization in Full System Binary Translation

Oral History of Boris Babayan

Experience of Building and Deployment Debian on Elbrus Architecture

Chronologie De L'informatique Scientifique

Error Handling and Energy Estimation Framework for Error Resilient Near-Threshold Computing Rengarajan Ragavan

International Journal of Soft Computing and Engineering

Proceedings of the 7Th Spring/Summer Young Researchers’ Colloquium on Software Engineering

Computer  Pc Components, Features and Control  Components of System and Its Feature  Types of Computer  Software Types  Operating System

Design and Implementation of a Multimedia Extension for a RISC Processor

Architecture 1 CISC, RISC, VLIW, Dataﬂow Contents