Exploring the Ultimate Limits of Adiabatic Circuits

Total Page:16

File Type:pdf, Size:1020Kb

Exploring the Ultimate Limits of Adiabatic Circuits Special Session: Exploring the Ultimate Limits of Adiabatic Circuits Michael P. Frank Robert W. Brocato Thomas M. Conte Alexander H. Hsia Cognitive & Emerging RF Microsystems Dept. Schools of Computer Science and Sandia National Laboratories Computing Dept. Sandia National Laboratories Electrical & Computer Eng. Albuquerque, NM, USA Sandia National Laboratories Albuquerque, NM, USA Georgia Institute of Technology Albuquerque, NM, USA orcid:0000-0001-9751-1234 Atlanta, GA, USA Requiescat in pace [email protected] [email protected] Anirudh Jain Nancy A. Missert Karpur Shukla Brian D. Tierney School of Computer Science Nanoscale Sciences Department Laboratory for Emerging Radiation Hard CMOS Georgia Institute of Technology Sandia National Laboratories Technologies Technology Dept. Atlanta, GA, USA Albuquerque, NM, USA Brown University Sandia National Laboratories [email protected] orcid:0000-0003-2082-2282 Providence, RI, USA Albuquerque, NM, USA orcid:0000-0002-7775-6979 [email protected] Abstract—The field of adiabatic circuits is rooted in electronics I. INTRODUCTION know-how stretching all the way back to the 1960s and has poten- tial applications in vastly increasing the energy efficiency of far- In 1961, Rolf Landauer of IBM observed that there is a fun- future computing. But now, the field is experiencing an increased damental physical limit to the energy efficiency of logically ir- level of attention in part due to its potential to reduce the vulnera- reversible computational operations, meaning, those that lose bility of systems to side-channel attacks that exploit, e.g., unwan- known information to their thermal environment [1]. This ob- ted EM emissions, power supply fluctuations, and so forth. servation, now enshrined as Landauer’s Principle, is today un- In this context, one natural question is: Just how low can the derstood (with some important qualifications) as a rigorous the- energy dissipation from adiabatic circuits, and the associated ex- orem of fundamental statistical physics and information theory traneous signal emissions, be made to go? We argue that the ulti- [2], and is widely considered to be one of the most important mate limits of this approach lie much farther away than is com- results in the fundamental physics of computing. monly appreciated. Recent advances at Sandia National Labora- tories in the design of fully static, fully adiabatic CMOS logic styles During the 1970s, Landauer’s protégé Charles Bennett real- and high-quality energy-recovering resonant power-clock drivers ized that an alternative paradigm for computation based on log- offer the potential to reduce dynamic switching losses by multiple ically reversible operations, which could in principle sidestep orders of magnitude, and, particularly for cryogenic applications, Landauer’s efficiency limit, was theoretically coherent [3], and optimization of device structures can reduce the standby power developed a more sophisticated understanding of the thermody- consumption of inactive devices, and the ultimate dissipation li- namics of computation built on this insight [4], [5]. Early con- mits of the adiabatic approach, by multiple orders of magnitude cepts for engineering implementations of reversible computing as well. (e.g., [6], [7]) eventually led to Younis & Knight’s discovery (in In this paper, we review the above issues, and give a prelimi- the early 1990s) that it could even be achieved for pipelined se- nary overview of our group’s activities towards the demonstration quential CMOS, using their CRL [8] and SCRL [9] logic styles. of groundbreaking levels of energy efficiency for semiconductor- CRL was the first complete sequential CMOS logic family that based logic, together with a broader exploration of the ultimate is what we call truly, fully adiabatic, meaning that, in an ideal- limits of physically realizable techniques for approaching the the- ized, no-leakage limit, with negligible parasitic capacitances, the oretical ideal of perfect thermodynamic reversibility in comput- dissipation per logic operation can in principle be made arbitra- ing, and the study of the implications of this technology direction for practical computing architectures. rily small—even below Landauer’s limit of 퐸diss ≥ 푘푇 ln 2 for Keywords—Adiabatic circuits, adiabatic logic, CMOS, physical irreversible bit erasures. Due to Landauer’s Principle, we know limits of computing, low-power design, reversible computing that any physical mechanism capable of beating this limit must be logically reversible—although what that means, precisely, is misapprehended in much of the literature (see [10] for a correct This work is supported by the Advanced Simulation and Computing (ASC) account). program at the U.S. Department of Energy’s National Nuclear Security Admini- stration (NNSA). Sandia National Laboratories is a multi-mission laboratory Note that avoiding energy dissipation includes avoiding en- managed and operated by National Technology and Engineering Solutions of ergy emissions associated with side-channel attacks such as Dif- Sandia, LLC, a wholly owned subsidiary of Honeywell International, Inc., for ferential Power Analysis (DPA); this led to attention being paid NNSA under contract DE-NA0003525. This document describes objective to the possible security applications of adiabatic circuits and re- technical results and analysis. Any subjective views or opinions that might be versible computing as early as 2006 [11]. However, existing expressed in this document do not necessarily represent the views of the U.S. Department of Energy or the United States Government. Approved for public implementations of adiabatic circuits typically remain far from release, SAND2020-8419 C. the reversible ideal, for several reasons that we will discuss. XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE In 2017, an effort informally termed the Adiabatic Circuits In parallel with this core engineering effort, the scope of our Feasibility Study (ACSF) began at Sandia National Laboratories effort has expanded over the last year to also include two univ- with the goal of pushing the limits of energy efficiency in ersity-based collaborations on related topics: physical implementations of adiabatic CMOS circuits, as a part of an effort to assess the long-term feasibility of these techniques 1) Karpur Shukla of Brown University is applying theoret- for possible use in future high-performance computing (HPC) ical tools from modern non-equilibrum quantum thermodyna- systems. Although the main goal of this effort is to attain more mics to derive fundamental, technology-independent lower li- power-efficient (and ultimately, more cost-efficient) computa- mits on energy dissipation as a function of delay for general tion, improving security can be a potentially beneficial side ef- quantum systems performing classical reversible computations. fect, as previously mentioned. This study may help us develop new insights into the design of innovative “Beyond CMOS” device technologies for reversible II. TECHNICAL APPROACH computing, so as to aspirationally achieve breakthrough reduc- tions in energy-delay product, and take the cost-efficient perf- As mentioned above, to avoid limiting the energy efficiency ormance of reversible computing far beyond the limits of any (i.e., to approach zero energy dissipation per operation), the phy- possible non-reversible technology. sical operation of our circuits must be truly, fully adiabatic, 2) Tom Conte and Anirudh Jain of Georgia Tech aim to which requires us to apply the principles of reversible comput- carry out a study of processor architectures based on reversible ing, albeit in the generalized sense discussed in [10]. Ignoring technology, including an assessment of the potential system- leakage for the moment, even just to approach reversible, adia- level impact of hypothetical new reversible device technologies batic operation in a switching circuit requires that the following for computational science applications. fundamental “design rules” must be obeyed [12]: 1) Avoid passing current through diodes. This is due to the III. PROGRESS TO DATE AND EARLY RESULTS “diode drop,” an intrinsic voltage drop that exists in a diode, A. Logic Family, Simulations, and Test Chip Layouts. which is unavoidable for fundamental thermodynamic reasons. As mentioned in the previous section, our approach is based 2) Avoid turning a switch (such as a FET) “on” when there on implementing truly, fully adiabatic CMOS logic styles. The is a (non-negligible) voltage difference 푉 between its drain DS first of these, historically, was Younis & Knight’s Charge Reco- and source terminals. Such an event would result in a substan- very Logic (CRL) [8]. A number of generalizations and varia- tial, non-adiabatic flow of current. tions on the techniques innovated by CRL were developed in 3) Avoid turning a switch “off” when there is a non-negli- subsequent literature (e.g., [9], [14], [15]). We selected for our gible current 퐼 between drain and source, unless these nodes DS study a version called 2LAL (two-level adiabatic logic) which is remain connected along an alternate path. The reasoning be- effectively just a slight generalization of CRL but was reinven- hind this rule is more subtle, and so it is not widely recognized, ted by one of us (Frank) in 2000. It is an attractive target for and is broken by many designs. Nevertheless, obeying this rule study because of its simple clock waveforms, which have the is essential for approaching physical reversibility [13]. shortest
Recommended publications
  • By Erika Azabache Villar a Thesis Submitted in Partial Fulfillment of The
    SMALLER/FASTER DELTA-SIGMA DIGITAL PIXEL SENSORS by Erika Azabache Villar A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Integrated Circuits and Systems Department of Electrical and Computer Engineering University of Alberta c Erika Azabache Villar, 2016 Abstract A digital pixel sensor (DPS) array is an image sensor where each pixel has an analog-to-digital converter (ADC). Recently, a logarithmic delta-sigma (ΔΣ) DPS array, using first-order ΔΣ ADCs, achieved wide dynamic range and high signal-to-noise-and-distortion ratios at video rates, requirements that are difficult to meet using conventional image sensors. However, this state-of-the-art ΔΣ DPS design is either too large for some applications, such as optical imag- ing, or too slow for others, such as gamma imaging. Consequently, this master’s thesis investi- gates smaller or faster ΔΣ DPS designs, relative to the state of the art. All designs are validated through simulations. Commercial image sensors, for optical and gamma imaging, are used as targeted baselines to establish competitive specifications. To achieve a smaller pixel, process scaling is exploited. Three logarithmic ΔΣ DPS designs are presented for 180, 130, and 65 nm fabrication processes, demonstrating a path to competitiveness for the optical imaging market. Decimator and readout circuits are improved, compared to previous work, while reducing area, and capacitors in the modulator prove to be the limiting factor in deep-submicron processes. Area trends are used to construct a roadmap to even smaller pixels. To achieve a faster pixel, a higher-order ΔΣ architecture is exploited.
    [Show full text]
  • An Area Efficient Sub-Threshold Voltage Level Shifter Using a Modified Wilson Current Mirror for Low Power Applications
    An Area Efficient Sub-threshold Voltage Level Shifter using a Modified Wilson Current Mirror for Low Power Applications Biswarup Mukherjee & Aniruddha Ghosal Published online: 22 May 2019. Submit your article to this journal Article views: 12 View Crossmark data An Area Efficient Sub-threshold Voltage Level Shifter using a Modified Wilson Current Mirror for Low Power Applications Biswarup Mukherjee1 and Aniruddha Ghosal2 1Neotia Institute of Technology, Management and Science, Jhinga, Diamond Harbour Road, 24Pgs. (S), West Bengal, India; 2Institute of Radio Physics and Electronics, University of Calcutta, West Bengal, India ABSTRACT KEYWORDS In the present communication, a new technique has been introduced for implementing low-power Current mirror; CAD area efficient sub-threshold voltage level shifter (LS) circuit. The proposed LS circuit consists of only simulation; Level shifter (LS); nine transistors and can operate up to 100 MHz of input frequency successfully. The proposed LS Low power; Sub-threshold is made of single threshold voltage transistors which show least complexity in fabrication and bet- operation; Typical transistor (TT) ter performance in terms of delay analysis and power consumption compared to other available designs. CAD tool-based simulation at TSMC 180 nm technology and comparison between the pro- posed design and other available designs show that the proposed design performs better than other state-of-the-art designs for a similar range of voltage conversion with the most area efficiency. 1. INTRODUCTION are used to reduce the circuit activity or the capacitance value in order to reduce the switching power consump- With the increasing demand of portable devices in elec- tion.
    [Show full text]
  • Overview of CMOS Sensors for Future Tracking Detectors
    instruments Review Overview of CMOS Sensors for Future Tracking Detectors Ricardo Marco-Hernández Instituto de Física Corpuscular (CSIC-UV), 46980 Valencia, Spain; rmarco@ific.uv.es Received: 23 October 2020; Accepted: 27 November 2020; Published: 30 November 2020 Abstract: Depleted Complementary Metal-Oxide-Semiconductor (CMOS) sensors are emerging as one of the main candidate technologies for future tracking detectors in high luminosity colliders. Their capability of integrating the sensing diode into the CMOS wafer hosting the front-end electronics allows for reduced noise and higher signal sensitivity, due to the direct collection of the sensor signal by the readout electronics. They are suitable for high radiation environments due to the possibility of applying high depletion voltage and the availability of relatively high resistivity substrates. The use of a CMOS commercial fabrication process leads to their cost reduction and allows faster construction of large area detectors. In this contribution, a general perspective of the state of the art of CMOS detectors for High Energy Physics experiments is given. The main developments carried out with regard to these devices in the framework of the CERN RD50 collaboration are summarized. Keywords: DMAPS; CMOS; radiation sensors; electronics 1. Introduction Current large pixel detectors in High Energy Physics, such as the ones which will be part of the A Toroidal LHC (Large Hadron Collider) ApparatuS (ATLAS) Tracker Detector [1] or the Compact Muon Solenoid (CMS) Tracker Detector [2] upgrades for the High-Luminosity Large Hadron Collider (HL-LHC), mostly follow a hybrid approach. In a hybrid pixel detector the sensor and the readout electronics are independent devices connected by means of bump-bonding, as can be seen in Figure1.
    [Show full text]
  • A Area VLSI Design (SCL)
    A Area VLSI Design (SCL) A1 Sub Area ASIC Design (SCL) A1.1 Design of Instrumentation Amplifier A High-Precision Instrumentation Amplifier with large CMRR is required for the Sensor Signal conditioning applications. Instrumentation amplifier is a versatile device used for amplification of small differential mode signals while rejecting large common mode signal at the same time. In a typical signal conditioning chain, the output of the sensor goes to the instrumentation amplifier. The instrumentation amplifier amplifies the small output signal of the sensor and gives it to the ADC for digitization. Target Specification: • Supply Voltage: 3.3V • Temperature Range: -40 degC to 125 degC • Programmable Gain: 1 to 1000 • Low noise < 0.3µV p-p at 0.1 Hz to 10 Hz • Low non linearity < 20ppm (Gain = 1) • High CMRR (Common Mode Rejection Ratio) > 90dB (Gain=1) • Low offset voltage < 100µV • 3 dB Bandwidth: 1MHz (at Gain = 1) The design of the proposed Instrumentation Amplifier is to be carried out in SCL 180 nm process. A1.2 Design of a Current Feedback Amplifier (CFA) Opamps come in two types: the voltage-feedback amplifier (VFA), for which the input error is a voltage; and the current-feedback amplifier (CFA), for which the input error is a current. The CFA noninverting amplifier is relatively free from a gain-bandwidth trade-off. Additional advantage of CFAs compared to VFAs is the absence of slew-rate limiting. CFA is primarily used where both high speed and low distortion signal processing is required. CFAs are mainly used in broadcast video systems, radar systems , IF and RF stages and other high speed circuits.
    [Show full text]
  • Progress in Particle Tracking and Vertexing Detectors Nicolas Fourches
    Progress in particle tracking and vertexing detectors Nicolas Fourches (CEA/IRFU): 22nd March 2020, University Paris-Saclay Abstract: This is part of a document, which is devoted to the developments of pixel detectors in the context of the International Linear Collider. From the early developments of the MIMOSAs to the proposed DotPix I recall some of the major progresses. The need for very precise vertex reconstruction is the reason for the Research and Development of new pixel detectors, first derived from the CMOS sensors and in further steps with new semiconductors structures. The problem of radiation effects was investigated and this is the case for the noise level with emphasis of the benefits of downscaling. Specific semiconductor processing and characterisation techniques are also described, with the perspective of a new pixel structure. TABLE OF CONTENTS: 1. The trend towards 1-micron point-to-point resolution and below 1.1. Gaseous detectors : 1.2. Liquid based detectors : 1.3. Solid state detectors : 2. The solution: the monolithically integrated pixel detector: 2.1. Advantages and drawbacks 2.2. Spatial resolution : experimental physics requirements 2.2.1. Detection Physics at colliders 2.2.1.1. Track reconstruction 2.2.1.2. Constraints on detector design a) Multiple interaction points in the incident colliding particle bunches b) Multiple hits in single pixels even in the outer layers c) Large NIEL (Non Ionizing Energy Loss) in the pixels leading to displacement defects in the silicon layers d) Cumulative ionization in the solid state detectors leading to a total dose above 1 MGy in the operating time of the machine a) First reducing the bunch length and beam diameter would significantly limit the number of spurious interaction points.
    [Show full text]
  • EP Activity Report 2015
    EUROPRACTICE IC SERVICE THE RIGHT COCKTAIL OF ASIC SERVICES EUROPRACTICE IC SERVICE OFFERS YOU A PROVEN ROUTE TO ASICS THAT FEATURES: · .QYEQUV#5+%RTQVQV[RKPI · (NGZKDNGCEEGUUVQUKNKEQPECRCEKV[HQTUOCNNCPFOGFKWOXQNWOGRTQFWEVKQPSWCPVKVKGU · 2CTVPGTUJKRUYKVJNGCFKPIYQTNFENCUUHQWPFTKGUCUUGODN[CPFVGUVJQWUGU · 9KFGEJQKEGQH+%VGEJPQNQIKGU · &KUVTKDWVKQPCPFHWNNUWRRQTVQHJKIJSWCNKV[EGNNNKDTCTKGUCPFFGUKIPMKVUHQTVJGOQUVRQRWNCT%#&VQQNU · 46.VQ.C[QWVUGTXKEGHQTFGGRUWDOKETQPVGEJPQNQIKGU · (TQPVGPF#5+%FGUKIPVJTQWIJ#NNKCPEG2CTVPGTU +PFWUVT[KUTCRKFN[FKUEQXGTKPIVJGDGPG«VUQHWUKPIVJG'74124#%6+%'+%UGTXKEGVQJGNRDTKPIPGYRTQFWEVFGUKIPUVQOCTMGV SWKEMN[CPFEQUVGHHGEVKXGN[6JG'74124#%6+%'#5+%TQWVGUWRRQTVUGURGEKCNN[VJQUGEQORCPKGUYJQFQP°VPGGFCNYC[UVJG HWNNTCPIGQHUGTXKEGUQTJKIJRTQFWEVKQPXQNWOGU6JQUGEQORCPKGUYKNNICKPHTQOVJG¬GZKDNGCEEGUUVQUKNKEQPRTQVQV[RGCPF RTQFWEVKQPECRCEKV[CVNGCFKPIHQWPFTKGUFGUKIPUGTXKEGUJKIJSWCNKV[UWRRQTVCPFOCPWHCEVWTKPIGZRGTVKUGVJCVKPENWFGU+% OCPWHCEVWTKPIRCEMCIKPICPFVGUV6JKU[QWECPIGVCNNHTQO'74124#%6+%'+%UGTXKEGCUGTXKEGVJCVKUCNTGCF[GUVCDNKUJGF HQT[GCTUKPVJGOCTMGV THE EUROPRACTICE IC SERVICES ARE OFFERED BY THE FOLLOWING CENTERS: · KOGE.GWXGP $GNIKWO · (TCWPJQHGT+PUVKVWVHWGT+PVGITKGTVG5EJCNVWPIGP (TCWPJQHGT++5 'TNCPIGP )GTOCP[ This project has received funding from the European Union’s Seventh Programme for research, technological development and demonstration under grant agreement N° 610018. This funding is exclusively used to support European universities and research laboratories. © imec FOREWORD Dear EUROPRACTICE customers, We are at the start of the
    [Show full text]
  • A Survey of Analog-To-Digital Converters for Operation Under Radiation Environments
    electronics Review A Survey of Analog-to-Digital Converters for Operation under Radiation Environments Ernesto Pun-García 1,*,† and Marisa López-Vallejo 2 1 R&D Department, Arquimea Ingeniería SLU, 28918 Leganés, Spain 2 IPTC, ETSI Telecomunicación, Universidad Politécnica de Madrid, 28040 Madrid, Spain; [email protected] * Correspondence: [email protected]; Tel.: +34-610-571-208 † Current address: Calle Margarita Salas 10, 28918 Leganés, Madrid, Spain. Academic Editors: Marko Laketic, Paisley Shi and Anna Li Received: 7 September 2020; Accepted: 3 October 2020; Published: 15 October 2020 Abstract: In this work, we analyze in depth multiple characteristic data of a representative population of radenv-ADCs (analog-to-digital converters able to operate under radiation). Selected ADCs behave without latch-up below 50 MeV·cm2/mg and are able to bear doses of ionizing radiation above 50 krad(Si). An exhaustive search of ADCs with radiation characterization data has been carried out throughout the literature. The obtained collection is analyzed and compared against the state of the art of scientific ADCs, which reached years ago the electrical performance that radenv-ADCs provide nowadays. In fact, for a given Nyquist sampling rate, radenv-ADCs require significantly more power to achieve lower effective resolution. The extracted performance patterns and conclusions from our study aim to serve as reference for new developments towards more efficient implementations. As tools for this purpose, we have conceived FOMTID and FOMSET, two new figures of merit to compare radenv-ADCs that consider electrical and radiation performance. Keywords: ADC performance patterns; ADC state of the art; ADC survey; analog-to-digital converters; applications under radiation environments; figure of merit under radiation; radenv-ADC design guidelines; radiation effects; radiation hardening; radiation test methods 1.
    [Show full text]
  • WP Microelectronics and Interconnections
    Advanced European Infrastructures for Detectors at Accelerators FirSecondst Ann Annualual MeMeetingeting: WPWP44 St Statusatus R Reporteport ChrisChristophetophe de LA TA ILdeLE (laCN TailleRS/IN2P(CNRS/IN2P3)3) Valerio RE (INFN ) Valerio Re (INFN/Univ. Bergamo) LPNHE, Paris, April 7, 2017 DESY, Hamburg, June 17, 2016 This project has received funding from the European Union’s Horizon 2020 Research and Innovation programme under Grant Agreement no. 654168. WP4: microelectronics and interconnections • WP Coordinators: Christophe de la Taille, Valerio Re • Goal : provide chips and interconnections to detectors developed by other WPs • Task 1: Scientific coordination (CNRS-OMEGA, INFN-UNIBG) • Task 2 : 65 nm chips for trackers (CERN) • Fine pitch, low power, advanced digital processing • Task 3 : SiGe 130nm for calorimeters/gaseous (IN2P3) • Highly integrated charge and time measurement • Task 4 : interconnections between 65 nm chips and pixel sensors (INFN) • TSVs in 65 nm CMOS wafers, bonding of 65 nm chips to sensors, exploration of fine pitch bonding processes AIDA WP4 milestones MS4.1 Architectural review of deliverable chips in 65nm run M14 (accomplished) MS4.2 Final design review of 65nm M30 (October 2017) MS4.3 Test report of deliverable D4.1 M46 (February 2019) MS4.4 Selection of SiGe foundry M14 (accomplished) MS4.5 Final design review of deliverable chips in SiGe run M30 (October 2017) MS4.6 Test report of deliverable D4.2 M46 MS4.7 Selection of TSV process M14 (accomplished) MS4.8 Final design review of deliverable D4.3 (TSV
    [Show full text]
  • On the Effects of Total Ionizing Dose in Silicon-Germanium Bicmos Platforms
    ON THE EFFECTS OF TOTAL IONIZING DOSE IN SILICON-GERMANIUM BICMOS PLATFORMS A Thesis Presented to The Academic Faculty by Zachary E. Fleetwood In Partial Fulfillment of the Requirements for the Degree Master's Thesis in the School of Electrical and Computer Engineering Georgia Institute of Technology December 2014 Copyright c 2014 by Zachary E. Fleetwood ON THE EFFECTS OF TOTAL IONIZING DOSE IN SILICON-GERMANIUM BICMOS PLATFORMS Approved by: Professor John D. Cressler, Advisor School of Electrical and Computer Engineering Georgia Institute of Technology Professor John Papapolymerou School of Electrical Engineering Georgia Institute of Technology Professor Douglas Yoder School of Electrical Engineering Georgia Institute of Technology Date Approved: 5 Dec 2014 ACKNOWLEDGEMENTS I would first like to thank my research advisor Dr. Cressler. His endless support and encouragement has made me a better researcher and has provided me with the perfect environment to continue my education. I would also like to thank the members of my reading committee: Dr. Papapolymerou and Dr. Yoder. I appreciate their taking the time to provide me with helpful feedback on this thesis in addition to being my teachers at Georgia Tech. I would also like to thank my Mom, Dad, and brothers: Aaron and Nathan. Your support and love over the years means everything to me. I would especially like to thank my father for taking the time to help provide me with feedback on my technical work | you have taught me so much. And I would also like to thank all my friends. Thank you to my Vandy friends Dylan McQuaide and Zac Diggins | I don't know where I would be right now without you guys.
    [Show full text]
  • Intel's Atom Lines 1. Introduction to Intel's Atom Lines 3. Atom-Based Platforms Targeting Entry Level Desktops and Notebook
    Intel’s Atom lines • 1. Introduction to Intel’s Atom lines • 2. Intel’s low-power oriented CPU micro-architectures • 3. Atom-based platforms targeting entry level desktops and notebooks • 4. Atom-based platforms targeting tablets • 5. Atom-based platforms targeting smartphones • 6. Intel’s withdrawal from the mobile market • 7. References 1. Introduction to Intel’s Atom lines • 1.1 The rapidly increasing importance of the mobile market space • 1.2 Related terminology • 1.3 Introduction to Intel’s low-power Atom series 1.1 The rapidly increasing importance of the mobile market space 1.1 The rapidly increasing importance of the mobile market space (1) 1.1 The rapidly increasing importance of the mobile market space Diversification of computer market segments in the 2000’s Main computer market segments around 2000 Servers Desktops Embedded computer devices E.g. Intel’s Xeon lines Intel’s Pentium 4 lines ARM’s lines AMD’s Opteron lines AMD’s Athlon lines Major trend in the first half of the 2000’s: spreading of mobile devices (laptops) Main computer market segments around 2005 Servers Desktops Mobiles Embedded computer devices E.g. Intel’s Xeon lines Intel’s Pentium 4 lines Intel’s Celeron lines ARM’s lines AMD’s Opteron lines AMD’s Athlon64 lines AMD’s Duron lines 1.1 The rapidly increasing importance of the mobile market space (2) Yearly worldwide sales and Compound Annual Growth Rates (CAGR) of desktops and mobiles (laptops) around 2005 [1] 350 300 Millions 250 CAGR 17% 200 Mobile 150 100 Desktop CAGR 5% 50 0 2003 2004 2005 2006 2007 2008
    [Show full text]
  • A Case for Packageless Processors
    A Case for Packageless Processors Saptadeep Pal∗, Daniel Petrisko†, Adeel A. Bajwa∗, Puneet Gupta∗, Subramanian S. Iyer∗, and Rakesh Kumar† ∗Department of Electrical and Computer Engineering, University of California, Los Angeles †Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign fsaptadeep,abajwa,s.s.iyer,[email protected], fpetrisk2,[email protected] Abstract—Demand for increasing performance is far out- significantly limit the number of supportable IOs in the pacing the capability of traditional methods for performance processor due to the large size and pitch of the package- scaling. Disruptive solutions are needed to advance beyond to-board connection relative to the size and pitch of on- incremental improvements. Traditionally, processors reside inside packages to enable PCB-based integration. We argue chip interconnects (∼10X and not scaling well). In addition, that packages reduce the potential memory bandwidth of a the packages significantly increase the interconnect distance processor by at least one order of magnitude, allowable thermal between the processor die and other dies. Eliminating the design power (TDP) by up to 70%, and area efficiency by package, therefore, has the potential to increase bandwidth a factor of 5 to 18. Further, silicon chips have scaled well by at least an order of magnitude(Section II). Similarly, while packages have not. We propose packageless processors - processors where packages have been removed and dies processor packages are much bigger than the processor itself directly mounted on a silicon board using a novel integra- (5 to 18 times bigger). Removing the processor package tion technology, Silicon Interconnection Fabric (Si-IF).
    [Show full text]
  • The Intel X86 Microarchitectures Map Version 3.2
    The Intel x86 Microarchitectures Map Version 3.2 8086 (1978, 3 µm) 80386 (1985, 1.5 to 1 µm) P6 (1995, 0.50 to 0.35 μm) Skylake (2015, 14 nm) NetBurst (2000 , 180 to 130 nm) Series: Alternative Names: iAPX 386, 386, i386 Alternative Names: i686 Alternative Names: SKL (Desktop and Mobile), SKX (Server) (Note : all U and Y processors are MCPs) P5 (1993, 0.80 to 0.35 μm) Alternative Names: Pentium 4, Pentium IV, P4 • 16-bit data bus: Series: Series: Pentium Pro (used in desktops and servers) Series: Alternative Names: Pentium, Series: 8086 (iAPX 86) • Desktop/Server: i386DX Variant: Klamath (1997, 0.35 μm) • Desktop: Desktop 6th Generation Core i5 (i5-6xxx, S-H) 80586, 586, i586 • Desktop: Pentium 4 1.x (except those with a suffix, Willamette, 180 nm), Pentium 4 2.0 (Willamette, 180 • 8-bit data bus: • Desktop lower-performance: i386SX Alternative Names: Pentium II, PII • Desktop higher-performance: Desktop 6th Generation Core i7 (i7-6xxx, S-H), Desktop 7th Generation Core i7 X (i7-7xxxX, X), Desktop 7th Series: nm) 8088 (iAPX 88) • Mobile: i386SL, 80376, i386EX, Series: Pentium II 233/266/300 steppings C0 and C1 (Klamath, used in desktops) Generation Core i9 X (i7-7xxxXE, i7-7xxxX, X), Desktop 9th Generation Core i7 X (i7-9xxxX, X, 14 nm++?), Desktop 9th Generation Core i9 X • Desktop/Server: P5, P54C • Desktop higher-performance: Pentium 4 1.xA (Northwood, 130 nm), Pentium 4 2.x (except 2.0, 2.40A, i386CXSA, i386SXSA, i386CXSB New instructions: Deschutes (1998, 0.25 to 0.18 μm) (i7-9xxxXE, i7-9xxxX, X, 14 nm++?), Xeon W-3175X (W, 14 nm++?)
    [Show full text]