An Outlook of Technology Scaling Beyond Moore's Law

Total Page:16

File Type:pdf, Size:1020Kb

An Outlook of Technology Scaling Beyond Moore's Law An Outlook of Technology Scaling Beyond Moore's Law Adrian M. Ionescu Ecole Polytechnique Fédérale de Lausanne Adrian Ionescu, October 2005 1 Summary • Introduction: Moore’s law… • More Moore, Beyond CMOS, More-than-More • Fundamental limits • Evolutionary and non-classical MOSFET (More Moore…) • Emerging nanoelectronics (… beyond CMOS) Single/few electron electronics & QCA Nanowires & carbon nanotubes Molecular electronics Spintronics • Conclusion Adrian Ionescu, October 2005 2 40 years of Moore’s law: 1965 - 2005 • 1965: a single transistor cost more than a dollar • 1975: the cost of a transistor had dropped to less than a penny, while transistor size allowed for almost 100,000 transistors on a single die • 1979 to 1999, processor performance went from about 1.5 million instructions per second (MIPS), to almost 50 MIPS on the i486™, to over 1,000 MIPS on the Intel® Pentium® III • Today's Intel® processors run at 3.2 GHz and higher, deliver over 10,000 MIPS, and can be manufactured in high volumes with transistors that cost less than 1/10,000th of a cent Adrian Ionescu, October 2005 3 How it started… Adrian Ionescu, October 2005 4 … and where we are… Source: http://www.intel.com/research/silicon/mooreslaw.htm Adrian Ionescu, October 2005 5 90 nm Intel’s processor First planar integrated Montecito (2004) circuit (1961) Itanium Processor Family Transistors: 1.72 Billion Frequency: >1.7GHz Power: ~100W Source: Intel Developer Forum, September, 2004 Adrian Ionescu, October 2005 6 Source: M. Bohr, Intel Development Forum, September 2004. Adrian Ionescu, October 2005 7 CMOS scaling (1) Alternative devices CMOS Nanotechnology benefits for 100 electronics and m) CMOS IC evolution μ 10 computing: 1 • nano-processors with CMOS 0.1μm in 2002 declining energy 0.1 • ultra-small terra-bit 0.01 level storage Feature Size ( Feature Transition Region Alternative 0.001 Quantum devices • > GHz frequency and devices bandwidth Atomic dimensions • integrated nano- 1960 2000 2020 2040 1980 sensors Year • novel functionalities Adrian Ionescu, October 2005 8 CMOS scaling (2) Classical Scaling Consequence (Denard) Voltage: V/α Higher density: α2 Oxide: tox/α Higher speed: α Wide width: W/α Power: 1/α2 Gate width: L/α Power density: const. Diffusion: xd/α Doping: αNA Deviations from classical scaling • gate leakage • reliability • performance at higher voltage Consequence • increase of power density Adapted after: H-S. P. Wong, et al., Proceedings of the IEEE, vol. 87, No. 4, pp. 537-70, 2001. Adrian Ionescu, October 2005 9 Gate dielectric @ fundamental scaling limit Practical limits on the thickness of the SiO2 gate oxide are crucial. Two fundamental considerations: • roughness to be controlled @ atomic scale: leakage of 1nm oxide increases by 10 every 0.1nm rms roughness • intrinsic transition region to reach bulk SiO2 properties: 0.3-0.5 nm Assume tolerable gate leakage is 100 Acm and gate area is 1% of a 1- cm chip and power-supply voltage of 1 V, the power dissipation due to the gate current is 1 W. J. D. Plummer and P. B. Griffin, Proceedings of the IEEE, Vol. 89, pp. 240 –258, 2001. D. A. Muller et al., Nature, pp. 758-761, June 1999. Adrian Ionescu, October 2005 10 Power is key limiting factor? • Active power • Passive power (gate & subthreshold leakage) Problems: • Server microprocessors cannot simultaneaously use all their transistors due to power limitations • Leakage power limits max usable μP frequency Gate Length (μm) Adrian Ionescu, October 2005 11 Cost limit Adrian Ionescu, October 2005 12 Fundamental limits From: thermodinamics, quantum-mechanics, electromagnetics • Limit on energy transfer during a binary switching: E(min) = (ln2) kT (=kTlogeN, N=2) (J. Neumann) • Heisenberg’s uncertainty principle: ΔE > h/Δt Æ forbidden region for power-delay • electromagnetics Æ τ > L/c0 (limited time of electromagnetic wave travelling across interconnects) Adrian Ionescu, October 2005 13 Why E(min) = kT x ln2 ? Binary signal discrimination: the slope of the static transfer curve of a (CMOS) binary logic gate must be greater than unity in absolute value at the transition point where input and output voltage levels are equal Æ CMOS inverter ⎡ Cfs ⎤ Cd Vdd(min) = 2[]kT / q ⎢1+ ⎥ ln(2 + ) ⎣ Cox + Cd ⎦ Cox kT kT Vdd(min) ≅ 2()ln 2 = 1.38 = 0.036V @T = 300K q q Min signal energy stored on gate: kT Es(min) = (1/ 2)Q Vdd = ()1/ 2 q x 2 ()ln 2 = kT x ln 2 == 0.693kT g q 2 εox Lmin ⎡ t ox ⎤ 2 1/ 2 with : Cg = ⇒ Lmin = ⎢ ⎥q /[]2()ln 2 kT = 9.3nm @ t ox = 1nm t ox ⎣εox ⎦ Source: J.D. Meindl, J. A. Davis, IEEE JSSC, Vol. 35, October 2000, pp. 1515-1516. Adrian Ionescu, October 2005 14 Fundamental limits Average power transfer during a binary transition, P, versus transition time, td. The red, orange, and green zones are forbidden by fundamental, silicon material, and 50-nm channel length transistor device level limits,respectively. Allowable design space Source: J. D. Meindl, Q. Chen, J. A. Davis, Science, Vol. 293, pp. 2044-2049, September 2001 Adrian Ionescu, October 2005 15 MOSFET @ nanoscale: what changes? 10µm Classical transport Gate (collisions, mobility) N+ N+ scattering 1021 ) ) -3 0.2µm 2,5 -3 1020 cm/s) 2 cm/s) classicalClassique 7 7 1,5 1019 Velocity overshoot 1 Quantique 0,5 1018 + 1D quantum effects Concentration d'électrons (cm quantum 0 17 Vitesse (x10 10 Electron conc. (cm Velocity (10 Velocity 0 0,05 0,1 0,0227 0,0237 0,0247 0,0257 Profondeur par rapport à l'interface (µm) 0.1µm Distance alongle long the du channel canal (µm) (μm) Coordinate from surface (μm) Gate Parameter (dopant) N+ N+ fluctuations 20nm Gate Ballistic transport N+ N+ + 2D quantum effects Adrian Ionescu, October 2005 16 Example: Process Variations (1) • main affected parameter: VT • both the discrete random dopant distribution in the channel region and the quantum effects in the inversion layer should be considered for < 100nm • due to channel dopant variations in individual devices, threshold voltage fluctuations in ULSI’s are estimated to exceed 0.1 V in the for less than 0.1 • μm if the doping is not optimized Source: T. Mizuno, IEEE Transactions on Electron Devices, Vol. 47, pp. 756–761, April 2000. A. Asenov et al., IEEE Transactions on Electron Devices, Vol. 48, pp. 722-729, April 2001. Adrian Ionescu, October 2005 17 Example: Process Variations (2) There is a most suitable NA to supress δVT • SOI: • bulk-Si: Design for NA vs Leff in nano-region ULSI: (a) SOI (b) bulk MOSFET’s. The shaded regions indicate suitable NA for ΔI /I < 10%, 6δV < 0.1 V, and suppressing the short channel effects. More design space exists for SOI! Adrian Ionescu, October 2005 18 The ideal MOS switch @ nanoscale: what should be improved? • Control of short channel Quasi-ideal switch effects Æ VT, Ioff • Subthreshold slope: better than 60mV/decade? , log) • Mobility, velocity saturation, lin ballistic transport Æ Ion I ( • Contact and series MOS switch resistances V • Impact of process variations Adrian Ionescu, October 2005 19 Solutions for better-than-60mV/decade subthreshold slope kT C C kT S = ln10 (1+ dep + ss ) → ln10 = 60mV / decade q Cox Cox q Various concepts: • I-MOS • Tunnel FET • NEM-MOSFET • Hybrid devices •… Adrian Ionescu, October 2005 20 I-MOS Subthreshold slope improvement by avalanche breakdown current in gated-diode Source: K. Gopalakrishnan et al., IEEE Transactions on Electron Devices, Vol. 52, Jan. 2005 pp. 69 – 76. Adrian Ionescu, October 2005 21 Si/SiGe Vertical Tunnel FET (1) • Why S < 60mV/decade in tunnel FET? B Sd= (log/ IdV )−1 < 60mV/dec IAV=⋅−ξ exp( ) dgs eff ξ 1 dVeff ξξ+ Bd −1 =+ln10(2 ) VdVeff gsξ dV gs < 60mV/dec / ln10 = 26mV/dec V 2 S = GS Where: 3/ 2 • Bkane depends on effective mass 2VGS + BKane Eg / D • D depends on tox, L, Vds, and dopings Adrian Ionescu, October 2005 22 Si/SiGe Vertical Tunnel FET (2) Subthreshold swing S vs. Ge mole fraction in the δp+ layer Source: K. K. Bhuwalka, J Schulze and I Eisele, J. J. of Appl Phys 43, 4073 (2004) Adrian Ionescu, October 2005 23 Simulation prediction: SS of tunnel FET Issues: • what physics and practical transistor • materials • fabrication S = 1mV/decade @ room temperature Source : P. F. Wang et al. Solid-State Electronics 48, 2281 (2004). Adrian Ionescu, October 2005 24 NEM or Suspended-Gate FET 10-6 0.14 Vd = 50mV 0.12 Source I leakage, Gate -7 (A) 10 d Pull-in 0.1 Drain 0.08 10-8 Suspended-Gate 20μm 0.06 g (pA) Drain current, I 0.04 10-9 0.02 -10 AlSi 1% 10 0 -6 -4 -2 0 2 4 6 8 10 Vg (V) 220nm • ultra-low gate leakage in off-state Gate oxide 40nm (quasi-negligible) Silicon 200nm • 10mV/decade SS demonstrated by EPFL Adrian Ionescu, October 2005 25 Evolutionary MOSFET: more Moore… Source: H-S. P. Wong, et al., Proceedings of the IEEE, vol. 87, No. 4, pp. 537-70, 2001. Adrian Ionescu, October 2005 26 Non-classical MOSFET (1): more Moore… Adrian Ionescu, October 2005 27 Non-classical MOSFET (2): more Moore… Adrian Ionescu, October 2005 28 Emerging (nano)technologies No mature and/or credible candidate for ‘beyond CMOS’ yet! 4-D parametrization of emerging nano-technologies: size x speed x energy x cost Source: J. A. Hutchby et al., IEEE Circuits and Devices Magazine, Vol. 18, pp. 28–41, March 2002. Adrian Ionescu, October 2005 29 Emerging logic devices (…after Moore) Source: ITRS 2003. Adrian Ionescu, October 2005 30 Single/few electron electronics Single Electron Transistor MOS transistor (SET) versus (MOSFET) SET MOSFET Gate Drain (D) Drain CTD , RD Gate Source Drain Gate Source Drain CG VDS n+ n+ Gate (G) CTS ,RS p V GS conductive conductive Source tunneling Source (S) pn+ n-channel junctions island junctions • electron conduction is one by one (!) • many electrons simultaneously • drain & gate control Coulomb blockade (CB) participate to the conduction • needs opaque junctions: • gate controls channel RT > RQ~26kΩ • junctions highly transparent • needs very small island (~1nm) for room • works inherently at room temperature temperature operation Adrian Ionescu, October 2005 31 HOW SET works? SET principle (orthodox theory): K.K.
Recommended publications
  • Evolution of Microprocessor Performance
    EvolutionEvolution ofof MicroprocessorMicroprocessor PerformancePerformance So far we examined static & dynamic techniques to improve the performance of single-issue (scalar) pipelined CPU designs including: static & dynamic scheduling, static & dynamic branch predication. Even with these improvements, the restriction of issuing a single instruction per cycle still limits the ideal CPI = 1 Multiple Issue (CPI <1) Multi-cycle Pipelined T = I x CPI x C (single issue) Superscalar/VLIW/SMT Original (2002) Intel Predictions 1 GHz ? 15 GHz to ???? GHz IPC CPI > 10 1.1-10 0.5 - 1.1 .35 - .5 (?) Source: John P. Chen, Intel Labs We next examine the two approaches to achieve a CPI < 1 by issuing multiple instructions per cycle: 4th Edition: Chapter 2.6-2.8 (3rd Edition: Chapter 3.6, 3.7, 4.3 • Superscalar CPUs • Very Long Instruction Word (VLIW) CPUs. Single-issue Processor = Scalar Processor EECC551 - Shaaban Instructions Per Cycle (IPC) = 1/CPI EECC551 - Shaaban #1 lec # 6 Fall 2007 10-2-2007 ParallelismParallelism inin MicroprocessorMicroprocessor VLSIVLSI GenerationsGenerations Bit-level parallelism Instruction-level Thread-level (?) (TLP) 100,000,000 (ILP) Multiple micro-operations Superscalar /VLIW per cycle Simultaneous Single-issue CPI <1 u Multithreading SMT: (multi-cycle non-pipelined) Pipelined e.g. Intel’s Hyper-threading 10,000,000 CPI =1 u uuu u u Chip-Multiprocessors (CMPs) u Not Pipelined R10000 e.g IBM Power 4, 5 CPI >> 1 uuuuuuu u AMD Athlon64 X2 u uuuuu Intel Pentium D u uuuuuuuu u u 1,000,000 u uu uPentium u u uu i80386 u i80286
    [Show full text]
  • Course #: CSI 440/540 High Perf Sci Comp I Fall ‘09
    High Performance Computing Course #: CSI 440/540 High Perf Sci Comp I Fall ‘09 Mark R. Gilder Email: [email protected] [email protected] CSI 440/540 This course investigates the latest trends in high-performance computing (HPC) evolution and examines key issues in developing algorithms capable of exploiting these architectures. Grading: Your grade in the course will be based on completion of assignments (40%), course project (35%), class presentation(15%), class participation (10%). Course Goals Understanding of the latest trends in HPC architecture evolution, Appreciation for the complexities in efficiently mapping algorithms onto HPC architectures, Familiarity with various program transformations in order to improve performance, Hands-on experience in design and implementation of algorithms for both shared & distributed memory parallel architectures using Pthreads, OpenMP and MPI. Experience in evaluating performance of parallel programs. Mark R. Gilder CSI 440/540 – SUNY Albany Fall '08 2 Grades 40% Homework assignments 35% Final project 15% Class presentations 10% Class participation Mark R. Gilder CSI 440/540 – SUNY Albany Fall '08 3 Homework Usually weekly w/some exceptions Must be turned in on time – no late homework assignments will be accepted All work must be your own - cheating will not be tolerated All references must be sited Assignments may consist of problems, programming, or a combination of both Mark R. Gilder CSI 440/540 – SUNY Albany Fall '08 4 Homework (continued) Detailed discussion of your results is expected – the program is only a small part of the problem Homework assignments will be posted on the class website along with all of the lecture notes ◦ http://www.cs.albany.edu/~gilder Mark R.
    [Show full text]
  • Itanium Processor
    IA-64 Microarchitecture --- Itanium Processor By Subin kizhakkeveettil CS B-S5….no:83 INDEX • INTRODUCTION • ARCHITECTURE • MEMORY ARCH: • INSTRUCTION FORMAT • INSTRUCTION EXECUTION • PIPELINE STAGES: • FLOATING POINT PERFORMANCE • INTEGER PERFORMANCE • CONCLUSION • REFERENCES Itanium Processor • First implementation of IA-64 • Compiler based exploitation of ILP • Also has many features of superscalar INTRODUCTION • Itanium is the brand name for 64-bit Intel microprocessors that implement the Intel Itanium architecture (formerly called IA-64). • Itanium's architecture differs dramatically from the x86 architectures (and the x86-64 extensions) used in other Intel processors. • The architecture is based on explicit instruction-level parallelism, in which the compiler makes the decisions about which instructions to execute in parallel Memory architecture • From 2002 to 2006, Itanium 2 processors shared a common cache hierarchy. They had 16 KB of Level 1 instruction cache and 16 KB of Level 1 data cache. The L2 cache was unified (both instruction and data) and is 256 KB. The Level 3 cache was also unified and varied in size from 1.5 MB to 24 MB. The 256 KB L2 cache contains sufficient logic to handle semaphore operations without disturbing the main arithmetic logic unit (ALU). • Main memory is accessed through a bus to an off-chip chipset. The Itanium 2 bus was initially called the McKinley bus, but is now usually referred to as the Itanium bus. The speed of the bus has increased steadily with new processor releases. The bus transfers 2x128 bits per clock cycle, so the 200 MHz McKinley bus transferred 6.4 GB/sand the 533 MHz Montecito bus transfers 17.056 GB/s.
    [Show full text]
  • Intel® Processor Architecture
    Intel® Processor Architecture January 2013 Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Agenda •Overview Intel® processor architecture •Intel x86 ISA (instruction set architecture) •Micro-architecture of processor core •Uncore structure •Additional processor features – Hyper-threading – Turbo mode •Summary Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. 2 Intel® Processor Segments Today Architecture Target ISA Specific Platforms Features Intel® phone, tablet, x86 up to optimized for ATOM™ netbook, low- SSSE-3, 32 low-power, in- Architecture power server and 64 bit order Intel® mainstream x86 up to flexible Core™ notebook, Intel® AVX, feature set Architecture desktop, server 32 and 64bit covering all needs Intel® high end server IA64, x86 by RAS, large Itanium® emulation address space Architecture Intel® MIC accelerator for x86 and +60 cores, Architecture HPC Intel® MIC optimized for Instruction Floating-Point Set performance Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. 3 Itanium® 9500 (Poulson) New Itanium Processor Poulson Processor • Compatible with Itanium® 9300 processors (Tukwila) Core Core 32MB Core Shared Core • New micro-architecture with 8 Last Core Level Core Cores Cache Core Core • 54 MB on-die cache Memory Link • Improved RAS and power Controllers Controllers management capabilities • Doubles execution width from 6 to 12 instructions/cycle 4 Intel 4 Full + 2 Half Scalable Width Intel • 32nm process technology Memory QuickPath Interface Interconnect • Launched in November 2012 (SMI) Compatibility provides protection for today’s Itanium® investment Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation.
    [Show full text]
  • Itanium and Vnuma
    Virtualisation Case Study: Itanium and vNUMA Matthew Chapman [email protected] 1 ITANIUM ➜ High-performance processor architecture ➜ Also known as IA-64 ➜ Joint venture between HP and Intel ➜ Not the same instruction set as IA-32/AMD64/EM64T ➜ Very good floating-point performance 2000: Itanium“Merced” 180nm,upto800Mhz,4MBcache 2002: ItaniumII“McKinley” 180nm,upto1Ghz,3MBcache 2003: ItaniumII“Madison” 130nm,upto1.6Ghz,6MBcache 2004: Itanium II “Madison 9M” 130 nm, up to 1.67Ghz, 9MB cache 2006: Itanium II “Montecito” 90 nm, dual-core, up to 1.6Ghz, 24MB cache 2008: “Tukwila” 65nm,quad-core,upto2.5Ghz? ITANIUM 2 EXPLICITLY PARALLEL INSTRUCTION-SET COMPUTING (EPIC) ➜ Goal to increase instructions per cycle ➜ Itanium can have similar performance to x86 at a lower clock speed ➜ Based on Very Long Instruction Word (VLIW) ➜ Explicit parallelism in instruction set ➜ Simplified instruction decode and issue ➜ Scheduling decisions made by compiler Memory Branch Branch Branch Integer Integer Integer Integer MI;I bundle Integer Memory Memory Memory Memory Floating−point Floating−point Floating−point Floating−point VLIW Instruction MFI bundle Integer Execution Units Execution Units EPIC Instructions EXPLICITLY PARALLEL INSTRUCTION-SET COMPUTING (EPIC) 3 EXAMPLE Load and add three numbers in assembly code: ld8 r4 = [r1] ld8 r5 = [r2] ld8 r6 = [r3] ;; add r7 = r4, r5 ;; add r8 = r6, r7 ;; EXAMPLE 4 Resulting instructions: MMI ld8 r4=[r1] ld8 r5=[r2] nop.i 0 M;MI; ld8 r6 = [r3] ;; add r7=r4,r5 // Arithmetic on M too nop.i 0 ;; M;MI add r8 = r6, r7
    [Show full text]
  • Chapter 29 Itanium Architecture
    Chapter 29 Itanium Architecture Chapter 29 - Itanium Architecture July 2003 Chapter 29 / Page 1 Chapter 29 Itanium Architecture INDEX Introduction 3 Overview of the Itanium Architecture 3 Main features of EPIC....................................................................................................... 3 The Itanium processor family (IPF) 5 HP-UX Itanium Releases 6 Differences between HP-UX for PA and IPF ................................................................... 6 What needs to be considered on an IPF system 8 How an IPF system boots.................................................................................................. 8 Boot disk layout and EFI................................................................................................... 8 How to configure the Console input/output devices....................................................... 10 The boot process ............................................................................................................. 13 How to mirror the root disk............................................................................................. 18 How to replace a failed disk............................................................................................ 21 The ARIES emulator 22 IPF vs. PA-RISC Terminology 22 Additional Information 23 July 2003 Chapter 29 / Page 2 Chapter 29 Itanium Architecture Introduction When PA-RISC was released, HP began designing the architecture to replace it. Several years into the project, HP determined that the economics
    [Show full text]
  • Madison Processor
    TheThe RoadRoad toto BillionBillion TransistorTransistor ProcessorProcessor ChipsChips inin thethe NearNear FutureFuture DileepDileep BhandarkarBhandarkar RogerRoger GolliverGolliver Intel Corporation April 22th, 2003 Copyright © 2002 Intel Corporation. Outline yy SemiconductorSemiconductor TechnologyTechnology EvolutionEvolution yy Moore’sMoore’s LawLaw VideoVideo yy ParallelismParallelism inin MicroprocessorsMicroprocessors TodayToday yy MultiprocessorMultiprocessor SystemsSystems yy TheThe PathPath toto BillionBillion TransistorsTransistors yy SummarySummary ©2002, Intel Corporation Intel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries *Other names and brands may be claimed as the property of others BirthBirth ofof thethe RevolutionRevolution ---- TheThe IntelIntel 40044004 IntroducedIntroduced NovemberNovember 15,15, 19711971 108108 KHz,KHz, 5050 KIPsKIPs ,, 23002300 1010μμ transistorstransistors 20012001 –– Pentium®Pentium® 44 ProcessorProcessor Introduced November 20, 2000 @1.5 GHz core, 400 MT/s bus 42 Million 0.18µ transistors August 27, 2001 @2 GHz, 400 MT/s bus 640 SPECint_base2000* 704 SPECfp_base2000* SourceSource:: hhtttptp:/://www/www.specbench.org/cpu2000/results/.specbench.org/cpu2000/results/ 3030 YearsYears ofof ProgressProgress yy40044004 toto PentiumPentium®® 44 processorprocessor –– TransistorTransistor count:count: 20,000x20,000x increaseincrease –– Frequency:Frequency: 20,000x20,000x increaseincrease
    [Show full text]
  • United States Securities and Exchange Commission Form
    UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, D.C. 20549 FORM 8-K CURRENT REPORT Pursuant to Section 13 or 15(d) of the Securities Exchange Act of 1934 Date of Report: April 15, 2003 (Date of earliest event reported) INTEL CORPORATION (Exact name of registrant as specified in its charter) Delaware 0-6217 94-1672743 (State of (Commission (IRS Employer incorporation) File Number) Identification No.) 2200 Mission College Blvd., Santa Clara, California 95052-8119 (Address of principal executive offices) (Zip Code) (408) 765-8080 (Registrant's telephone number, including area code) Item 12. DISCLOSURE OF RESULTS OF OPERATIONS AND FINANCIAL CONDITION 12.1 Attached hereto as Exhibit 99.1 and incorporated by reference herein is financial information for Intel Corporation for the quarter ended March 29, 2003 and forward-looking statements relating to 2003 and the second quarter of 2003 as presented in a press release of April 15, 2003. The information in this report shall be deemed incorporated by reference into any registration statement heretofore or hereafter filed under the Securities Act of 1933, as amended, except to the extent that such information is superceded by information as of a subsequent date that is included in or incorporated by reference into such registration statement. The information in this report shall not be treated as filed for purposes of the Securities Exchange Act of 1934, as amended. Item 7. FINANCIAL STATEMENTS AND EXHIBITS (c) Exhibits 99.1 Financial information for Intel Corporation for the quarter and the year ended March 29, 2003 and forward-looking statements relating to 2003 and the second quarter of 2003 as presented in a press release of April 15, 2003.
    [Show full text]
  • Intel's Core 2 Family
    Intel’s Core 2 family - TOCK lines II Nehalem to Haswell Dezső Sima Vers. 3.11 August 2018 Contents • 1. Introduction • 2. The Core 2 line • 3. The Nehalem line • 4. The Sandy Bridge line • 5. The Haswell line • 6. The Skylake line • 7. The Kaby Lake line • 8. The Kaby Lake Refresh line • 9. The Coffee Lake line • 10. The Cannon Lake line 3. The Nehalem line 3.1 Introduction to the 1. generation Nehalem line • (Bloomfield) • 3.2 Major innovations of the 1. gen. Nehalem line 3.3 Major innovations of the 2. gen. Nehalem line • (Lynnfield) 3.1 Introduction to the 1. generation Nehalem line (Bloomfield) 3.1 Introduction to the 1. generation Nehalem line (Bloomfield) (1) 3.1 Introduction to the 1. generation Nehalem line (Bloomfield) Developed at Hillsboro, Oregon, at the site where the Pentium 4 was designed. Experiences with HT Nehalem became a multithreaded design. The design effort took about five years and required thousands of engineers (Ronak Singhal, lead architect of Nehalem) [37]. The 1. gen. Nehalem line targets DP servers, yet its first implementation appeared in the desktop segment (Core i7-9xx (Bloomfield)) 4C in 11/2008 1. gen. 2. gen. 3. gen. 4. gen. 5. gen. West- Core 2 Penryn Nehalem Sandy Ivy Haswell Broad- mere Bridge Bridge well New New New New New New New New Microarch. Process Microarchi. Microarch. Process Microarch. Process Process 45 nm 65 nm 45 nm 32 nm 32 nm 22 nm 22 nm 14 nm TOCK TICK TOCK TICK TOCK TICK TOCK TICK (2006) (2007) (2008) (2010) (2011) (2012) (2013) (2014) Figure : Intel’s Tick-Tock development model (Based on [1]) * 3.1 Introduction to the 1.
    [Show full text]
  • Parallel Computing 2Nd Edition by Grama Et
    High Performance Computing Course #: CSI 441/541 High Perf Sci Comp II Spring ‘10 Mark R. Gilder Email: [email protected] [email protected] CSI 441/541 This course investigates the latest trends in high-performance computing (HPC) evolution and examines key issues in developing algorithms capable of exploiting these architectures. Grading: Your grade in the course will be based on completion of assignments (40%), course project (35%), class presentation(15%), class participation (10%). Course Goals Understanding of the latest trends in HPC architecture evolution, Appreciation for the complexities in efficiently mapping algorithms onto HPC architectures, Hands-on experience in design and implementation of algorithms for both shared & distributed memory parallel architectures using MPI, Experience in evaluating performance of parallel programs. Mark R. Gilder CSI 441/541 – SUNY Albany Spring '10 2 Grades 40% Homework assignments 35% Final project 15% Class presentations 10% Class participation Mark R. Gilder CSI 441/541 – SUNY Albany Spring '10 3 Homework Usually weekly w/some exceptions Must be turned in on time – no late homework assignments will be accepted All work must be your own - cheating will not be tolerated All references must be sited Assignments may consist of problems, programming, or a combination of both Mark R. Gilder CSI 441/541 – SUNY Albany Spring '10 4 Homework (continued) Detailed discussion of your results is expected ◦ the actual program is only a small part of the problem Homework assignments will be posted on the class website along with all of the lecture notes ◦ http://www.cs.albany.edu/~gilder Mark R. Gilder CSI 441/541 – SUNY Albany Spring '10 5 Project Topic of general interest to the course Read three or four papers to get some ideas of latest research in HPC field Identify an application that can be implemented on our RIT cluster Implement and write a final report describing your application and results Present your results in class Mark R.
    [Show full text]
  • Dual-Core Intel® Itanium® 2 Processor: Reference Manual Update
    Dual-Core Update to the Intel® Itanium® 2 Processor Reference Manual For Software Development and Optimization Revision 0.9 January 2006 Document Number: 308065-001 Notice: This document contains information on products in the design phase of development. The information here is subject to change without notice. Do not finalize a design with this information. INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The Itanium 2 processor may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. The code name “Montecito” presented in this document is only for use by Intel to identify a product, technology, or service in development, that has not been made commercially available to the public, i.e., announced, launched or shipped.
    [Show full text]
  • Intel Multi-Core Presentation
    MultiMulti--CoreCore MicroprocessorMicroprocessor Chips:Chips: MotivationMotivation && ChallengesChallenges DileepDileep Bhandarkar,Bhandarkar, Ph.Ph. D.D. Architect at Large Digital Enterprise Group Intel Corporation May 2006 Copyright © 2006 Intel Corporation. 2006 Intel Distinguished Lecture Agenda yy SemiconductorSemiconductor TechnologyTechnology EvolutionEvolution yy DesignDesign ChallengesChallenges yy WhyWhy MultiMulti--CoreCore ProcessorProcessor Chips?Chips? yy Power/PerformancePower/Performance TradeTrade--OffsOffs yy CMPCMP DirectionsDirections yy BeyondBeyond CMPCMP yy SummarySummary ©2006, Intel Corporation Intel, the Intel logo, Pentium, Itanium and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries *Other names and brands may be claimed as the property of others www.intel.com/education 2006 Intel Distinguished Lecture IntelIntel only:only: OnOn--timetime ““22--yearyear--cyclecycle”” 180nm 130nm 90nm 65nm 45nm Wafer Size (mm): 200 200/300 300 300 300 1st Production: 1999 2001 2003 2005 2007 Transistors: SiGe SiGe Interconnects: 100nm LG 70nm LG 50nm LG 35nm LG Details CoSi2 CoSi2 NiSi NiSi Coming! Strain Si Strain Si 6 Al 6 Cu 7 Cu 8 Cu SiOF SiOF Low-k Low-k 4545 nmnm LogicLogic ProcessProcess onon TrackTrack forfor DeliveryDelivery inin 20072007 Process Name P1262 P1264 P1266 P1268 Lithography 90 nm 65 nm 45 nm 32 nm 1st Production 2003 2005 2007 2009 Moore'sMoore's LawLaw continues!continues! IntelIntel continuescontinues toto developdevelop aa
    [Show full text]