Visual Computing:Computing: Graphicsgraphics Rere--Defineddefined

Total Page:16

File Type:pdf, Size:1020Kb

Visual Computing:Computing: Graphicsgraphics Rere--Defineddefined IntelIntel ArchitectureArchitecture PressPress BriefingBriefing StephenStephen L.L. SmithSmith Vice President Director, Digital Enterprise Group Operations RonakRonak SinghalSinghal Principal Engineer Digital Enterprise Group 17 March, 2008 TodayToday’’ss NewsNews IntelIntel Technology:Technology: DeliveringDelivering onon thethe PromisePromise Mission Critical Tick-Tock Expandable Server Nehalem Larrabee 2 Intel:Intel: TheThe ArchitectureArchitecture forfor LifeLife Internet Netbook HPC Workstation Smartphone MID Mobile PC Mission Critical Embedded Desktop PC Server Milli Watts IA Compatible and Scalable Peta FLOPs 3 Tukwila:Tukwila: DeliveringDelivering PerformancePerformance toto WorldWorld’’ss MostMost PowerfulPowerful ComputersComputers y Quad-core with 30 MB cache y 2 billion transistors y Multi-Threading Technology y Intel QuickPath interconnect y Dual Integrated Memory Controllers y Estimate >2x* performance y Mainframe-class RAS “With Intel’’s upcoming quad-core Tukwila processor, Windows Server solutions running on Itanium-based systems will provide an even more scalable, reliable, agile and dynamic datacenter foundation for our customers.” —Bill Laing, GM Windows Server & Solutions Division, Microsoft * Compared to Dual-core Itanium® Processor 9100 series 4 ProductProduct CadenceCadence forfor SustainedSustained LeadershipLeadership 20072007--0808 PenrynPenryn Processors 45nm45nm TICKTICK TOCKTOCK Delivering Products on Schedule and Moore’s Law 5 ExpandableExpandable andand Scalable:Scalable: QuadQuad--CoreCore IntelIntel®® XeonXeon®® processorprocessor 73007300 y Caneland platform built for virtualization and consolidation y Energy Efficient performance:: LeadingLeading inin benchmarksbenchmarks y Scalable y Enterprise proven reliability and investment protection y Great customer acceptance Industry’’s Virtualization Platform of Choice 6 ExpandableExpandable andand Scalable:Scalable: GetsGets BetterBetter withwith DunningtonDunnington y 6 core Processor y 1.9 billion transistors cores cores y 45nm Hi-K technology y 45nm Hi-K technology cache cache y 16 MB L3 cache system interface y Latest Intel virtualization capabilities cores y Socket compatible with cache Caneland platform y Available 2H’’08 7 EnergyEnergy Efficiency:Efficiency: TopTop SPECpowerSPECpower** ResultsResults SPECPower_ Rank Sponsor ssj2008 result Platform Processors 1 HP 778 DL180 G5 2x Intel® Xeon® E5450 2 Dell 719 PE 2950 III 2x Intel® Xeon® E5440 3 Dell 712 PE 1950 III 2x Intel® Xeon® E5440 4 HP 698 DL160 G5 2x Intel® Xeon® E5450 5 FSC 690 RX300 S4 2x Intel® Xeon® E5440 6 Dell 682 PE 2950 2x Intel® Xeon® E5440 7 FSC 667 TX150 S6 1x Intel® Xeon® X3220 8 HP 662 DL360 G5 2x Intel® Xeon® E5450 9 HP 546 DL580 G5 4x Intel® Xeon® L7345 10 Intel 468 SM 6025B 2x Intel® Xeon® L5335 First industry standard Energy Efficiency benchmark Public SPECpower results from http://www.spec.org/power_ssj2008/results/power_ssj2008.html as of Feb 28, 2008 SPECPower_ssj2008 results measured as ssj_ops/watt 8 *SPEC, the SPEC logo and SPECpower are registered trademarks of the Standard Performance Evaluation Corporation ProductProduct CadenceCadence forfor SustainedSustained LeadershipLeadership 20072007--0808 PenrynPenryn NehalemNehalem Processors Processors 45nm45nm 45nm45nm TICKTICK TOCKTOCK Driving Products to Deliver on Moore’s Law 9 NehalemNehalem MicroMicro--architecture:architecture: DynamicallyDynamically ScalableScalable andand InnovativeInnovative NewNew DesignDesign Scalable from 2 to 8 cores Micro-architecture enhancements (4 –wide) Integrated Memory Controller – 3 Ch DDR3 2-way simultaneous multi-threading Integrated memory controller Core 0 Core 1 Core 2 Core 3 QuickPath interconnect Q Shared and Inclusive Level-3 cache P Shared L3 Cache Dynamic power management I SSE 4.2 Production: Q4’’08 10 NehalemNehalem DesignDesign ScalableScalable ViaVia ModularityModularity Nehalem Building Ex: 4 Core Ex: 8 Core Block Library iGraphics IA Core QPI … … Cache … IMC Sample Range of Product Options 11 Block combinations are for illustration only and do not represent actual product plans. Block sizes are not indicative of die size contributions. Nehalem:Nehalem: CoreCore uArchuArch EnhancementsEnhancements Foundation: Intel® Core™ Microarchitecture Significant Performance and Efficiency Enhancements y Increased parallelism Concurrent uOps Possible 33% more ¾ 33% more micro-ops in flight possible 128 parallelism y Enhanced algorithms 112 96 ¾ Faster “unaligned” cache accesses 80 ¾ Faster synchronization primitives 64 48 y Further branch prediction enhancements 32 16 ¾ New 2nd level branch predictor 0 ¾ Renamed Return Stack Buffer Intel Core Intel Core 2 Nehalem BuildsBuilds uponupon IndustryIndustry LeadingLeading 44 InstructionInstruction issueissue IntelIntel®® CoreCore micromicro--architecturearchitecture 12 SimultaneousSimultaneous MultiMulti--ThreadingThreading (SMT)(SMT) y Each core able to execute two software threads simultaneously y Extremely power efficient y Enhanced with larger caches and more memory bandwidth y Benefits ¾ Highly threaded workloads (eg, multi-media apps, databases, search engines) ¾ Multi-Tasking scenarios SimultaneousSimultaneous MultiMulti--threadingthreading EnhancesEnhances PerformancePerformance andand EnergyEnergy EfficiencyEfficiency 13 EnhancedEnhanced CacheCache SubsystemSubsystem y New 3-level Cache Hierarchy 32K L1 32K L1 32K L1 32K L1 ¾ L1 cache same as Intel Core™ uArch I-cache I-cache I-cache I-cache 32K L1 32K L1 32K L1 32K L1 • 32 KB Instruction/32 KB Data D-Cache D-Cache D-Cache D-Cache ¾ New 256 KB/core, low latency L2 cache 256K 256K 256K 256K ¾ New Large 8MB fully-shared L3 cache L2 L2 L2 L2 Cache Cache Cache Cache • Inclusive Cache Policy - minimize snoop traffic 8 MB Last Level Cache y New 2-level TLB hierarchy ¾ Adds 2nd level 512 entry Translation Look-aside Buffer SuperiorSuperior multimulti--levellevel sharedshared cachecache extendsextends IntelIntel®® SmartSmart CacheCache technologytechnology 14 Nehalem/TylersburgNehalem/Tylersburg PlatformsPlatforms (High(High EndEnd DesktopDesktop andand Server/Workstation)Server/Workstation) Desktop Nehalem Nehalem Nehalem QPI QPI Tylersburg Tylersburg Tylersburg I/O Hub PCI Express* I/O Hub PCI Express* PCI Express* Gen 2 Gen 2 DMI Gen 2 DMI ICH Server ICH ICH y Intel® QuickPath Interconnect y Integrated DDR3 Memory ¾ New point to point Controller interconnect ¾ 3 channels per processor ¾ 2 links per CPU socket ¾ Massive amounts of Bandwidth ¾ Up to 25.6 Gb/sec total ¾ Significant Memory Latency bandwidth/link Reduction HugeHuge LatencyLatency DecreaseDecrease andand BandwidthBandwidth IncreaseIncrease overover PriorPrior GenerationGeneration 15 NehalemNehalem HighHigh EndEnd Desktop/ServerDesktop/Server IMCIMC Nehalem Nehalem y 3 channels per socket y Up to 3 DIMMs/channel Tylersburg y DDR3-800, 1066, 1333 ¾ Future scalability 2 Socket Memory Bandwidth* y Supports RDIMM and UDIMM 5 4 >4x y Very low latency Band- 3 width y Very high bandwidth 2 1 y Built-In RAS Features 0 y Built-In RAS Features Harpertown (1600 Nehalem(3xDDR3- FSB) 1333) LeadershipLeadership MemoryMemory BandwidthBandwidth 16 *Source: Intel internal measurement ProductProduct CadenceCadence forfor SustainedSustained LeadershipLeadership 20092009--1010 WestmereWestmere SandySandy BridgeBridge Processors Processors 3232 nmnm 3232 nmnm TICKTICK TOCKTOCK Continuing the Pace of Innovation 17 IntelIntelIntel®®® AdvancedAdvanced VectorVector ExtensionExtension (AVX)(AVX) 256256-bit-bit vectorvector extensionextension toto SSESSE foforr FPFP intensiveintensive applicationsapplications KEY FEATURES BENEFITS Wider Vectors Increased from 128 bit to 256 bit Up to 2x peak FLOPs output Enhanced Data Organize, access and pull only Rearrangement Use the new 256 bit primitives to necessary data broadcast, mask loads and do data more quickly and efficiently permutes Three Operand, Fewer register copies, better Non Destructive Syntax register use, more opportunities Designed for efficiency and future for parallel loads and compute extensibility operations, smaller code size 18 VisualVisual Computing:Computing: GraphicsGraphics ReRe--defineddefined Mainstream Graphics Visual Computing y Triangle / Rasterization y New life-like Rendering e.g. Global illumination y Rigid pipeline architecture y Programmable, ubiquitous architecture y Tools constrained by y High definition audio and video architecture processing y Inefficient for non-graphics y Combines with model based computing computing (e.g. Physics) 19 20 VisualVisual ComputingComputing Acquiring,Acquiring, Analyzing,Analyzing, ModelingModeling andand SynthesizingSynthesizing VisualVisual WorkloadsWorkloads Photorealistic Computational 3D Rendering Modeling Interactive High Definition User Interface Audio, Video 20 21 VisualVisual Computing:Computing: WhatWhat DoesDoes itit Take?Take? Intel Leadership • Platforms: Client, Workstation, Server • CPU, Graphics, Media Architecture • Process and Technology Leadership • Software, Tools & Developer Support Photorealistic Computational 3D Rendering Modeling Interactive High Definition User Interface Audio, Video 21 22 LarrabeeLarrabee:: VisualVisual ComputingComputing ArchitectureArchitecture y Many IA cores ¾ Scalable to TeraFLOPS y New cache architecture y New vector instruction set ¾ Vector memory operations ¾ Conditionals ¾ Integer and FP arithmetic y New vector processing unit / wide SIMD 22 IntelIntel SoftwareSoftware:: UnleashesUnleashes DeveloperDeveloper FreedomFreedom IndustryIndustry LeadingLeading IntelIntel®® SoftwareSoftware
Recommended publications
  • Introduction to the Poulson (Intel 9500 Series) Processor Openvms Advanced Technical Boot Camp 2015 Keith Parris / September 29, 2015
    Introduction to the Poulson (Intel 9500 Series) Processor OpenVMS Advanced Technical Boot Camp 2015 Keith Parris / September 29, 2015 © Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Information on Poulson from Intel’s ISSCC Paper © Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Poulson information from Intel’s ISSCC Paper http://www.intel.com/content/dam/www/public/us/en/documents/white-papers/itanium-poulson-isscc-paper.pdf 3 © Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Poulson information from Intel’s ISSCC Paper http://www.intel.com/content/dam/www/public/us/en/documents/white-papers/itanium-poulson-isscc-paper.pdf 4 © Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Poulson information from Intel’s ISSCC Paper http://www.intel.com/content/dam/www/public/us/en/documents/white-papers/itanium-poulson-isscc-paper.pdf 5 © Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Poulson information from Intel’s ISSCC Paper • Intel presented a paper on Poulson at the International Solid-State Chips Conference (ISSCC) in July 2011. From this, we learned: • Poulson would be in a 32 nm process (2 process generations ahead from Tukwila, which was at 65 nm, skipping the 45 nm process) • The socket would be compatible with Tukwila • Poulson would have 8 cores, of a brand new core design • The front end (instruction fetch) would be decoupled from the back end (instruction execution) • Poulson could execute and retire as many as 12 instructions per cycle, double Tukwila’s 6 instructions http://www.intel.com/content/dam/www/public/us/en/documents/white-papers/itanium-poulson-isscc-paper.pdf 6 © Copyright 2015 Hewlett-Packard Development Company, L.P.
    [Show full text]
  • Multiprocessing Contents
    Multiprocessing Contents 1 Multiprocessing 1 1.1 Pre-history .............................................. 1 1.2 Key topics ............................................... 1 1.2.1 Processor symmetry ...................................... 1 1.2.2 Instruction and data streams ................................. 1 1.2.3 Processor coupling ...................................... 2 1.2.4 Multiprocessor Communication Architecture ......................... 2 1.3 Flynn’s taxonomy ........................................... 2 1.3.1 SISD multiprocessing ..................................... 2 1.3.2 SIMD multiprocessing .................................... 2 1.3.3 MISD multiprocessing .................................... 3 1.3.4 MIMD multiprocessing .................................... 3 1.4 See also ................................................ 3 1.5 References ............................................... 3 2 Computer multitasking 5 2.1 Multiprogramming .......................................... 5 2.2 Cooperative multitasking ....................................... 6 2.3 Preemptive multitasking ....................................... 6 2.4 Real time ............................................... 7 2.5 Multithreading ............................................ 7 2.6 Memory protection .......................................... 7 2.7 Memory swapping .......................................... 7 2.8 Programming ............................................. 7 2.9 See also ................................................ 8 2.10 References .............................................
    [Show full text]
  • A 65 Nm 2-Billion Transistor Quad-Core Itanium Processor
    18 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 44, NO. 1, JANUARY 2009 A 65 nm 2-Billion Transistor Quad-Core Itanium Processor Blaine Stackhouse, Sal Bhimji, Chris Bostak, Dave Bradley, Brian Cherkauer, Jayen Desai, Erin Francom, Mike Gowan, Paul Gronowski, Dan Krueger, Charles Morganti, and Steve Troyer Abstract—This paper describes an Itanium processor imple- mented in 65 nm process with 8 layers of Cu interconnect. The 21.5 mm by 32.5 mm die has 2.05B transistors. The processor has four dual-threaded cores, 30 MB of cache, and a system interface that operates at 2.4 GHz at 105 C. High speed serial interconnects allow for peak processor-to-processor bandwidth of 96 GB/s and peak memory bandwidth of 34 GB/s. Index Terms—65-nm process technology, circuit design, clock distribution, computer architecture, microprocessor, on-die cache, voltage domains. I. OVERVIEW Fig. 1. Die photo. HE next generation in the Intel Itanium processor family T code named Tukwila is described. The 21.5 mm by 32.5 mm die contains 2.05 billion transistors, making it the first two billion transistor microprocessor ever reported. Tukwila combines four ported Itanium cores with a new system interface and high speed serial interconnects to deliver greater than 2X performance relative to the Montecito and Montvale family of processors [1], [2]. Tukwila is manufactured in a 65 nm process with 8 layers of copper interconnect as shown in the die photo in Fig. 1. The Tukwila die is enclosed in a 66 mm 66 mm FR4 laminate package with 1248 total landed pins as shown in Fig.
    [Show full text]
  • Poulson: an 8 Core 32 Nm Next Generation Intel* Itanium* Processor
    Poulson: An 8 Core 32 nm Next Generation Intel® Itanium® Processor Steve Undy Intel® 1 I NTEL CONFIDENTIAL This slide MUST be used with any slides removed from this presentation Legal Disclaimer • INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLI ED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR I MPLI ED WARRANTY, RELATING TO SALE AND/ OR USE OF INTEL® PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTI CULAR PURPOSE, MERCHANTABI LI TY, OR I NFRI NGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RI GHT. I NTEL PRODUCTS ARE NOT INTENDED FOR USE IN MEDICAL, LIFE SAVING, OR LIFE SUSTAINING APPLICATIONS. • Intel may make changes to specifications and product descriptions at any time, without notice. • Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. • Configurations: [describe config + what test used + who did testing]. For more information go to http://www.intel.com/performance • Intel does not control or audit the design or implementation of third party benchmarks or Web sites referenced in this document.
    [Show full text]
  • Intel Developer Forum Day 1 News Disclosures from Shanghai
    Intel Corporation 2200 Mission College Blvd. P.O. Box 58119 Santa Clara, CA 95052-8119 News Fact Sheet CONTACTS: Claudine Mangano George Alfs Connie Brown 408-765-0146 408-219-7588 503-791-2367 [email protected] [email protected] [email protected] INTEL DEVELOPER FORUM DAY 1 NEWS DISCLOSURES FROM SHANGHAI April 2, 2008: Intel Corporation is holding its Intel Developer Forum in Shanghai on April 2-3. Below are brief summaries of each executive’s Day 1 keynote speech and news highlights. Anand Chandrasekher, “The Heart of a New Generation” Intel Senior Vice President and General Manager, Ultra Mobility Group Describing how the Internet continues to grow and flourish globally, Chandrasekher said that personal mobility is a key catalyst driving people to embrace the Internet in new places and ways. Chandrasekher formally introduced five Intel® Atom™ processors (formerly codenamed “Silverthorne”) and Intel Centrino® Atom™ processor technology for Mobile Internet Devices (MIDs), outlined a series of innovations in the new platform and highlighted the growing momentum in the MID software and device ecosystem. Also covered were the benefits of the new Atom Microarchitecture, including performance features, Internet and software compatibility, and the drastic reductions that Intel has made in power and packaging to achieve this new technology platform. Chandrasekher also offered insight into the company’s future silicon roadmap. Intel Launches Centrino Atom Processor Technology – Chandrasekher introduced Intel Centrino Atom Processor Technology, the company’s first-generation low-power platform for MIDs designed to enable the best Internet experience in a device that fits in your pocket.
    [Show full text]
  • Intel® Processor Architecture
    Intel® Processor Architecture January 2013 Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. Agenda •Overview Intel® processor architecture •Intel x86 ISA (instruction set architecture) •Micro-architecture of processor core •Uncore structure •Additional processor features – Hyper-threading – Turbo mode •Summary Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. 2 Intel® Processor Segments Today Architecture Target ISA Specific Platforms Features Intel® phone, tablet, x86 up to optimized for ATOM™ netbook, low- SSSE-3, 32 low-power, in- Architecture power server and 64 bit order Intel® mainstream x86 up to flexible Core™ notebook, Intel® AVX, feature set Architecture desktop, server 32 and 64bit covering all needs Intel® high end server IA64, x86 by RAS, large Itanium® emulation address space Architecture Intel® MIC accelerator for x86 and +60 cores, Architecture HPC Intel® MIC optimized for Instruction Floating-Point Set performance Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation. All rights reserved. *Other brands and names are the property of their respective owners. 3 Itanium® 9500 (Poulson) New Itanium Processor Poulson Processor • Compatible with Itanium® 9300 processors (Tukwila) Core Core 32MB Core Shared Core • New micro-architecture with 8 Last Core Level Core Cores Cache Core Core • 54 MB on-die cache Memory Link • Improved RAS and power Controllers Controllers management capabilities • Doubles execution width from 6 to 12 instructions/cycle 4 Intel 4 Full + 2 Half Scalable Width Intel • 32nm process technology Memory QuickPath Interface Interconnect • Launched in November 2012 (SMI) Compatibility provides protection for today’s Itanium® investment Software & Services Group, Developer Products Division Copyright © 2013, Intel Corporation.
    [Show full text]
  • Impact of the New Generation of X86 on the Server Market
    Impact of the New Generation of x86 on the Server Market Gartner RAS Core Research Note G002012320, George J. Weiss, Jeffrey Hewitt, 3 June 2010, R3598 07042011 Intel and OEMs have driven performance and reliability of the Intel Xeon 7500 processor series (code-named Nehalem-EX) to a level overlapping what Gartner judges to be about 80% of the RISC/Itanium function/feature set, but at a much lower price point. The result will cause increased server technology consolidation toward a bipolar distribution: x86-based platforms and one primary market-share-leading RISC technology (IBM Power), complementing the prevailing legacy of mainframes. Key Findings • A bipolar technology distribution of one strong RISC (Power) and x86 will drive at least 70% to 80% of the market’s total revenue, while other technologies subsist with minor single-digit revenue share; beyond this decade, a unipolar market will exist in which x86 technology becomes the standard, predominant server technology. • Unix has receded by $4 billion in the past three years. Migrations from Unix to Linux or Windows continue, according to Gartner client discussions. • Users continue to optimize costs (such as through virtualization, lower hardware costs, open-source software and cloud computing) primarily on x86. • Of the three non-x86 technologies, IBM’s has had a recent track record of sustained market share growth. Recommendations • Calculate an approximate x86 total cost of ownership (TCO) equivalence or line item cost analysis for comparison as if these applications were to be hosted on x86 for similar performance/availability, when considering Unix upgrades. • Make the comparison to an approximately equivalent x86 configuration when evaluating price quotes.
    [Show full text]
  • Static Scheduling, VLIW, EPIC & Speculation Beating the IPC=1
    Static Scheduling, VLIW, EPIC & Speculation Today’s topics: HW support for better compiler scheduling VLIW/EPIC idea & IPF example School of Computing 1 University of Utah CS6810 Beating the IPC=1 Asymptote • Superscalar . static/compiler scheduled » common in embedded space MIPS & ARM . dynamic scheduled » HW scheduling via scoreboard/Tomasulo approach • VLIW . long instruction word contains set of independent ops . key – compiler schedule and hazard detection (εδ adv.) » each slot goes to a particular type of XU • similar to reservation station role . problem in high performance practice » need to be conservative w.r.t. run time activities • data dependent branch predicate » fix – add some HW to make less conservative but probable choice School of Computing 2 University of Utah CS6810 Page 1 VLIW History • As usual it’s not new . late 60’s early 70’s – microcode » same idea, different granularity . 80’s (textbook inaccurate on this) » Cydrome Cydra-5 (Rau/UIUC) & Multiflow (Fisher/Yale) • mini-super segment (Cray like performance on a budget) • killer micro ate them and the companies cratered • both Rau and Fisher go to HP to develop PA-WW – note both were compiler types (Fisher inspired by dataflow geeks) . 90’s » HP wants out of process business, Intel wants a server line » HP & Intel jointly develop and produce Itanium • 2001 first release of “Merced” & IA-64 . Now » AMD shocks x86 land w/ 64-bit architecture at MPF 2000 » poor IA-64 integer performance forces Intel to follow suit » IA-64 IPF still happening • now all Intel but “Itanic” problems persist School of Computing 3 University of Utah CS6810 “Itanic” • Interesting quotes: .
    [Show full text]
  • Intel's Core 2 Family
    Intel’s Core 2 family - TOCK lines II Nehalem to Haswell Dezső Sima Vers. 3.11 August 2018 Contents • 1. Introduction • 2. The Core 2 line • 3. The Nehalem line • 4. The Sandy Bridge line • 5. The Haswell line • 6. The Skylake line • 7. The Kaby Lake line • 8. The Kaby Lake Refresh line • 9. The Coffee Lake line • 10. The Cannon Lake line 3. The Nehalem line 3.1 Introduction to the 1. generation Nehalem line • (Bloomfield) • 3.2 Major innovations of the 1. gen. Nehalem line 3.3 Major innovations of the 2. gen. Nehalem line • (Lynnfield) 3.1 Introduction to the 1. generation Nehalem line (Bloomfield) 3.1 Introduction to the 1. generation Nehalem line (Bloomfield) (1) 3.1 Introduction to the 1. generation Nehalem line (Bloomfield) Developed at Hillsboro, Oregon, at the site where the Pentium 4 was designed. Experiences with HT Nehalem became a multithreaded design. The design effort took about five years and required thousands of engineers (Ronak Singhal, lead architect of Nehalem) [37]. The 1. gen. Nehalem line targets DP servers, yet its first implementation appeared in the desktop segment (Core i7-9xx (Bloomfield)) 4C in 11/2008 1. gen. 2. gen. 3. gen. 4. gen. 5. gen. West- Core 2 Penryn Nehalem Sandy Ivy Haswell Broad- mere Bridge Bridge well New New New New New New New New Microarch. Process Microarchi. Microarch. Process Microarch. Process Process 45 nm 65 nm 45 nm 32 nm 32 nm 22 nm 22 nm 14 nm TOCK TICK TOCK TICK TOCK TICK TOCK TICK (2006) (2007) (2008) (2010) (2011) (2012) (2013) (2014) Figure : Intel’s Tick-Tock development model (Based on [1]) * 3.1 Introduction to the 1.
    [Show full text]
  • Montecito Overview
    Tukwila – a Quad-Core Intel® Itanium® Processor Eric DeLano Intel® Corporation 1 26 August 2008– HotChips 20, Copyright 2008 Intel Corporation Agenda • Tukwila Overview • Exploiting Thread Level Parallelism • Improving Instruction Level Parallelism • Scalability and Headroom • Power and Frequency management • Enterprise RAS and Manageability • Conclusion 26 August 2008 – HotChips 20, Copyright 2008 Intel Corporation 2 Intel® Itanium® Processor Family Roadmap Intel® Itanium® Processor Tukwila Poulson Kittson Generation Processor 9000, 9100 Series Quad Core (2 Ultra Parallel Highlights Dual Core Billion Transistors) Micro-architecture • 30MB On-Die Cache, Hyper- • 24MB L3 cache • Advanced multi-core Threading Technology architecture • Hyper-Threading • QuickPath Interconnect 9th Technology • Hyper-threading Itanium® • Dual Integrated Memory • Intel® Virtualization enhancements Product Controllers, 4 Channels New Technology • Instruction-level Technologies • Mainframe-Class RAS • Intel® Cache Safe advancements • Enhanced Virtualization Technology • 32nm process technology • Common chipset w/ Next • Lock-step data integrity • Large On-Die Cache Gen Xeon® processor MP technologies (9100 series) • New RAS features • Voltage Frequency Mgmt • DBS Power Management • Up to 2x Perf Vs 9100 • Compatible with Tukwila Technology (9100 series) Series* platforms Targeted Enterprise Business (Database, Business Intelligence, ERP, HPC, …) Segments Availability 2006-07 End 2008 Future Future Intel and the Intel logo are trademarks or registered trademarks
    [Show full text]
  • DCG) Platform Roadmap Data Center Group Marketing (DCGM
    NDA Data Center Group (DCG) Platform Roadmap Data Center Group Marketing (DCGM) WW32 2010 Intel Confidential—NDA Platform Roadmap 11 Legal Information • All products, computer systems, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice. This document contains information on products in the design phase of development. • Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. • Intel processor numbers are not a measure of performance. Processor numbers differentiate features within each processor family, not across different processor families. Click here for details • Hyper-Threading Technology requires a computer system with a processor supporting HT Technology and an HT Technology-enabled chipset, BIOS and operating system. Performance will vary depending on the specific hardware and software you use. For more information including details on which processors support HT Technology, see here • Intel® Turbo Boost Technology requires a PC with a processor with Intel Turbo Boost Technology capability. Intel Turbo Boost Technology performance varies depending on hardware, software and overall system configuration. Check with your PC manufacturer on whether your system delivers Intel Turbo Boost Technology. For more information, see http://www.intel.com/technology/turboboost. • No computer system can provide absolute security under all conditions. Intel® Trusted Execution Technology (Intel® TXT) requires a computer system with Intel® Virtualization Technology, an Intel TXT-enabled processor, chipset, BIOS, Authenticated Code Modules and an Intel TXT-compatible measured launched environment (MLE). The MLE could consist of a virtual machine monitor, an OS or an application.
    [Show full text]
  • How Intel® Itanium®-Based Servers Are Changing the Economics of Mission-Critical Computing
    Mainframe Reliability on Industry-Standard Servers : How Intel® Itanium®-based Servers are Changing the Economics of Mission-Critical Computing Michael EISA Intel Strategic Initiatives Manager Europe, Middle East & Africa HP EMEA TSG 16th October, 2008 11 Business Challenges Mainframe Environment Costs – Lack of agility incurs cost in lost opportunity and business responsiveness – High cost levels for maintenance and support – Poor price/performance – Poor choice of software, high proprietary licensing cost – Lack of Open Systems severely restricts options – Limited skills availability with an ever shrinking pool of talent – Big server footprint with high power and cooling costs Recession fears hit IT budgets* CIO Agenda 2008: The challenges for the year ahead... “The state of the economy and its potential impact on IT budgets will be the key challenge for IT chiefs this year along with the ongoing war for tech talent, according to silicon.com's exclusive CIO Agenda 2008 survey” For notes and disclaimers, see legal information slide at end of this presentation. Source: (1) IDC QST Analysis, October 2006. (2,3) Supporting Performance Benchmarks, System Configurations, and System Pricing in backup. (4) IDC WW Unix Migration Model, 2006. (5) IDC Server Tracker Q4’07 022608. *http://www.silicon.com/research/specialreports/cio-agenda-2008/0,3800014530,39182131,00.htm 2 * Other names and brands may be claimed as the property of others. Copyright © 2008, Intel Corporation. Mainframe customers are moving to Itanium® HPHP AnnouncesAnnounces OverOver
    [Show full text]