Intel Haswell Core

Intel Haswell Core

11/28/12 Today: Intel Haswell and smartphone/tablet processors CS 61C: • This material is not on final exam! Great Ideas in Computer Architecture • Intended for you to see modern day computer Case Studies: Server and Cellphone architectures. microprocessors Instructors: Krste Asanovic, Randy H. Katz hCp://inst.eecs.Berkeley.edu/~cs61c/fa12 Fall 2012 -- Lecture #38 1 Fall 2012 -- Lecture #38 2 Intel Haswell Core • Not yet in producVon, the next core aer Ivy Bridge! • Acknowledgements: Slides include material from Intel and David Kanter at Real World Technologies – Recommend site realworldtech.com for reading about new microprocessor architectures 3 Fall 2012 -- Lecture #38 4 FinFETs are a Berkeley EECS innovaon! Fall 2012 -- Lecture #38 5 Fall 2012 -- Lecture #38 6 1 11/28/12 How to run x86 code fast? Convert CISC to RISC Dynamically! • Translate complex x86 instrucVons into RISC-like micro- • x86 architecture evolved from 16-bit microprocessor operaons (µops) during instrucVon decode designed for CISC microcoded implementaon – e.g., “R ß R op Mem” translates into – 8086 introduced in 1978 (34 years old!) – Only older widely used ISA is IBM 360 architecture family – load T1, Mem # Load from Mem into temp reg introduced in 1964 (48 years old!) – R ß R op T1 # Operate using value in temp • Typical instrucVon: Reg = Reg op M[Reg] – Two-address format • Execute µops using speculave out-of-order superscalar – Register-memory operaons engine with register renaming – Few general-purpose registers (8 iniVally, 16 in 64-bit extension) – Both architectural and temporary registers are renamed from same pool • Many complex instrucVons with repeat prefixes • – Reconstruct whole x86 instrucVons during commit process String move in one instrucVon to report excepVons precisely • Variable-length instrucVons up to 16 bytes long • µop translaon introduced in PenVum Pro family • Added one instrucVon/week over lifeVme!! architecture (P6 family) in 1995, used in all subsequent x86 out-of-order processors 7 Haswell Rename/Reorder [Kanter] Haswell Front-End [Kanter] Fall 2012 -- Lecture #38 9 Fall 2012 -- Lecture #38 10 Haswell Execuon [Kanter] ALU Memory Fall 2012 -- Lecture #38 11 Fall 2012 -- Lecture #38 12 2 11/28/12 Fall 2012 -- Lecture #38 13 Fall 2012 -- Lecture #38 14 Fall 2012 -- Lecture #38 15 Fall 2012 -- Lecture #38 16 Fall 2012 -- Lecture #38 17 Fall 2012 -- Lecture #38 18 3 11/28/12 Administrivia Smartphone Processors • Final review session with TAs • Many companies and parts but some common – Wednesday December 5 features: – 12:00pm-3:00pm – ARM ISA for applicaon processors – 1 Pimentel – Lots of dedicated accelerator blocks, especially image processors for cameras and GPUs for graphics – Only ~2W max power dissipaon! Fall 2012 -- Lecture #38 19 Fall 2012 -- Lecture #38 20 NVIDIA Tegra 3 Tegra 3 Block Diagram • Used in Nexus 7 and many other phones, tablets, and Audi cars… Fall 2012 -- Lecture #38 21 Fall 2012 -- Lecture #38 22 Workload-Based Dynamic Enabling and Disabling of CPU Cores When the Battery Saver core is turned off and the mobile processor is using the main cores for processing, the CPU Governor and CPU management logic continues to monitor CPU workload and utilization of each of the main cores, dynamically enabling or disabling one to four of the main cores. For example, applications such as Email, basic games, or text messaging typically need the processing power of just one of the four main cores. For more demanding applications such as Flash-intensive Web browsing or heavy multitasking, the CPU manager may turn on two CPU cores. But to meet peak performance demands of applications such as console class gaming and media editing or creation, all four CPU cores would be turned on to deliver peak performanceTegra3 “4plus1” operaon demanded by the application(s). ARM Cortex A9 Background tasks, audio, video, Email syncs, social media syncs, etc. Single core performance for Email, 2D games, basic browsing, maps, etc. Dual core performance for Flash enabled browsing, multitasking, video chat, etc. Quad core performance for console class gaming, faster browsing, media processing Fall 2012 -- Lecture #38 23 Fall 2012 -- Lecture #38 24 Figure 5 CPU core management based on workload 9 4 Qualcomm Snapdragon S4 Mobile Processors 2 Snapdragon S4 System Overview First Mobile Introduction of 28nm Process Technology. Snapdragon S4 Processors: Superior Qualcomm designs its own custom CPU cores based on the Thermal Performance ARM architecture, which allows greater design ! exibility at Purpose Built for Mobile Applications. Snapdragon S4’s both the CPU and system level. Snapdragon S4 processors are compact design and next generation process technology the " rst mobile processors manufactured in 28nm process provides a highly e# cient thermal micro-architecture. technology using the ARMv7 Instruction Set Architecture Current generation ARM CPU designs cannot achieve the (ISA) to establish a new generation of performance. By same level of power e# ciency the new Snapdragon S4 migrating to a 28nm process, Qualcomm is able to deliver a class delivers. Snapdragon S4 bene" ts from 28nm pro- highly compact, e# cient design able to scale across multiple cess technology and newly developed cores which provide: form factors from mass market smartphones and tablets to compact laptops, delivering higher performance with less • Smaller size power. • Lower power • Improved thermal performance Next Generation Technology Today. The S4 class repre- sents the industry’s only fully custom system on a chip We expect to see S4 processors hold peak performance optimized for the mobile experience and incorporates in- longer than other processors under a constant workload novations such as per-core dynamic power management, due to the e# cient use of power. Less power lost to leak- next generation highly parallel and programmable GPU, age translates into better battery life. LTE world multimode modem, high speed dual channel Signi" cant Thermal Advantage. Figure 2 shows how S4 interleaved memory, and programmable DSPs. in 28nm process technology outperforms leading ARM Snapdragon S4 processors will be available in single, based designs in 40nm G process by a signi" cant margin, 11/28/12 dual and quad CPU con" gurations for maximum design allowing greater OEM design ! exibility. ! exibility. Customer samples are shipping now, approxi- mately six months ahead of any comparable solutions. Qualcomm Snapdragon MSM8960 Figure 1: MSM8960 Block Diagram Figure 2: Enhanced Thermal PerformanceApple A6 and A6X (32nm) Modem! Multicore! Multimedia! [Chipworks.com, 2012] Subsystem! Subsystem! Subsystem! Krait! Krait! LTE!World! CPU! CPU! Adreno! Modem! GPU! VeNum! VeNum! Audio/Video!HW! GPS/WiFi/BT/FM! L1!Cache! L1!Cache! Accelerators! Hexagon! Hexagon! Hexagon! Multimedia! DSP! DSP! L2!Cache! DSP! Processor! Snapdragon!System!Fabric! Dual!Channel!Memory! Snapdragon!Adaptive!Power!Technologies! Fall 2012 -- Lecture #38 25 Fall 2012 -- Lecture #38 26 Published October 7th, 2011 Copyright ©2011 Qualcomm, Inc. All Rights Reserved Apple A6X Summary • ConVnual rapid change in architecture – Mobile and server processors include large and increasing number of processors on single chip [chipworks.com] – More specialized processors common – New architectural concepts (transacVonal memory) • Covered basic ideas behind architectures in In iPad 4 76.8 GFLOPS Peak! CS61C, but to learn more take CS152 Fall 2012 -- Lecture #38 27 Fall 2012 -- Lecture #38 28 5 .

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    5 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us