Automatic Microprocessor Performance Bug Detection

Total Page:16

File Type:pdf, Size:1020Kb

Automatic Microprocessor Performance Bug Detection Automatic Microprocessor Performance Bug Detection Erick Carvajal Barboza∗, Sara Jacob∗, Mahesh Ketkary, Michael Kishinevskyy, Paul Gratz∗, Jiang Hu∗ ∗Texas A&M University fecarvajal, sarajacob 96, [email protected], [email protected] yIntel Corportation fmahesh.c.ketkar, [email protected] Abstract—Processor design validation and debug is a difficult snoop filter eviction bug [42]. Here the performance regression and complex task, which consumes the lion’s share of the was > 10% on several benchmarks. The bug took several design process. Design bugs that affect processor performance months of debugging effort before being fully characterized. rather than its functionality are especially difficult to catch, particularly in new microarchitectures. This is because, unlike In a competitive environment where time to market and functional bugs, the correct processor performance of new performance are essential, there is a critical need for new, microarchitectures on complex, long-running benchmarks is more automated mechanisms for performance debugging. typically not deterministically known. Thus, when performance To date, little prior research in performance debugging exists, benchmarking new microarchitectures, performance teams may despite its importance and difficulty. The few existing works [9], assume that the design is correct when the performance of the new microarchitecture exceeds that of the previous generation, [51], [53] take an approach of manually constructing a reference despite significant performance regressions existing in the design. model for comparison. However, as described by Ho et al. [26], In this work we present a two-stage, machine learning-based the same bug can be repeated in the model and thus cannot be methodology that is able to detect the existence of performance detected. Moreover, building an effective model is very time bugs in microprocessors. Our results show that our best technique consuming and labor intensive. Thus, as a first step we focus detects 91.5% of microprocessor core performance bugs whose average IPC impact across the studied applications is greater on automating bug detection. We note that, simply detecting a than 1% versus a bug-free design with zero false positives. When bug in the pre-silicon phase or early in the post-silicon phase evaluated on memory system bugs, our technique achieves 100% is highly valuable to shorten the validation process or to avoid detection with zero false positives. Moreover, the detection is costly steppings. Further, bug detection is a challenge by itself. automatic, requiring very little performance engineer time. Performance bugs, while potentially significant versus a bug- I.I NTRODUCTION free version of the given microarchitecture, may be less than the difference between microarchitectures. Consider the data shown Verification and validation are typically the largest com- in Figure 1. The figure shows the performance of several SPEC ponent of the design effort of a new processor. The effort CPU2006 benchmarks [2] on gem5 [6], configured to emulate can be broadly divided into two distinct disciplines, namely, two recent Intel microarchitectures: Ivybridge (3rd gen Intel functional and performance verification. The former is con- “Core” [23]) and Skylake (6th gen [19]), using the methodology cerned with design correctness and has rightfully received discussed in Sections III and IV. The figure shows that the significant attention in the literature. Even though challenging, baseline, bug-free Skylake microarchitecture provides nearly it benefits from the availability of known correct output to 1.7X the performance of the older Ivybridge, as expected given compare against. Alternately, performance verification, which is the greater core execution resources provided. typically concerned with generation-over-generation workload performance improvement, suffers from the lack of a known Ivybridge (Bug-Free) Skylake (Bug-Free) Skylake (Bug 1) Skylake (Bug 2) 3.00 correct output to check against. Given the complexity of computer systems, it is often extremely difficult to accurately 2.00 predict the expected performance of a given design on a given arXiv:2011.08781v2 [cs.AR] 19 Nov 2020 workload. Performance bugs, nevertheless, are a critical concern 1.00 as the cadence of process technology scaling slows, as they rob Speedup 0.00 processor designs of the primary performance and efficiency 403.gcc 433.milc gains to be had through improved microarchitecture. 401.bzip2 444.namd450.soplex 458.sjeng 400.perlbench Further, while simulation studies may provide an approxima- 436.cactusADM Geometric Mean tion of how long a given application might run on a real system, Fig. 1: Speedup of Skylake simulation with and without prior work has shown that these performance estimates will performance bugs, normalized against Ivybridge simulation. typically vary greatly from real system performance [8], [18], [44]. Thus, the process to detect performance bugs, in practice, In addition to a bug-free case, we collected performance is via tedious and complex manual analysis, often taking large data for two arbitrarily selected bug-cases for Skylake. Bug 1 is periods of time to isolate and eliminate a single bug. For an instruction scheduling bug wherein xor instructions are the example, consider the case of the Xeon Platinum 8160 processor only instructions selected to be issued when they are the oldest instruction in the instruction queue. As the figure shows, the overall impact of this bug is very low (< 1% in average across • Our methodology detects 91.5% of these processor core the given workloads). Bug 2 is another instruction scheduling performance bugs leading to ≥ 1% IPC degradation with bug which causes sub instructions to be incorrectly marked as 0% false positives. It also achieves 100% accuracy for synchronizing, thus all younger instructions must wait to be the evaluated performance bugs in memory systems. issued after any sub instruction has been issued and all older As an early work in this space, we limit our scope somewhat instructions must issue before any sub instruction, regardless to maintain tractablity. This work is focused mainly on the of dependencies and hardware resources. The impact of this processor core (the most complex component in the system), bug is larger, at ∼ 7% across the given workloads. however we show that the methodology can be generalized to Nevertheless, for both bugs, the impact is less than the other units by evaluating its usage on memory systems. Further, difference between Skylake and Ivybridge. Thus, from the here we focus on performance bugs that affect cycle count and designers perspective, working on the newer Skylake design consider timing issues to be out of scope. with Ivybridge in hand, if early performance models showed The goal of this work is to find bugs in new architectures the results shown for Bug 2 (even more so Bug 1) designers that are incrementally evolved from those used in the training might incorrectly assume that the buggy Skylake design did data, it may not work as well when there have been large shifts not have performance bugs since it was much better than in microarchitectural details, such as going from in-order to previous generations. Even as the performance difference out-of-order. However, major generalized microarchitectural between microarchitecture generations decreases, differentiating changes are increasingly rare. For example, Intel processors’ between performance bugs and expected behavior remains a last major microarchitecture refactoring occurred ∼ 15 years challenge given the variability between simulated performance ago, from P4 to “Core”. Over this period, there have been and real systems. several microarchitectures where our approach could provide Examining the figure, we note that each bug’s impact varies value. In the event of larger changes, our method can be from workload to workload. That said, in the case of Bug 1, no partially reused by augmenting it with additional features. single benchmark shows a performance loss of more than 1%. II.O BJECTIVE AND ABASELINE APPROACH This makes sense since the bug only impacts the performance of applications which have, relatively rare, xor instructions. The objective of this work is the detection of perfor- From this, one might conclude that Bug 1 is trivial and perhaps mance bugs in new microprocessor designs. We explicitly needs not be fixed at all. This assumption however, would be look at microarchitectural-level bugs, as opposed to logic or wrong, as many workloads do in fact rely heavily upon the architectural-level bugs. As grounds for the detection, one or xor instruction, enough so that a moderate impact on them multiple legacy microarchitecture designs are available and translates into significant performance impact overall. assumed to be bug-free or have only minor performance bugs In this work, we explore several approaches to this problem remaining. Given the extensive pre-silicon and post-silicon and ultimately develop an automated, two-stage, machine debug, this assumption generally holds on legacy designs. learning-based methodology that will allow designers to There are no assumptions about how a bug may look like. automatically detect the existence of performance bugs in As it is very difficult to define a theoretical coverage model for microprocessor designs. Even though this work focus is the performance debug, our methodology is a heuristic approach. processor core, we show that the methodology can be used We introduce
Recommended publications
  • A Superscalar Out-Of-Order X86 Soft Processor for FPGA
    A Superscalar Out-of-Order x86 Soft Processor for FPGA Henry Wong University of Toronto, Intel [email protected] June 5, 2019 Stanford University EE380 1 Hi! ● CPU architect, Intel Hillsboro ● Ph.D., University of Toronto ● Today: x86 OoO processor for FPGA (Ph.D. work) – Motivation – High-level design and results – Microarchitecture details and some circuits 2 FPGA: Field-Programmable Gate Array ● Is a digital circuit (logic gates and wires) ● Is field-programmable (at power-on, not in the fab) ● Pre-fab everything you’ll ever need – 20x area, 20x delay cost – Circuit building blocks are somewhat bigger than logic gates 6-LUT6-LUT 6-LUT6-LUT 3 6-LUT 6-LUT FPGA: Field-Programmable Gate Array ● Is a digital circuit (logic gates and wires) ● Is field-programmable (at power-on, not in the fab) ● Pre-fab everything you’ll ever need – 20x area, 20x delay cost – Circuit building blocks are somewhat bigger than logic gates 6-LUT 6-LUT 6-LUT 6-LUT 4 6-LUT 6-LUT FPGA Soft Processors ● FPGA systems often have software components – Often running on a soft processor ● Need more performance? – Parallel code and hardware accelerators need effort – Less effort if soft processors got faster 5 FPGA Soft Processors ● FPGA systems often have software components – Often running on a soft processor ● Need more performance? – Parallel code and hardware accelerators need effort – Less effort if soft processors got faster 6 FPGA Soft Processors ● FPGA systems often have software components – Often running on a soft processor ● Need more performance? – Parallel
    [Show full text]
  • Multiprocessing Contents
    Multiprocessing Contents 1 Multiprocessing 1 1.1 Pre-history .............................................. 1 1.2 Key topics ............................................... 1 1.2.1 Processor symmetry ...................................... 1 1.2.2 Instruction and data streams ................................. 1 1.2.3 Processor coupling ...................................... 2 1.2.4 Multiprocessor Communication Architecture ......................... 2 1.3 Flynn’s taxonomy ........................................... 2 1.3.1 SISD multiprocessing ..................................... 2 1.3.2 SIMD multiprocessing .................................... 2 1.3.3 MISD multiprocessing .................................... 3 1.3.4 MIMD multiprocessing .................................... 3 1.4 See also ................................................ 3 1.5 References ............................................... 3 2 Computer multitasking 5 2.1 Multiprogramming .......................................... 5 2.2 Cooperative multitasking ....................................... 6 2.3 Preemptive multitasking ....................................... 6 2.4 Real time ............................................... 7 2.5 Multithreading ............................................ 7 2.6 Memory protection .......................................... 7 2.7 Memory swapping .......................................... 7 2.8 Programming ............................................. 7 2.9 See also ................................................ 8 2.10 References .............................................
    [Show full text]
  • The Intel X86 Microarchitectures Map Version 2.0
    The Intel x86 Microarchitectures Map Version 2.0 P6 (1995, 0.50 to 0.35 μm) 8086 (1978, 3 µm) 80386 (1985, 1.5 to 1 µm) P5 (1993, 0.80 to 0.35 μm) NetBurst (2000 , 180 to 130 nm) Skylake (2015, 14 nm) Alternative Names: i686 Series: Alternative Names: iAPX 386, 386, i386 Alternative Names: Pentium, 80586, 586, i586 Alternative Names: Pentium 4, Pentium IV, P4 Alternative Names: SKL (Desktop and Mobile), SKX (Server) Series: Pentium Pro (used in desktops and servers) • 16-bit data bus: 8086 (iAPX Series: Series: Series: Series: • Variant: Klamath (1997, 0.35 μm) 86) • Desktop/Server: i386DX Desktop/Server: P5, P54C • Desktop: Willamette (180 nm) • Desktop: Desktop 6th Generation Core i5 (Skylake-S and Skylake-H) • Alternative Names: Pentium II, PII • 8-bit data bus: 8088 (iAPX • Desktop lower-performance: i386SX Desktop/Server higher-performance: P54CQS, P54CS • Desktop higher-performance: Northwood Pentium 4 (130 nm), Northwood B Pentium 4 HT (130 nm), • Desktop higher-performance: Desktop 6th Generation Core i7 (Skylake-S and Skylake-H), Desktop 7th Generation Core i7 X (Skylake-X), • Series: Klamath (used in desktops) 88) • Mobile: i386SL, 80376, i386EX, Mobile: P54C, P54LM Northwood C Pentium 4 HT (130 nm), Gallatin (Pentium 4 Extreme Edition 130 nm) Desktop 7th Generation Core i9 X (Skylake-X), Desktop 9th Generation Core i7 X (Skylake-X), Desktop 9th Generation Core i9 X (Skylake-X) • Variant: Deschutes (1998, 0.25 to 0.18 μm) i386CXSA, i386SXSA, i386CXSB Compatibility: Pentium OverDrive • Desktop lower-performance: Willamette-128
    [Show full text]
  • Intel® Omni-Path Architecture Overview and Update
    The architecture for Discovery June, 2016 Intel Confidential Caught in the Vortex…? Business Efficiency & Agility DATA: Trust, Privacy, sovereignty Innovation: New Economy Biz Models Macro Economic Effect Growth Enablers/Inhibitors Intel® Solutions Summit 2016 2 Intel® Solutions Summit 2016 3 Intel Confidential 4 Data Center Blocks Reduce Complexity Intel engineering, validation, support Data Center Blocks Speed time to market Begin with a higher level of integration HPC Cloud Enterprise Storage Increase Value VSAN Ready HPC Compute SMB Server Block Reduce TCO, value pricing Block Node Fuel innovation Server blocks for specific segments Focus R&D on value-add and differentiation Intel® Solutions Summit 2016 5 A Holistic Design Solution for All HPC Needs Intel® Scalable System Framework Small Clusters Through Supercomputers Compute Memory/Storage Compute and Data-Centric Computing Fabric Software Standards-Based Programmability On-Premise and Cloud-Based Intel Silicon Photonics Intel® Xeon® Processors Intel® Solutions for Lustre* Intel® Omni-Path Architecture HPC System Software Stack Intel® Xeon Phi™ Processors Intel® SSDs Intel® True Scale Fabric Intel® Software Tools Intel® Xeon Phi™ Coprocessors Intel® Optane™ Technology Intel® Ethernet Intel® Cluster Ready Program Intel® Server Boards and Platforms 3D XPoint™ Technology Intel® Silicon Photonics Intel® Visualization Toolkit Intel Confidential 14 Parallel is the Path Forward Intel® Xeon® and Intel® Xeon Phi™ Product Families are both going parallel How do we attain extremely high compute
    [Show full text]
  • Intel Atom Processor E3800 Product Families in Retail
    White Paper Intelligent Systems Intel® Celeron® Processor and Intel® Atom™ Processor E3800 Product Families Intel® Celeron® Processor and Intel® Atom™ Processor E3800 Product Families in Retail Transform user experiences in entry retail devices with full HD video decode, improved graphics, quad-core compute performance, and built-in security Introduction Advanced features include: Intelligent devices that provide HD • Media: Scalable full HD video playback video capability, compelling graphics, includes support for 10 or more responsive performance, and security are simultaneous video streams. transforming in-store retail experiences. • Graphics: Gen 7 Intel® Graphics Today’s retail customers expect POS Architecture enables enhanced visual systems, interactive kiosks, and digital processing over previous-generation signs to support rich media and graphics Intel Atom processors. experience for timely and visually compelling digital promotions and a range • Power and Form Factor: SoC with of choices at checkout, with confidence smaller package size and industrial “The Intel Celeron processor and that the device provides security to temperature range is ideal for thin, protect transactional and personal data. light and environmentally adaptive Intel Atom processor E3800 entry retail devices. The Intel® Celeron® processor and Intel® product families provide full Atom™ processor E3800 product families • Compute: Quad-core processing1 HD simultaneous video decode for intelligent systems help bring these enables improved out-of-order compute capabilities to entry retail devices. performance for more responsive user capability, delivering interactive Compared to previous-generation Intel experiences. Celeron and Atom processors, this new 2-D and 3-D graphics with much • Security: Built-in hardware-assisted processor family provides significantly security enhancements include Intel® improved playback enabling improved media and graphics performance AES New Instructions (Intel® AES NI)2 and enables smaller, more power-efficient immersive visual experiences and Secure Boot.
    [Show full text]
  • Intel® Atom™ Processor E3800 the Latest Low Power Platform E3800 Family Platform for Intelligent Systems
    Intel® Atom™ Processor E3800 The Latest Low Power Platform E3800 Family Platform for Intelligent Systems tŚŝůĞĚĞƐŝŐŶĞĚƚŽďĞĂƚƌƵĞƚĞƐƚŽĨ/ŶƚĞů͛ƐƉĞƌĨŽƌŵĂŶĐĞŝŶƚŚĞƵůƚƌĂŵŽďŝůĞƐƉĂĐĞ͕^ŝůǀĞƌŵŽŶƚŝƐƚŚĞĮƌƐƚƚƌƵĞĂƌĐŚŝƚĞĐƚƵƌĞ ƵƉĚĂƚĞƚŽ/ŶƚĞů͛ƐƚŽŵƉƌŽĐĞƐƐŽƌƐŝŶĐĞŝƚƐŝŶƚƌŽĚƵĐƟŽŶŝŶϮϬϬϴ͘>ĞǀĞƌĂŐŝŶŐ/ŶƚĞů͛ƐĮƌƐƚϮϮŶŵƉƌŽĐĞƐƐĂŶĚĂǀĞƌLJůŽǁƉŽǁĞƌͲ ŵŝĐƌŽĂƌĐŚŝƚĞĐƚƵƌĞ͕^ŝůǀĞƌŵŽŶƚĂŝŵƐƐƋƵĂƌĞůLJĂƚƚŚĞůĂƚĞƐƚ<ƌĂŝƚĐŽƌĞƐĨƌŽŵYƵĂůĐŽŵŵĂŶĚZD͛ƐŽƌƚĞdžϭϱ͘ĂƐĞĚŽŶ ^ŝůǀĞƌŵŽŶƚ͕/ŶƚĞůΠŝŶƚƌŽĚƵĐĞƐϯϴϬϬƉƌŽĚƵĐƚĨĂŵŝůLJ͕ĂƐĞƌŝĞƐŽĨƐLJƐƚĞŵŽŶĐŚŝƉ;^ŽͿĚĞƐŝŐŶĞĚĨŽƌůŽǁͲƉŽǁĞƌ͕ĨĞĂƚƵƌĞͲƌŝĐŚ ĂŶĚŚŝŐŚůLJͲĐĂƉĂďůĞĂƉƉůŝĐĂƟŽŶƐ͘ ϯϴϬϬƉƌŽĚƵĐƚĨĂŵŝůLJƚĂŬĞƐƵƉƚŽĨŽƵƌ^ŝůǀĞƌŵŽŶƚĐŽƌĞƐ͕ĂŶĚĨŽƌƚŚĞĮƌƐƚƟŵĞŝŶĂŶƵůƚƌĂŵŽďŝůĞ/ŶƚĞů^Ž͕ŝƐƉĂŝƌĞĚǁŝƚŚ /ŶƚĞů͛ƐŽǁŶŐƌĂƉŚŝĐƐ/W͘/ŶŽƚŚĞƌǁŽƌĚƐ͕ƌĂƚŚĞƌƚŚĂŶƵƐŝŶŐĂ'WhďůŽĐŬĨƌŽŵ/ŵĂŐŝŶĂƟŽŶdĞĐŚŶŽůŽŐŝĞƐ͕E3800 product family leverages the same GPU architecture as the 3rdŐĞŶĞƌĂƟŽŶ/ŶƚĞůŽƌĞƉƌŽĐĞƐƐŽƌƐ;ĐŽĚĞŶĂŵĞĚ/ǀLJƌŝĚŐĞͿ͘ Silvermont Core Highlights Better Performance Better Power Efficiency 22nm Architecture 200 250 150 300 100 350 50 400 0 450 500 Out-of-order execuon engine Wider dynamic operang range 3D Tri-gate transistors tuned for New mul-core and system fabric Enhanced acve and idle power SoC products architecture management Architecture and design co-opmized with the process New IA instrucons extensions (Intel Core Westmere Level) Bay Trail: Not just for Atoms anymore E3800 product family combines a CPU based on Intel’s new Silver- mont architecture with a GPU that is architecturally similar to (but 4xPCIe* less powerful than) the HD 4000 graphics engine integrated in the 3rdŐĞŶĞƌĂƟŽŶ/ŶƚĞůΠŽƌĞƉƌŽĐĞƐƐŽƌƐůĂƵŶĐŚĞĚŝŶĞĂƌůLJϮϬϭϮ͘dŚĞƐĞ
    [Show full text]
  • The Intel X86 Microarchitectures Map Version 2.2
    The Intel x86 Microarchitectures Map Version 2.2 P6 (1995, 0.50 to 0.35 μm) 8086 (1978, 3 µm) 80386 (1985, 1.5 to 1 µm) P5 (1993, 0.80 to 0.35 μm) NetBurst (2000 , 180 to 130 nm) Skylake (2015, 14 nm) Alternative Names: i686 Series: Alternative Names: iAPX 386, 386, i386 Alternative Names: Pentium, 80586, 586, i586 Alternative Names: Pentium 4, Pentium IV, P4 Alternative Names: SKL (Desktop and Mobile), SKX (Server) Series: Pentium Pro (used in desktops and servers) • 16-bit data bus: 8086 (iAPX Series: Series: Series: Series: • Variant: Klamath (1997, 0.35 μm) 86) • Desktop/Server: i386DX Desktop/Server: P5, P54C • Desktop: Willamette (180 nm) • Desktop: Desktop 6th Generation Core i5 (Skylake-S and Skylake-H) • Alternative Names: Pentium II, PII • 8-bit data bus: 8088 (iAPX • Desktop lower-performance: i386SX Desktop/Server higher-performance: P54CQS, P54CS • Desktop higher-performance: Northwood Pentium 4 (130 nm), Northwood B Pentium 4 HT (130 nm), • Desktop higher-performance: Desktop 6th Generation Core i7 (Skylake-S and Skylake-H), Desktop 7th Generation Core i7 X (Skylake-X), • Series: Klamath (used in desktops) 88) • Mobile: i386SL, 80376, i386EX, Mobile: P54C, P54LM Northwood C Pentium 4 HT (130 nm), Gallatin (Pentium 4 Extreme Edition 130 nm) Desktop 7th Generation Core i9 X (Skylake-X), Desktop 9th Generation Core i7 X (Skylake-X), Desktop 9th Generation Core i9 X (Skylake-X) • New instructions: Deschutes (1998, 0.25 to 0.18 μm) i386CXSA, i386SXSA, i386CXSB Compatibility: Pentium OverDrive • Desktop lower-performance: Willamette-128
    [Show full text]
  • Intel Launches Low-Power, High-Performance Silvermont Microarchitecture
    May 6, 2013 Intel Launches Low-Power, High-Performance Silvermont Microarchitecture NEWS HIGHLIGHTS: ● Intel announces Silvermont microarchitecture, a new design in Intel's 22nm Tri-Gate SoC process delivering significant increases in performance and energy efficiency. ● Silvermont microarchitecture delivers ~3x more peak performance or the same performance at ~5x lower power over current-generation Intel® Atom™ processor core.1 ● Silvermont to serve as the foundation for a breadth of 22nm products targeted at tablets, smartphones, microservers, network infrastructure, storage and other market segments including entry laptops and in-vehicle infotainment. SANTA CLARA, Calif., May 6, 2013 – Intel Corporation today took the wraps off its brand new, low-power, high-performance microarchitecture named Silvermont. The technology is aimed squarely at low-power requirements in market segments from smartphones to the data center. Silvermont will be the foundation for a range of innovative products beginning to come to market later this year, and will also be manufactured using the company's leading-edge, 22nm Tri-Gate SoC manufacturing process, which brings significant performance increases and improved energy efficiency. "Silvermont is a leap forward and an entirely new technology foundation for the future that will address a broad range of products and market segments," said Dadi Perlmutter, Intel executive vice president and chief product officer. "Early sampling of our 22nm SoCs, including "Bay Trail" and "Avoton" is already garnering positive feedback from our customers. Going forward, we will accelerate future generations of this low-power microarchitecture on a yearly cadence." The Silvermont microarchitecture delivers industry-leading performance-per-watt efficiency.2 The highly balanced design brings increased support for a wider dynamic range and seamlessly scales up and down in performance and power efficiency.
    [Show full text]
  • Silvermont CPU Overview
    Introducing Next Generation Low Power Microarchitecture: Silvermont Dadi Perlmutter Executive Vice President General Manager, Intel Architecture Group Chief Product Officer Risk Factors •Today’s presentations contain forward-looking statements. All statements made that are not historical facts are subject to a number of risks and uncertainties, and actual results may differ materially. Please refer to our most recent earnings release, Form 10-Q and 10-K filing available for more information on the risk factors that could cause actual results to differ. •If we use any non-GAAP financial measures during the presentations, you will find on our website, intc.com, the required reconciliation to the most directly comparable GAAP financial measure. Rev. 4/16/13 Legal Disclaimers Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to: http://www.intel.com/performance . Intel, Intel Atom and the Intel logo are trademarks of Intel Corporation in the United States and other countries. 1 Based on the geometric mean of a variety of power and performance measurements across various benchmarks. Benchmarks included in this geomean are measurements on browsing benchmarks and workloads including SunSpider* and page load tests on Internet Explorer*, FireFox*, & Chrome*; Dhrystone*; EEMBC* workloads including CoreMark*; Android* workloads including CaffineMark*, AnTutu*, Linpack* and Quadrant* as well as measured estimates on SPECint* rate_base2000 & SPECfp* rate_base2000; on Silvermont preproduction systems compared to Atom processor Z2580.
    [Show full text]
  • Intel's Atom Lines 1. Introduction to Intel's Atom Lines 3. Atom-Based Platforms Targeting Entry Level Desktops and Notebook
    Intel’s Atom lines • 1. Introduction to Intel’s Atom lines • 2. Intel’s low-power oriented CPU micro-architectures • 3. Atom-based platforms targeting entry level desktops and notebooks • 4. Atom-based platforms targeting tablets • 5. Atom-based platforms targeting smartphones • 6. Intel’s withdrawal from the mobile market • 7. References 1. Introduction to Intel’s Atom lines • 1.1 The rapidly increasing importance of the mobile market space • 1.2 Related terminology • 1.3 Introduction to Intel’s low-power Atom series 1.1 The rapidly increasing importance of the mobile market space 1.1 The rapidly increasing importance of the mobile market space (1) 1.1 The rapidly increasing importance of the mobile market space Diversification of computer market segments in the 2000’s Main computer market segments around 2000 Servers Desktops Embedded computer devices E.g. Intel’s Xeon lines Intel’s Pentium 4 lines ARM’s lines AMD’s Opteron lines AMD’s Athlon lines Major trend in the first half of the 2000’s: spreading of mobile devices (laptops) Main computer market segments around 2005 Servers Desktops Mobiles Embedded computer devices E.g. Intel’s Xeon lines Intel’s Pentium 4 lines Intel’s Celeron lines ARM’s lines AMD’s Opteron lines AMD’s Athlon64 lines AMD’s Duron lines 1.1 The rapidly increasing importance of the mobile market space (2) Yearly worldwide sales and Compound Annual Growth Rates (CAGR) of desktops and mobiles (laptops) around 2005 [1] 350 300 Millions 250 CAGR 17% 200 Mobile 150 100 Desktop CAGR 5% 50 0 2003 2004 2005 2006 2007 2008
    [Show full text]
  • Intel® 64 and IA-32 Architectures Software Developer's Manual
    Intel® 64 and IA-32 Architectures Software Developer’s Manual Volume 4: Model-Specific Registers NOTE: The Intel® 64 and IA-32 Architectures Software Developer's Manual consists of ten volumes: Basic Architecture, Order Number 253665; Instruction Set Reference A-L, Order Number 253666; Instruction Set Reference M-U, Order Number 253667; Instruction Set Reference V-Z, Order Number 326018; Instruction Set Reference, Order Number 334569; System Programming Guide, Part 1, Order Number 253668; System Programming Guide, Part 2, Order Number 253669; System Programming Guide, Part 3, Order Number 326019; System Programming Guide, Part 4, Order Number 332831; Model-Specific Registers, Order Number 335592. Refer to all ten volumes when evaluating your design needs. Order Number: 335592-065US December 2017 Intel technologies features and benefits depend on system configuration and may require enabled hardware, software, or service activation. Learn more at intel.com, or from the OEM or retailer. No computer system can be absolutely secure. Intel does not assume any liability for lost or stolen data or systems or any damages resulting from such losses. You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifica- tions.
    [Show full text]
  • Intel® 64 and IA32 Architectures Performance Monitoring Events
    Intel® 64 and IA32 Architectures Performance Monitoring Events 2017 December Revision 1.0 Document Number:335279-001 Performance Monitoring Events No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice.Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. The products and services described may contain defects or errors known as errata which may cause deviations from published specifications. Current characterized errata are available on request. Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at http://intel.com/. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1.800.548.4725 or by visiting www.intel.com/design/literature.htm. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation
    [Show full text]