A Portable Kernel Abstraction for Low-Overhead Ephemeral Mapping Management Khaled Elmeleegy, Anupam Chanda, and Alan L

Total Page:16

File Type:pdf, Size:1020Kb

A Portable Kernel Abstraction for Low-Overhead Ephemeral Mapping Management Khaled Elmeleegy, Anupam Chanda, and Alan L A Portable Kernel Abstraction For Low-Overhead Ephemeral Mapping Management Khaled Elmeleegy, Anupam Chanda, and Alan L. Cox Department of Computer Science Rice University, Houston, Texas 77005, USA {kdiaa,anupamc,alc}@cs.rice.edu Willy Zwaenepoel School of Computer and Communication Sciences EPFL, Lausanne, Switzerland [email protected] Abstract of process tracing and debugging. To create an ephemeral mapping two actions are required: the Modern operating systems create ephemeral allocation of a temporary kernel virtual address virtual-to-physical mappings for a variety of pur- and the modification of the virtual-to-physical ad- poses, ranging from the implementation of inter- dress mapping. To date, these actions have been process communication to the implementation of performed through separate interfaces. This pa- process tracing and debugging. With succeed- per demonstrates the benefits of combining these ing generations of processors the cost of creat- actions under a single interface. ing ephemeral mappings is increasing, particu- This work is motivated by the increasing cost larly when an ephemeral mapping is shared by of ephemeral mapping creation, particularly, the multiple processors. increasing cost of modifications to the virtual-to- To reduce the cost of ephemeral mapping man- physical mapping. To see this trend, consider the agement within an operating system kernel, we latency in processor cycles for the invlpg in- introduce the sf buf ephemeral mapping in- struction across several generations of the IA32 terface. We demonstrate how in several kernel architecture. This instruction invalidates the subsystems — including pipes, memory disks, Translation Look-aside Buffer (TLB) entry for sockets, execve(), ptrace(), and the vnode the given virtual address. In general, the op- pager — the current implementation can be re- erating system must issue this instruction when placed by calls to the sf buf interface. it changes a virtual-to-physical mapping. When We describe the implementation of the this instruction was introduced in the 486, it took sf buf interface on the 32-bit i386 architecture 12 cycles to execute. In the Pentium, its latency and the 64-bit amd64 architecture. This imple- increased to 25 cycles. In the Pentium III, its mentation reduces the cost of ephemeral mapping latency increased to ∼100 cycles. Finally, in management by reusing wherever possible ex- the Pentium 4, its latency has reached ∼500 to isting virtual-to-physical address mappings. We ∼1000 cycles. So, despite a factor of three de- evaluate the sf buf interface for the pipe, mem- crease in the cycle time between a high-end Pen- ory disk and networking subsystems. Our results tium III and a high-end Pentium 4, the cost of a show that these subsystems perform significantly mapping change measured in wall clock time has better when using the sf buf interface. On a actually increased. multiprocessor platform interprocessor interrupts are greatly reduced in number or eliminated alto- Furthermore, on a multiprocessor, the cost of gether. ephemeral mapping creation can be significantly higher if the mapping is shared by two or more processors. Unlike data cache coherence, TLB 1 Introduction coherence is generally implemented in software by the operating system [2, 12]: The processor Modern operating systems create ephemeral initiating a mapping change issues an interpro- virtual-to-physical mappings for a variety of pur- cessor interrupt (IPI) to each of the processors poses, ranging from the implementation of in- that share the mapping; the interrupt handler that terprocess communication to the implementation is executed by each of these processors includes USENIX Association 2005 USENIX Annual Technical Conference 223 an instruction, such as invlpg, that invalidates lines of code reduction in an operating system that processor’s TLB entry for the mapping’s vir- kernel from using the sf buf interface. Sec- tual address. Consequently, a mapping change is tion 6 presents an experimental evaluation of the quite costly for all processors involved. sf buf interface. We present related work in In the past, TLB coherence was only an issue Section 7 and conclude in Section 8. for multiprocessors. Today, however, some im- plementations of Simultaneous Multi-Threading 2 Ephemeral Mapping Usage (SMT), such as the Pentium 4’s, require the op- erating system to implement TLB coherence in a We use FreeBSD 5.3 as an example to demon- single-processor system. strate the use of ephemeral mappings. FreeBSD To reduce the cost and complexity of 5.3 uses ephemeral mappings in a wide vari- ephemeral mapping management within an op- ety of places, including the implementation of erating system kernel, we introduce the sf buf pipes, memory disks, sendfile(),sockets, ephemeral mapping interface. Like Mach’s pmap execve(), ptrace(), and the vnode pager. interface [11], our objective is to provide a machine-independent interface enabling variant, machine-specific implementations. Unlike pmap, 2.1 Pipes our sf buf interface supports allocation of tem- Conventional implementations of Unix pipes per- porary kernel virtual addresses. We describe how form two copy operations to transfer the data various subsystems in the operating system ker- from the writer to the reader. The writer copies nel benefit from the sf buf interface. the data from the source buffer in its user address We present the implementation of the sf buf space to a buffer in the kernel address space, and interface on two representative architectures, the reader later copies this data from the kernel i386, a 32-bit architecture, and amd64, a 64-bit buffer to the destination buffer in its user address architecture. This implementation is efficient: it space. performs creation and destruction of ephemeral In the case of large data transfers that fill mappings in O(1) expected time on i386 and O(1) the pipe and block the writer, FreeBSD uses time on amd64.Thesf buf interface enables ephemeral mappings to eliminate the copy oper- the automatic reuse of ephemeral mappings so ation by the writer, reducing the number of copy that the high cost of mapping changes can be operations from two to one. The writer first de- amortized over several uses. In addition, this im- termines the set of physical pages underlying the plementation of the sf buf interface incorpo- source buffer, then wires each of these physical rates several techniques for avoiding TLB coher- pages disabling their replacement or page-out, ence operations, eliminating the need for costly and finally passes the set to the receiver through IPIs. the object implementing the pipe. Later, the We have evaluated the performance of the pipe, reader obtains the set of physical pages from the memory disk and networking subsystems using pipe object. For each physical page, it creates an the sf buf interface. Our results show that these ephemeral mapping that is private to the current subsystems benefit significantly from its use. For CPU and is not used by other CPUs. Henceforth, the bw pipe program from the lmbench bench- we refer to this kind of mapping as a CPU-private mark [10] the sf buf interface improves perfor- ephemeral mapping. The reader then copies the mance up to 168% on one of our test platforms. data from the kernel virtual address provided by In all of our experiments the number of TLB in- the ephemeral mapping to the destination buffer validations is greatly reduced or eliminated. in its user address space, destroys the ephemeral The rest of the paper is organized as follows. mapping, and unwires the physical page reen- The next two sections motivate this work from abling its replacement or page-out. two different perspectives: First, Section 2 de- scribes the many uses of ephemeral mappings in 2.2 Memory Disks an operating system kernel; second, Section 3 presents the execution costs for the machine-level Memory disks have a pool of physical pages. To operations used to implement ephemeral map- read from or write to a memory disk a CPU- pings. We define the sf buf interface and its private ephemeral mapping for the desired pages implementation on two representative architec- of the memory disk is created. Then the data is tures in Section 4. Section 5 summarizes the copied between the ephemerally mapped pages 224 2005 USENIX Annual Technical Conference USENIX Association and the read/write buffer provided by the user. 2.5 ptrace(2) After the read or write operation completes, the ephemeral mapping is freed. The ptrace(2) system call enables one pro- cess to trace or debug another process. It in- cludes the capability to read or write the memory 2.3 sendfile(2) and Sockets of the traced process. To read from or write to the traced process’s memory, the kernel creates CPU- The zero-copy sendfile(2) system call and private ephemeral mappings for the desired phys- zero-copy socket send use ephemeral mappings ical pages of the traced process. The kernel then in a similar way. For zero-copy send the copies the data between the ephemerally mapped kernel wires the physical pages correspond- pages and the buffer provided by the tracing pro- ing to the user buffer in memory and then cess. The kernel then frees the ephemeral map- creates ephemeral mappings for them. For pings. sendfile() it does the same for the pages of the file. The ephemeral mappings persist until the corresponding mbuf chain is freed, e.g., when 2.6 Vnode Pager TCP acknowledgments are received. The ker- nel then frees the ephemeral mappings and un- The vnode pager creates ephemeral mappings to wires the corresponding physical pages. These carry out I/O. These ephemeral mappings are not ephemeral mappings are not CPU-private be- CPU private. They are used for paging to and cause they need to be shared among all the CPUs from file systems with small block sizes.
Recommended publications
  • Nessus 8.3 User Guide
    Nessus 8.3.x User Guide Last Updated: September 24, 2021 Table of Contents Welcome to Nessus 8.3.x 12 Get Started with Nessus 15 Navigate Nessus 16 System Requirements 17 Hardware Requirements 18 Software Requirements 22 Customize SELinux Enforcing Mode Policies 25 Licensing Requirements 26 Deployment Considerations 27 Host-Based Firewalls 28 IPv6 Support 29 Virtual Machines 30 Antivirus Software 31 Security Warnings 32 Certificates and Certificate Authorities 33 Custom SSL Server Certificates 35 Create a New Server Certificate and CA Certificate 37 Upload a Custom Server Certificate and CA Certificate 39 Trust a Custom CA 41 Create SSL Client Certificates for Login 43 Nessus Manager Certificates and Nessus Agent 46 Install Nessus 48 Copyright © 2021 Tenable, Inc. All rights reserved. Tenable, Tenable.io, Tenable Network Security, Nessus, SecurityCenter, SecurityCenter Continuous View and Log Correlation Engine are registered trade- marks of Tenable,Inc. Tenable.sc, Tenable.ot, Lumin, Indegy, Assure, and The Cyber Exposure Company are trademarks of Tenable, Inc. All other products or services are trademarks of their respective Download Nessus 49 Install Nessus 51 Install Nessus on Linux 52 Install Nessus on Windows 54 Install Nessus on Mac OS X 56 Install Nessus Agents 58 Retrieve the Linking Key 59 Install a Nessus Agent on Linux 60 Install a Nessus Agent on Windows 64 Install a Nessus Agent on Mac OS X 70 Upgrade Nessus and Nessus Agents 74 Upgrade Nessus 75 Upgrade from Evaluation 76 Upgrade Nessus on Linux 77 Upgrade Nessus on Windows 78 Upgrade Nessus on Mac OS X 79 Upgrade a Nessus Agent 80 Configure Nessus 86 Install Nessus Home, Professional, or Manager 87 Link to Tenable.io 88 Link to Industrial Security 89 Link to Nessus Manager 90 Managed by Tenable.sc 92 Manage Activation Code 93 Copyright © 2021 Tenable, Inc.
    [Show full text]
  • I386-Engine™ Technical Manual
    i386-Engine™ C/C++ Programmable, 32-bit Microprocessor Module Based on the Intel386EX Technical Manual 1950 5 th Street, Davis, CA 95616, USA Tel: 530-758-0180 Fax: 530-758-0181 Email: [email protected] http://www.tern.com Internet Email: [email protected] http://www.tern.com COPYRIGHT i386-Engine, VE232, A-Engine, A-Core, C-Engine, V25-Engine, MotionC, BirdBox, PowerDrive, SensorWatch, Pc-Co, LittleDrive, MemCard, ACTF, and NT-Kit are trademarks of TERN, Inc. Intel386EX and Intel386SX are trademarks of Intel Coporation. Borland C/C++ are trademarks of Borland International. Microsoft, MS-DOS, Windows, and Windows 95 are trademarks of Microsoft Corporation. IBM is a trademark of International Business Machines Corporation. Version 2.00 October 28, 2010 No part of this document may be copied or reproduced in any form or by any means without the prior written consent of TERN, Inc. © 1998-2010 1950 5 th Street, Davis, CA 95616, USA Tel: 530-758-0180 Fax: 530-758-0181 Email: [email protected] http://www.tern.com Important Notice TERN is developing complex, high technology integration systems. These systems are integrated with software and hardware that are not 100% defect free. TERN products are not designed, intended, authorized, or warranted to be suitable for use in life-support applications, devices, or systems, or in other critical applications. TERN and the Buyer agree that TERN will not be liable for incidental or consequential damages arising from the use of TERN products. It is the Buyer's responsibility to protect life and property against incidental failure. TERN reserves the right to make changes and improvements to its products without providing notice.
    [Show full text]
  • DOS Virtualized in the Linux Kernel
    DOS Virtualized in the Linux Kernel Robert T. Johnson, III Abstract Due to the heavy dominance of Microsoft Windows® in the desktop market, some members of the software industry believe that new operating systems must be able to run Windows® applications to compete in the marketplace. However, running applications not native to the operating system generally causes their performance to degrade significantly. When the application and the operating system were written to run on the same machine architecture, all of the instructions in the application can still be directly executed under the new operating system. Some will only need to be interpreted differently to provide the required functionality. This paper investigates the feasibility and potential to speed up the performance of such applications by including the support needed to run them directly in the kernel. In order to avoid the impact to the kernel when these applications are not running, the needed support was built as a loadable kernel module. 1 Introduction New operating systems face significant challenges in gaining consumer acceptance in the desktop marketplace. One of the first realizations that must be made is that the majority of this market consists of non-technical users who are unlikely to either understand or have the desire to understand various technical details about why the new operating system is “better” than a competitor’s. This means that such details are extremely unlikely to sway a large amount of users towards the operating system by themselves. The incentive for a consumer to continue using their existing operating system or only upgrade to one that is backwards compatible is also very strong due to the importance of application software.
    [Show full text]
  • Enclave Security and Address-Based Side Channels
    Graz University of Technology Faculty of Computer Science Institute of Applied Information Processing and Communications IAIK Enclave Security and Address-based Side Channels Assessors: A PhD Thesis Presented to the Prof. Stefan Mangard Faculty of Computer Science in Prof. Thomas Eisenbarth Fulfillment of the Requirements for the PhD Degree by June 2020 Samuel Weiser Samuel Weiser Enclave Security and Address-based Side Channels DOCTORAL THESIS to achieve the university degree of Doctor of Technical Sciences; Dr. techn. submitted to Graz University of Technology Assessors Prof. Stefan Mangard Institute of Applied Information Processing and Communications Graz University of Technology Prof. Thomas Eisenbarth Institute for IT Security Universit¨atzu L¨ubeck Graz, June 2020 SSS AFFIDAVIT I declare that I have authored this thesis independently, that I have not used other than the declared sources/resources, and that I have explicitly indicated all material which has been quoted either literally or by content from the sources used. The text document uploaded to TUGRAZonline is identical to the present doctoral thesis. Date, Signature SSS Prologue Everyone has the right to life, liberty and security of person. Universal Declaration of Human Rights, Article 3 Our life turned digital, and so did we. Not long ago, the globalized commu- nication that we enjoy today on an everyday basis was the privilege of a few. Nowadays, artificial intelligence in the cloud, smartified handhelds, low-power Internet-of-Things gadgets, and self-maneuvering objects in the physical world are promising us unthinkable freedom in shaping our personal lives as well as society as a whole. Sadly, our collective excitement about the \new", the \better", the \more", the \instant", has overruled our sense of security and privacy.
    [Show full text]
  • MILITARY I386tm SX MICROPROCESSOR
    MILITARY i386TM SX MICROPROCESSOR Y Full 32-Bit Internal Architecture Y Virtual M8086 Mode Allows Execution Ð 8-, 16-, 32-Bit Data Types of M8086 Software in a Protected and Ð 8 General Purpose 32-Bit Registers Paged System Y Runs Intel386TM Software in a Cost Y High Speed Numerics Support with the Effective 16-Bit Hardware Environment Military i387TM SX Coprocessor Ð Runs Same Applications and O.S.'s Y On-Chip Debugging Support Including TM as the Military i386 DX Processor Breakpoint Registers Ð Object Code Compatible with M8086, M80186, M80286, and i386 Y Complete System Development Processors Support Ð Runs MS-DOS*, OS/2* and UNIX** Ð Software: C, PL/M, Assembler Ð Debuggers: PMON-i386 DX, Y Very High Performance 16-Bit Data Bus ICETM-i386 SX Ð 20 MHz Clock Ð Extensive Third-Party Support: C, Ð Two-Clock Bus Cycles Pascal, FORTRAN, BASIC, Ada*** on Ð 20 Megabytes/Sec Bus Bandwidth VAX, UNIX**, MS-DOS*, and Other Ð Address Pipelining Allows Use of Hosts Slower/Cheaper Memories Y High Speed CHMOS IV Technology Y Integrated Memory Management Unit Ð Virtual Memory Support Y 88-Lead Pin Grid Array Package Ð Optional On-Chip Paging (See Packaging Specification, Order Ý 231369) Ð 4 Levels of Hardware Enforced Y 100-Lead Plastic Flat Pack Package Protection Y Available in Four Product Grades: Ð MMU Fully Compatible with Those of Ð MIL-STD-883 (PGA), b55§Cto the M80286 and i386 DX CPUs a125§C(TC) Y Large Uniform Address Space Ð Military Temperature Only (PGA), Ð 16 Megabyte Physical b55§Ctoa125§C(TC) Ð 64 Terabyte Virtual Ð Extended Temperature (PGA), Ð 4 Gigabyte Maximum Segment Size b40§Ctoa110§C(TC) Ð Extended Temperature (PQFP), b20§Ctoa100§C(TC) The Military i386 SX Microprocessor is a 32-bit CPU with a 16-bit external data bus and a 24-bit external address bus.
    [Show full text]
  • Computer Architectures an Overview
    Computer Architectures An Overview PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Sat, 25 Feb 2012 22:35:32 UTC Contents Articles Microarchitecture 1 x86 7 PowerPC 23 IBM POWER 33 MIPS architecture 39 SPARC 57 ARM architecture 65 DEC Alpha 80 AlphaStation 92 AlphaServer 95 Very long instruction word 103 Instruction-level parallelism 107 Explicitly parallel instruction computing 108 References Article Sources and Contributors 111 Image Sources, Licenses and Contributors 113 Article Licenses License 114 Microarchitecture 1 Microarchitecture In computer engineering, microarchitecture (sometimes abbreviated to µarch or uarch), also called computer organization, is the way a given instruction set architecture (ISA) is implemented on a processor. A given ISA may be implemented with different microarchitectures.[1] Implementations might vary due to different goals of a given design or due to shifts in technology.[2] Computer architecture is the combination of microarchitecture and instruction set design. Relation to instruction set architecture The ISA is roughly the same as the programming model of a processor as seen by an assembly language programmer or compiler writer. The ISA includes the execution model, processor registers, address and data formats among other things. The Intel Core microarchitecture microarchitecture includes the constituent parts of the processor and how these interconnect and interoperate to implement the ISA. The microarchitecture of a machine is usually represented as (more or less detailed) diagrams that describe the interconnections of the various microarchitectural elements of the machine, which may be everything from single gates and registers, to complete arithmetic logic units (ALU)s and even larger elements.
    [Show full text]
  • Intel® Itanium™ Processor- Specific Application Binary Interface (ABI)
    Intel® Itanium™ Processor- specific Application Binary Interface (ABI) May 2001 Document Number: 245370-003 Information in this document is provided in connection with Intel® products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The Itanium processor may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order. Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725, or by visiting Intel’s website at http://developer.intel.com/design/litcentr.
    [Show full text]
  • Evaluation of X32 ABI for Virtualization and Cloud
    Evaluation of X32 ABI for Virtualization and Cloud Jun Nakajima Intel Corporation 1 Wednesday, August 29, 12 1 Agenda • What is X32 ABI? • Benefits for Virtualization & Cloud • Evaluation of X32 • Summary 2 Wednesday, August 29, 12 2 x32 ABI Basics • x86-64 ABI but with 32-bit longs and pointers • 64-bit arithmetic • Fully utilize modern x86 • 8 additional integer registers (16 total) • 8 additional SSE registers • SSE for FP math • 64-bit kernel is required - Linux kernel 3.4 has support for x32 Same memory footprint as x86 with advantages of x86-64. No hardware changes are required. 3 Wednesday, August 29, 12 3 ABI Comparison i386 x86-64 x32 Integer registers 6 15 15 FP registers 8 16 16 Pointers 4 bytes 8 bytes 4 bytes 64-bit arithmetic No Yes Yes Floating point x87 SSE SSE Calling convention Memory Registers Registers PIC prologue 2-3 insn None None 4 Wednesday, August 29, 12 4 The x32 Performance Advantage • In the order of Expected Contribution: - Efficient Position Independent Code - Efficient Function Parameter Passing - Efficient 64-bit Arithmetic - Efficient Floating Point Operations X32 is expected to give a 10-20% performance boost for C, C++ 5 Wednesday, August 29, 12 5 Efficient Position Independent Code extern int x, y, z; void foo () { z = x * y; } i386 psABI x32 psABI call __i686.get_pc_thunk.cx movl x@GOTPCREL(%rip), %edx addl $_GLOBAL_OFFSET_TABLE_, %ecx movl y@GOTPCREL(%rip), %eax movl!y@GOT(%ecx), %eax movl (%rax), %rax movl!x@GOT(%ecx), %edx imull (%rdx), %rax movl!(%eax), %eax movl z@GOTPCREL(%rip), %edx imull!(%edx),
    [Show full text]
  • Intel® FPGA Software Installation and Licensing
    Intel® FPGA Software Installation and Licensing Updated for Intel® Quartus® Prime Design Suite: 21.2 Subscribe MNL-1065 | 2021.06.21 Send Feedback Latest document on the web: PDF | HTML Contents Contents 1. Introduction to Intel® FPGA Software Installation and Licensing................................... 4 1.1. Intel FPGA Download Center................................................................................... 4 1.2. Intel FPGA Self-Service Licensing Center...................................................................4 2. System Requirements and Prerequisites.........................................................................5 2.1. Minimum Hardware Requirements............................................................................5 2.2. Cable and Port Requirements.................................................................................. 5 2.3. Software Requirements.......................................................................................... 6 2.3.1. Installing Windows Subsystem for Linux* (WSL) on Windows.......................... 7 2.3.2. Intel High Level Synthesis Compiler Software Requirements............................ 7 2.3.3. ModelSim - Intel FPGA Edition Software Requirements....................................8 2.4. Interaction with Third-party Software....................................................................... 8 3. Downloading and Installing Intel FPGA Software........................................................... 9 3.1. Software Available in the Download Center................................................................9
    [Show full text]
  • Lecture Notes in Assembly Language Piotr Fulmański
    Uniwersytet Łódzki Wydział Matematyki i Informatyki Informatyka Lecture Notes in Assembly Language Short introduction to low-level programming Piotr Fulmański Łódź, 2013 Spis treści Spis treści iii 1 Before we begin1 1.1 Simple assembler.................................... 1 1.1.1 Excercise 1 ................................... 2 1.1.2 Excercise 2 ................................... 2 1.1.3 Excercise 3 ................................... 3 1.1.4 Excercise 4 ................................... 5 1.2 Improvements, part I.................................. 6 1.2.1 Excercise 5 ................................... 9 1.3 Improvements, part II ................................. 9 1.3.1 Solution 5.2.2 – bad second approach..................... 14 1.4 Improvements, part III................................. 16 1.4.1 Excercise 6 ................................... 17 1.5 Improvements, part IV................................. 19 1.5.1 Excercise 6 – second approach ........................ 19 1.5.2 Excercise 7 ................................... 19 1.5.3 Excercise 8 ................................... 20 1.6 Improvements, part V ................................. 20 1.6.1 Excercise 9 ................................... 20 1.6.2 Excercise 10................................... 21 1.7 Other excercises .................................... 21 1.7.1 Excercise 11................................... 21 1.7.2 Excercise x ................................... 22 iii iv SPIS TREŚCI 1.7.3 Excercise x ................................... 22 1.7.4 Excercise x ..................................
    [Show full text]
  • Intel Processor Identification and the CPUID Instruction
    E AP-485 APPLICATION NOTE Intel Processor Identification and the CPUID Instruction January 1999 Order Number : 241618-012 1/14/99 2:13 PM CPUID INTEL CONFIDENTIAL (until publication date) Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel’s Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. Intel’s Intel Architecture processors (e.g., Pentium® processor, Pentium processor with MMX™ technology, Pentium Pro processor, Pentium II processor, Pentium II Xeon™ processor, Intel Celeron™ processor, Pentium III processor and Pentium III Xeon processor) may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
    [Show full text]
  • Machine Checks on I386/X86-64 Andi Kleen, Suse Labs [email protected]
    Machine checks on i386/x86-64 Andi Kleen, SuSE Labs [email protected] What is a machine check? Hardware error Internal errors, Memory, Cache, IO, Busses But users have a hard time to recognize this Hardware is (mostly) error correcting 64bit is worse: more DIMMs, more bit flips Smaller/more transistors NMI, Thermal Can be panic or a logged event Talking only about x86-64 here, a bit i386 IA64, S390, PPC64 handle machine checks differently The OS is just the messenger Caused by hardware But always blamed on the software of course Sometimes drivers/BIOS can misprogram hardware Challenge is to explain to customers that their hardware is broken General classification Uncorrected Console log or a jpg Don't make it to disk Sometimes available in BIOS event log afterwards Sometimes can be recovered by kernel after reboot Run through mcelog --ascii first Corrected error but logged Logged in a binary log (/dev/mcelog) mcelog cronjob decodes them into /var/log/mcelog On i386 they just go to /var/log/messages Nothing really bad happened, but if it happens often hardware will likely fail A live example 2.4 AMD kernel: Northbridge Machine Check exception = b60ea00100000813 0 ecc error link number 0 err cpu1 uncorrected ecc error processor context corrupt error address valid error enable error uncorrected previous error lost error address 00000001826ac018 Address: 00000001826ac018 CPU 1: Machine Check Exception: 0000000000000000 Kernel panic: Unable to continue Another example 2.6 AMD fatal HARDWARE ERROR CPU 0: Machine Check Exception: 4 Bank 4: b605200100000813 TSC 24291bbb8104 ADDR fbf2c068 This is not a software problem! Run through mcelog --ascii to decode and contact your hardware vendor Kernel panic - not syncing: Machine check Intel example 2.6 Fatal CPU 1: Machine Check Exception: 0000000000000004 Bank 1: b200000000000115 Kernel panic: CPU context corrupt Another Intel example Non fatal mcelog entry STATUS 9000020110800e0f MCGSTATUS 2 MCE 0 HARDWARE ERROR.
    [Show full text]