Shared Memory Communication

Total Page:16

File Type:pdf, Size:1020Kb

Shared Memory Communication Design and Implementation of a Multi-purpose Cluster System Network Interface Unit by Boon Seong Ang Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY February 1999 © 1999 Massachusetts Institute of Technology. All rights reser MASSACHUS OF TEC Author............. ..... V.. .4....... Department of Electrical Engineerin Ind Computer c February 23, 1999 Certified by................ Arvind Johnson Professor of Computer Science /42 Thesis Supervisor Certified by.. ................. Larry Rudolph Principal Research Scientist Th sis Supervisor Accepted by............ A. C. Smith Chairman, Departmental Committee on Graduate Students Design and Implementation of a Multi-purpose Cluster System Network Interface Unit by Boon Seong Ang Submitted to the Department of Electrical Engineering and Computer Science on February 23, 1999, in partial fulfillment of the requirements for the degree of Doctor of Philosophy Abstract Today, the interface between a high speed network and a high performance com- putation node is the least mature hardware technology in scalable general purpose cluster computing. Currently, the one-interface-fits-all philosophy prevails. This ap- proach performs poorly in some cases because of the complexity of modern memory hierarchy and the wide range of communication sizes and patterns. Today's mes- sage passing NIU's are also unable to utilize the best data transfer and coordination mechanisms due to poor integration into the computation node's memory hierarchy. These shortcomings unnecessarily constrain the performance of cluster systems. Our thesis is that a cluster system NIU should support multiple communica- tion interfaces layered on a virtual message queue substrate in order to streamline data movement both within each node as well as between nodes. The NIU should be tightly integrated into the computation node's memory hierarchy via the cache- coherent snoopy system bus so as to gain access to a rich set of data movement operations. We further propose to achieve the goal of a large set of high performance communication functions with a hybrid NIU micro-architecture that combines custom hardware building blocks with an off-the-shelf embedded processor. These ideas are tested through the design and implementation of the StarT- Voyager NES, an NIU used to connect a cluster of commercial PowerPC based SMP's. Our prototype demonstrates that it is feasible to implement a multi-interface NIU at reasonable hardware cost. This is achieved by reusing a set of basic hardware building blocks and adopting a layered architecture that separates protected network sharing from software visible communication interfaces. Through different mechanisms, our 35MHz NIU (140MHz processor core) can deliver very low latency for very short messages (under 2ps), very high bandwidth for multi-kilobyte block transfers (167 MBytes/s bi-directional bandwidth), and very low processor overhead for multi-cast communication (each additional destination after the first incurs 10 processor clocks). We introduce the novel idea of supporting a large number of virtual message queues through a combination of hardware Resident message queues and firmware emulated Non-resident message queues. By using the Resident queues as firmware controlled caches, our implementation delivers hardware speed on the average while providing graceful degradation in a low cost implementation. Finally, we also demonstrate that an off-the-shelf embedded processor comple- ments custom hardware in the NIU, with the former providing flexibility and the latter performance. We identify the interface between the embedded processor and custom hardware as a critical design component and propose a command and com- pletion queue interface to improve the performance and reduce the complexity of embedded firmware. Thesis Supervisor: Arvind Title: Johnson Professor of Computer Science Thesis Supervisor: Larry Rudolph Title: Principal Research Scientist Acknowledgments This dissertation would not have been possible without the encouragement, support, patience and cooperation of many people. Although no words can adequately express my gratitude, an acknowledgement is the least I can do. First and foremost, I want to thank my wife, Wee Lee, and our families for standing by me all these years. They gave me the latitude to seek my calling, were patient as the years passed but I was no closer to enlightenment, and provided me a sanctuary to retreat to whenever my marathon-like graduate school career wore me thin. To you all, my eternity gratitude. I am greatly indebted to my advisors, Arvind and Larry, for their faith in my abilities and for standing by me throughout my long graduate student career. They gave me the opportunity to co-lead a large systems project, an experience which greatly enriched my systems building skills. To Larry, I want to express my gratitude for all the fatherly/brotherly advice and the cheering sessions in the last leg of my graduate school apprentice-ship. I would also like to thank the other members of my thesis committee: Frans and Anant, for helping to refine this work. I want to thank Derek Chiou for our partnership through graduate school, working together on Monsoon, StarT, StarT-NG and StarT-Voyager. I greatly enjoy bringing vague ideas to you, and jointly developing them into well thought out solutions. This work on StarT-Voyager NES is as much yours as it is mine. Thank you too for the encouragement and counselling you gave me all these years. The graduate students and staff in Computation Structures Group gave me a home away from home. Derek Chiou, Alex Caro, Andy Boughton, James Hoe, RPaul Johnson, Andy Shaw, Shail Aditya Gupta, Xiaowei Shen, Mike Ehrlich, Dan Rosen- band and Jan Maessen, thank you for the company in this long pilgrimage through graduate school. It was a pleasure working with all of you bright, hardworking, de- voted and, at one time, idealistic people. I am also indebted to many of you, especially Derek and Alex, for coming to my rescue whenever I painted myself into a corner. In my final years at MIT, I had the pleasure of working with the StarT-Voyager 5 team: Derek, Mike, Dan, Andy Boughton, Jack Constanza, Brad Bartley, and Wing- chung Ho. It was a tremendous education experience laboring along-side the other talented team members, especially Derek, Mike and Dan. I also want to thank our friends at IBM: Marc, Pratap, Eknath, Beng Hong, and Alan, for assisting us in the StarT-Voyager project. To all of you I mentioned above, and many others that I missed, thank you for the lessons and memories of these years. 6 Contents 1 Introduction 13 1.1 M otivation ... ... .. ... ... ... .. .. ...... 14 1.2 Proposed Cluster Communication Architecture . ... .. ... 17 1.3 A Flexible Network Interface Unit (NIU) Design .. ... ... 19 1.4 Related Work .. .. ... ... .. ... .. .. .. ... ... 22 1.5 Contributions ... ... .. ... .... ... ... ..... 25 1.6 Road M ap .. ... ... ... ... ... ... ... ..... 27 2 Design Requirements 29 2.1 Message Passing Communication . ..... 30 2.1.1 Channel vs Queues-and-network Model . ......... ... 30 2.1.2 Message Content Specification . ... ... ... ... ... 3 2 2.1.3 Message Buffer Management . .... .. ... ... ... .. 3 2 2.1.4 Message Reception Method .... .. ...... ....... 3 5 2.1.5 Communication Service Guarantee .. ..... ........ 36 2.1.6 Synchronization Semantics . .... .. ..... ........ 3 7 2.1.7 Sum m ary .. .... .... ... ... ............. 38 2.2 Shared Memory Communication .. ..... ............. 3 8 2.2.1 Caching and Coherence Granularity .. ............. 39 2.2.2 Memory Model ..... .... .... .. .... .... ... 4 0 2.2.3 Invalidate vs Update .. ....... ... ... ... ... 4 1 2.2.4 CC-NUMA and S-COMA ... .... .. ... ... ... .. 4 2 2.2.5 Atomic Access and Operation-aware Protocol 44 7 2.2.6 Performance Enhancement Hints ... .... .... ... 4 5 2.2.7 Summary ..... .... ... ... .. ... ... .. ... 4 5 2.3 System Requirements . ...... ... ... ... ... ... ... 4 5 2.3.1 Multi-tasking Model ... ... ... .... ... ... ... 4 6 2.3.2 Network Sharing Model ... ... .... ... ... .... 4 7 2.3.3 Fault Isolation . ...... .. ... .. ... ... .. ... .. 5 0 2.4 SMP Host System Restrictions .. .. ... .... .... .... ... 5 0 2.5 NIU Functionalities ........ .... .... .... .... ... 5 2 2.5.1 Interface to Host .... .... .... .... .... ... 5 2 2.5.2 Interface to Network ... .. .... .... .... ... ... 5 4 2.5.3 Data Path and Buffering . .. .... ... ... .... ... 5 4 2.5.4 Data Transport Reliability and Ordering .... ..... ... 55 2.5.5 Unilateral Remote Action . .... ... ... ... ... ... 5 6 2.5.6 Cache Protocol Engine ... ... ... ... ... .. ... 5 6 2.5.7 Home Protocol Engine ... ... .... ... ... ... 5 8 3 A Layered Network Interface Macro-architecture 61 3.1 Physical Network Layer .. ............. .. ... 62 3.1.1 Two Independent Networks ......... .. .. .. 63 3.1.2 Reliable Delivery Option .. ........ .. .. 63 3.1.3 Ordered Delivery Option and Ordering-set Concept . .. .. 64 3.1.4 Bounded Outstanding Packet Count .. .. ... .. .. 65 3.1.5 Programmable Send-rate Limiter ...... .. ... .. 65 3.2 Virtual Queues Layer . ...... ...... .... .. ... .. 66 3.2.1 Virtual Queue Names and Translation . .. .. .. .. 68 3.2.2 Dynamic Destination Buffer Allocation . .. .. .. 69 3.2.3 Reactive Flow-control ............. .. .. ... 72 3.2.4 Decoupled Process Scheduling and Message Queue Activity .. 75 3.2.5 Transparent Process
Recommended publications
  • Interconnect Solutions Short Form Catalog
    Interconnect Solutions Short Form Catalog How to Search this Catalog This digital catalog provides you with three quick ways to find the products and information you are looking for. Just point and click on the bookmarks to the left, the linked images on the next page or the labeled sections of the table of contents. You can also use the “search” function built into Adobe Acrobat to jump directly to any text reference in this document. Acrobat “Search” function instructions: 1. Press CONTROL + F 2. When the dialog box appears, type in the word or words you are looking for and press ENTER. 3. Depending on your version of Acrobat, it will either take you directly to the first instance found, or display a list of pages where the text can be found. In the latter, click on the link to the pages provided. Interconnect Solutions Short Form Catalog Complete Solutions for the Electronics Industry 3M Electronics offers a comprehensive range of Interconnect Solutions for the electronics industry with a product portfolio that includes connectors, cables, cable assemblies and assembly tooling for a wide variety of applications. 3M is dedicated to innovation, continually developing new products that become an important part of everyday life across many diverse markets. A number of 3M solution categories are based on custom-designed products for specialized applications. 3M Electronics can help you design, modify and customize your product as well as help you to seamlessly integrate our products into your manufacturing process on a global basis. RoHS Compliant Statement “RoHS compliant” means that the product or part does not contain any of the following substances in excess of the following maximum concentration values in any homogeneous material, unless the substance is in an application that is exempt under RoHS: (a) 0.1% (by weight) for lead, mercury, hexavalent chromium, polybrominated biphenyls or polybrominated diphenyl ethers; or (b) 0.01% (by weight) for cadmium.
    [Show full text]
  • Publication Title 1-1962
    publication_title print_identifier online_identifier publisher_name date_monograph_published_print 1-1962 - AIEE General Principles Upon Which Temperature 978-1-5044-0149-4 IEEE 1962 Limits Are Based in the rating of Electric Equipment 1-1969 - IEEE General Priniciples for Temperature Limits in the 978-1-5044-0150-0 IEEE 1968 Rating of Electric Equipment 1-1986 - IEEE Standard General Principles for Temperature Limits in the Rating of Electric Equipment and for the 978-0-7381-2985-3 IEEE 1986 Evaluation of Electrical Insulation 1-2000 - IEEE Recommended Practice - General Principles for Temperature Limits in the Rating of Electrical Equipment and 978-0-7381-2717-0 IEEE 2001 for the Evaluation of Electrical Insulation 100-2000 - The Authoritative Dictionary of IEEE Standards 978-0-7381-2601-2 IEEE 2000 Terms, Seventh Edition 1000-1987 - An American National Standard IEEE Standard for 0-7381-4593-9 IEEE 1988 Mechanical Core Specifications for Microcomputers 1000-1987 - IEEE Standard for an 8-Bit Backplane Interface: 978-0-7381-2756-9 IEEE 1988 STEbus 1001-1988 - IEEE Guide for Interfacing Dispersed Storage and 0-7381-4134-8 IEEE 1989 Generation Facilities With Electric Utility Systems 1002-1987 - IEEE Standard Taxonomy for Software Engineering 0-7381-0399-3 IEEE 1987 Standards 1003.0-1995 - Guide to the POSIX(R) Open System 978-0-7381-3138-2 IEEE 1994 Environment (OSE) 1003.1, 2004 Edition - IEEE Standard for Information Technology - Portable Operating System Interface (POSIX(R)) - 978-0-7381-4040-7 IEEE 2004 Base Definitions 1003.1, 2013
    [Show full text]
  • Opening Plenary March 2021
    Opening Plenary March 2021 Glenn Parsons – IEEE 802.1 WG Chair [email protected] 802.1 plenary agenda Monday, March 8th opening Tuesday, March 16th closing • Copyright Policy • Copyright Policy • Call for Patents • Call for Patents • Participant behavior • Participant behavior • Administrative • Membership status • Membership status • Future Sessions • Future Sessions • Sanity check – current projects • 802 EC report • TG reports • Sanity check – current projects • Outgoing Liaisons • Incoming Liaisons • Motions for EC • TG agendas • Motions for 802.1 • Any other business • Any other business 2 INSTRUCTIONS FOR CHAIRS OF STANDARDS DEVELOPMENT ACTIVITIES At the beginning of each standards development meeting the chair or a designee is to: .Show the following slides (or provide them beforehand) .Advise the standards development group participants that: .IEEE SA’s copyright policy is described in Clause 7 of the IEEE SA Standards Board Bylaws and Clause 6.1 of the IEEE SA Standards Board Operations Manual; .Any material submitted during standards development, whether verbal, recorded, or in written form, is a Contribution and shall comply with the IEEE SA Copyright Policy; .Instruct the Secretary to record in the minutes of the relevant meeting: .That the foregoing information was provided and that the copyright slides were shown (or provided beforehand). .Ask participants to register attendance in IMAT: https://imat.ieee.org 3 IEEE SA COPYRIGHT POLICY By participating in this activity, you agree to comply with the IEEE Code of Ethics, all applicable laws, and all IEEE policies and procedures including, but not limited to, the IEEE SA Copyright Policy. .Previously Published material (copyright assertion indicated) shall not be presented/submitted to the Working Group nor incorporated into a Working Group draft unless permission is granted.
    [Show full text]
  • Architectural Support for Real-Time Computing Using Generalized Rate Monotonic Theory
    744 計 測 と 制 御 シ テ ア Vol.31, No.7 集 ル タ イ ム 分 ス ム 特 リ 散 (1992年7月) ≪展 望 ≫ Architectural Support for Real-Time Computing using Generalized Rate Monotonic Theory Lui SHA* and Shirish S. SATHAYE** Abstract The rate monotonic theory and its generalizations have been adopted by national high tech- nology projects such as the Space Station and has recently been supported by major open stand- ards such as the IEEE Futurebus+ and POSIX. 4. In this paper, we focus on the architectural support necessary for scheduling activities using the generalized rate monotonic theory. We briefly review the theory and provide an application example. Finally we describe the architec- tural requirements for the use of the theory. Key Words: Real-Time Scheduling, Distributed Real-Time System, Rate Monotonic scheduling ● Stability under transient overload. When the 1. Introduction system is overloaded by events and it is im- Real-time computing systems are critical to an possible to meet all the deadlines, we must industrialized nation's technological infrastructure. still guarantee the deadines of selected critical Modern telecommunication systems, factories, de- tasks. fense systems, aircrafts and airports, space stations Generalized rate monotonic scheduling (GRMS) and high energy physics experiments cannot op- theory allows system developers to meet the erate .without them. In real-time applications, above requirements by managing system concur- the correctness of computation depends upon not rency and timing constraints at the level of task- only its results but also the time at which outputs ing and message passing. In essence, this theory are generated.
    [Show full text]
  • Bi-Directional Optical Backplane Bus for General Purpose Multi-Processor B Oard-To-B Oard Optoelectronic Interconnects
    JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 13, NO. 6, JUNE 1995 1031 Bi-Directional Optical Backplane Bus for General Purpose Multi-Processor B oard-to-B oard Optoelectronic Interconnects Srikanth Natarajan, Chunhe Zhao, and Ray. T. Chen Absfract- We report for the first time a bidirectional opti- cal backplane bus for a high performance system containing nine multi-chip module (MCM) boards, operating at 632.8 and 1300 nm. The backplane bus reported here employs arrays of multiplexed polymer-based waveguide holograms in conjunction with a waveguiding plate, within which 16 substrate guided waves for 72 (8 x 9) cascaded fanouts, are generated. Data transfer of 1.2 GbUs at 1.3-pm wavelength is demonstrated for a single bus line with 72 cascaded fanouts. Packaging-related issues such as Waveguiding Plate transceiver size and misalignment are embarked upon to provide n a reliable system with a wide bandwidth coverage. Theoretical U hocessor/Memory Board treatment to minimize intensity fluctuations among the nine modules in both directions is further presented and an optimum I High-speed Optoelectronic Transceiver design rule is provided. The backplane bus demonstrated, is for general-purpose and therefore compatible with such IEEE stan- - Waveguide Hologram For Bi-Directional Coupling dardized buses as VMEbus, Futurebus and FASTBUS, and can Fig. 1. Optical equivalent of a section of a single bidirectional electronic function as a backplane bus in existing computing environments. bus line. I. INTRODUCTION needed to preserve the rising and falling edges of the signals HE LIMITATIONS of current computer backplane buses increases. This makes using bulky, expensive, terminated Tstem from their purely electronic interconnects.
    [Show full text]
  • BUS and CACHE MEMORY ORGANIZATIONS for MULTIPROCESSORS By
    BUS AND CACHE MEMORY ORGANIZATIONS FOR MULTIPROCESSORS by Donald Charles Winsor A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Electrical Engineering) in The University of Michigan 1989 Doctoral Committee: Associate Professor Trevor N. Mudge, Chairman Professor Daniel E. Atkins Professor John P. Hayes Professor James O. Wilkes ABSTRACT BUS AND CACHE MEMORY ORGANIZATIONS FOR MULTIPROCESSORS by Donald Charles Winsor Chairman: Trevor Mudge The single shared bus multiprocessor has been the most commercially successful multiprocessor system design up to this time, largely because it permits the implementation of efficient hardware mechanisms to enforce cache consistency. Electrical loading problems and restricted bandwidth of the shared bus have been the most limiting factors in these systems. This dissertation presents designs for logical buses constructed from a hierarchy of physical buses that will allow snooping cache protocols to be used without the electrical loading problems that result from attaching all processors to a single bus. A new bus bandwidth model is developed that considers the effects of electrical loading of the bus as a function of the number of processors, allowing optimal bus configurations to be determined. Trace driven simulations show that the performance estimates obtained from this bus model agree closely with the performance that can be expected when running a realistic multiprogramming workload in which each processor runs an independent task. The model is also used with a parallel program workload to investigate its accuracy when the processors do not operate independently. This is found to produce large errors in the mean service time estimate, but still gives reasonably accurate estimates for the bus utilization.
    [Show full text]
  • COVID-19 – Emergency Management Tips and Practices for Bus Transit Systems
    COVID-19 – Emergency Management Tips and Practices for Bus Transit Systems Prepared for: Florida Department of Transportation Office of Freight, Logistics, and Passenger Operations Prepared by: Center for Urban Transportation Research University of South Florida Published: April 1, 2020 Revision: March 10, 2021 Revision: March 10, 2021 Table of Contents List of Figures ..................................................................................................................................................... iii Emergency Management Tips and Practices for Transit .................................................................................... 1 Supplies ........................................................................................................................................................... 1 Hazard Risk Analysis ....................................................................................................................................... 1 Center for Disease Control (CDC) Guidance ................................................................................................... 2 U.S. Department of Transportation’s Federal Transit Administration (FTA) .................................................. 4 Federal Mask Requirement for Surface Transportation Providers ............................................................ 5 Additional Resources .................................................................................................................................. 5 Transportation Research
    [Show full text]
  • 3M Interconnect Short Form Catalog
    3M Interconnect Products Short Form Catalog Comprehensive Solutions for Reliable Connections Complete Solutions for the Electronics Industry 3M Interconnect Solutions offers a comprehensive range of Interconnect Solutions for the electronics industry, with a product portfolio that includes connectors, cables, cable assemblies and assembly tooling for a wide variety of applications. 3M is dedicated to innovation, continually developing new products that become an important part of everyday life across many diverse markets. A number of 3M solution categories are based on custom-designed products for specialized applications. 3M Interconnect Solutions can help you design, modify and customize your product as well as help you seamlessly integrate our products into your manufacturing process on a global basis. Design, modify and customize your product 2 Contents Backplane Connector Solutions 4 Boardmount Connector Solutions 6 Wiremount Connectors and Assembly Solutions 9 Device-to-Board Connector Solutions 13 Input/Output Connector Solutions 16 Cable Solutions 24 Cable Assembly Solutions — High Speed Assemblies 26 3 3M™ BACKPLANE CONNECTOR SOLUTIONS PRODUCT FEATURES MATERIALS PERFORMANCE • RoHS Compliant Version Available Contact Current Rating: 3.0 A @ 30°C DIN Connectors • DIN 41612 Material: Copper Alloy T-rise Above Ambient • C-Form Plug and Socket Insulation Temperature Rating: -55°C to +125°C • C-Form Wiremount Plugs Material: Engineering Thermoplastic and Sockets Flammability UL 94V-0 • R-Form Plug and Socket Plating • Q-Form Plug and
    [Show full text]
  • TIM Specification
    TIM-40 MODULE SPECIFICATION Including TMS320C44 Addendum VERSION 1.12. 1 Table of Contents TABLE OF CONTENTS ............................................................................................................................................................... 2 1.THE MODULE CONCEPT........................................................................................................................................................ 4 1.1 INTRODUCTION.................................................................................................................................................................... 4 1.2 ORIGINS OF THE MODULE..................................................................................................................................................... 4 1.3 MODULE CONCEPT............................................................................................................................................................... 4 1.4 MOTHERBOARD CONCEPT .................................................................................................................................................... 4 1.5 SUMMARY ........................................................................................................................................................................... 4 2. PHYSICAL SPECIFICATIONS ............................................................................................................................................. 5 2.1 INTRODUCTION...................................................................................................................................................................
    [Show full text]
  • NIM Standard E Of1.35Inches(=“Singlewidt (Looking Intobin) Bin Connectorfront Methods
    NIM Standard NIM is an acronym for Nuclear Instrumentation Methods. The NIM standard (DOE/ER-0457) was established in 1964 for the nuclear and high energy physics communities. The goal of NIM was to promote the interchangeability of instrumentation modules. Standard NIM modules are 8.75 inches G. Varner – PHYS476 tall and a width which is a multiple of 1.35 inches (= “single width”; “double width” = 2.7 inches) Example: extended pin set Bin connector front (looking into bin) Module Bin CAMAC Standard CAMAC is also an acronym: Computer Automated Measurement and Control. The CAMAC standard (IEEE 583) was established in 1975 and has been used in virtually every physics laboratory and many industrial applications. While other modular instrumentation standards have been widely adopted, the G. Varner – PHYS476 robust CAMAC module and enclosure construction and simple hardware level interface means CAMAC is still a viable choice for instrument applications today. Separate 24-bit Read/Write Busses Simple FNAD commanding: e.g. F = 0 (read) N = 1 (crate slot) A = card channel # D = data (either) Z = initialize (reset) X = I’m responding Q = zero suppress L = LAM (Look-At-Me) LeCroy 3377 500ps multi-hit TDC FASTBUS Standard FASTBUS is a standard largely exclusive to High Energy Physics. Detailed in ANSI/IEEE 960 and IEC-547, it is designed for VERY high power operation (many kW per crate – from the age of ECL) G. Varner – PHYS476 Hundreds of pins, Impossible to detail here and it is likely you will never need to know this. Mechanics are frightening: bed of nails in back.
    [Show full text]
  • Cache Attacks on ARM
    Moritz Lipp Cache Attacks on ARM Master thesis Graz, University Of Technology NOTICE If you want to cite this work based on last-level cache attacks on ARM, please cite the following research paper instead: •L ipp, Moritz ; Gruss, Daniel ; Spreitzer, Raphael ; Maurice, Clementine´ ;Mangard, Stefan: ARMageddon: Cache Attacks on Mobile De- vices. In: 25th USENIX Security Symposium (USENIX Security 16). Austin, TX : USENIX Association, August 2016. – ISBN 978–1–931971– 32–4, 549–564 If you want to cite this work in respect to rowhammer on ARM-based devices, please cite the following publication instead: •V een, Victor van d. ; Fratantonio, Yanick ; Lindorfer, Martina ; Gruss, Daniel ; Maurice, Clementine´ ; Vigna, Giovanni ; Bos, Her- bert ; Razavi, Kaveh ; Giuffrida, Christiano: Drammer : Determinis- tic Rowhammer Attacks on Commodity Mobile Platforms. In: ACM Conference on Computer and Communications Security – CCS, 2016 Contents 1 Introduction 1 1.1 Motivation . .3 1.2 Key Challenges and Results . .4 1.3 Contributions . .5 1.4 Test devices . .6 1.5 Outline . .7 2 Background 9 2.1 CPU caches . .9 2.2 Cache coherence . 18 2.3 Shared memory . 29 2.4 Cache Attacks . 31 2.5 DRAM . 39 3 Attack primitives 43 3.1 Timing Measurements . 43 3.2 Cache Eviction . 47 3.3 Defeating the Cache-Organization . 53 4 Attack Case Studies 57 4.1 High Performance Covert-Channels . 58 4.2 Spying on User input . 62 4.3 Attacks on Cryptographic Algorithms . 69 4.4 Rowhammer on ARM . 75 5 Countermeasures 79 6 Conclusion 81 i List of tables . 83 List of figures . 86 List of acronyms .
    [Show full text]
  • Firewire™ Reference Tutorial (An Informational Guide) January 22
    FireWire™ Reference Tutorial (An Informational Guide) January 22, 2010 Abstract This document is a FireWire tutorial aimed at engineers that have no previous exposure or understanding of FireWire. Its purpose is to familiarize the reader with FireWire and to build a sound and complete understanding of how the FireWire bus operates from the experiences of expert members of the 1394 Trade Association. 1394 Trade Association Specifications, Tutorials and Guides are developed within 1394 Trade Working Groups of the 1394 Trade Association, a non-profit industry association devoted Association to the promotion of and growth of the market for IEEE 1394-compliant products. Participants in Working Groups serve voluntarily and without compensation from the Specification Trade Association. Most participants represent member organizations of the 1394 Trade Association. The work product developed within these working groups represent a consensus of the expertise represented by the participants. Use of a 1394 Trade Association document is wholly voluntary. The existence of a 1394 Trade Association document is not meant to imply that there are not other ways to produce, test, measure, purchase, market or provide other goods and services related to the scope of the 1394 Trade Association Specification. Furthermore, the viewpoint expressed at the time a document is accepted and issued is subject to change brought about through developments in the state of the art and comments received from users of the specification. Users are cautioned to check to determine that they have the latest revision of any 1394 Trade Association document. Comments for revision of 1394 Trade Association Documents are welcome from any interested party, regardless of membership affiliation with the 1394 Trade Association.
    [Show full text]