Fully Buffered DIMM (FB-DIMM) Design Considerations

Total Page:16

File Type:pdf, Size:1020Kb

Fully Buffered DIMM (FB-DIMM) Design Considerations Fully Buffered DIMM (FB-DIMM) Design Considerations Howard David Memory Architect Michael McTague Staff Engineer Intel Corp. Feb 18, 2004 1 Agenda y FB-DIMM Architecture y FB-DIMM Channel Electrical Characteristics y FB-DIMM Physical Design y FB-DIMM System Design Considerations 2 Intel Platform Memory Technology Roadmap DDR3 DDR3 FB-DIMM DDR2 667/800 DDR2 DDR2 400/533 DDR DDR333/400 DDR266 RDRAM* PC1066 PC800 SDRAM PC133 2004 2005 20052006 y DDR2 400/533 support in all main IA segments in 2004 – with DDR flexibility y FB-DIMM new server interconnect in 2005 y Low Power SDRAM moving to low power DDR y RDRAM still used in specific applications *Other names and brands may be claimed as the property of others 3 FB-DIMM Architecture FB-DIMM Block Diagram y DRAM interface is entirely behind the buffer – DDR2 DRAM scales from 533 to 800 MT/s, with up to 8 DIMMs (288 devices) per channel Commodity DRAMDRAM DRAMDRAM DRAMDRAM DRAMs DRAMDRAM Differential Pairs DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM Up to 8 DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM 10 DIMMs Buffer Buffer Buffer ••• Buffer Memory 14 Controller DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM SMBus DRAMDRAM DRAMDRAM DRAMDRAM DRAMDRAM CLK Source 4 FB-DIMM Architecture FB-DIMM Module Block Diagram Pass-through Read Data & Merging Read Data (Primary Northbound) Logic (Secondary Northbound) Serializer AMB is the new Data Bus buffer Interface DRAM Data technology for 9 to 36 DRAM server memory De-serializer DRAM C/A Copy A & Decode Logic DRAM C/A Copy B Reference Clock Clocking DRAM Clocks Pass- Write Data and Command Write Data and Command through (Primary Southbound) (Secondary Southbound) Logic Advanced SPD Memory Buffer SMBus EEPROM (AMB) 5 FB-DIMM Channel Electrical Characteristics Signaling Characteristics y Differential point-to-point, 50 Ω termination y DC coupled, current mode driver y Derived clocking ( mesochronous clocking) y Transmitter de-emphasis to reduce ISI y 3.2 Gb/s, 4.0 Gb/s, and 4.8 Gb/s data rates Tx Rx De- emphasis Pre - Sync driver De-skew 50 Clock 50 Recovery PLL 50 50 PLL Reference CLK 6 FB-DIMM Channel Electrical Characteristics Differential Signaling Parameter Description Equation VDIFFp-p Differential Peak to 2*max[VD+ - VD-] Peak Voltage VCM-AC AC Common Mode (VD+ + VD-)/2 Voltage VCM-DC DC Common Mode AVG[(VD+ + VD-)/2] Voltage D- VD+ VCM-AC VD- D+ 7 FB-DIMM Channel Electrical Characteristics Tx Voltage Spec (Medium drive strength example) Parameter Description Min Max VTX-DIFF P-P-Min Differential Swing, 800 mv De-emphasis off VTX-DIFF P-P-Min, Differential swing, 505 mv 566 mv -3.5 Db De-emphasis on VTX-DIFF P-P-Min, Differential swing, 357 mv 450 mv -6.0 Db De-emphasis on VTX-DIFF P-P-Min, Differential swing, 239 mv 301 mv -9.0 Db De-emphasis on VTX-SE Single Ended Swing 700 mv 8 FB-DIMM Channel Electrical Characteristics Tx Timing Spec UI min spec D ata late Tx Total VTx diff min p-p min Data early Median to max Jitter Median Median Jitter Jitter UImin- Tx min- 2Txm edian to m ax jitter= 0 Transfer UI TTx-Total- TTX-Eye-Median-Max-Jitter Rate Nom min (ps) (Gb/s) (ps) (ps) 3.2 312.0 248.5 32.0 4.0 250.0 191.0 29.5 4.8 208.0 157.0 25.68 9 FB-DIMM Channel Electrical Characteristics Rx Spec D+,D- Crossing point D+,D- Crossing point 0.40 0.35 0.30 0.25 0.20 0.15 VRx-DIFFpp-Min 0.10 0.05 1 _ d2 n_ i _ c e 0.00 _r f f di _ d e e_r y -0.05 e -0.10 -0.15 -0.20 -0.25 TRx-Total-Min -0.30 -0.35 -0.40 0.0 40.0 80.0 120.0 160.0 200.0 240.0 280.0 312.5 time, psec •Measured •Simulated Data UI, Nom TRX-Total- TRX-EYE- VRX-DIFFp-p-MIN (mV) Rate (ps) Min (ps) Median-Max (Gb/s) Jitter (ps) 3.2 312.0 115 98.67 170 4.0 250.0 90 80.0 170 4.8 208.0 75 66.68 170 10 FB-DIMM Channel Electrical Characteristics Clock Spec y HCSL (High-Speed Current Steering Logic) clock y 0.7 v swing y 1/4th of the DDR2 DRAM frequency ( 133, 166, and 200 MHz) y Spread Spectrum Clock (SSC) with up to –0.5% down spread y Reference clock jitter specified to allow standard clock buffers. Z0d = 100 33 Ω Z = 100 Z = 100 < 0.2” 0d 0d < 0.2” 33 Ω < TBD” ~2.5” 0.5” DBxxx 50 Ω Clock 50 Ω AMB Driver FB-DIMM Connector 11 FB-DIMM Channel Electrical Characteristics Interconnect Modeling y Three (3) Topologies supported y Accurately models connection from Tx pin to Rx pin y Non-Interleaved routing with no bit to bit length matching required y Differential signaling design rules Tx Package Southbound Interconnect Network Rx Package (Memory Writes) + _ y Package Breakout +_ y Lossy Transmission Lines y Connectors y Vias y Sockets and test points Not Equivalent due to DIMM Routing and connector location 12 FB-DIMM Channel Electrical Characteristics FB-DIMM Topologies DIMMs on Motherboard FB-DIMM Memory Controller AMB Base Board FB-DIMM Connector 13 FB-DIMM Channel Electrical Characteristics FB-DIMM Topologies DIMMs on a Riser Card Memory Riser Board FB-DIMM Connector Memory Controller FB-DIMM Base Board AMB PCI Express* or pin and socket connector *Other names and brands may be claimed as the property of others. 14 FB-DIMM Channel Electrical Characteristics FB-DIMM Topologies DIMM to DIMM FB-DIMM FB-DIMM AMB AMB Base Board FB- DIMM connectors 15 FB-DIMM Channel Electrical Characteristics Interconnect Components y Package Models – Return Loss (<-10 Db) y PCB Model – Impedance, delay, and loss characteristics of traces – 3 pair, non-interleaved, cross talk model – VIA Models y Connector(s) – 3D Models validated using Measurements MC Vendor Model AMB Vendor Model W Element W Element +_ +_ +_ Package Network Package + Network _ +_ +_ 85 ohm Strip Line 6 x 50 Ohms 6 x 50 Ohms 85 ohm 85 ohm Strip Line FBD DDR2 Microstrip Breakout Vias Connector Model 16 FB-DIMM Channel Electrical Characteristics FB-DIMM Connector y Standard DDR2 Connector w/ FB-DIMM connector spec y Characterized up to 8 GHz y Q3D Spice Model based on FOXCONN* Connector Data • Less than 1 Db insertion loss. • Less than 3.2% Far end Cross talk • Less than 10 Db insertion loss *Other names and brands may be claimed as the property of others. 17 FB-DIMM Channel Electrical Characteristics Channel Design Rules y Channel impedance: 85 ohms , normal distribution y Microstrip trace geometry – w=6m, s= 4x h , tolerance = +/- 20%, etc. y Strip line trace geometry – w= 5 mil, s = 3x h, tolerance = +/-15%, etc y Asymmetric strip line geometry – w = 5 mil, s = 4x h , tolerance = +/-15%, etc, etc y Loss tangent 0.017 to 0.025, ε ~ 3.4 (FR4 materials) ef h2 s d εr2 t εr2 w h1 18 FB-DIMM Channel Electrical Characteristics Channel Validation Tx Package ...00110011... Tx Eye + 1. Tx Stand alone Test _ S pecification 2 x 50 ohms 2. Interconnect meets Rx Eye Spec Tx Package Worse Case Rx Eye ISI Pattern + Interconnect Network _ Specification 2 x 50 ohms 3. Rx Meets BER with Minimum Eye Tx Package Rx Package Worse Case Pattern Check ISI Pattern + Interconnect Network + _ _ or “On-die” Measurement FB-DIMM channel requires new memory design techniques 19 FB-DIMM Physical Design Mechanical Prototype 20 FB-DIMM Physical Design Physically Compatible Approach y FB-DIMM fits into existing system infrastructure 21 FB-DIMM Physical Design DIMM Mechanical Outline AMB on front Retention notches reduced to 2.5 mm Nominal height 8 DRAM on front, 10 DRAM on back New end notches Key moved 22 FB-DIMM Physical Design Advanced Memory Buffer (AMB) y New 0.8 mm ball pitch package – 24.5 mm x 19.5 mm x 2.15 mm – (29 columns x 23 rows) – 12 corner balls = 655 balls 1234567891011121314151617181920212223242526272829 TEST2 TEST4 A GND DQ26 DQ12 VDD DQS10 DQ13 VDD DQS01# DQ10 VDD VDD VDD VDD VDD DQ52 DQS15 VDD DQ49 DQS06# VDD DQ48 DQ38 VDD DDRCAL TEST3 TEST5 DDRCAL B VDD DQS03 DQS03# GND DQ14 DQS10# GND DQ11 DQS01 GND VDD GND VDD GND DQS15# DQ53 GND DQS06 DQ50 GND DQS13# DQS13 GND DDRCAL DDRCAL DDRCAL C GND DQS02 DQ18 GND DQ04 DQS09# GND DQ15 DQ09 GND DQ08 GND DQS17 DQS17# GND DQ54 GND DQ55 DQ51 GND DQS07 DQ56 GND DQ46 DQS14# VDD D DQ19 DQS02# GND DQ16 DQ24 GND DQS09 DQ07 GND DQ03 DQS00 GND DQS8# DQS8 VDD CB6 CB7 GND DQS16 DQ63 GND DQ59 DQS07# GND DQ36 DQ44 GND DQS14 DQ47 DRAM Mechanical E DQ21 GND DQ17 DQ29 GND DQ25 DQ06 GND DQ05 DQ01 GND DQ00 CB1 GND CB2 GND CB5 DQS16# GND DQ61 DQ57 GND DQ58 DQ39 GND DQ33 DQ45 GND DQ41 TEST0 TEST1 TEST6 TEST7 Data bus Balls F GND DQ20 DQ23 GND DQ31 DQ27 GND GND DQS00# DQ02 VDD CB0 CB3 CB4 VDD DQ62 DQ60 GND GND DQ37 DQ35 GND DQS05# DQ43 GND G DQS11# DQS11 mech mech mech GND DQS12 DQS12# mech mech mech TEST RFU3 RFU4 RFU5 TEST RFU6 RFU7 mech mech mech DQS04 DQS04# GND mech mech mech DQS05 DQ40 H DQ22 GND mech mech mech DQ28 DQ30 GND mech mech mech GND VDD GND VDD GND VDD GND mech mech mech GND DQ34 DQ32 mech mech mech GND DQ42 J GND CLK2 mech mech mech BA1A GND CKE1A mech mech mech VDD GND VDD GND VDD GND VDD mech mech mech RAS#B GND RFU1 mech mech mech CLK3# GND DRAM K CLK2# CLK0 mech mech mech GND WE#A RAS#A mech mech mech GND VCC GND VCC GND VCC GND mech mech mech ODT0B CS1#B GND mech mech mech CLK1# CLK3 C/A/Clk L CLK0# GND mech mech mech A0A CKE0A GND mech mech mech VCC GND VCC GND VCC GND VCC mech mech mech GND CAS#B WE#B mech mech mech GND CLK1 DRAM M ODT0A RFU0 mech mech mech CAS#A GND BA2A mech mech mech GND VCC GND VCC GND VCC GND mech mech mech CS0#B GND BA1B mech mech mech CKE0B GND copy “A” N CS1#A CS0#A mech mech mech GND BA0A A10A mech mech mech VCC GND VCC GND VCC GND VCC mech mech mech A0B A2B GND mech mech mech BA0B BA2B C/A/Clk P A6A GND mech mech mech A2A A1A A3A mech mech mech GND VCC GND VCC GND VCC GND mech mech mech GND A4B A1B mech mech mech
Recommended publications
  • Fully-Buffered DIMM Memory Architectures: Understanding Mechanisms, Overheads and Scaling
    Fully-Buffered DIMM Memory Architectures: Understanding Mechanisms, Overheads and Scaling Brinda Ganesh†, Aamer Jaleel‡, David Wang†, and Bruce Jacob† †University of Maryland, College Park, MD ‡VSSAD, Intel Corporation, Hudson, MA {brinda, blj}@eng.umd.edu Abstract Consequently the maximum number of DIMMs per channel in Performance gains in memory have traditionally been successive DRAM generations has been decreasing. SDRAM obtained by increasing memory bus widths and speeds. The channels have supported up to 8 DIMMs of memory, some diminishing returns of such techniques have led to the types of DDR channels support only 4 DIMMs, DDR2 channels proposal of an alternate architecture, the Fully-Buffered have but 2 DIMMs, and DDR3 channels are expected to support DIMM. This new standard replaces the conventional only a single DIMM. In addition the serpentine routing required for electrical path-length matching of the data wires becomes memory bus with a narrow, high-speed interface between the challenging as bus widths increase. For the bus widths of today, memory controller and the DIMMs. This paper examines the motherboard area occupied by a single channel is signifi- how traditional DDRx based memory controller policies for cant, complicating the task of adding capacity by increasing the scheduling and row buffer management perform on a Fully- number of channels. To address these scalability issues, an alter- Buffered DIMM memory architecture. The split-bus nate DRAM technology, the Fully Buffered Dual Inline Mem- architecture used by FBDIMM systems results in an average ory Module (FBDIMM) [2] has been introduced. improvement of 7% in latency and 10% in bandwidth at The FBDIMM memory architecture replaces the shared par- higher utilizations.
    [Show full text]
  • Architectural Techniques to Enhance DRAM Scaling
    Architectural Techniques to Enhance DRAM Scaling Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Electrical and Computer Engineering Yoongu Kim B.S., Electrical Engineering, Seoul National University Carnegie Mellon University Pittsburgh, PA June, 2015 Abstract For decades, main memory has enjoyed the continuous scaling of its physical substrate: DRAM (Dynamic Random Access Memory). But now, DRAM scaling has reached a thresh- old where DRAM cells cannot be made smaller without jeopardizing their robustness. This thesis identifies two specific challenges to DRAM scaling, and presents architectural tech- niques to overcome them. First, DRAM cells are becoming less reliable. As DRAM process technology scales down to smaller dimensions, it is more likely for DRAM cells to electrically interfere with each other’s operation. We confirm this by exposing the vulnerability of the latest DRAM chips to a reliability problem called disturbance errors. By reading repeatedly from the same cell in DRAM, we show that it is possible to corrupt the data stored in nearby cells. We demonstrate this phenomenon on Intel and AMD systems using a malicious program that generates many DRAM accesses. We provide an extensive characterization of the errors, as well as their behavior, using a custom-built testing platform. After examining various potential ways of addressing the problem, we propose a low-overhead solution that effec- tively prevents the errors through a collaborative effort between the DRAM chips and the DRAM controller. Second, DRAM cells are becoming slower due to worsening variation in DRAM process technology. To alleviate the latency bottleneck, we propose to unlock fine-grained paral- lelism within a DRAM chip so that many accesses can be served at the same time.
    [Show full text]
  • Fully Buffered Dimms for Server
    07137_Fully_Buffered_DIMM.qxd 22.03.2007 12:08 Uhr Seite 1 Product Information April 2007 Fully Buffered Complete in-house solution DIMM up to 8 GB DDR2 SDRAM DIMM With the ever increasing operation frequency the number of DIMM modules on a memory channel with the current parallel stub-bus interface has hit saturation * point. For server applications where lots of DRAM components are required, a new solution replacing the Registered DIMM modules for data rates of 533 Mb/s Advantages and above becomes necessary. s Maximum DRAM density per channel s World-class module assembly The FB-DIMM channel architecture introduces a new memory interconnect s Optimized heat sink solution s Power Saving Trench Technology – designed technology, which enables high memory capacity with high-speed DRAMs. The for optimized power consumption transition to serial point-to-point connections between the memory controller s Full JEDEC compliant and Intel validated and the first module on the channel, and between subsequent modules down the channel, makes the bus-loading independent from the DRAM input/output Availability (IO) speed. This enables high memory capacity with high-speed DRAMs. s DDR2-533, DDR2-667 and DDR2-800 s Densities from 512 MB up to 8 GB The FB-DIMM requires high-speed chip design and DRAM expertise. Qimonda Features is uniquely positioned to provide the entire skill set in-house, which clearly s Based on Qimonda’s 512 Mb and demonstrates Qimonda’s solution competence. Qimonda supplies FB-DIMMs in 1 Gb component two AMB sources with optimized heat sink design using power-saving DDR2 s Buffer with up to 6 Gb/s per pin s Complete data buffering on DIMM components.
    [Show full text]
  • SBA-7121M-T1 Blade Module User's Manual
    SBA-7121M-T1 Blade Module User’s Manual Revison 1.0c SBA-7121M-T1 Blade Module User’s Manual The information in this User’s Manual has been carefully reviewed and is believed to be accurate. The vendor assumes no responsibility for any inaccuracies that may be contained in this document, makes no commitment to update or to keep current the information in this manual, or to notify any person or organization of the updates. Please Note: For the most up-to-date version of this manual, please see our web site at www.supermicro.com. Super Micro Computer, Inc. ("Supermicro") reserves the right to make changes to the product described in this manual at any time and without notice. This product, including software and documentation, is the property of Supermicro and/or its licensors, and is supplied only under a license. Any use or reproduction of this product is not allowed, except as expressly permitted by the terms of said license. IN NO EVENT WILL SUPERMICRO BE LIABLE FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, SPECULATIVE OR CONSEQUENTIAL DAMAGES ARISING FROM THE USE OR INABILITY TO USE THIS PRODUCT OR DOCUMENTATION, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. IN PARTICULAR, SUPERMICRO SHALL NOT HAVE LIABILITY FOR ANY HARDWARE, SOFTWARE, OR DATA STORED OR USED WITH THE PRODUCT, INCLUDING THE COSTS OF REPAIRING, REPLACING, INTEGRATING, INSTALLING OR RECOVERING SUCH HARDWARE, SOFTWARE, OR DATA. Any disputes arising between manufacturer and customer shall be governed by the laws of Santa Clara County in the State of California, USA. The State of California, County of Santa Clara shall be the exclusive venue for the resolution of any such disputes.
    [Show full text]
  • All-Inclusive ECC: Thorough End-To-End Protection for Reliable Computer Memory
    All-Inclusive ECC: Thorough End-to-End Protection for Reliable Computer Memory Jungrae Kim∗ Michael Sullivan∗∗ Sangkug Lym∗ Mattan Erez∗ ∗The University of Texas at Austin ∗∗NVIDIA fdale40, sklym, [email protected] [email protected] Abstract—Increasing transfer rates and decreasing I/O voltage DDR1 levels make signals more vulnerable to transmission errors. While DDR2 the data in computer memory are well-protected by modern DDR3 1.6 Gbps error checking and correcting (ECC) codes, the clock, control, DDR4 3.2 Gbps command, and address (CCCA) signals are weakly protected GDDR3 GDDR4 or even unprotected such that transmission errors leave serious 3.5 Gbps gaps in data-only protection. This paper presents All-Inclusive GDDR5 7 Gbps 14 Gbps ECC (AIECC), a memory protection scheme that leverages and GDDR5X augments data ECC to also thoroughly protect CCCA signals. 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 AIECC provides strong end-to-end protection of memory, cmd rate (per pin) addr rate (per pin) data rate (per pin) detecting nearly 100% of CCCA errors and also preventing (a) The increasing maximum DRAM transfer rates of command, address, transmission errors from causing latent memory data corruption. and data signals. AIECC provides these system-level benefits without requiring 3 2.5 8 extra storage and transfer overheads and without degrading the Core power IO power 2.5 effective level of data protection. 1.8 6 3.49 2 (46%) 2.88 1.5 1.35 2.88 1.5 1.2 (47%) (42%) 4 (52%) 1 4.13 1.48 VDD (V) 0.5 2 3.23 2.65 I.
    [Show full text]
  • Identifying Valueram Memory Modules
    Identifying ValueRAM Memory Modules This reference guide is designed to help you identify our ValueRAM® memory modules by specification. While this is a representation of a majority of our generic modules, naming conventions may vary as necessary. The back page is a summary of the terms used to describe these industry-standard modules. DDR3 for Desktops and Servers (PC3-8500, PC3-10666) DRAM Type 4: x4 DRAM chip K: Kit + Kingston ValueRAM DDR3 8: x8 DRAM chip CAS Latency Number of pieces KVR 1066 D3 D 4 R 7 S K2/4G Speed 1066/1333 Ranks R: Registered S: Thermal Sensor Capacity D: Dual Rank P: Parity (see chart) S: Single Rank DDR2 for Notebooks, Desktops, and Servers (PC2-3200, PC2-4200, PC2-5300, PC2-6400) DRAM Type 4: x4 DRAM chip K: Kit + Special Designators Kingston ValueRAM DDR2 8: x8 DRAM chip CAS Latency Number of pieces I: Intel Validated KVR 400 D2 D 8 N 3 H K2/512 I Speed 400/533/667/800 Ranks N: Non–ECC H: High Profile Capacity S: Single Rank E: ECC L: Very Low Profile (see chart) D: Dual Rank S: SODIMM Q: Quad Rank F: FB-DIMM M: MiniDIMM U: MicroDIMM P: Parity Registered (for AMD DDR2 Servers) DDR for Servers (PC2700, PC3200) Ranks S: Single Rank Capacity Kingston ValueRAM D: Dual Rank R: Registered (see chart) KVR 333 S 4 R 25 / 512 I Speed 333/400 DRAM Type CAS Latency Special Designators 4: x4 DRAM chip I: Intel Validated 8: x8 DRAM chip DDR for Notebooks, Desktops, and PC2100 Servers (PC2100, PC2700, PC3200) DRAM Type X72 ECC 4: x4 DRAM chip L: Low Profile Capacity Kingston ValueRAM X64 Non-ECC 8: x8 DRAM chip A: DDR400 3-3-3 (see chart) KVR 266 X72 R 4 C2 L K2 /512 D Speed 266/333/400 R: Registered CAS Latency K: Kit + Special Designators* S: SODIMM number of pieces S: Single Rank D: Dual Rank Q: Quad Rank * For server memory, if an S/D/Q designators is not shown, please check product list for ranks.
    [Show full text]
  • SDRAM Memory Systems: Architecture Overview and Design Verification SDRAM Memory Systems: Architecture Overview and Design Verification Primer
    Primer SDRAM Memory Systems: Architecture Overview and Design Verification SDRAM Memory Systems: Architecture Overview and Design Verification Primer Table of Contents Introduction . 3 - 4 DRAM Trends . .3 DRAM . 4 - 6 SDRAM . 6 - 9 DDR SDRAM . .6 DDR2 SDRAM . .7 DDR3 SDRAM . .8 DDR4 SDRAM . .9 GDDR and LPDDR . .9 DIMMs . 9 - 13 DIMM Physical Size . 9 DIMM Data Width . 9 DIMM Rank . .10 DIMM Memory Size & Speed . .10 DIMM Architecture . .10 Serial Presence Detect . .12 Memory System Design . .13 - 15 Design Simulation . .13 Design Verification . .13 Verification Strategy . .13 SDRAM Verification . .14 Glossary . .16 - 19 2 www.tektronix.com/memory SDRAM Memory Systems: Architecture Overview and Design Verification Primer Introduction Memory needs to be compatible with a wide variety of memory controller hubs used by the computer DRAM (Dynamic Random Access Memory) is attractive to manufacturers. designers because it provides a broad range of performance Memory needs to work when a mixture of different and is used in a wide variety of memory system designs for manufacturer’s memories is used in the same memory computers and embedded systems. This DRAM memory system of the computer. primer provides an overview of DRAM concepts, presents potential future DRAM developments and offers an overview Open memory standards are useful in helping to ensure for memory design improvement through verification. memory compatibility. DRAM Trends On the other hand, embedded systems typically use a fixed There is a continual demand for computer memories to be memory configuration, meaning the user does not modify larger, faster, lower powered and physically smaller. These the memory system after purchasing the product.
    [Show full text]
  • A Power and Temperature Aware DRAM Architecture
    A Power and Temperature Aware DRAM Architecture Song Liu, Seda Ogrenci Memik, Yu Zhang, and Gokhan Memik Department of Electrical Engineering and Computer Science Northwestern University, Evanston, IL 60208, USA {sli646, seda, yzh702, memik}@eecs.northwestern.edu ABSTRACT larger main memory capacitance with increasing Technological advances enable modern processors to utilize bandwidths. However, DRAM power consumption takes an increasingly larger DRAMs with rising access frequencies. increasingly larger share of the system power budget. This is leading to high power consumption and operating Memory power issue is becoming further critical in battery- temperature in DRAM chips. As a result, temperature powered systems such as laptops. management has become a real and pressing issue in high Existing DRAM power management approaches focus on performance DRAM systems. Traditional low power fully utilizing low power modes of the DRAM. However, techniques are not suitable for high performance DRAM DRAMs in high performance systems have fewer systems with high bandwidth. In this paper, we propose and opportunities to use low power modes. Simply switching evaluate a customized DRAM low power technique based between different modes may incur significant performance on Page Hit Aware Write Buffer (PHA-WB). Our proposed penalty on the DRAM bandwidth. A technique that relates approach reduces DRAM system power consumption and to DRAM’s inherent nature is needed. Our goal is to temperature without any performance penalty. Our develop a technique to explore DRAM power savings in the experiments show that a system with a 64-entry PHA-WB active mode. Specifically, our proposed technique reduces could reduce the total DRAM power consumption by up to DRAM power consumption by improving DRAM page hit 22.0% (9.6% on average).
    [Show full text]
  • Tuning IBM System X Servers for Performance
    Front cover Tuning IBM System x Servers for Performance Identify and eliminate performance bottlenecks in key subsystems Expert knowledge from inside the IBM performance labs Covers Windows, Linux, and VMware ESX David Watts Alexandre Chabrol Phillip Dundas Dustin Fredrickson Marius Kalmantas Mario Marroquin Rajeev Puri Jose Rodriguez Ruibal David Zheng ibm.com/redbooks International Technical Support Organization Tuning IBM System x Servers for Performance August 2009 SG24-5287-05 Note: Before using this information and the product it supports, read the information in “Notices” on page xvii. Sixth Edition (August 2009) This edition applies to IBM System x servers running Windows Server 2008, Windows Server 2003, Red Hat Enterprise Linux, SUSE Linux Enterprise Server, and VMware ESX. © Copyright International Business Machines Corporation 1998, 2000, 2002, 2004, 2007, 2009. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Contents Notices . xvii Trademarks . xviii Foreword . xxi Preface . xxiii The team who wrote this book . xxiv Become a published author . xxix Comments welcome. xxix Part 1. Introduction . 1 Chapter 1. Introduction to this book . 3 1.1 Operating an efficient server - four phases . 4 1.2 Performance tuning guidelines . 5 1.3 The System x Performance Lab . 5 1.4 IBM Center for Microsoft Technologies . 7 1.5 Linux Technology Center . 7 1.6 IBM Client Benchmark Centers . 8 1.7 Understanding the organization of this book . 10 Chapter 2. Understanding server types . 13 2.1 Server scalability . 14 2.2 Authentication services . 15 2.2.1 Windows Server 2008 Active Directory domain controllers .
    [Show full text]
  • Technical Product Specification
    Intel® Compute Module MFS5000SI Technical Product Specification Intel order number: E15154-007 Revision 1.4 June 2009 Enterprise Platforms and Services Division Revision History Intel® Compute Module MFS5000SI TPS Revision History Date Revision Modifications Number July 2007 0.95 Initial release. August 2007 0.96 Updated September 2007 1.0 Updated February 2008 1.1 Updated November 2008 1.2 Updated May 2009 1.3 Updated June 2009 1.4 Updated supported memory configurations Disclaimers Information in this document is provided in connection with Intel® products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The Intel® Compute Module MFS5000SI may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request.
    [Show full text]
  • Xserve Technology Overview January 2008 Technology Overview  Xserve
    Xserve Technology Overview January 2008 Technology Overview Xserve Contents Page 4 Introduction Page 5 Product Overview Key Features Page 7 Performance Overview Java Server Performance Processor Performance Memory Performance Storage Performance File Server Performance Page 1 Quad-Core Intel Xeon Processors Enhanced Intel Core Microarchitecture 12MB On-Die L2 Cache Dual Independent System Buses Reduced Idle Power High-Performance Floating-Point Division Enhanced 128-Bit SSE4 SIMD Engine Page 14 High-Bandwidth Server Architecture Dual Independent 1600MHz Frontside Buses Advanced FB-DIMM Memory Technology High-Performance PCI Express Expansion Industry-Standard Connectivity Page 18 Flexible Storage Options SATA Drive Technology SAS Drive Technology Apple Drive Modules Software and Hardware RAID Options Fibre Channel Page 22 Integrated Lights-Out Remote Management Anywhere, Anytime Monitoring and Control Dedicated Monitoring Hardware Server Monitor Software Innovative Remote Management Apple Remote Desktop Technology Overview Xserve Page 6 Mac OS X Server v10.5 Leopard UNIX Certified Foundation Optimized for Multicore Intel Processors 64-Bit Computing Advanced Networking Architecture Comprehensive Built-in Services Powerful Workgroup Management Services Innovative Collaborative Services Software Development with Xcode 3 Page 1 Service, Support, and Training Options AppleCare Premium Service and Support Plan AppleCare Service Parts Kit Apple Maintenance Program Mac OS X Server Software Support Training and Certification Programs Page 33
    [Show full text]
  • FB-DIMM Spring IDF Architecture Presentation
    Fully Buffered DIMM (FB-DIMM) Server Memory Architecture: Capacity, Performance, Reliability, and Longevity Pete Vogt Principal Engineer Intel Corp February 18, 2004 1 Agenda y Server Capacity Problem and Solution y Performance y Reliability y Longevity 2 Problem & Solution Intel Platform Memory Technology Roadmap DDR3 DDR3 FB-DIMM DDR2 667/800 DDR2 DDR2 400/533 DDR DDR333/400 DDR266 RDRAM* PC1066 PC800 SDRAM PC133 2004 2005 20052006 y DDR2 400/533 support in all main IA segments in 2004 – with DDR flexibility y FB-DIMM new server interconnect in 2005 y Low Power SDRAM moving to low power DDR y RDRAM still used in specific applications *Other names and brands may be claimed as the property of others 3 Problem & Solution FB-DIMM New Server Interconnect in 2005 Why is FB-DIMM Needed in the Future? 4 Problem & Solution DRAM Device Trends DRAM Bit Density; Mb/dev y DRAM bit density follows 2500 Moore’s law and 2000 increases 2X every ~2 1500 years 1000 500 0 99 00 01 02 03 04 05 06 07 Rule of thumb - Source: Pete Vogt DRAM Data Rate; Mbps y DRAM data rates driven 1200 1000 by CPU performance 800 increase 2X every 600 400 generation (3.35 years) 200 0 99 00 01 02 03 04 05 06 07 Projected historical data - Source: Pete Vogt DRAM Devices Continue to Scale 5 Problem & Solution Devices per Channel Trend Stub-bus Topology y Existing “stub-bus” DRAM DRAM architecture has DRAM DRAM impedance discontinuities Mem that effect signal integrity Ctrl Impedance Discontinuities Stub-bus Devices/Channel 160 y As the data rate goes up 140 120 the number of
    [Show full text]