High-Bandwidth Memory Interface Design

Total Page:16

File Type:pdf, Size:1020Kb

High-Bandwidth Memory Interface Design High-Bandwidth Memory Interface Design Chulwoo Kim [email protected] Dept. of Electrical Engineering Korea University, Seoul, Korea February 17, 2013 Chulwoo Kim 1 of 86 Outline Introduction Clock Generation and Distribution Transceiver Design TSV Interface for DRAM Summary References Chulwoo Kim 2 of 86 Outline Introduction DRAM 101 Simplified DRAM Architecture and Operation Differences of DRAM (DDRx, GDDRx, LPDDRx) Trend Memory Interface: Differences and Issues Clock Generation and Distribution Transceiver Design TSV Interface for DRAM Summary References Chulwoo Kim 3 of 86 DRAM 101 SDR Single Data Rate Main Memory DDRx CLK SDRAM PC, Notebook, Server DQ D Graphics Memory Synchronous DDR GDDRx Dynamic Graphic Card, Console Random Double Data Rate Access Memory CLK Mobile Memory LPDDRx DQ D D Phone, Tablet PC CLK MCU Command C CAS* Latency CLK & Data Command CLK SDRAM DQ D D D D D D D D Burst Length *CAS : Column Address Strobe Chulwoo Kim Introduction 4 of 86 DRAM DDR4 Die Photo Bank Bank Bank Bank Bank Bank Bank Bank 0 1 2 3 8 9 10 11 Supply Voltage VDD=1.2V, VPP=2.5V Process 38nm CMOS /3-metal Banks 4-Bank Group, 16 Bank Bank BankData Bank Rate Bank 2400Bank Mbps Bank Bank Bank 4 5 Number6 of IO‟s 7 X4 12/ X8 13 14 15 [1] K. B. Koo et al., ISSCC 2012, pp. 40-41 Chulwoo Kim Introduction 5 of 86 Simplified DRAM Architecture Bank Bank Word Driver Line Word BLT BLB Fuse Repair Row Row Decoder Row WL Cell Array Column Decoder BLSA* Write Drv. / Read Amp. Column Repair Fuse Peripheral Circuit Generator Serial to Parallel DCLK ICLK CMD DLL parallel to serial Controller DQ RX DQ TX CLK/ADD/CMD Buffer Bank Bank * BLSA : Bit line sense amplifier Chulwoo Kim Introduction 6 of 86 Concept of DRAM operation Bank Bank WRITE Np×Ndq : Serial to parallel BLSABLSA (DQ GIO) READ *BLSA : Bit line sense amplifier : Parallel to serial *Np: Number of (GIO DQ) pre-fetch *Ndq: Number of DQ Peripheral Circuit Serial to Parallel GIO *GIO : Global I/O parallel to serial Ndq bits Ndq bits Np×Ndq bits DQ RX DQ TX Bank Bank Chulwoo Kim Introduction 7 of 86 Pre-fetch Timing(DDR1,BL*=2) tCCD*=1 CLK RD RD GIO GIO GIO After CL* BL*=2 DQS DQ 0 1 0 1 Number of GIO channel=Np×Ndq=2×8=16 (DDR1 x8) * tCCD : CAS to CAS delay * CL : CAS latency [2] JEDEC, JESD79F, pp. 24-29 * BL : Burst length Chulwoo Kim Introduction 8 of 86 Pre-fetch Diagram(DDR1) Bank Bank Bank Bank Num. of GIO channel = 2×Ndq Bank Bank Bank Bank Pre-fetch operation 2-bit pre-fetch [2×Ndq] data access (If the output data rate is 400Mbps, the internal data rate is 200Mbps) Chulwoo Kim Introduction 9 of 86 Pre-fetch Timing(DDR2,BL=4) tCCD=2 CLK RD RD GIO GIO GIO After RL* BL=4 DQS DQ 0 1 2 3 0 1 2 3 Number of GIO channel=Np×Ndq=4×8=32 (DDR2 x8) * RL : READ latency [3] JEDEC, JESD79-2F, pp. 35 Chulwoo Kim Introduction 10 of 86 Pre-fetch Diagram(DDR2) Bank Bank Bank Bank Num. of GIO channel = 4×Ndq Bank Bank Bank Bank Pre-fetch operation 4-bit pre-fetch [4×Ndq] data access (If the output data rate is 800Mbps, the internal data rate is 200Mbps, same as DDR1) Chulwoo Kim Introduction 11 of 86 Pre-fetch Timing(DDR3,BL=8) tCCD=4 CLK RD RD GIO GIO GIO After RL BL=8 DQS DQ 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 Number of GIO channel=Np×Ndq=8×8=64 (DDR3 x8) [4] JEDEC, JESD79-3F, pp. 62 Chulwoo Kim Introduction 12 of 86 Pre-fetch Diagram(DDR3) Bank Bank Bank Bank Num. of GIO channel = 8×Ndq Bank Bank Bank Bank Pre-fetch operation 8-bit pre-fetch [8×Ndq] data access (If the output data rate is 1.6Gbps, the internal data rate is 200Mbps, same as DDR1) Chulwoo Kim Introduction 13 of 86 Bank Grouping Timing(DDR4,BL=8) tCCD_S=4 tCCD_L=5 CLK RD RD RD G0 G1 G1 GIO_BG0 GIO_BG0 GIO_BG1 GIO_BG1 GIO_BG1 GIO_BG2 GIO_BG3 After RL BL=8 DQS DQ 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 Number of GIO channel=Np×Ndq×Ngroup=8×8×4 = 256(DDR4 x8) [5] JEDEC, JESD79-4, pp. 77-78 [6] T. Y. Oh et al., ISSCC 2010, pp. 434-435 Chulwoo Kim Introduction 14 of 86 Pre-fetch & Bank Grouping(DDR4) Bank Bank Bank Bank Group0 Group1 GIO MUX Num. of GIO channel = 8×Ndq Group2 Group3 Bank Bank Bank Bank Pre-fetch operation 8-bit pre-fetch Bank grouping [1] K. B. Koo et al., ISSCC 2012, pp. 40-41 Chulwoo Kim Introduction 15 of 86 Differences of DDRx,GDDRx,LPDDRx DDRx GDDRx LPDDRx Bank Bank Bank Bank PAD Bank Bank Architecture PAD PAD Bank Bank Bank Bank Bank Bank PAD Application PC/Server Graphic card Mobile/Consumer Socket DIMM On board MCP*/PoP*/SiP* IO ×4/×8 ×16/×32 ×16/×32 .Single uni-directional .No DLL Unique WDQS, RDQS .DPD* .VDDQ termination .PASR* Function .CRC, DBI .TCSR* .ABI * MCP: Multi chip package * DPD: Deep power down * PoP : Package on package * PASR : Partial array self refresh * SiP : System in package * TCSR : Temperature compensated self refresh Chulwoo Kim Introduction 16 of 86 DDR Comparison DDR1 DDR2 DDR3 DDR4 VDD [V] 2.5 1.8 1.5 1.2 Data Rate 200M~400M 400M~800M 800M~2.1G 1.6G~3.2G [bps/pin] Pre-Fetch 2 bit 4 bit 8 bit 8 bit STROBE Single DQS Differential DQS, DQSB Interface SSTL_2 SSTL_18 SSTL_15 POD_12 .OCD calibration .Dynamic ODT .CA parity .ODT .ZQ calibration .DBI*, CRC* New .Write leveling .Gear down Feature .CAL* ▪ PDA* .FGREF * ▪ TCAR* .Bank grouping * DBI: Data bus inversion * PDA: Per DRAM addressability * CRC: Cyclic redundancy check * FGREF: Fine granularity refresh * CAL: Command address latency * TCAR: Temperature controlled array refresh Chulwoo Kim Introduction 17 of 86 GDDR Comparison GDDR1 gDDR2 GDDR3 GDDR4 GDDR5 VDD [V] 2.5 1.8 1.5 1.5 1.5/1.35 Data Rate 300~900M 800M~1G 700M~2.6G 2.0G~3.0G 3.6G~7.0G [bps/pin] Pre-Fetch 2 bit 4 bit 4 bit 8 bit 8 bit Differential STROBE Single DQS Bi-direction Single Uni-direction WDQS, RDQS DQS*, DQSB Interface SSTL_2 SSTL_2 POD-18 POD-15 POD-15 .OCD* .ZQ .DBI .No DLL calibration .Parity(opt) .PLL(option) New .ODT* .WCK, WCKB Feature .CRC ▪ ABI* .RDQS(option) .Bank grouping * DQS: DQ strobe signal, DQ is dada I/O Pin * ODT: On die termination * OCD: Off chip driver * ABI: Address bus inversion Chulwoo Kim Introduction 18 of 86 LPDDR Comparison LPDDR1 LPDDR2 LPDDR3 VDD [V] 1.8 1.2 1.2 Data Rate 200M~400M 200M~1066M 333M~1600M [bps/pin] Pre-Fetch 2 bit 4 bit 8 bit STROBE DQS DQS_T, DQS_C DQS_T, DQS_C Interface SSTL_18* HSUL_12* HSUL_12* DLL X X X .CA pin .ODT New (High tapped termination) Feature * SSTL: Stub series terminated logic * HSUL: High speed un-terminated logic Chulwoo Kim Introduction 19 of 86 Trend DDR1 Although all types of DRAMs are 2.5 reaching their limits in supply voltage, GDDR1 the demand of high-bandwidth memory is keep increasing LPDDR1 gDDR2 1.8 DDR2 VDD [V] VDD GDDR3 1.5 GDDR4 GDDR5 DDR3 LPDDR2 DDR4 1.2 LPDDR3 0.2 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 … 7.0 Data Rate [Gbps] Chulwoo Kim Introduction 20 of 86 Memory Interface DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM DRAM GPU DRAM CPU System Feature Issue Single-ended/high speed Reflection Many channel Inter-symbol interference (weak for coupling effect) Simultaneous switching output DDR: multi-drop noise (multi rank, multi DIMM) Pin to pin skew GDDR: point to point Poor transistor performance Impedance discontinuities (stubs, connector, via, etc. ) Chulwoo Kim Introduction 21 of 86 Outline Introduction Clock Generation and Distribution Delay-locked loop (DLL) Duty cycle corrector (DCC) Clock distribution Transceiver Design TSV Conclusions References Chulwoo Kim 22 of 86 Basic DLL Architecture tD1 tDVDL tDREP I_CLK Variable Replica Clock Delay Line Delay PD Controller FB_CLK Data O_CLK DATA from memory core External DRAM tD2 tD1 Clock tCK ∙ N = tDVDL +tDREP I_CLK tDREP ≈ tD1 +tD2 FB_CLK tDREP tDVDL tCK ∙ N = tDVDL +tD1 +tD2 + γ O_CLK Data γ = tDREP – (tD1 +tD2) tD2 Chulwoo Kim Clock Generation and Distribution 23 of 86 Replica Delay Mismatch γ variation [ps] Supply Voltage [V] γ ≈0 HVDD Long Valid HVDD Valid Valid Data Data Data Window Window Window γ <0 tCK VDD VDD LVDD LVDD Short γ >0 tDQSCK* (or tAC) tDQSCK (or tAC) tDQSCK (or tAC) *tDQSCK (or tAC) – DQS output access time for CK/CKb Chulwoo Kim Clock Generation and Distribution 24 of 86 Locking Range Considerations I_CLK Long tDREQUIRED tDINIT+tDREP FB_CLK Bird’s beak tCK I_CLK tD +tD INIT REP tDREQUIRED FB_CLK Short tDQSCK (or tAC) tDINIT = tDVDL(0) + tDREP N×tCK > tDVDL(0) + tDREP tCK = tDVDL + tDREP + t∆ [7] H.-W. Lee et al., submitted to TVLSI Chulwoo Kim Clock Generation and Distribution 25 of 86 Synchronous Mirror Delay (SMD) tD1 tD1+tD2 tD3 Clock Clock Delay Measure Delay Line I_CLK I_CLK Replica Replicate Measure Delay Replicate Delay Line OUT tD2 OUT tD1 tD3 tD3 tD2 tD1+tD2 Basic Operation Measure and replicate the delay No feedback Match delay in two cycles [8] T.
Recommended publications
  • 2GB DDR3 SDRAM 72Bit SO-DIMM
    Apacer Memory Product Specification 2GB DDR3 SDRAM 72bit SO-DIMM Speed Max CAS Component Number of Part Number Bandwidth Density Organization Grade Frequency Latency Composition Rank 0C 78.A2GCB.AF10C 8.5GB/sec 1066Mbps 533MHz CL7 2GB 256Mx72 256Mx8 * 9 1 Specifications z Support ECC error detection and correction z On DIMM Thermal Sensor: YES z Density:2GB z Organization – 256 word x 72 bits, 1rank z Mounting 9 pieces of 2G bits DDR3 SDRAM sealed FBGA z Package: 204-pin socket type small outline dual in line memory module (SO-DIMM) --- PCB height: 30.0mm --- Lead pitch: 0.6mm (pin) --- Lead-free (RoHS compliant) z Power supply: VDD = 1.5V + 0.075V z Eight internal banks for concurrent operation ( components) z Interface: SSTL_15 z Burst lengths (BL): 8 and 4 with Burst Chop (BC) z /CAS Latency (CL): 6,7,8,9 z /CAS Write latency (CWL): 5,6,7 z Precharge: Auto precharge option for each burst access z Refresh: Auto-refresh, self-refresh z Refresh cycles --- Average refresh period 7.8㎲ at 0℃ < TC < +85℃ 3.9㎲ at +85℃ < TC < +95℃ z Operating case temperature range --- TC = 0℃ to +95℃ z Serial presence detect (SPD) z VDDSPD = 3.0V to 3.6V Apacer Memory Product Specification Features z Double-data-rate architecture; two data transfers per clock cycle. z The high-speed data transfer is realized by the 8 bits prefetch pipelined architecture. z Bi-directional differential data strobe (DQS and /DQS) is transmitted/received with data for capturing data at the receiver. z DQS is edge-aligned with data for READs; center aligned with data for WRITEs.
    [Show full text]
  • Memory & Devices
    Memory & Devices Memory • Random Access Memory (vs. Serial Access Memory) • Different flavors at different levels – Physical Makeup (CMOS, DRAM) – Low Level Architectures (FPM,EDO,BEDO,SDRAM, DDR) • Cache uses SRAM: Static Random Access Memory – No refresh (6 transistors/bit vs. 1 transistor • Main Memory is DRAM: Dynamic Random Access Memory – Dynamic since needs to be refreshed periodically (1% time) – Addresses divided into 2 halves (Memory as a 2D matrix): • RAS or Row Access Strobe • CAS or Column Access Strobe Random-Access Memory (RAM) Key features – RAM is packaged as a chip. – Basic storage unit is a cell (one bit per cell). – Multiple RAM chips form a memory. Static RAM (SRAM) – Each cell stores bit with a six-transistor circuit. – Retains value indefinitely, as long as it is kept powered. – Relatively insensitive to disturbances such as electrical noise. – Faster and more expensive than DRAM. Dynamic RAM (DRAM) – Each cell stores bit with a capacitor and transistor. – Value must be refreshed every 10-100 ms. – Sensitive to disturbances. – Slower and cheaper than SRAM. Semiconductor Memory Types Static RAM • Bits stored in transistor “latches” à no capacitors! – no charge leak, no refresh needed • Pro: no refresh circuits, faster • Con: more complex construction, larger per bit more expensive transistors “switch” faster than capacitors charge ! • Cache Static RAM Structure 1 “NOT ” 1 0 six transistors per bit 1 0 (“flip flop”) 0 1 0/1 = example 0 Static RAM Operation • Transistor arrangement (flip flop) has 2 stable logic states • Write 1. signal bit line: High à 1 Low à 0 2. address line active à “switch” flip flop to stable state matching bit line • Read no need 1.
    [Show full text]
  • Tesla K80 Gpu Accelerator
    TESLA K80 GPU ACCELERATOR BD-07317-001_v05 | January 2015 Board Specification DOCUMENT CHANGE HISTORY BD-07317-001_v05 Version Date Authors Description of Change 01 June 23, 2014 GG, SM Preliminary Information (Information contained within this board specification is subject to change) 02 October 8, 2014 GG, SM • Updated product name • Minor change to Table 2 03 October 31, 2014 GG, SM • Added “8-Pin CPU Power Connector” section • Updated Figure 2 04 November 14, 2014 GG, SM • Removed preliminary and NDA • Updated boost clocks • Minor edits throughout document 05 January 30, 2015 GG, SM Updated Table 2 with MTBF data Tesla K80 GPU Accelerator BD-07317-001_v05 | ii TABLE OF CONTENTS Overview ............................................................................................. 1 Key Features ...................................................................................... 2 NVIDIA GPU Boost on Tesla K80 ................................................................ 3 Environmental Conditions ....................................................................... 4 Configuration ..................................................................................... 5 Mechanical Specifications ........................................................................ 6 PCI Express System ............................................................................... 6 Tesla K80 Bracket ................................................................................ 7 8-Pin CPU Power Connector ...................................................................
    [Show full text]
  • 1Gb Gddr3 SDRAM E-Die
    K4W1G1646E 1Gb gDDR3 SDRAM 1Gb gDDR3 SDRAM E-die 96 FBGA with Lead-Free & Halogen-Free (RoHS Compliant) INFORMATION IN THIS DOCUMENT IS PROVIDED IN RELATION TO SAMSUNG PRODUCTS, AND IS SUBJECT TO CHANGE WITHOUT NOTICE. NOTHING IN THIS DOCUMENT SHALL BE CONSTRUED AS GRANTING ANY LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHER- WISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IN SAMSUNG PRODUCTS OR TECHNOL- OGY. ALL INFORMATION IN THIS DOCUMENT IS PROVIDED ON AS "AS IS" BASIS WITHOUT GUARANTEE OR WARRANTY OF ANY KIND. 1. For updates or additional information about Samsung products, contact your nearest Samsung office. 2. Samsung products are not intended for use in life support, critical care, medical, safety equipment, or similar applications where Product failure could result in loss of life or personal or physical harm, or any military or defense application, or any governmental procurement to which special terms or provisions may apply. * Samsung Electronics reserves the right to change products or specification without notice. 1 of 123 Rev. 1.2 July 2009 K4W1G1646E 1Gb gDDR3 SDRAM Revision History Revision Month Year History 0.0 July 2008 - First release - Correction ball configuration on page 4. 0.9 October 2008 - Changed ordering Infomation 0.9 November 2008 - Correction package size on page 5. - Added thermal characteristics table 0.91 December 2008 - Added speed bin 1066Mbps - Corrected tFAW value on page 44 0.92 February 2009 - Corrected Package Dimension on page - Corrected Package pinout on page 4 - Added thermal characteristics values (speed bin 1066/1333/1600Mbps ) on page 14. 1.0 February 2009 - Added IDD SPEC values (speed bin 1066/1333/1600Mbps) on page 36.
    [Show full text]
  • Performance Impact of Memory Channels on Sparse and Irregular Algorithms
    Performance Impact of Memory Channels on Sparse and Irregular Algorithms Oded Green1,2, James Fox2, Jeffrey Young2, Jun Shirako2, and David Bader3 1NVIDIA Corporation — 2Georgia Institute of Technology — 3New Jersey Institute of Technology Abstract— Graph processing is typically considered to be a Graph algorithms are typically latency-bound if there is memory-bound rather than compute-bound problem. One com- not enough parallelism to saturate the memory subsystem. mon line of thought is that more available memory bandwidth However, the growth in parallelism for shared-memory corresponds to better graph processing performance. However, in this work we demonstrate that the key factor in the utilization systems brings into question whether graph algorithms are of the memory system for graph algorithms is not necessarily still primarily latency-bound. Getting peak or near-peak the raw bandwidth or even the latency of memory requests. bandwidth of current memory subsystems requires highly Instead, we show that performance is proportional to the parallel applications as well as good spatial and temporal number of memory channels available to handle small data memory reuse. The lack of spatial locality in sparse and transfers with limited spatial locality. Using several widely used graph frameworks, including irregular algorithms means that prefetched cache lines have Gunrock (on the GPU) and GAPBS & Ligra (for CPUs), poor data reuse and that the effective bandwidth is fairly low. we evaluate key graph analytics kernels using two unique The introduction of high-bandwidth memories like HBM and memory hierarchies, DDR-based and HBM/MCDRAM. Our HMC have not yet closed this inefficiency gap for latency- results show that the differences in the peak bandwidths sensitive accesses [19].
    [Show full text]
  • Different Types of RAM RAM RAM Stands for Random Access Memory. It Is Place Where Computer Stores Its Operating System. Applicat
    Different types of RAM RAM RAM stands for Random Access Memory. It is place where computer stores its Operating System. Application Program and current data. when you refer to computer memory they mostly it mean RAM. The two main forms of modern RAM are Static RAM (SRAM) and Dynamic RAM (DRAM). DRAM memories (Dynamic Random Access Module), which are inexpensive . They are used essentially for the computer's main memory SRAM memories(Static Random Access Module), which are fast and costly. SRAM memories are used in particular for the processer's cache memory. Early memories existed in the form of chips called DIP (Dual Inline Package). Nowaday's memories generally exist in the form of modules, which are cards that can be plugged into connectors for this purpose. They are generally three types of RAM module they are 1. DIP 2. SIMM 3. DIMM 4. SDRAM 1. DIP(Dual In Line Package) Older computer systems used DIP memory directely, either soldering it to the motherboard or placing it in sockets that had been soldered to the motherboard. Most memory chips are packaged into small plastic or ceramic packages called dual inline packages or DIPs . A DIP is a rectangular package with rows of pins running along its two longer edges. These are the small black boxes you see on SIMMs, DIMMs or other larger packaging styles. However , this arrangment caused many problems. Chips inserted into sockets suffered reliability problems as the chips would (over time) tend to work their way out of the sockets. 2. SIMM A SIMM, or single in-line memory module, is a type of memory module containing random access memory used in computers from the early 1980s to the late 1990s .
    [Show full text]
  • Product Guide SAMSUNG ELECTRONICS RESERVES the RIGHT to CHANGE PRODUCTS, INFORMATION and SPECIFICATIONS WITHOUT NOTICE
    May. 2018 DDR4 SDRAM Memory Product Guide SAMSUNG ELECTRONICS RESERVES THE RIGHT TO CHANGE PRODUCTS, INFORMATION AND SPECIFICATIONS WITHOUT NOTICE. Products and specifications discussed herein are for reference purposes only. All information discussed herein is provided on an "AS IS" basis, without warranties of any kind. This document and all information discussed herein remain the sole and exclusive property of Samsung Electronics. No license of any patent, copyright, mask work, trademark or any other intellectual property right is granted by one party to the other party under this document, by implication, estoppel or other- wise. Samsung products are not intended for use in life support, critical care, medical, safety equipment, or similar applications where product failure could result in loss of life or personal or physical harm, or any military or defense application, or any governmental procurement to which special terms or provisions may apply. For updates or additional information about Samsung products, contact your nearest Samsung office. All brand names, trademarks and registered trademarks belong to their respective owners. © 2018 Samsung Electronics Co., Ltd. All rights reserved. - 1 - May. 2018 Product Guide DDR4 SDRAM Memory 1. DDR4 SDRAM MEMORY ORDERING INFORMATION 1 2 3 4 5 6 7 8 9 10 11 K 4 A X X X X X X X - X X X X SAMSUNG Memory Speed DRAM Temp & Power DRAM Type Package Type Density Revision Bit Organization Interface (VDD, VDDQ) # of Internal Banks 1. SAMSUNG Memory : K 8. Revision M: 1st Gen. A: 2nd Gen. 2. DRAM : 4 B: 3rd Gen. C: 4th Gen. D: 5th Gen.
    [Show full text]
  • NVIDIA Quadro CX Overview
    QuickSpecs NVIDIA Quadro CX Overview Models NVIDIA Quadro CX – The Accelerator for Creative Suite 4 Introduction The NVIDIA® Quadro® CX is the accelerator for Adobe® Creative Suite® 4-giving creative professionals the performance, tools, and reliability they need to maximize their creativity. Quadro CX enables H.264 video encoding at lightning-fast speeds with NVIDIA CUDA™ technology and accelerates rendering time for advanced effects. A Faster Way to Work Faster video encoding: Encode H.264 video at lightning-fast speeds with the NVIDIA CUDA™-enabled plug-in for Adobe Premiere® Pro CS4. Accelerated effects: Accelerate rendering time for advanced effects such as transformations, color correction, depth of field blur, turbulent noise, and more. A Better Way to Work High-quality preview: Accurately see what your deliverable will look like with 30-bit color or uncompressed 10-bit/12-bit SDI before final output.* Natural canvas: Experience fluid interaction with the Adobe Photoshop® canvas for smoother zooming and image rotation. Advanced multi-display management tools: Easily work across multiple displays with NVIDIA nView® advanced management tools for a smoother production pipeline. A More Reliable Way to Work Designed for CS4: NVIDIA Quadro CX is designed and optimized for Creative Suite 4. Built by NVIDIA: Quadro CX is engineered and optimized by NVIDIA to ensure your system works when you need it. Extended product lifecycle: Extended (24-36 month) product life cycle and availability for a stable platform environment. NOTE: As of August 1, 2009 Quadro CX is being replaced with Elemental Accelerator Software for NVIDIA Quadro. The same functionality is offered with drop in the box software, across multiple NVIDIA Quadro graphics cards.
    [Show full text]
  • GRA110 3U VPX High Performance Graphics Board
    GE Fanuc Embedded Systems GRA110 3U VPX High Performance Graphics Board Features The GRA110 is the first graphics board to be With a rich set of I/O, the GRA110 is designed • NVIDIA G73 GPU announced in the 3U VPX form factor. Bringing to serve many of the most common video - As used on NVIDIA® GeForce® 7600GT desktop performance to the rugged market, applications. Dual, independent channels the GRA110 represents a step change in mean that it is capable of driving RGB analog • Leading OpenGL performance capability for the embedded systems integrator. component video, digital DVI 1.0, and RS170, • 256 MBytes DDR SDRAM With outstanding functionality, together with NTSC or PAL standards. In addition, the GRA110’s • Two independent output channels PCI ExpressTM interconnect, even the most video input capability allows integration of sensor demanding applications can now be deployed data using RS170, NTSC or PAL video formats. • VESA output resolutions to 1600x1200 with incredible fidelity. • RS-170, NTSC & PAL video output • DVI 1.0 digital video output The VPX form factor allows for high speed PCI Express connections to single board computers in • RS170, NTSC & PAL Video output the system. The GRA110 supports the 16-lane PCI • Air- and rugged conduction- variants Express implementation, providing the maximum • 3U VPX Form Factors available communication bandwidth to a CPU such as GE Fanuc Embedded System’s SBC340. The PCI Express link will automatically adapt to the active number of lanes available, and so will work with single board computers
    [Show full text]
  • DDR and DDR2 SDRAM Controller Compiler User Guide
    DDR and DDR2 SDRAM Controller Compiler User Guide 101 Innovation Drive Software Version: 9.0 San Jose, CA 95134 Document Date: March 2009 www.altera.com Copyright © 2009 Altera Corporation. All rights reserved. Altera, The Programmable Solutions Company, the stylized Altera logo, specific device designations, and all other words and logos that are identified as trademarks and/or service marks are, unless noted otherwise, the trademarks and service marks of Altera Corporation in the U.S. and other countries. All other product or service names are the property of their respective holders. Altera products are protected under numerous U.S. and foreign patents and pending ap- plications, maskwork rights, and copyrights. Altera warrants performance of its semiconductor products to current specifications in accordance with Altera's standard warranty, but reserves the right to make changes to any products and services at any time without notice. Altera assumes no responsibility or liability arising out of the application or use of any information, product, or service described herein except as expressly agreed to in writing by Altera Corporation. Altera customers are advised to obtain the latest version of device specifications before relying on any published information and before placing orders for products or services. UG-DDRSDRAM-10.0 Contents Chapter 1. About This Compiler Release Information . 1–1 Device Family Support . 1–1 Features . 1–2 General Description . 1–2 Performance and Resource Utilization . 1–4 Installation and Licensing . 1–5 OpenCore Plus Evaluation . 1–6 Chapter 2. Getting Started Design Flow . 2–1 SOPC Builder Design Flow . 2–1 DDR & DDR2 SDRAM Controller Walkthrough .
    [Show full text]
  • AXP Internal 2-Apr-20 1
    2-Apr-20 AXP Internal 1 2-Apr-20 AXP Internal 2 2-Apr-20 AXP Internal 3 2-Apr-20 AXP Internal 4 2-Apr-20 AXP Internal 5 2-Apr-20 AXP Internal 6 Class 6 Subject: Computer Science Title of the Book: IT Planet Petabyte Chapter 2: Computer Memory GENERAL INSTRUCTIONS: • Exercises to be written in the book. • Assignment questions to be done in ruled sheets. • You Tube link is for the explanation of Primary and Secondary Memory. YouTube Link: ➢ https://youtu.be/aOgvgHiazQA INTRODUCTION: ➢ Computer can store a large amount of data safely in their memory for future use. ➢ A computer’s memory is measured either in Bits or Bytes. ➢ The memory of a computer is divided into two categories: Primary Memory, Secondary Memory. ➢ There are two types of Primary Memory: ROM and RAM. ➢ Cache Memory is used to store program and instructions that are frequently used. EXPLANATION: Computer Memory: Memory plays a very important role in a computer. It is the basic unit where data and instructions are stored temporarily. Memory usually consists of one or more chips on the mother board, or you can say it consists of electronic components that store instructions waiting to be executed by the processor, data needed by those instructions, and the results of processing the data. Memory Units: Computer memory is measured in bits and bytes. A bit is the smallest unit of information that a computer can process and store. A group of 4 bits is known as nibble, and a group of 8 bits is called byte.
    [Show full text]
  • You Need to Know About Ddr4
    Overcoming DDR Challenges in High-Performance Designs Mazyar Razzaz, Applications Engineering Jeff Steinheider, Product Marketing September 2018 | AMF-NET-T3267 Company Public – NXP, the NXP logo, and NXP secure connections for a smarter world are trademarks of NXP B.V. All other product or service names are the property of their respective owners. © 2018 NXP B.V. Agenda • Basic DDR SDRAM Structure • DDR3 vs. DDR4 SDRAM Differences • DDR Bring up Issues • Configurations and Validation via QCVS Tool COMPANY PUBLIC 1 BASIC DDR SDRAM STRUCTURE COMPANY PUBLIC 2 Single Transistor Memory Cell Access Transistor Column (bit) line Row (word) line G S D “1” => Vcc “0” => Gnd “precharged” to Vcc/2 Cbit Ccol Storage Parasitic Line Capacitor Vcc/2 Capacitance COMPANY PUBLIC 3 Memory Arrays B0 B1 B2 B3 B4 B5 B6 B7 ROW ADDRESS DECODER ADDRESS ROW W0 W1 W2 SENSE AMPS & WRITE DRIVERS COLUMN ADDRESS DECODER COMPANY PUBLIC 4 Internal Memory Banks • Multiple arrays organized into banks • Multiple banks per memory device − DDR3 – 8 banks, and 3 bank address (BA) bits − DDR4 – 16 banks with 4 banks in each of 4 sub bank groups − Can have one active row in each bank at any given time • Concurrency − Can be opening or closing a row in one bank while accessing another bank Bank 0 Bank 1 Bank 2 Bank 3 Row 0 Row 1 Row 2 Row 3 Row … Row Buffers COMPANY PUBLIC 5 Memory Access • A requested row is ACTIVATED and made accessible through the bank’s row buffers • READ and/or WRITE are issued to the active row in the row buffers • The row is PRECHARGED and is no longer
    [Show full text]