<<

MEMORIES

Digital Integrated Circuits Memory © Prentice Hall 1995 Chapter Overview

• Memory Classification

• Memory Architectures

• The Memory Core

• Periphery

• Reliability

Digital Integrated Circuits Memory © Prentice Hall 1995 Classification

RWM NVRWM ROM

Random Non-Random EPROM Mask-Programmed Access Access 2 E PROM Programmable (PROM)

SRAM FIFO FLASH

DRAM LIFO Shift Register CAM

Digital Integrated Circuits Memory © Prentice Hall 1995 Memory Architecture: Decoders

M M bits

S S0 0 Word 0 Word 0 S1 Word 1 A0 Word 1 S2 Storage Storage

s Word 2 Word 2

d Cell A1 Cell r r o e d W

o c N AK-1 e S D N-2 Word N-2 Word N-2 SN_1 Word N-1 Word N-1

Input-Output Input-Output (M bits) (M bits)

N words => N select signals Decoder reduces # of select signals Too many select signals K = log2N

Digital Integrated Circuits Memory © Prentice Hall 1995 Array-Structured Memory Architecture

Problem: ASPECT RATIO or HEIGHT >> WIDTH

2L-K Line Storage Cell

AK r e

d Word Line AK+1 o c e D

AL-1 w o R

M.2K

Sense Amplifiers / Drivers Amplify swing to rail-to-rail amplitude

A 0 Column Decoder Selects appropriate AK-1 word

Input-Output (M bits)

Digital Integrated Circuits Memory © Prentice Hall 1995 Hierarchical Memory Architecture

Row Address

Column Address

Block Address

Global Control Selector Global Circuitry Amplifier/Driver

I/O

Advantages: 1. Shorter wires within blocks 2. Block address activates only 1 block => power savings

Digital Integrated Circuits Memory © Prentice Hall 1995 Memory Timing: Definitions

Read Cycle

READ

Read Access Read Access Write Cycle

WRITE

Write Access Data Valid

DATA

Data Written

Digital Integrated Circuits Memory © Prentice Hall 1995 Memory Timing: Approaches

MSB LSB

Address Row Address Bus Column Address

RAS Address Address Bus

Address transition CAS initiates memory operation

RAS-CAS timing

DRAM Timing SRAM Timing Multiplexed Adressing Self-timed

Digital Integrated Circuits Memory © Prentice Hall 1995 MOS NOR ROM

VDD Pull-up devices

WL[0] GND WL[1]

WL[2] GND WL[3]

BL[0] BL[1] BL[2] BL[3]

Digital Integrated Circuits Memory © Prentice Hall 1995 MOS NOR ROM Layout

Metal1 on top of diffusion

WL[0] GND (diffusion) WL[1] Polysilicon Basic cell 10 l x 7 l Metal1

WL[2] 2 l

WL[3]

Only 1 layer (contact mask) is used to program memory array Programming of the memory can be delayed to one of last steps

Digital Integrated Circuits Memory © Prentice Hall 1995 MOS NOR ROM Layout

BL[0] BL[1] BL[2] BL[3] Threshold raising implant

WL[0] GND (diffusion) Basic Cell 8.5 l x 7 l Metal1 over diffusion WL[1]

Polysilicon WL[2]

WL[3]

Threshold raising implants disable

Digital Integrated Circuits Memory © Prentice Hall 1995 MOS NAND ROM

VDD

Pull-up devices

BL[0] BL[1] BL[2] BL[3]

WL[0]

WL[1]

WL[2]

WL[3]

All word lines high by default with exception of selected row

Digital Integrated Circuits Memory © Prentice Hall 1995 MOS NAND ROM Layout

Diffusion

Polysilicon

Basic cell

5 l x 6 l

Threshold

lowering

implant

No contact to VDD or GND necessary; drastically reduced cell size Loss in performance compared to NOR ROM

Digital Integrated Circuits Memory © Prentice Hall 1995 Equivalent Transient Model for MOS NOR ROM

VDD

BL rword Model for NOR ROM WL Cbit

cword

Word line parasitics Resistance/cell: (7/2) x 10 W/q = 35 W Wire capacitance/cell: (7l ´ 2l) (0.6)2 0.058 + 2 ´ (7l ´ 0.6) ´ 0.043 = 0.65 fF Gate Capacitance/cell: (4l ´ 2l) (0.6)2 1.76 = 5.1 fF. Bit line parasitics: Resistance/cell: (8.5/4) x 0.07 W/q = 0.15 W (which is negligible) Wire capacitance/cell: (8.5l ´ 4l) (0.6)2 0.031 + 2 ´ (8.5l ´ 0.6) ´ 0.044 = 0.83 fF Drain capacitance/cell: ((3l ´ 4l) (0.6)2 ´ 0.3 + 2 ´ 3l ´ 0.6 ´ 0.8) ´ 0.375 + 4l ´ 0.6 ´ 0.43 = 2.6 fF

Digital Integrated Circuits Memory © Prentice Hall 1995 Equivalent Transient Model for MOS NAND ROM

VDD

BL

CL rbit Model for NAND ROM rword c WL bit

cword

Word line parasitics: Resistance/cell: (6/2) x 10 W/q = 30 W Wire capacitance/cell: (6l ´ 2l) (0.6)2 0.058 + 2 ´ (6l ´ 0.6) ´ 0.043 = 0.56 fF Gate Capacitance/cell: (3l ´ 2l) (0.6)2 1.76 = 3.8 fF. Bit line parasitics: Resistance/cell: ~ 10 kW, the average resistance over the range of interest. Wire capacitance/cell: Included in diffusion capacitance Source/Drain capacitance/cell: ((3l ´ 3l) (0.6)2 ´ 0.3 + 2 ´ 3l ´ 0.6 ´ 0.8) ´ 0.375 + (3l ´ 2l) (0.6)2 ´ 1.76 = 5.2 fF

Digital Integrated Circuits Memory © Prentice Hall 1995 Propagation Delay of NOR ROM

Word line delay Consider the 512´512 case. The delay of the distributed rc-line containing M cells can be approximated using the expressions derived in Chapter 8.

2 2 tword = 0.38 (rword ´ cword) M = 0.38 (35 W ´ (0.65 + 5.1) fF) 512 = 20 nsec Bit line delay Assume a (2.4/1.2) pull-down device and a (8/1.2) pull-up transistor. The bit line between 5 V and 2.5 V.

Cbit = 512 ´ (2.6 + 0.8) fF = 1.7 pF -6 2 2 IavHL = 1/2 (2.4/0.9) (19.6 10 )((4.25) /2 + (4.25 ´ 3.75 - (3.75) /2)) - 1/2 (8/0.9) (5.3 10-6) (4.25 ´ 1.25 - (1.25)2/2) = 0.36 mA

tHL = (1.7 pF ´ 1.25 V) / 0.36 mA = 5.9 nsec The low-to-high response time can be computed using a similar approach.

tLH = (1.7 pF ´ 1.25 V) / 0.36 mA = 5.9 nsec

Digital Integrated Circuits Memory © Prentice Hall 1995 Decreasing Word Line Delay

Driver WL Polysilicon word line

Metal word line (a) Driving the word line from both sides

Metal bypass

WL K cells Polysilicon word line (b) Using a metal bypass

(c) Use silicides

Digital Integrated Circuits Memory © Prentice Hall 1995 Precharged MOS NOR ROM

VDD fpre Precharge devices

WL[0] GND WL[1]

WL[2] GND WL[3]

BL[0] BL[1] BL[2] BL[3]

PMOS precharge device can be made as large as necessary, but clock driver becomes harder to design.

Digital Integrated Circuits Memory © Prentice Hall 1995 Floating-gate transistor (FAMOS)

Floating gate Gate D Source Drain

tox G

tox S n+ p n+ Substrate

(a) Device cross-section (b) Schematic symbol

Digital Integrated Circuits Memory © Prentice Hall 1995 Floating-Gate Transistor Programming

20 V 0 V 5 V

20 V 0 V 5 V 10 V® 5 V -5 V -2.5 V

S D S D S D

Avalanche injection. Removing programming voltage Programming results in leaves charge trapped. higher VT.

Digital Integrated Circuits Memory © Prentice Hall 1995 FLOTOX EEPROM

Floating gate Gate I Source Drain

20-30 nm -10 V VGD 10 V + + n Substrate n p 10 nm (a) Flotox transistor (b) Fowler-Nordheim I-V characteristic

BL

WL

VDD

(c) EEPROM cell during a read operation

Digital Integrated Circuits Memory © Prentice Hall 1995 Flash EEPROM

Control gate

Floating gate

erasure Thin tunneling

n+ source n+ drain programming

p-substrate

Digital Integrated Circuits Memory © Prentice Hall 1995 Cross-sections of NVM cells

Flash EPROM Courtesy Digital Integrated Circuits Memory © Prentice Hall 1995 Characteristics of -of-the-art NVM

Digital Integrated Circuits Memory © Prentice Hall 1995 Read-Write Memories (RAM)

• STATIC (SRAM) Data stored as long as supply is applied Large (6 transistors/cell) Fast Differential

• DYNAMIC (DRAM) Periodic refresh required Small (1-3 transistors/cell) Slower Single Ended

Digital Integrated Circuits Memory © Prentice Hall 1995 6-transistor CMOS SRAM Cell

WL

VDD M2 M4

Q Q M6 M5

M1 M3

BL BL

Digital Integrated Circuits Memory © Prentice Hall 1995 CMOS SRAM Analysis (Write)

WL

VDD M4

Q = 0 M6 M5 Q = 1

M1 VDD BL = 1 BL = 0

2 2 VDD V VDD V k æ(V – V )------– ------DD ö = k æ(V – V )------– ------DD ö (W/L) ³ 0.33 (W/L) n, M6èDD Tn 2 8 ø p, M4 èDD Tp 2 8 ø n,M6 p,M4

k V 2 V 2 n, M5 VDD DD DD VDD ------æ------– V æ------öö = k æ(V – V )------– ------ö 2 è2 Tnè2 øø n, M1èDD Tn 2 8 ø (W/L)n,M5 ³ 10 (W/L)n,M1

Digital Integrated Circuits Memory © Prentice Hall 1995 CMOS SRAM Analysis (Read)

WL

VDD M4 BL BL

Q = 0 M6 M5 Q = 1

M1 VDD VDD VDD

Cbit Cbit

2 2 kn, M5 VDD VDD VDD VDD ------æ------– V æ------öö = k æ(V – V )------– ------ö 2 è2 Tnè2 øø n, M1èDD Tn 2 8 ø

(W/L)n,M5 £ 10 (W/L)n,M1 (supercedes read constraint)

Digital Integrated Circuits Memory © Prentice Hall 1995 6T-SRAM — Layout

VDD M2 M4

Q Q

M1 M3

GND

M5 M6 WL

BL BL

Digital Integrated Circuits Memory © Prentice Hall 1995 Resistance-load SRAM Cell

WL

VDD

RL RL

Q Q M3 M4

BL M1 M2 BL

Static power dissipation -- Want RL large Bit lines precharged to VDD to address tp problem

Digital Integrated Circuits Memory © Prentice Hall 1995 3-Transistor DRAM Cell

BL1 BL2

WWL WWL

RWL RWL

X X M3 VDD-VT M2 M1 V BL1 DD CS V BL2 VDD-VT D

No constraints on device ratios Reads are non-destructive Value stored at node X when writing a “1” = VWWL-VTn

Digital Integrated Circuits Memory © Prentice Hall 1995 3T-DRAM — Layout

BL2 BL1 GND

RWL M3

M2

WWL M1

Digital Integrated Circuits Memory © Prentice Hall 1995 1-Transistor DRAM Cell

BL

WL Write "1" Read "1" WL

M1 C X S GND VDD-VT

V BL DD

VDD/2 VDD/2 CBL sensing

Write: CS is charged or discharged by asserting WL and BL. Read: Charge redistribution takes places between bit line and storage capacitance

CS DV= VBL – VPRE = (VBIT – VPRE)------CS + CBL Voltage swing is small; typically around 250 mV.

Digital Integrated Circuits Memory © Prentice Hall 1995 DRAM Cell Observations

1T DRAM requires a sense amplifier for each bit line, due to charge redistribution read-out. DRAM memory cells are single ended in contrast to SRAM cells. The read-out of the 1T DRAM cell is destructive; read and refresh operations are necessary for correct operation. Unlike 3T cell, 1T cell requires presence of an extra capacitance that must be explicitly included in the design. When writing a “1” into a DRAM cell, a threshold voltage is lost. This charge loss can be circumvented by bootstrapping the

word lines to a higher value than VDD.

Digital Integrated Circuits Memory © Prentice Hall 1995 1-T DRAM Cell

Capacitor

Metal word line M1 word line SiO2 poly Field Oxide n+ n+ Inversion layer poly Diffused induced by bit line plate bias Polysilicon Polysilicon plate (a) Cross-section gate (b) Layout

Used Polysilicon-Diffusion Capacitance Expensive in Area

Digital Integrated Circuits Memory © Prentice Hall 1995 SEM of poly-diffusion 1T-DRAM

Digital Integrated Circuits Memory © Prentice Hall 1995 Advanced 1T DRAM Cells

Word line Capacitor dielectric layer Insulating Layer Cell plate

Cell Plate Si

Capacitor Insulator Transfer gate Isolation Refilling Poly Storage electrode

Storage Node Poly

Si Substrate 2nd Field Oxide

Trench Cell Stacked-capacitor Cell

Digital Integrated Circuits Memory © Prentice Hall 1995 Periphery

• Decoders • Sense Amplifiers

• Input/Output Buffers • Control / Timing Circuitry

Digital Integrated Circuits Memory © Prentice Hall 1995 Row Decoders

Collection of 2M complex logic gates Organized in regular and dense fashion

(N)AND Decoder

NOR Decoder

Digital Integrated Circuits Memory © Prentice Hall 1995 Dynamic Decoders

Precharge devices GND GND VDD

WL3 WL3 VDD

WL2 WL2 VDD

WL1 WL1 VDD

WL 0 WL0

VDD f A0 A0 A1 A1 A0 A0 A1 A1 f Dynamic 2-to-4 NOR decoder 2-to-4 MOS dynamic NAND Decoder

Propagation delay is primary concern

Digital Integrated Circuits Memory © Prentice Hall 1995 A NAND decoder using 2-input pre- decoders

WL1

WL0

A0A1 A0A1 A0A1 A0A1 A2A3 A2A3 A2A3 A2A3

A1 A0 A0 A1 A3 A2 A2 A3 Splitting decoder into two or more logic layers produces a faster and cheaper implementation

Digital Integrated Circuits Memory © Prentice Hall 1995 4 input pass-transistor based column decoder

BL0 BL1 BL2 BL3

S

r 0

A0 e d o c

e S1 d

R O

N S

2 t u p n

A i 1 S 2 3

D

Advantage: speed (tpd does not add to overall memory access time) only 1 extra transistor in signal path Disadvantage: large

Digital Integrated Circuits Memory © Prentice Hall 1995 4-to-1 tree based column decoder

BL0 BL1 BL2 BL3

A0

A0

A1

A1

D Number of devices drastically reduced Delay increases quadratically with # of sections; prohibitive for large decoders Solutions: buffers progressive sizing combination of tree and pass transistor approaches

Digital Integrated Circuits Memory © Prentice Hall 1995 Decoder for circular shift-register

VDD VDD VDD VDD VDD VDD

WL0 WL1 WL2 f f f f f f ... R f f R f f R f f

VDD

Digital Integrated Circuits Memory © Prentice Hall 1995 Sense Amplifiers

make DV as small C × DV t = ------as possible p Iav

large small Idea: Use Sense Amplifer

small transition s.a.

input output

Digital Integrated Circuits Memory © Prentice Hall 1995 Differential Sensing - SRAM

VDD VDD V PC V DD DD y M3 M4 y

x M1 M2 x x x BL BL EQ SE M5 SE

WLi (b) Doubled-ended Current Mirror Amplifier

VDD SRAM cell i y y Diff. x Sense x x x Amp y y D D SE

(a) SRAM sensing scheme. (c) Cross-Coupled Amplifier

Digital Integrated Circuits Memory © Prentice Hall 1995 Latch-Based Sense Amplifier

EQ BL BL

VDD

SE

SE

Initialized in its meta-stable point with EQ Once adequate voltage gap created, sense amp enabled with SE Positive feedback quickly forces output to a stable operating point.

Digital Integrated Circuits Memory © Prentice Hall 1995 Single-to-Differential Conversion

WL BL x Diff. x

+ S.A. _ cell Vref

y y

How to make good Vref?

Digital Integrated Circuits Memory © Prentice Hall 1995 Open bitline architecture

EQ

R L1 L0 R0 R1 L

VDD SE

BLL BLR

...... CS CS CS SE CS CS CS

dummy dummy cell cell

Digital Integrated Circuits Memory © Prentice Hall 1995 DRAM Read Process with Dummy Cell

6.0

) 4.0 t l o

V BL (

V 2.0 BL 5.0 0.0 4.0 WL

0 1 2 3 4 5 ) t t (nsec) l 3.0 SE o V

(a) reading a zero ( 2.0 EQ V 6.0 1.0 0.00 1 2 3 4 5 )

t 4.0

l (c) control signals

o BL V (

V 2.0 BL

0.00 1 2 3 4 5 t (nsec) (b) reading a one

Digital Integrated Circuits Memory © Prentice Hall 1995 Single-Ended Cascode Amplifier

VDD

Vcasc

WLC

WL

Digital Integrated Circuits Memory © Prentice Hall 1995 DRAM Timing

Digital Integrated Circuits Memory © Prentice Hall 1995 Address Transition Detection

VDD

DELAY A0 td ATD ATD

DELAY A1 td

... DELAY

AN-1 td

Digital Integrated Circuits Memory © Prentice Hall 1995 Reliability and

Digital Integrated Circuits Memory © Prentice Hall 1995 Open Bit-line Architecture —Cross Coupling

EQ

WL1 WL0 WLD WLD WL0 WL1 CWBL CWBL BL BL Sense CBL CBL Amplifier C C C C C C

Digital Integrated Circuits Memory © Prentice Hall 1995 Folded-Bitline Architecture

WL1 WL1 WL0 WL0 WLD WLD CWBL

BL CBL x y

... Sense C C C C C C EQ Amplifier

x y BL CBL

CWBL

Digital Integrated Circuits Memory © Prentice Hall 1995 Transposed-Bitline Architecture

BL’ Ccross BL SA BL BL"

(a) Straightforward bitline routing.

BL’ C BL cross SA BL BL"

(b) Transposed bitline architecture. Digital Integrated Circuits Memory © Prentice Hall 1995 Alpha-particles

a-particle

WL VDD

BL

SiO2 n+

1 particle ~ 1 million carriers

Digital Integrated Circuits Memory © Prentice Hall 1995 Yield

Yield curves at different stages of process maturity (from [Veendrick92])

Digital Integrated Circuits Memory © Prentice Hall 1995 Redundancy

Row Address Redundant rows Fuse : Bank Redundant columns r e

Memory d o c

Array e D

w o R

Column Decoder Column Address

Digital Integrated Circuits Memory © Prentice Hall 1995 Redundancy and Error Correction

Digital Integrated Circuits Memory © Prentice Hall 1995 Programmable Logic Array

Product Terms

x0x1 AND x2 OR PLANE PLANE

f0 f1

x0 x1 x2

Digital Integrated Circuits Memory © Prentice Hall 1995 Pseudo-Static PLA

V GND GND GND GND DD GND

GND

GND

f f VDD 0 1 x0 x0 x1 x1 x2 x2

AND-PLANE OR-PLANE

Digital Integrated Circuits Memory © Prentice Hall 1995 Dynamic PLA

fAND

GND VDD

fOR

fOR fAND f f VDD 0 1 GND x0 x0 x1 x1 x2 x2

AND-PLANE OR-PLANE

Digital Integrated Circuits Memory © Prentice Hall 1995 Clock Signal Generation for self-timed dynamic PLA

f f fAND

Dummy AND Row

fAND

fAND fOR Dummy AND Row

fOR

(a) Clock signals (b) Timing generation circuitry.

Digital Integrated Circuits Memory © Prentice Hall 1995 PLA Layout

And-Plane Or-Plane VDD f GND

x0 x0 x1 x1 x2 x2 f0 f1 Pull-up devices Pull-up devices Digital Integrated Circuits Memory © Prentice Hall 1995 PLA versus ROM

Programmable Logic Array structured approach to random logic “two level logic implementation” NOR-NOR (product of sums) NAND-NAND (sum of products)

IDENTICAL TO ROM!

Main difference ROM: fully populated PLA: one element per minterm

Note: Importance of PLA’s has drastically reduced 1. slow 2. better software techniques (mutli-level logic synthesis)

Digital Integrated Circuits Memory © Prentice Hall 1995 Semiconductor Memory Trends

Memory Size as a function of time: x 4 every three years

Digital Integrated Circuits Memory © Prentice Hall 1995 Semiconductor Memory Trends

Increasing die size factor 1.5 per generation Combined with reducing cell size factor 2.6 per generation

Digital Integrated Circuits Memory © Prentice Hall 1995 Semiconductor Memory Trends

Technology feature size for different SRAM generations

Digital Integrated Circuits Memory © Prentice Hall 1995