<<

A UMTS Baseband Receiver Chip for Infrastructure Applications

S. Sriram, K. Brown, P. Bertrand, F. Moerman, O. Paviot, C. Sengupta, V. Sundararajan, and A. Gatherer

Wireless Infrastructure Business Unit, Texas Instruments Inc. Outline 2

ƒ UMTS/CDMA Cellular System Overview ƒ CDMA Base Station Receiver Functions ƒ System Partitioning ƒ The TCI110 Receive Chip-rate Application Specific Signal Processor (ASSP) ™ Correlator architecture ™ Front-end buffer ™ Finger de-spreader ™ Path searcher ™ Preamble detector ™ Host Interface ƒ Summary Cellular System 3

Base Station (Node B)

Uplink (reverse link)

Downlink (fwd link)

User Equipment (UE) UMTS FDD 3G Standard 4

ƒ Frequency Division Duplex

ƒ Wideband CDMA

ƒ Variable data rates and associated services ™ 2 MBPS peak rate

ƒ Network backward compatible to GSM 3G Base Station: Key Care-abouts 5

Cost per channel

Flexibility ƒ Variable data rate and traffic ™ Mix of rates from 12.2Kbps (voice) up to 2Mbps (data) ƒ Flexible cell sizes ™ Macro/Micro/Pico/In-door ƒ Support of disparate environments ™ Vehicular, pedestrian, stationary ƒ Flexible resource allocation ™ Seamless processing/memory trade-off between various traffic scenarios ƒ Flexible implementation of base-band algorithms ™ Allow for field upgrades/enhancements UMTS/W-CDMA Deployment 6 Projections

Commercial

NA Experimental Prototype

Mass Deployment Commercial Experimental China Prototype Mass Deployment Region Commercial Experimental Japan Prototype Mass Deployment Commercial Experimental Europe Prototype

2001 2002 2003 2004 2005 Time 7

Pseudo-Noise ... (PN) Sequence

N “chips” “0” “0” “1” User Data X “001...”

Bandwidth w Bandwidth Nw CDMA 8

PN1 ... User 1 Data X

w w PN : PN sequences for 2 different users are User 2 X orthogonal Data + RF PN (k) PN (k) ≅ 0 w. w Σ i j . . k PN m User m Data X

w w ...

every N chips PN1 : “De-spread” with local User 1 Σ X RF PN sequence Data Base Station Receiver Functions 9

Antenna 0 Antenna 0 . . . RF Front End Antenna N Analog Baseband NETWORK BACKPLANE N

DIGITAL BASEBAND

MULTIPATH DESPREADERS COMBINING RAKE fingers PROCESSING

PARAMETER ESTIMATION •Channel decoding PREAMBLE SEARCH AND CONTROL •Rate matching •CRC •Channel estimation •De-interleaving •AFC, AGC, Time Tracking •Channel de-mux MULTI-PATH SEARCH •Measurements •Finger allocation •TFCI decoding NETWORK RADIO RESOURCE •RACH post-processing INTERFACE MANAGEMENT Chip-Rate Processing Front-End 10

De-spreader functions ƒ Implements Rake “fingers” ƒ Inner product function:

SF −1 y(k) = ∑ x(k.SF + n)* pn(k.SF + n) n=0 Search functions ƒ Path search and Preamble search ƒ Search for pilot signal within a time window of uncertainty ƒ Matched filter function: M −1 y(k) = ∑ x(k − n)* pn(n) n=0

Very high computation rates involved ƒ > 150 Billion Complex “Multiply Accumulates” per second ƒ Relatively low processing rate downstream of the correlator System Partition 11

TCI110 TCI100 TCI100

Receive Chip- Chip-Rate Symbol-Rate Rate ASSP Assist DSP DSP DIGITAL BASEBAND

DESPREADERS MULTIPATH SYMBOL RATE RAKE fingers COMBINING PROCESSING

•Channel decoding PARAMETER ESTIMATION PREAMBLE SEARCH •Rate matching AND CONTROL •CRC •De-interleaving •Channel estimation •Channel de-mux •AFC, AGC, Time Tracking MULTI-PATH SEARCH •Measurements •Finger allocation •TFCI decoding NETWORK RADIO RESOURCE •RACH post-processing INTERFACE MANAGEMENT TCI110 Architecture 12

TCI110

Power Estimator

ARMP

Preamble Detector Front- FE End Host TCI100 Interface Transfer Path Interface Monitortext

ARMF Finger De- spreader

Host Synchro Interrupt module Interface Correlator Architecture 13

Four task-based accelerators ƒ Finger de-spreader, Path monitor, Preamble detector, and Power estimator ƒ Tasks set-up through software running on a programmable DSP ƒ Results transferred to DSP periodically via DMA

Each accelerator employs a vector-correlator architecture ƒ Datapath and control customized for specific functions (FD, PM, PD, PE) ƒ Control includes two ARM micro-controllers ƒ All tasks mapped to a accelerator run on the same data path in a time-multiplexed manner Correlator Architecture 14

PNPN CodeCode GenGen

Input n buffer: Adder Coh. Non-coh. Output X Non- 8x n Trees ACC ACC Memory Oversampling chips N antennas Scratch Scratch

Front End Interface Mem. Mem.

From the Analog Front End

ControlControl Memory Memory

. . . DSP ARMARM Microcontroller Control Microcontroller . . . TCI API S/W

Task Buffer Front-End Interface 15

Function ƒ Distribute ™ Up to 24 sample streams (including 2 delay streams to FD) ƒ Interpolate for FD ™ From 8x, or 4x, or 2x samples/chip to 8x samples/chip ƒ Decimate for other modules ™ From 8x, 4x, 3x samples/chip to 2x for PD and PM and 1x for PE Typical configurations:

Oversampling Factor Bus Mode 2 4 8 16 bits 12 6 3 32 bits 24 12 6 48 bits 24 12 6 Finger De-Spreader 16

Performs de-spreading of received multi-path components in a CDMA RAKE receiver ƒ Data/Control channel despreading ƒ Includes Early/On-time/Late-time de-spreading with energy accumulation for time tracking ƒ Flexible allocation of a pool of correlation resources

Usage scenarios: Finger Despreader SF Fingers UEs 256 512 64 128 512 64 64 512 64 32 512 64 16 364 45 8 192 24 4 99 12 Finger De-Spreader Usage Examples 17

ƒ 2048 chip-rate de-spreaders running in parallel ƒ May be flexibly configured in a number of ways ™ 64 UE at 12.2Kbps, 8 Fingers/UE • DPDCH de-spreading • DPCCH despreading • Early/On-time/Late correlation results on DPCCH for time tracking OR ™ 128 UE at 12.2Kbps, 4 Fingers/UE • DPDCH de-spreading • DPCCH despreading • Early/On-time/Late correlation results on DPCCH for time tracking OR ™ 51 HSDPA UE, 8 Fingers/UE • ADPCH de-spreading • HS-DPCCH de-spreading • DPCCH despreading • Early/On-time/Late correlation results on DPCCH for time tracking OR ™ Combinations of the above within the 2048 de-spreader limit Path Monitor Performance 18

Performs multi-path search Other usage scenarios: for all received users ƒ Flexible time- of resources among users Path Monitor ƒ Includes flexible coherent and Activity Profile non-coherent (energy) Total accumulation Search Window UEs UEs Activity Chips Samples Typical usage: 8 1/2 128 2 ƒ 64 Users, Search over 2 antennae 44 4 1/1 128 2 in parallel 16 1/2 128 2 ƒ 128 chip window (at ½ chip 24 resolution), 1/8th activity factor 8 1/1 128 2 8 1/1 128 2 12 4 1/1 256 2 Preamble Detector Performance 19

Implements a sliding window Other usage scenarios: correlator for detection of Random Access Channel Preamble ƒ Flexible coherent and non- Preamble Detector coherent accumulation intervals Search Window ƒ Parallel search over all 16 RACH Scr. Signatures Activity signatures Codes Chips Samples

16 1/2 1024 2 2 16 1/1 512 2 Typical usage 4 16 1/1 512 1 ƒ Correlate over complete 8 16 1/2 256 2 preamble (4096 chips) ƒ 512 chip window, ½ chip resolution (20Km cell radius) ƒ Search over 2 antennae in parallel Host Interface 20

Host Interface sets up EDMA channels in PaRAM

TCI110 TCI100

Configuration/ Configuration/ CPUCPU StatusStatus RegistersRegisters & & Step1: TablesTables Host Host EMIF A InterfaceInterface EMIF A

OutputOutput buffers buffers Step2: (Symbols, EOL, EDMAEDMA ControllerController (Symbols, EOL, And SearchSearch data) data) L2L2 And PaRAMPaRAM

EDMA controller transfers data blocks to L2. HI re-groups the data to have one transfer per user or groups of users TCI110 Chip Metrics 21

Technology 0.13 µm CMOS, 7 Layer Metal High speed copper process ~ 75M Transistors Size ~ 50% die area is SRAM 35 x 35 388-pin BGA package

Clock ~ 125 MHz Datapath ~ 250 MHz ARM

Power ~ 2W 3.3 V I/Os, 1.2 V Internal TCI110 Summary 22

ƒ Lowest cost per channel enabled via ™ Time-multiplexed datapath architecture that allows memory sharing ™ Highly integrated SOC

ƒ Flexible / Programmable ™ “Pool of Resources” concept for flexible resource allocation in a multi-channel context ™ Inherently flexible design enhanced with the programmability of embedded cores ™ Yet optimized for chip rate applications: • ~ 200 Billion chip operations per second • Parallel datapaths for finger de-spreading, path monitoring, preamble detection • Dedicated interface and distribution of antenna data • Highly optimized transfer of results data to C64x

ƒ Enhanced time-to-market ™ Programmable approach allows bug fixing / feature enhancement in Software 23 Flexibility/Cost Combination of DSP + Custom ASIC

Channel Card Architecture Approach of Leading WI OEMs

All DSP DSP + Reconfigurable

High DSP + FPGA TCI100C64x +TM Customizable+ Custom ASIC ASSP

TI WI New Single Chipset Platform Chipset is Customizable Flexibility to Meet Each OEM’s Individual Needs

All ASIC Low

High Bill of Material Cost Low Customizable Chipset Maintains OEMs 24 Ability to Differentiate Channel gain TX

TCI120 Offset TCI100

TCI110 Number TCI100 of users

RX Fingers per user Search window sizes

Accumulation lengths

Flexible hardware configured via High-performance, registers and commands under DSP programmable DSP software control