Principles of Communications

Chapter 7: Optimal Receivers

Selected from Chapter 8.1-8.3, 8.4.6, 8.5.3 of Fundamentals of Communications Systems, Pearson Prentice Hall 2005, by Proakis & Salehi

Meixia Tao @ SJTU Topics to be Covered

A/D Source Channel Source Modulator converter encoder encoder

Absent if source is Noise Channel digital

Source Channel User D/A Detector converter decoder decoder

. Detection theory . Decision regions . Optimal receiver structure . Error probability analysis . Matched filter

Meixia Tao @ SJTU 2 Example

. Alice tells her Dad that she wants either strawberry or blueberry.

Alice Alice says Bob hears Dad “I want straw- “I want blaw- berry ” berry ”

Channel

Bob’s question: which berry does Alice want? Is strawberryWhy? more possible? Strawberry or Blueberry?

Meixia Tao @ SJTU 3 Statistical . Demodulation and decoding of signals in digital communications is Alice Bob directly related to Statistical“I want decision theory“I want . In the general setting, we” are given a finiteblawberry set of possible” hypotheses about an , along with observations related statistically to the various hypotheses . The theory provides rules for Channelmaking the best possible decision (accordingAlice to somecan ask performance either strawberry criterion) or blueberry about which from Bobhypothesis. is likely to be true

. In digital communications, hypotheses are the possible messages and observations are the output of a channel . Based on the observed values of the channel output, we are interested in the best decision making rule in the sense of minimizing the probability of error

Meixia Tao @ SJTU 4 Detection Theory

. Given M possible hypotheses Hi (signal mi) with probability ,

. Pi represents the prior knowledge concerning the probability of the signal mi – . The observation is some collection of N real values, denoted by with conditional pdf

-- conditional pdf of observation given the signal mi . Goal: Find the best decision-making algorithm in the sense of minimizing the probability of decision error.

Message Channel Decision

Meixia Tao @ SJTU 5 Observation Space

. In general, can be regarded as a point in some observation space

. Each hypothesis Hi is associated with a decision region Di:

. The decision will be in favor of Hi if is in Di . Error occurs when a decision is made in favor of another when the signals falls outside the decision region Di

D3 D2 D1

D 4 Decision Space M=4

Meixia Tao @ SJTU 6 MAP Decision Criterion

. Consider a decision rule based on the computation of the posterior probabilities defined as

. Known as a posterior since the decision is made after (or given) the observation . Different from the a prior where some information about the decision is known in advance of the observation

Meixia Tao @ SJTU 7 MAP Decision Criterion (cont’d)

. By Bayes’ Rule:

. Since our criterion is to minimize the probability of detection error given , we deduce that the optimum decision rule is to choose if and only if is maximum for . . Equivalently,

Choose if and only if

. This decision rule is known as maximum a posterior or MAP decision criterion

Meixia Tao @ SJTU 8 ML Decision Criterion

. If , i.e. the signals are equiprobable, finding the signal that maximizes is equivalent to finding the signal that maximizes . The conditional pdf is usually called the . The decision criterion based on the maximum of is called the Maximum-Likelihood (ML) criterion. . ML decision rule: Choose if and only if

. In any digital communication systems, the decision task ultimately reverts to one of these rules

Meixia Tao @ SJTU 9 Topics to be Covered

A/D Source Channel Source Modulator converter encoder encoder

Absent if source is Noise Channel digital

Source Channel User D/A Detector converter decoder decoder

. Detection theory . Decision regions . Optimal receiver structure . Error probability analysis . Matched filter

Meixia Tao @ SJTU 10 Optimal Receiver in AWGN Channel

. Transmitter transmits a sequence of symbols or messages from a set of M symbols with prior probabilities

. The symbols are represented by finite energy waveforms , defined in the interval . The channel is assumed to corrupt the signal by additive white Gaussian noise (AWGN)

Channel

Transmitter Σ Receiver

Meixia Tao @ SJTU 11 Signal Space Representation

. The signal space of {s1(t), s2(t), …, sM(t)} is assumed to be of dimension N (N ≤ M) . for k = 1, …, N will denote an orthonormal basis function . Then each transmitted signal waveform can be represented as

where

Meixia Tao @ SJTU 12 . Note that the noise nw(t) can be written as

where

Projection of nw(t) on the N-dim space orthogonal to the space, falls outside the signal space spanned by . The received signal can thus be represented as

where

Projection of r(t) on N-dim signal space Meixia Tao @ SJTU 13 Graphical Illustration

. In vector forms, we have

Received signal point

Noise vector Observation vector Message point Signal vector

Meixia Tao @ SJTU 14 Receiver Structure

. Subdivide the receiver into two parts . Signal demodulator: to convert the received waveform r(t) into an N-dim vector . Detector: to decide which of the M possible signal waveforms was transmitted based on observation of the vector

receiver r(t) Signal Detector demodulator

. Two realizations of the signal demodulator . Correlation-Type demodulator . Matched-Filter-Type demodulator Meixia Tao @ SJTU 15 Topics to be Covered

A/D Source Channel Source Modulator converter encoder encoder

Absent if source is Noise Channel digital

Source Channel User D/A Detector converter decoder decoder

. Detection theory . Decision regions . Optimal receiver structure . Error probability analysis . Matched filter

Meixia Tao @ SJTU 16 What is Matched Filter (匹配滤波器)?

. The matched filter (MF) is the optimal linear filter for maximizing the output SNR.

. Derivation of the MF

∞ . st ↔ A( f) = s( t) e− jtω dt Input signal component i ( ) ∫−∞ i . nt( ) Sf( ) = N/2 Input noise component i with PSD ni 0 ∞ . s( t) = st( −τ) h( ττ) d Output signal component oi∫−∞ ∞ . Sample at = A( f) H( f) ejtω df ∫−∞

Meixia Tao @ SJTU 17 Output SNR

∞ ω s( t) = A( f) H( f) ejt0 df . At the instance , o 0 ∫−∞ . Average power of the output noise is

N ∞ 2 N= E{} n2 ( t) = 0 H( f) df o 2 ∫−∞ . Output SNR

∞ 2 jtω 0 2 A( f) H( f) e df sto ( 0 ) ∫−∞ d = = 2 ∞ 2 En{} t N0 o ( ) H( f) df 2 ∫−∞

Find H( f ) that can maximize d

Meixia Tao @ SJTU 18 Maximum Output SNR

. Schwarz’s inequality: 2 ∞∞22 ∞ F( x) dx Q( x) dx≥ F* ( x) Q( x) dx ∫∫−∞ −∞ ∫−∞

equality holds when F( x) = CQ( x)

ω F* ( x) = Afe( ) jt0 . Let , then Qf( ) = Hf( ) E : signal energy

∞∞22 ∞ 2 A( f) df H( f) df A( f) df ∫∫−∞ −∞ ∫−∞ 2E d ≤== ∞ 2 NN00N H( f) df 0 22∫−∞

Meixia Tao @ SJTU 19 Solution of Matched Filter

. When the max output SNR is achieved, we have

∞ jtω * − jtω h( t) = H( f) e df 0 mm∫−∞ Hm ( f) = A( fe) ∞ −−jttω( ) = A* ( f) e0 df * ∫−∞ htm ( ) = sti ( 0 − t) ∫ * =sti ( 0 − t)

. Transfer function: complex conjugate of the input signal spectrum

. Impulse response: time-reversal and delayed version of the input signal s(t)

Meixia Tao @ SJTU 20 Properties of MF (1)

. t 因果性) Choice of 0 versus the causality ( 0

0 T 0

Not implementable

Not implementable

sti ( 00 − t) 0 ≤< t t htm ( ) =   0 otherwise

where tT0 ≥

Meixia Tao @ SJTU 21 Properties of MF (2)

. Equivalent form – Correlator

. Let st i ( ) be within yt() tT= y( t) = xt( )* h( t) = xt ( )* s( T − t) xt() yT() mi MF T =x()τ sTi ( −+ t ττ) d ∫0

. Observe at sampling time tT= TT y( T )= x ()τ sii( ττ) d= x( t) s( t) dt ∫∫00 xt() yT() Correlation

integration sti () correlator (相关积分) Meixia Tao @ SJTU 22 Correlation Integration

. Correlation function

∞∞ R(τ) = s( t) s( t += ττ) dt s( t −) s( t) dt =− R () τ 12 ∫∫−∞ 1 2 −∞ 1 2 21

∞ . R(ττ) = stst( ) ( + ) dt function ∫−∞ . RR(ττ) =( − )

. RR(0) ≥ (τ )

∞ . R(0) = s2 ( t) dt = E ∫−∞

2 ∞∞2 . Rτ ↔ Af R(0) = s2 ( t) dt = A( f) df ( ) ( ) ∫∫−∞ −∞

Meixia Tao @ SJTU 23 Properties of MF (3)

. MF output is the autocorrelation function of input signal

∞ ∞ s( t) = s( t − uh) ( udu) =−−st( ust) ( udu) o∫−∞ im ∫−∞ ii0 ∞ = s(µµ) s[ +− ttd] µ = Rtt() − ∫−∞ ii 00s0

. The peak value of st 0 () happens tt= 0

∞ s( t) = s2 (µ ) du= E oi0 ∫−∞

. st0 () is symmetric at tt = 0

2 − jtω 0 Aom( f) = AfH( ) ( f) = Af( ) e

Meixia Tao @ SJTU 24 Properties of MF (4) . MF output noise . The statistical autocorrelation of depends on the autocorrelation of ∞ N0 R(ττ) = Entnt{ ( ) ( + )} = huhumm( ) ( +τ ) du no oo 2 ∫−∞ N ∞ = 0 stst( ) ( −τ ) dt 2 ∫−∞ ii . Average power N ∞ 220 E{ non( t)} = R (0) = s i(µ ) du o 2 ∫−∞ NN∞∞22 = 00A( f) df= H( f) df domain 22∫∫−∞ −∞ m N = 0 E 2

Meixia Tao @ SJTU 25 Example: MF for a rectangular pulse

. Consider a rectangular pulse A

. The impulse response of a filter 0 T t matched to is also a rectangular pulse A

. The output of the matched filter 0 T t is

. The output SNR is A2T 2 T 2 2 2A T (SNR)o = s (t)dt = ∫0 N0 N0 0 T 2T t Meixia Tao @ SJTU 26 What if the noise is Colored?

. Basic idea: preprocess the combined signal and noise such that the non-white noise becomes white noise - Whitening Process

xt( ) = si ( t) + nt( ) where n(t) is colored noise with PSD Sfn ( ) xt′′′( ) = st( ) + nt( )

Choose H1( f ) so that is white, i.e. 2 Sfnn′ ( ) = Hf1 ( ) Sf( ) = C

Meixia Tao @ SJTU 27 H1( f ) , H2( f )

2 C . : Hf1 ( ) = Sfn ( ) ′ . should match with A( f) = H1 ( f) Af( )

∗−−j22ππ ft00∗∗ j ft H21( f) = A′ ( fe) = H( fA) ( fe) . Overall transfer function of the cascaded system:

∗∗− j2π ft0 Hf( ) =⋅= HfHf1( ) 2( ) HfHfAfe 11( ) ( ) ( ) 2 ∗ − j2π ft0 = H1 ( f) A( fe)

∗ MF for colored Af( ) − π Hf( ) = ej2 ft0 noise Sfn ( )

Meixia Tao @ SJTU 28 Update

. We have discussed what is matched filter . Let us now come back to the optimal receiver structure

receiver r(t) Signal Detector demodulator

. Two realizations of the signal demodulator . Correlation-Type demodulator . Matched-Filter-Type demodulator

Meixia Tao @ SJTU 29 Correlation Type Demodulator

. The received signal r(t) is passed through a parallel bank of N cross correlators which basically compute the projection of r(t) onto the N basis functions

To detector

Meixia Tao @ SJTU 30 Matched-Filter Type Demodulator

. Alternatively, we may apply the received signal r(t) to a bank of N matched filters and sample the output of filters at t = T. The impulse responses of the filters are

To detector

Sample at t=T

Meixia Tao @ SJTU 31 . We have demonstrated that Update . for a signal transmitted over an AWGN channel, either a correlation type demodulator or a matched filter type demodulator produces the vector which contains all the necessary information in r(t)

. Now, we will discuss . the design of a signal detector that makes a decision of the transmitted signal in each signal interval based on the observation of , such that the probability of making an error is minimized (or correct probability is maximized)

Meixia Tao @ SJTU 32 Decision Rules

Recall that . MAP decision rule: choose if and only if

. ML decision rule choose if and only if

In order to apply the MAP or ML rules, we need to evaluate the likelihood function

Meixia Tao @ SJTU 33 Distribution of the Noise Vector

. Since nw(t) is a Gaussian random process, . is a Gaussian (from definition)

. : ,

. Correlation between nj and nk

PSD of is

Meixia Tao @ SJTU 34 . Using the property of a delta function we have:

. Therefore, nj and nk ( ) are uncorrelated Gaussian random variables

. They are independent with zero-mean and N0/2

. The joint pdf of

Meixia Tao @ SJTU 35 Likelihood Function

. If mk is transmitted, with . Transmitted signal values in each dimension represent the mean values for each received signal . . Conditional pdf of the random variables

Meixia Tao @ SJTU 36 Log-Likelihood Function

. To simplify the computation, we take the natural logarithm of , which is a monotonic function. Thus

. Let

. is the Euclidean distance between and in the N- dim signal space. It is also called distance metrics

Meixia Tao @ SJTU 37 Optimum Detector . MAP rule:

. ML rule:

ML detector chooses iff received vector is closer to in terms of Euclidean distance than to any other for i ≠ k Minimum distance detection (will discuss more in decision region)

Meixia Tao @ SJTU 38 Optimal Receiver Structure

. From previous expression we can develop a receiver structure using the following derivation

in which

= signal energy

= correlation between the received signal vector and the transmitted signal vector = common to all M decisions and hence can be ignored

Meixia Tao @ SJTU 39 . The new decision function becomes

. Now we are ready draw the implementation diagram of MAP receiver (signal demodulator + detector)

Meixia Tao @ SJTU 40 MAP Receiver Structure Method 1 (Signal Demodulator + Detector)

+

Compute Comparator + (select the largest)

+

This part can also be implemented using matched filters

Meixia Tao @ SJTU 41 MAP Receiver Structure Method 2 (Integrated demodulator and detector)

+

Comparator + (select the largest)

+

This part can also be implemented using matched filters

Meixia Tao @ SJTU 42 Method 1 vs. Method 2

. Both receivers perform identically . Choice depends on circumstances . For instance, if N < M and are easier to generate than , then the choice is obvious

Meixia Tao @ SJTU 43 Example: optimal receiver design

. Consider the signal set

Meixia Tao @ SJTU 44 Example (cont’d)

. Suppose we use the following basis functions

. Since the energy is the same for all four signals, we can drop the energy term from

Meixia Tao @ SJTU 45 Example (cont’d)

. Method 1

Compute +

Choose + the Largest

+

Meixia Tao @ SJTU 46 Example (cont’d)

. Method 2

+

Chose the Largest

+

Meixia Tao @ SJTU 47 Exercise

In an additive white Gaussian noise channel with a

noise power-spectral density of N0/2, two equiprobable messages are transmitted by  At  0 ≤ t ≤ T s1(t) =  T  0 otherwise  At A − 0 ≤ t ≤ T s2 (t) =  T  0 otherwise

. Determine the structure of the optimal receiver.

Meixia Tao @ SJTU 48 Notes on Optimal Receiver Design

. The receiver is general for any signal forms

. Simplifications are possible under certain scenarios

Meixia Tao @ SJTU 49 Topics to be Covered

A/D Source Channel Source Modulator converter encoder encoder

Absent if source is Noise Channel digital

Source Channel User D/A Detector converter decoder decoder

. Detection theory . Decision regions . Optimal receiver structure . Error probability analysis . Matched filter

Meixia Tao @ SJTU 50 Graphical Interpretation of Decision Regions . Signal space can be divided into M disjoint decision

regions R1 R2, …, RM.

If decide mk was transmitted

Select decision regions so that Pe is minimized . Recall that the optimal receiver sets iff is minimized

. For simplicity, if one assumes pk = 1/M, for all k, then the optimal receiver sets iff is minimized

Meixia Tao @ SJTU 51 Decision Regions

. Geometrically, this . Take projection of r(t) in the signal space (i.e. ). Then, decision is made in favor of signal that is the closest to in the sense of minimum Euclidean distance

. And those observation vectors with

for all should be assigned to decision region Rk

Meixia Tao @ SJTU 52 Example: Binary Case

. Consider binary transmission over AWGN channel

with PSD Sn(f) = N0/2 using

. Assume P(m1) ≠ P(m2) . Determine the optimal receiver (and optimal decision regions)

Meixia Tao @ SJTU 53 Solution . Optimal decision making

Choose m1 < > Choose m2 . Let and . Equivalently, Choose m1

Choose m2 Constant c

R1: and R2:

Meixia Tao @ SJTU 54 Solution (cont’d) . Now consider the example with on the decision boundary

R2 R1

r 0 s2 s1

Meixia Tao @ SJTU 55 Determining the Optimum Decision Regions . In general, boundaries of decision regions are perpendicular bisectors of the lines joining the original transmitted signals . Example: three equiprobable 2-dim signals

R2

s1 s2 R1

s3 R3

Meixia Tao @ SJTU 56 Example: Decision Region for QPSK

. Assume all signals are equally likely . All 4 signals could be written as the linear combination of two basis functions . Constellations of 4 signals

R2 s1=(1,0) s2 s =(0,1) R3 2 s3 R1

s3=(-1,0) s1

s4 s4=(0,-1) R4

Meixia Tao @ SJTU 57 Exercise

Three equally probable messages m1, m2, and m3 are to be transmitted over an AWGN channel with noise power-spectral density N 0 / 2 . The messages are 1 0 ≤ t ≤ T s1(t) =  0 otherwise  T 1 0 ≤ t ≤  2  T s2 (t) = −s3 (t) = −1 ≤ t ≤ T  2 0 otherwise   1. What is the dimensionality of the signal space ? 2. Find an appropriate basis for the signal space (Hint: You can find the basis without using the Gram-Schmidt procedure ). 3. Draw the signal constellation for this problem. 4. Sketch the optimal decision regions R1, R2, and R3.

Meixia Tao @ SJTU 58 Notes on Decision Regions

. Boundaries are perpendicular to a line drawn between two signal points . If signals are equiprobable, decision boundaries lie exactly halfway in between signal points . If signal probabilities are unequal, the region of the less probable signal will shrink

Meixia Tao @ SJTU 59 Topics to be Covered

A/D Source Channel Source Modulator converter encoder encoder

Absent if source is Noise Channel digital

Source Channel User D/A Detector converter decoder decoder

. Detection theory . Decision regions . Optimal receiver structure . Error probability analysis . Matched filter

Meixia Tao @ SJTU 60 Probability of Error using Decision Regions . Suppose is transmitted and is received . Correct decision is made when with probability

. Averaging over all possible transmitted symbols, we obtain the average probability of making correct decision

. Average probability of error

Meixia Tao @ SJTU 61 Example: Pe analysis

. Now consider our example with binary data transmission

•Given m1 is transmitted, then

R2 R1

0 s2 s1 •Since n is Gaussian with zero mean and variance

Meixia Tao @ SJTU 62 . Likewise

. Thus,

where and

Meixia Tao @ SJTU 63 Example: Pe analysis (cont’d)

. Note that when

Meixia Tao @ SJTU 64 Example: Pe analysis (cont’d)

. This example demonstrates an interesting fact:

. When optimal receiver is used, Pe does not depend upon the specific waveform used

. Pe depends only on their geometrical representation in signal space

. In particular, Pe depends on signal waveforms only through their energies (distance)

Meixia Tao @ SJTU 65 Exercise Three equally probable messages m1, m2, and m3 are to be transmitted over an AWGN channel with noise power-spectral density N 0 / 2 . The messages are 1 0 ≤ t ≤ T s1(t) =  0 otherwise  T 1 0 ≤ t ≤  2  T s2 (t) = −s3 (t) = −1 ≤ t ≤ T  2 0 otherwise   1. What is the dimensionality of the signal space ? 2. Find an appropriate basis for the signal space (Hint: You can find the basis without using the Gram-Schmidt procedure ). 3. Draw the signal constellation for this problem. 4. Sketch the optimal decision regions R1, R2, and R3. 5. Which of the three messages is more vulnerable to errors and why ? In other words, which of

p(Error | mi transmitted), i = 1, 2, 3 is larger ? Meixia Tao @ SJTU 66 General Expression for Pe

. Average probability of symbol error

Likelihood function . Since

N-dim integration

. Thus we rewrite Pe in terms of likelihood functions, assuming that symbols are equally likely to be sent

Meixia Tao @ SJTU 67 Union Bound

. Multi-dimension integrals are quite difficult to evaluate . To overcome this difficulty, we resort to the use of bounds . Now we develop a simple and yet useful upper bound for Pe, called union bound, as an approximation to the average probability of symbol error

Meixia Tao @ SJTU 68 Key Approximation

. Let denote the event that is closer to than to in the signal space when ( ) is sent

. Conditional probability of symbol error when is sent

. But

Meixia Tao @ SJTU 69 Key Approximation Example

R2

s2 R3 s3 R1

s1

s4 R4

s2

s3

s1 s1 s1

s4

Meixia Tao @ SJTU 70 Pair-wise Error Probability

. Define the pair-wise (or component-wise) error probability as

. It is equivalent to the probability of deciding in favor of when was sent in a simplified binary system that involves the use of two equally likely messages and . Then

. is the Euclidean distance between and

Meixia Tao @ SJTU 71 Union Bound

. Conditional error probability

. Finally, with M equally likely messages, the average probability of symbol error is upper bounded by

The most general formulation of union bound

Meixia Tao @ SJTU 72 Union Bound (cont’d)

. Let denote the minimum distance, i.e.

. Since Q( .) is a monotone decreasing function

. Consequently, we may simplify the union bound as

Simplified form of union bound

Meixia Tao @ SJTU 73 Question

What is the design criterion of a good signal set?

Meixia Tao @ SJTU 74