Part 7 Time-Division Multiplexing and Delta-Sigma Regeneration

o Regenerative repeater for PCM system n It can completely remove the distortion if the decision making device makes the right decision (on 1 or 0).

Amplifier Degraded Decision- Regenerated Equalizer making PCM wave PCM wave device

Timing circuit

© Po-Ning [email protected] 7-2 Decoding & Filtering o After regenerating the received pulse at the last stage, the receiver decodes and regenerates the original message signal (with acceptable quantization error). o Finally, a lowpass reconstruction filter whose cutoff frequency is equal to the message bandwidth W is applied at the end (to remove the unnecessary high-frequency components due to “quantization”).

© Po-Ning [email protected] 7-3 Impact of Noise in PCM Systems o Two major noise sources in PCM systems n (Message-independent) Channel noise n (Message-dependent) Quantization noise o The quantization noise is often under designer’s control, and can be made negligible by taking adequate number of quantization levels.

© Po-Ning [email protected] 7-4 Impact of Noise in PCM Systems o The main effect of channel noise is to introduce bit errors. n Notably, the symbol error rate is quite different from the bit error rate. n A symbol error may be caused by one-bit error, or two- bit error, or three-bit error, or …; so, in general, one cannot derive the symbol error rate from the bit error rate (or vice versa) unless some special assumption is made. n Considering the reconstruction of original analog signal, a bit error in the most significant bit is more harmful than a bit error in the least significant bit.

© Po-Ning [email protected] 7-5 Error Threshold o Eb/N0

n Eb: Transmitted signal energy per bit o E.g., information bit is encoded using three-times repetition code, in which each code bit is transmitted using one symbol with symbol energy Ec.

o Then Eb = 3 Ec.

n N0: One-sided noise spectral density o The bit error rate (BER) is a function of Eb/N0 and transmission speed (and implicitly bandwidth, etc).

© Po-Ning [email protected] 7-6 Error Threshold o Error threshold

n The minimum Eb/N0 to achieve the required BER. o By knowing the error threshold, one can always add a

regenerative repeater when Eb/N0 is about to drop below the threshold; hence, long-distance transmission becomes feasible. n Unlike the analog transmission, of which the distortion will accumulate for long-distance transmission.

© Po-Ning [email protected] 7-7 Time-Division Multiplexing System o An important feature of sampling process is a conservation- of-time. n In principle, the communication link is used only at the sampling time instances. o Hence, it may be feasible to put other message’s samples between adjacent samples of this message on a time-shared basis. o This forms the time-division multiplex (TDM) system. n A joint utilization of a common communication link by a plurality of independent message sources.

© Po-Ning [email protected] 7-8 Time-Division Multiplexing System

(anti-aliasing) synchronized (reconstruction)

1 LPF LPF 1

2 LPF PCM Channel PCM LPF 2

N LPF commutator commutator LPF N o The commutator (1) takes a narrow sample of each of the N

input messages at a rate fs slightly higher than 2W, where W is the cutoff frequency of the anti-aliasing filter, and (2)

interleaves these N samples inside the sampling interval Ts.

© Po-Ning [email protected] 7-9 Time-Division Multiplexing System

(anti-aliasing) synchronized (reconstruction)

1 LPF LPF 1

2 LPF PCM Channel PCM LPF 2

N LPF commutator commutator LPF N o The price we pay for TDM is that N samples be squeezed in

a time slot of duration Ts.

© Po-Ning [email protected] 7-10 Time-Division Multiplexing System o Synchronization is essential for a satisfactory operation of the TDM system. n One possible procedure to synchronize the transmitter clock and the receiver clock is to set aside a code element or pulse at the end of a frame, and to transmit this pulse every other frame only.

© Po-Ning [email protected] 7-11 T1 System o T1 system n Carries 24 64kbps voice channels with regenerative repeaters spaced at approximately 2-km intervals. n Each voice signal is essentially limited to a band from 300 to 3100 Hz. o Anti-aliasing filter with W = 3.1 KHz o Sampling rate = 8 KHz (> 2W = 6.2 KHz) n ITU G.711 µ-law is used with µ = 255. n Each frame consists of 24 ´ 8 + 1 = 193 bits, where a single bit is added at the end of the frame for the purpose of synchronization.

© Po-Ning [email protected] 7-12 193 bit/frame 1 8000 sample/sec = 1.544 Megabits/sec ⇥ 1 sample (from each of 24 voice channels)/frame ⇥ T1 System

n In addition to the 193 bits per frame (i.e., 1.544 Megabits per second), a telephone system must also pass signaling information such as “dial pulses” and “on/off-hook.” o The least significant bit of each voice channel is deleted in every sixth frame, and a signaling bit is inserted in its place.

DS1 Frame (24 DS0) (DS=Digital Signal) F 1 2 3 24

Framing bit

© Po-Ning [email protected] 7-13 Digital Multiplexers

1 1 High-speed transmission line 2 Multiplexer Demultiplexer 2

N N o The introduction of digital multiplexer enables us to combine digital signals of various natures, such as computer data, digitized voice signals, digitized facsimile and television signals.

© Po-Ning [email protected] 7-14 Digital Multiplexers o The multiplexing of digital signals is accomplished by using a bit-by-bit interleaving procedure with a selector switch that sequentially takes a (or more) bit from each incoming line and then applies it to the high-speed common line. 64 kbps Digital Voices

PCM 1 frame

4k Hz Analog Voices PCM TDM

(3 channels/frame) PCM

© Po-Ning [email protected] 7-15 Digital Multiplexers o Digital multiplexers are categorized into two major groups. 1. 1st Group: Multiplex digital computer data for TDM transmission over public switched telephone network. n Require the use of modem technology. 2. 2nd Group: Multiplex low-bit-rate digital voice data into high-bit-rate voice stream. n Accommodate in the hierarchy that is varying from one country to another. n Usually, the hierarchy starts at 64 Kbps, named a digital signal zero (DS0).

© Po-Ning [email protected] 7-16 North American Digital TDM Hierarchy o The first level hierarchy n Combine 24 DS0 to obtain a primary rate DS1 at 1.544 Mb/s (T1 transmission) o The second-level multiplexer n Combine 4 DS1 to obtain a DS2 with rate 6.312 Mb/s o The third-level multiplexer n Combine 7 DS2 to obtain a DS3 at 44.736 Mb/s o The fourth-level multiplexer n Combine 6 DS3 to obtain a DS4 at 274.176 Mb/s o The fifth-level multiplexer n Combine 2 DS4 to obtain a DS5 at 560.160 Mb/s

© Po-Ning [email protected] 7-17 North American Digital TDM Hierarchy

n The combined is higher than the multiple of the incoming bit rates because of the addition of bit stuffing and control signals.

DS0 DS1 M01 DS2 M12 DS3 M23 DS4 M34

Vo i c e T1=24 DS0 T2=4 DS1 T3=7 DS2 T4=6 DS3 (64 kbps) (1.544 Mbps) (6.312 Mbps) (44.736 Mbps) (274.176 Mbps)

© Po-Ning [email protected] 7-18 North American Digital TDM Hierarchy o Basic problems involved in the design of multiplexing system n Synchronization should be maintained to properly recover the interleaved digital signals. n Framing should be designed so that individual can be identified at the receiver. n Variation in the bit rates of incoming signals should be considered in the design. o A 0.01% variation in the propagation delay produced by a 1℉ decrease in temperature will result in 100 fewer pulses in the cable of length 1000-km with each pulse occupying about 1 meter of the cable.

© Po-Ning [email protected] 7-19 Digital Multiplexers o Synchronization and rate variation problems are resolved by bit stuffing. o AT&T M12 (second-level multiplexer) n 24 control bits are stuffed, and separated by sequences of 48 data bits (12 from each DS1 input).

M0 48 CI 48 F0 48 CI 48 CI 48 F1 48

M1 48 CII 48 F0 48 CII 48 CII 48 F1 48

M1 48 CIII 48 F0 48 CIII 48 CIII 48 F1 48

M1 48 CIV 48 F0 48 CIV 48 CIV 48 F1 48 Subframe Stuffing Frame Stuffing Stuffing Frame markers indicators markers indicators indicators markers

© Po-Ning [email protected] 7-20 DS2 M-Subframe: 294 bits (6 blocks)

M 48 C 48 F 48 C 48 C 48 F 48

For each block:

DS1 #1

DS1 #2

DS1 #3 Bit interleave DS1 #4

© Po-Ning [email protected] 7-21 AT&T M12 Multiplexer o The control bits are labeled F, M, and C.

n Frame markers: In sequence of F0F1F0F1F0F1F0F1, where F0 = 0 and F1 = 1. n Subframe markers: In sequence of M0M1M1M1, where M0 = 0 and M1 = 1. n Stuffing indicators: In sequences of CI CI CI CII CII CII CIII CIII CIII CIV CIV CIV, where all three bits of Cj equal 1’s indicate that a stuffing bit is added in the position of the first information bit associated with the first DS1 bit stream that follows the F1-control bit in the same subframe, and three 0’s in CjCjCj imply no stuffing. o The receiver should use majority law to check whether a stuffing bit is added.

© Po-Ning [email protected] 7-22 AT&T M12 Multiplexer o These stuffed bits can be used to balance (or maintain) the nominal input bit rates and nominal output bit rates. n S = nominal bit stuffing rate o The rate at which stuffing bits are inserted when both the input and output bit rates are at their nominal values.

n fin = nominal input bit rate n fout = nominal output bit rate n M = number of bits in a frame n L = number of information bits (input bits) for one input stream in a frame

© Po-Ning [email protected] 7-23 L 1 M L = 185.88082902 µs = 186.31178707 µs = 186.52849741 µs fin  fout  fin AT&T M12 multiplexer o For M12 framing,

fin = 1.544 Mbps

fout = 6.312 Mbps M 4(L 1) 4L M = 288´ 4 + 24 = 1176 bits = S +(1 S) fout 4fin 4fin L = 288 bits M L -1 L Duration of a frame = = S + (1- S) f f f out #"!in in One bit is replaced by a stuffed bit. f 1.544 Þ S = L - in M = 288 - 1176 = 0.334601 fout 6.312

© Po-Ning [email protected] 7-24 AT&T M12 Multiplexer o Allowable tolerances to maintain nominal output bit rates n A sufficient condition for the existence of S such that the nominal output bit rate can be matched. é L -1 L ù M é L -1 L ù max S + (1- S) ³ ³ min S + (1- S) SÎ[0,1]ê ú SÎ[0,1]ê ú ë fin fin û fout ë fin fin û L M L -1 L L -1 Û ³ ³ Û fout ³ fin ³ fout fin fout fin M M

288 287 1.5458 = 6.312 ³ f ³ 6.312 = 1.54043 1176 in 1176

© Po-Ning [email protected] 7-25 AT&T M12 Multiplexer

n This results in an allowable tolerance range: 1.5458 -1.54043 = 6.312 /1176 = 5.36735 kbps

n In terms of ppm (pulse per million pulses), 106 - b 106 106 + a ppm = = ppm 1.54043 1.544 1.5458

Þ a ppm = 1164.8 and bppm = 2312.18 o This tolerance is already much larger than the expected change in the bit rate of the incoming DS1 bit stream.

© Po-Ning [email protected] 7-26 Summary of PCM o Virtues of PCM systems n Robustness to channel noise and interference n Efficient regeneration of coded signal along the transmission path n Efficient exchange of increased channel bandwidth for improved signal-to-noise ratio, obeying an exponential law. n Uniform format for different kinds of baseband signal transmission; hence, facilitate their integration in a common network. n Message sources are easily dropped or reinserted in a TDM system. n Secure communication through the use of encryption/decryption.

© Po-Ning [email protected] 7-27 Summary of PCM o Two limitations of PCM system (in the past) n Complexity n Bandwidth o Nowadays, with the advance of VLSI technology, and with the availability of wideband communication channels (such as fiber) and compression technique (to reduce the bandwidth demand), the above two limitations are greatly released.

© Po-Ning [email protected] 7-28 Delta Modulation o Delta Modulation (DM) n The message is oversampled (at a rate much higher than the ) to purposely increase the correlation between adjacent samples. n Then, the difference between adjacent samples is encoded instead of the sample value itself.

© Po-Ning [email protected] 7-29 1 0 1 0 1 1 0 1 0 Staircase 1 0 1 approximation 1 m(t) 0 0 1 1 mq(t) 1 Ts

© Po-Ning [email protected] 7-30 Math Analysis of Delta Modulation

Let m[n] = m(nTs ).

Let mq [n] be the DM approximation of m(t) at time nTs . Then n mq[n] = mq[n −1]+ eq[n] = ∑ eq[ j], j=−∞

where eq[n] = Δ ⋅sgn(m[n]− mq[n −1]).

¥ The transmitted code word is {[(eq [n]/ D) +1]/ 2}n=-¥ .

© Po-Ning [email protected] 7-31 n mq [n] = mq [n -1] + eq [n] = åeq [n], j=-¥

where eq [n] = D ×sgn(m[n] - mq [n -1]). Delta Modulation 1-bit Encoder + _ quantizer for DM o The principle virtue of delta modulation + is its simplicity. + n It only requires z-1 the use of Accumulator comparator, Decoder quantizer, and + + for DM accumulator. z-1 Accumulator Lowpass filter Bandwidth W of m(t)

(See Slide 7-53) 7-32 Delta Modulation o Distortions due to delta modulation n Slope overload distortion n Granular noise

Slope-overload distortion Ts

m (t) m(t) q Granular noise

© Po-Ning [email protected] 7-33 Delta Modulation o Slope overload distortion n To eliminate the slope overload distortion, it requires D dm(t) ³ max (slope overload condition) Ts dt n So, increasing step size D can reduce the slope-overload distortion. n An alternative solution is to use dynamic D. (Often, a delta modulation with fixed step size is referred to as a linear delta modulator due to its fixed slope, a basic function of linearity.)

© Po-Ning [email protected] 7-34 Delta Modulation o Granular noise

n mq[n] will hunt around a relatively flat segment of m(t). n A remedy is to reduce the step size. o A tradeoff in step size is therefore resulted for slope overload distortion and granular noise.

© Po-Ning [email protected] 7-35 Delta-Sigma Modulation o Delta-sigma modulation n In fact, the delta modulation distortion can be reduced by increasing the correlation between samples. n This can be achieved by integrating the message signal m(t) prior to delta modulation. n The “integration” process is equivalent to a pre- emphasis of the low-frequency content of the input signal.

© Po-Ning [email protected] 7-36 Delta-Sigma Modulation

1-bit Encoder n A side benefit of + _ quantizer for DM “integration-before- delta-modulation,” Move the ++ which is named accumulator to the z-1 delta-sigma transmitter. modulation, is that Accumulator the receiver design

is further simplified Decoder + + (at the expense of a for DM more complex z-1 Accumulator transmitter). Lowpass filter

© Po-Ning [email protected] 7-37 Lowpass Decoder Delta-Sigma Modulation filter for DM

1-bit Encoder + +_ + quantizer for DM A straightforward z-1 structure Accumulator ++

combine z-1 Since integration is into one Accumulator a linear operation, the two integrators before comparator 1-bit Encoder + _ + can be combined + quantizer for DM into one after z-1 comparator. Accumulator z-1

© Po-Ning [email protected] 7-38 Math Analysis of Delta-Sigma Modulation

Let iq[n] be the DM approximation of i[n].

¥ The transmitted code word is {[(s q [n]/ D) +1]/ 2}n=-¥ . Since we only need a lowpass filter to smooth out the received signal at the receiver end. (See the previous slide.)

© Po-Ning [email protected] 7-39 Delta-Sigma Modulation o Final notes n Delta(-sigma) modulation trades channel bandwidth (e.g., much higher sampling rate) for reduced system complexity (e.g., the receiver only demands a lowpass filter). n Can we trade increased system complexity for a reduced channel bandwidth? Yes, by means of prediction technique. n In Section 3.13, we will introduce the basics of prediction technique. Its application will be addressed in subsequent sections.

© Po-Ning [email protected] 7-40 Linear Prediction x[n] x[n-1] x[n-2] x[n-p+1] x[n-p] z-1 z-1 z-1

w1 w2 wp-1 wp

prediction o Consider a finite-duration impulse response (FIR) discrete- time filter, where p is the prediction order, with linear

prediction p ˆ x[n] = åwk x[n - k] k =1

© Po-Ning [email protected] 7-41 Linear Prediction o Design objective

n To find the filter coefficient w1, w2, …, wp so as to minimize index of performance J: J = E[e2[n]], where e[n] = x[n] - xˆ[n].

© Po-Ning [email protected] 7-42 2 éæ p ö ù J = Eêç x[n] - åwk x[n - k]÷ ú ëêè k =1 ø ûú p p p 2 = E[x [n]] - 2åwk E[x[n]x[n - k]] +ååwk w j E[x[n - k]x[n - j]] k =1 k =1 j=1 p æ p p p ö = R [0] - 2 w R [k] +ç2 w w R [k - j] + w2 R [0]÷ X å k X ç åå k j X å k X ÷ k =1 è k =1 j>k k =1 ø ¶ æ p i-1 ö J = -2R [i] + ç2 w R [i - j] + 2 w R [k - i] + 2w R [0]÷ X ç å j X å k X i X ÷ ¶wi è j=i+1 k =1 ø p

= -2RX [i] + 2åw j RX [i - j] = 0 j=1

© Po-Ning [email protected] 7-43 p

åw j RX [i - j] = RX [i] for 1 £ i £ p. j=1 The above optimality equations are called the Wiener-Hopf equations for linear prediction. It can be rewritten in matrix form as:

é RX [0] RX [1] $ RX [ p -1]ùéw1 ù é RX [1]ù ê úê ú ê ú R [1] R [0] $ R [ p - 2] w2 R [2] ê X X X úê ú = ê X ú ê ! ! # ! úê ! ú ê ! ú ê ú ê ú w ê ú ëRX [ p -1] RX [ p - 2] " RX [0] ûë p û ëRX [ p]û

−1 or RX w = rX ⇒ Optimal solution wo = RX rX

© Po-Ning [email protected] 7-44 Toeplitz (Square) Matrix o Any square matrix of the form

é a0 a1 $ a p-1 ù ê a a a ú ê 1 0 $ p-2 ú ê " " # " ú ê ú a a ! a ë p-1 p-2 0 û p´ p is said to be Toeplitz. o A Toeplitz matrix (such as RX) can be uniquely determined by p elements, [a0, a1, …, ap-1].

© Po-Ning [email protected] 7-45 Linear Adaptive Predictor o The optimal w0 can only be obtained with the knowledge of autocorrelation function. o Question: What if the autocorrelation function is unknown? o Answer: Use linear adaptive predictor.

© Po-Ning [email protected] 7-46 Idea Behind Linear Adaptive Predictor o To minimize J, we should update wi toward the bottom of the J-bowel. ¶J gi º ¶wi n So, when gi > 0, wi should be decreased.

n On the contrary, wi should be increased if gi < 0. n Hence, we may define the update rule as: 1 wˆ [n +1] = wˆ [n] - µ × g [n] i i 2 i where µ is a chosen constant step size, and ½ is included only for convenience of analysis.

© Po-Ning [email protected] 7-47 o gi[n] can be approximated by:

p wˆ [n + 1] =w ˆ [n]+µ x[n i] x[n] wˆ [n]x[n j] ) i i · 0 j 1 j=1 X =ˆw [n]+µ x[n i]e@[n] A i ·

© Po-Ning [email protected] 7-48 Structure of Linear Adaptive Predictor

x[n-1] Linear predictor x[n] z-1

Error e[n] _ +

© Po-Ning [email protected] 7-49 Least Mean Square Algorithm o The pairs in the form of the popular least-mean-square (LMS) algorithm for linear adaptive prediction

wˆj[n + 1] =w ˆj[n]+µ x[n j]e[n] p · 8e[n]=x[n] wˆ [n]x[n j] > j < j=1 X :>

© Po-Ning [email protected] 7-50 Differential Pulse-Code Modulation (DPCM) o Basic idea behind differential pulse-code modulation n Adjacent samples are often found to exhibit a high degree of correlation. n If we can remove this adjacent redundancy before encoding, a more efficient coded signal can be resulted. n A way to remove the redundancy is to use linear prediction.

© Po-Ning [email protected] 7-51 DPCM

Encoder + Quantizer o For DPCM, the _ for DPCM quantization error is on e[n], ++ rather on m[n] as Prediction for PCM. filter o So, the quantization Decoder + + error q[n] is for DPCM supposed to be Prediction filter smaller. Lowpass filter Bandwidth W of m(t)

© Po-Ning [email protected] 7-52 DPCM

Encoder + Quantizer o Derive: _ for DPCM

eq [n] = e[n] + q[n] ++ Þ m [n] = mˆ[n] + e [n] q q Prediction = mˆ[n] + e[n] + q[n] filter = m[n] + q[n] ˆ m[n] Decoder + + So, we have the same for DPCM relation between m [n] and Prediction q filter m[n] (as in Slide 7-32) but Lowpass with smaller q[n]. filter Bandwidth W of m(t)

© Po-Ning [email protected] 7-53 DPCM o Notes n DM system can be treated as a special case of DPCM.

1-bit Encoder + _ quantizer for DM

Prediction filter => single delay Quantizer => single-bit ++

z-1 Accumulator

© Po-Ning [email protected] 7-54 DPCM o Distortions due to DPCM n Slope overload distortion oThe input signal changes too rapidly for the prediction filter to track it. n Granular noise

© Po-Ning [email protected] 7-55 Processing Gain o The DPCM system can be described by:

mq [n] = m[n] + q[n] o So, the output signal-to-noise ratio is: E[m2[n]] SNR = O E[q2[n]] o We can re-write SNRO as: E[m2[n]] E[e2[n]] SNR = = G × SNR O E[e2[n]] E[q2[n]] p Q where e[n] = m[n] - mˆ[n] is the prediction error.

© Po-Ning [email protected] 7-56 Processing Gain o In terminologies, ì E[m2 [n]] G processing gain ï p = 2 ï E[e [n]] í 2 ï E[e [n]] SNRQ = 2 signal to quantization noise ratio îï E[q [n]]

Notably, SNRQ can be treated as the SNR for

system of eq[n] = e[n]+ q[n].

© Po-Ning [email protected] 7-57 Processing Gain o Usually, the contribution of SNRQ to SNRO is fixed. n One additional bit in quantization results in 6 dB improvement. o Gp is the processing gain due to a nice “prediction.”

n The better the prediction is, the larger Gp is.

© Po-Ning [email protected] 7-58 DPCM o Final notes on DPCM n Comparing DPCM with PCM in the case of voice signals, the improvement is around 4-11 dB, depending on the prediction order. n The greatest improvement occurs in going from no prediction to first-order prediction, with some additional gain, resulting from increasing the prediction order up to 4 or 5, after which little additional gain is obtained. n For the same sampling rate (8KHz) and signal quality, DPCM may provide a saving of about 8~16 Kbps compared to standard PCM (64 Kpbs).

© Po-Ning [email protected] 7-59 DPCM

Source: IEEE Communications Magazine, September 1997. Excellent ADPCM PCM G.711 G.723.1 G.729 G.728 G.726 Good IS-641 G.727 G.723.1 Speech Quality JDC2 GSM Fair IS96 (Mean Opinion Scores) MELP 2.4 FS-1016 IS54 JDC FS-1015 GSM/2 Poor

Unacceptable 2 4 8 16 32 64 Bit Rate (kb/s) IS© = Po Interim-Ning [email protected] Standard GSM = Global System for Mobile Communications JDC = Japanese Digital Cellular FS = Federal Standard MELP = Mixed-Excitation Linear Prediction Adaptive Differential Pulse-Code Modulation o Adaptive prediction is used in DPCM. o Can we also combine adaptive quantization into DPCM to yield a comparably voice quality to PCM with 32 Kbps bit rate? The answer is YES from the previous figure. n 32 Kbps: 4 bits for one sample, and 8 KHz sampling rate n 64 Kbps: 8 bits for one sample, and 8 KHz sampling rate o So, “adaptive” in ADPCM means being responsive to changing level and spectrum of the input speech signal.

© Po-Ning [email protected] 7-61 Adaptive Quantization o Adaptive quantization refers to a quantizer that operates with a time-varying step size D[n]. o D[n] is adjusted according to the power of input sample m[n]. n Power = variance, if m[n] is zero-mean.

D[n] = f × E[m2[n]] n In practice, we can only obtain an estimate of E[m2[n]].

© Po-Ning [email protected] 7-62 Adaptive Quantization o The estimate of E[m2[n]] can be done in two ways: n Adaptive quantization with forward estimation (AQF) o Estimate based on unquantized samples of the input signals. n Adaptive quantization with backward estimation (AQB) o Estimate based on quantized samples of the input signals.

© Po-Ning [email protected] 7-63 AQF o AQF is in principle a more accurate estimator. However, it requires n an additional buffer to store unquantized samples for the learning period. n explicit transmission of level information to the receiver (the receiver, even without noise, only has the quantized samples). n a processing delay (around 16 ms for speech) due to buffering and other operations for AQF. o The above requirements can be relaxed by using AQB.

© Po-Ning [email protected] 7-64 AQB

m[n] mq[n] De- Quantizer quantizer

Level Level estimator estimator

A possible drawback for a feedback system is its potential unstability. However, stability in this system can be guaranteed if mq[n] is bounded.

© Po-Ning [email protected] 7-65 APF and APB o Likewise, the prediction approach used in ADPCM can be classified into: n Adaptive prediction with forward estimation (APF) o Prediction based on unquantized samples of the input signals. n Adaptive prediction with backward estimation (APB) o Prediction based on quantized samples of the input signals. o The pro and con of APF/APB is the same as AQF/AQB. o APB/AQB are a preferred combination in practical applications.

© Po-Ning [email protected] 7-66 ADPCM

Encoder + Quantizer _ for DPCM

Adaptive prediction + + with backward Prediction filter estimation (APB).

Estimate

© Po-Ning [email protected] 7-67 MPEG Audio Coding Standard o The ADPCM and various voice coding techniques introduced above did not consider the human auditory perception. o In practice, a consideration on human auditory perception can further improve the system performance (from the human standpoint). o The MPEG-1 standard is capable of achieving transparent, “perceptually lossless” compression of stereophonic audio signals at high sampling rate. n A human subjective test shows that a 6-to-1 compression ratio are “perceptually indistinguishable” to human.

© Po-Ning [email protected] 7-68 Characteristics of Human Auditory System o Psychoacoustic characteristic of human auditory system n Critical band o The inner ear will scale the power spectra of incoming signals nonlinearly in the form of limited frequency bands called the critical bands. o Roughly, the inner ear can be modeled as 25 selective overlapping band-pass filters with bandwidth < 100Hz for the lowest audible frequencies, and up to 5kHz for the highest audible frequencies.

© Po-Ning [email protected] 7-69 Characteristics of Human Auditory System

n Auditory masking o When a low-(power-)level signal (i.e., the maskee) and a high-(power-)level signal (i.e., the masker) occur simultaneously in the same critical band, and are close to each other in frequency, the low- (power-)level signal will be made inaudible (i.e., masked) by the high-(power-)level signal, if the low-(power-)level one lies below a masking threshold.

© Po-Ning [email protected] 7-70 Characteristic of Human Auditory System

o Masking threshold is frequency-dependent. Signal level (masker) SMR = signal-to-mask ratio

Minimum masking threshold Masking at the critical band threshold SNR Quantization noise level

Neighboring Neighboring Critical band critical band Critical band Within a critical band, the quantization noise is inaudible as long as the© Po NMR-Ning [email protected] = SMR - SNR for the pertinent quantizer is negative. 7-71 MPEG Audio Coding Standard

PCM Output Analysis Quantization Frame audio bit filter-bank & coding packing signal stream

Psychoacoustic Bit model allocation

© Po-Ning [email protected] 7-72 MPEG Audio Coding Standard o Analysis filter-bank n Divide the audio signal into a proper number of subbands, which is a compromise design for computational efficiency and perceptual performance. o Psychoacoustic model n Analyze the spectral content of the input audio signal and thereby compute the signal-to-mask ratio (SMR). o Quantization & coding n Decide how to apportion the available number of bits for the quantization of the subband signals. o Frame packing n Assemble the quantized audio samples into a decodable bit stream.

© Po-Ning [email protected] 7-73 Summary o TDM (Time-Division Multiplexing) o PCM, DM, DPCM, ADPCM o MPEG audio coding

© Po-Ning [email protected] 7-74