The Application of Dither and Noise-Shaping to
Nyquist-Rate Digital Audio: an Intro duction.
Christopher Hicks,
Communications and Signal Pro cessing Group,
Cambridge University Engineering Department,
United Kingdom.
Septemb er 12, 1995
Abstract
This pap er discusses the theory of dithered and noise-shap ed quantisation, and in particular their ap-
plication to Nyquist rate digital audio. Dither is shown to b e essential if quantisation-related distortion
is to b e avoided. Careful application of noise-shaping is shown to b e of audible b ene t by increasing the
p erceived dynamic range of a signal. Various drawbacks of the technique are discussed. In particular the
`fragility' of signals pro cessed in this way is noted, and the ease with which the b ene ts may b e undone
is illustrated.
Copyright: This do cument is copyright of the author, Christopher Hicks, 1995. Permission is hereby granted
to copy and store it in magnetic or electronic form and to make a single hard copy, for private study, educational
use or other non-commercial purp ose, sub ject to the following conditions: the do cument shall remain complete
and unaltered, and no charge shall b e made for access to it. The author reserves all other rights.
highly correlated signals (suchasmusic) results in 1 Intro duction
tonal distortion comp onents b eing added to the sig-
nal.
The imp ortance of dithering in digital audio sys-
tems has b een recognised, at least on an academic
The distribution of d is critical; it must e ectively
level, for some time. Despite this, the sub ject re-
decorrelate the quantisation error from the input
mains p o orly understo o d in the audio profession at
signal x, while adding a minimum of noise p ower
large, and the unp opularityofmany early digital
to the output signal y .Itmust also linearise the
recordings can b e attributed to signal distortion at
quantiser transfer function, such that the exp ected
low levels, as a result of unsuitable or non-existent
output E [y (n)] = x(n), the input. If d is a white
dither.
noise source with a suitable distribution then the
error e(n)=y (n) x(n) is also white, regardless of
The technique of noise-shaping has b een applied to
the sp ectrum of x.
digital audio in a numb er of sp eci c areas. The b est
known of these are analogue to digital and digital
In practice a triangular distribution for d of p eak
to analogue conversion, where it is used to obtain
deviation q (the quantiser step size) is found to
high resolution, low bandwidth p erformance from
b e suitable for high-quality audio [1]; this is of-
1
alow resolution, high bandwidth converter [1].
ten referred to as TPDF dither. In this case
the exp ected error E [e(n)] is zero, the error p ower
A more recent application is CD mastering from
2 2
= q =4, and b oth are indep endent of the signal
e high-resolution recordings stored on digital tap e or
x.
hard disc [2]. Toconvert a recording of, say,20
or 24 bit resolution to the CD format requires a re-
Thus the action of the dithered quantiser maybe
quantisation to 16 bits. This inevitably intro duces
mo delled as a simple addition of a stationary white
noise into the signal, and in a straightforward quan-
noise source e(n). The SNR is degraded bythe
tisation this noise is approximately white.
TPDF dither by 4.8 dB, compared with the un-
dithered case, but this is far preferable to the dis-
Noise-shaping can b e used create a non-white noise
tortion that can arise from undithered quantisa-
sp ectrum. For example it is p ossible to lower the
tion.
noise p ower sp ectral density in the frequency bands
where the ear is most sensitive, at the exp ense of
The dither signal itself need not b e white provided
higher noise p ower in other bands (where the ear
that it is suciently random that it is not p erceived
is less sensitive). This pro cess lowers the p erceived
as a tonal comp onent in the output signal. How-
quantisation noise o or and increases the sub jective
ever, the noise added by the quantiser itself is still
dynamic range of the signal.
approximately white (if the dither is of suitable
power) and the output noise is the sum of these
two noise sources. Therefore, using this scheme the
minimum noise p ower densityachievable (at a par-
2 Dithered Quantisation
ticular frequency) is approximately that due to the
quantiser alone, this b eing
Consider the system shown in gure 1, whose out-
2
put y (n) is the sum of a high-resolution input x(n)
Tq