Lossless Audio Coding Using Code-Excited Linear Prediction with Embedded Entropy Coding 碼激勵線性預測與嵌入式熵編碼結合的無損壓縮音頻編碼

CITY UNIVERSITY OF HONG KONG 香港城市大學 Lossless Audio Coding Using Code-Excited Linear Prediction with Embedded Entropy Coding 碼激勵線性預測與嵌入式熵編碼結合的無損壓縮音頻編碼 Submitted to Department of Electronic Engineering 電子工程學系 in Partial Fulfillment of the Requirements for the Degree of Master of Philosophy 哲學碩士學位 by Li Yongmin 李永敏 May 2011 二零一一年五月 ii Abstract Audio coding algorithm is used to compress and decompress digital audio data. Generally, there are two classes of audio coding techniques; lossy coding and lossless coding. Lossy coders such as MP3, can achieve high compression ratio, however the reconstructed signal is not exactly the same as the original signal. Lossless coders can reconstruct a signal identical to the original signal but the compression performance is not as good as that of lossy coders. With increasing demand of high-fidelity audio, most efforts in audio coding research are directed to lossless compression of high quality audio. Recent research trends in lossless audio compression focus on a decorrelation-plus-entropy coding approach. The main disadvantage of this approach is that it is not scalable to cater for channels with different bandwidth. In addition, the order of the decorrelation filter is generally very high and to compute the filter coefficients on the fly in real time is quite a demanding task both in the encoder and decoder. In this thesis, a scalable coding algorithm for lossless compression of audio signals is presented. The proposed algorithm consists of a lossy coding part and a lossless coding part. The lossy coding part is based on code excitation approach which is similar to Code-Excited Linear Prediction (CELP) coding for coding speech signal. However, the proposed approach has fundamental differences from CELP coding in iii many aspects, in particular, the excitation gain and short term prediction coefficients are adapted in a sample-by-sample fashion to cope with rapid time-varying nature of audio signals. In addition, a block adaptive linear predictive filter is added in the proposed coder to further improve short term prediction. The excitation codebook is a fixed codebook constructed from a collection of stochastic codewords which are obtained by training from a large collection of real audio signals. A total of eleven stochastic codebooks for various bit rates were designed. During encoding the excitation codebook is searched by using an M-L tree search strategy with a joint optimization based on minimum error energy and minimum code length after entropy coding. Several codebook selection methods and M-L tree search strategies are discussed and compared in order to determine the best compression performance at various encoding complexities. In the lossless coding part, the error between the input and the synthetic signal has significantly lower entropy which can then be encoded by an arithmetic coder to achieve lossless compression. If the residual information is not sent to the decoder, the decoder can recover a reasonable good quality of audio signal from the received lossy coding parameters only. The proposed coding algorithm has very low decoding complexity due to its simple code excitation structure and achieves compression performance comparable to other advanced lossless coders for coding CD quality audio. v Table of Contents Abstract ................................................................................................................................................... ii Acknowledgement .................................................................................................................................. iv Table of Contents .................................................................................................................................... v List of Figures ....................................................................................................................................... vii List of Tables .......................................................................................................................................... ix Glossary of Symbols .............................................................................................................................. xi Chapter 1 Introduction .......................................................................................................................... 1 1.1 Lossy Audio Coding ....................................................................................................................... 2 1.2 Lossless Audio Coding ................................................................................................................... 6 1.3 Motivation of This Research ........................................................................................................ 10 1.4 Thesis Outline............................................................................................................................... 12 Chapter 2 Lossless Audio Coding Using A Two Stage Joint Lossy/Entropy Coding Approach ..... 13 2.1 Lossy Coding Part ........................................................................................................................ 19 2.1.1 Excitation Codebook ............................................................................................................. 22 2.1.2 Sample-by-Sample Gain Adaption ........................................................................................ 24 2.1.3 Sample-by-Sample Adaptive Short Term Predictor .............................................................. 27 2.1.4 Block Adaptive Linear Predictive Filter ................................................................................ 28 2.1.5 Codebook Search Algorithm ................................................................................................. 29 2.1.6 Selecting the Order of the Sample-by-Sample Adaptive Predictor ....................................... 31 2.1.7 Improvement of Adding Block Adaptive Filter ..................................................................... 36 2.1.8 Adding A Long-Term Adaptive Codebook ............................................................................ 37 2.1.9 Using A Nonlinear Predictor ................................................................................................. 42 2.2 Entropy Coding ............................................................................................................................ 48 2.2.1 Basic Arithmetic Coding ....................................................................................................... 49 2.2.2 Adaptive Arithmetic Coding .................................................................................................. 50 Chapter 3 Improved Coding Schemes ................................................................................................ 52 vi 3.1 Improving Codebook Search Algorithm ....................................................................................... 52 3.1.1 Codeword Search .................................................................................................................. 53 3.1.2 Adaptive Codebook Selection ............................................................................................... 55 3.2 Encoder for Joint Lossy/Entropy Coding Scheme ........................................................................ 58 3.3 Decoder for Joint Lossy/Entropy Coding Scheme ....................................................................... 59 3.4 Improvement by Nonscalable Lossless Coding Scheme .............................................................. 60 3.5 Extension to Multi-Channel Coding ............................................................................................. 66 Chapter 4 Simulation and Results ....................................................................................................... 71 4.1 Simulation Performance ............................................................................................................... 71 4.2 Performance Comparison ............................................................................................................. 74 4.2.1 Test of Mono Files ................................................................................................................. 74 4.2.2 Test of Stereo Files ................................................................................................................ 77 Chapter 5 Conclusions and Future Work ........................................................................................... 79 5.1 Conclusions .................................................................................................................................. 79 5.2 Future Work .................................................................................................................................. 82 Bibliography .......................................................................................................................................... 83 vii List of Figures Figure1.1.1 Classification of lossy audio coding .................................................................... 2 Figure1.1.2 Generic structure of coder by exploring psychoacoustics .................................... 3 Figure1.1.3 Block diagram of the subband coding system ...................................................... 6 Figure1.2.1 Two approaches to lossless audio coding ............................................................. 7

Load more

Lossless Audio Coding Using Code-Excited Linear Prediction with Embedded Entropy Coding 碼激勵線性預測與嵌入式熵編碼結合的 無損壓縮音頻編碼

Lossless Audio Coding Using Code-Excited Linear Prediction with Embedded Entropy Coding 碼激勵線性預測與嵌入式熵編碼結合的無損壓縮音頻編碼