Data Compression Using Adaptive Transform Coding

Data Compression Using Adaptive Transform Coding by Martin C. Rost A DISSERTATION Presented to the Faculty of The Graduate College in the University of Nebraska In Partial Fulfillment of Requirements For the Degree of Doctor of Philosophy Major: Engineering (Electrical Engineering) Under the Supervision of Professor Khalid Sayood Lincoln, Nebraska October, 1988 (NASA-CR-189956) DATA COMPRESSION USING N92-19119 ADAPTIVE TRANSFORM CODING. APPENDIX 1: ITEM 1 Ph.D. Thesis (Nebraska Univ.) 194 p CSCL 098 Unclas G3/61 0072200 This work was supported by the NASA Goddard Space Flight Center under grant NAG-916. ACKNOWLEDGEMENTS I thank all the members of my committee for their participation and helpful comments. I am grateful to my readers, Dean Stanley Liberty and Dr. William Brogan, for their suggestions. Their help has improved the quality of this dissertation. I thank Dr. Jerry Gibson for his comments concerning the presentation of the text and for the time he took from his schedule to come and serve on my committee. Special thanks go to my advisor, Dr. Khalid Sayood. It was his patient guidance that kept this ship afloat. Data Compression Using Adaptive Transform Coding Martin Christopher Rost, Ph.D. University of Nebraska, 1988 Adviser: Khalid Sayood The data of natural images is not stationary, and the coding complexity of images varies from region to region. How well any particular source coding system works is dependent upon its design assumptions and how well these assumptions match the data. In this dissertation adaptive low-rate source coders are developed. These coders adapt by adjusting the complexity of the coder to match the local coding difficulty of the image. This is accomplished by using a threshold driven maximum distortion criterion to select the specific coder used. The different coders are built using variable blocksized transform techniques, and the threshold criterion selects small transform blocks to code the more difficult regions and larger blocks to code the less complex regions. The different coders are interconnected using a quad tree structure. The algorithm that generates the tree and tests the thresholds is independent of the design of block coders. This allows the system designer to select different coders for the different types of images being coded without having to change the entire coding system. A progressive transmission scheme based upon these coders is also developed. Progres- sive coding provides a recognizable image to the viewer in a very short time. As this image is viewed, more data can be received to update the image and improve its visual quality. A set of example systems are constructed to test the feasibility of these systems of source coders. These systems use scalar quantized and vector quantized transform block coders. Some of the systems are extended to include a new modified block truncation coding scheme. They are used to code both monochromatic and color images with good results to rates as low as 0.3 bits/pel. A theoretical framework is constructed from which the study of these coders can be explored, and an algorithm for selecting the optimal bit allocation for the quantization of transform coefficients is developed. The bit allocation algorithm is more fully developed, and can be used to achieve more accurate bit assignments than the algorithms currently used in the literature. Some upper and lower bounds for the bit-allocation distortion-rate function are developed. An obtainable distortion-rate function is developed for a particular scalar quantizer mixing method that can be used to code transform coefficients at any rate is also presented. ACKNOWLEDGEMENTS I thank all the members of my committee for their participation and helpful comments. I am grateful to my readers, Dean Stanley Liberty and Dr. William Brogan, for their suggestions. Their help has improved the quality of this dissertation. I thank Dr. Jerry Gibson for his comments concerning the presentation of the text and for the time he took from his f schedule to come and serve on my committee. Special thanks go to my advisor, Dr. Khalid Sayood. It was his patient guidance that kept this ship afloat. This work was supported by the NASA Goddard Space Flight Center under grant NAG-916. Data Compression Using Adaptive Transform Coding Table of Contents Abstract i List of Figures viii List of Tables x List of Abbreviations xi Chapter 1. Introduction 1 Chapter 2. The Coding and Perception of Images 4 2.1 Modelling overview 2.2 Digital images 2.3 Perceptual considerations 2.3a Spatial, spectral and luminance considerations 2.3b Low-bandwidth temporal considerations Chapter 3. Image Compression Techniques 20 3.1 Measuring data compression and image quality 3.2 Quantization 3.2a Scalar quantization 3.2b Vector quantization 3.3 Transform coding 3.3a KLT and DCT coding 3.4 Block truncation coding Chapter 4. Progressive Transmission of Images 38 4.1 Progressive image coding examples for the literature 4.la Fixed blocksize methods 4.1b Variable blocksize methods Chapter 5. Mixture Block Coding and Mixture Block Coding with Progressive Transmission . 44 JM Design considerations 5. la The largest blocksize 5.1b The smallest blocksize 5.1c Quad tree structure and the MBC coding rate 5.Id The MBC/PT coder and its coding rate 5.2 The MBC and MBC/PT simulators 5.2a The DCT block coders 5.2b Using block truncation coding with MBC 5.2c The distortion measures 5.2d The quantizers 5.3 MBC and MBC/PT computer simulations Chapter 6. A Distortion-rate Function for Transform Coding 123 6.1 The distortion-rate problem 6.2 The transform coefficient distortion-rate problem 6.3 The simplest rate solutions 6.4 The non-negative rate solutions 6.5 Distortion function for constant performance factors 6.6 Distortion function for variable-rate performance factors Chapter 7. A Vector Quantization Distortion-rate Function for MBC and MBC/PT 156 7.1 Block partitioning for vector quantization 7.2 Rate as a function of mixture and thresholds 7.3 Threshold driven distortion-rate function 7.4 Distortion function for MBC 7.5 Distortion function for MBC/PT Chapter 8. Summary and Conclusions 168 Appendix 1. A Simple Bit Allocation Example 171 Appendix 2. Another Approach to the Bit Allocation Problem 173 References 176 List of Figures Figure Page 2.1 Mannos and Sakrison visual perception model. 19 5.1 Example 16x16 block quad tree. 81 5.2 Example 16x16 block for MBC and default sub-block numbering for MBC. 82 5.3 MBC and MBC/PT DCT transform coefficients. 83 5.4 Design and use of the MBC and MBC/PT vector quantizer. 84 5.5 Mixture fractions versus blocksize-constant distortion thresholds. 85 5.6 PSNR versus rate as a function of distortion threshold. 86 5.7 Distortion versus rate as a function of distortion threshold. 87 5.8 MBC mixture fractions versus threshold for BW woman/hat. 88 5.9 MBC PSNR and distortion versus rate for BW woman/hat. 89 5.10 MBC mixture fractions versus threshold for BW F-16. 90 5.11 MBC PSNR and distortion versus rate for BW F-16. 91 5.12 MBC/PT mixture fractions versus threshold for BW woman/hat and F-16. 92 5.13 MBC/PT PSNR versus rate for BW woman/hat and F-16. 93 5.14 MBC/PT distortion versus rate for BW woman/hat and F-16. 94 5.15 MBC mixture fractions versus threshold for YIQ woman/hat and F-16. 95 5.16 MBC PSNR versus rate for YIQ woman/hat and F-16. 96 5.17 MBC distortion versus rate for YIQ woman/hat and F-16. 97 5.18 MBC/PT mixture fractions versus threshold for YIQ woman/hat and F-16. 98 5.19 MBC/PT PSNR versus rate for YIQ woman/hat and F-16. 99 5.20 MBC/PT distortion versus rate for YIQ woman/hat and F-16. 100 5.21 MBC mixture fractions versus threshold for RGB woman/hat and F-16. 101 5.22 MBC PSNR versus rate for RGB woman/hat and F-16. 102 5.23 MBC distortion versus rate for RGB woman/hat and F-16. 103 5.24 MBC/PT mixture fractions versus threshold for RGB woman/hat and F-16. 104 5.25 MBC/PT PSNR versus rate for RGB womwi/hat and F-16. 105 5.26 MBC/PT distortion versus rate for RGB woman/hat and F-16. 106 5.27 256x256 central portion of 512x512 BW woman/hat. 107 5.28 256x256 central portion of 512x512 BW F-16. 108 5.29 256x256 BW USC girl training image. 109 5.30 256x256 BW small face training image. 110 5.31 256x256 BW photo face training image. Ill 5.32 256x256 central portion of MBC woman/hat. 112 5.33 256x256 central portion MBC woman/hat block profile. 113 5.34 256x256 central portion MBC/PT woman/hat block profile. 114 5.35 256x256 central portion of MBC F-16. 115 5.36 256x256 central portion of MBC/BTC F-16. 116 5.37 256x256 central portion of MBC F-16 block profile. 117 5.38 256x256 central portion of MBC/BTC F-16 block profile. 118 5.39 256x256 central portion of first pass MBC/PT woman/hat coded with 16x16 blocks. 119 5.40 256x256 central portion of second pass MBC/PT woman/hat adding 8x8 blocks. 120 5.41 256x256 central portion of third pass MBC/PT woman/hat adding 4x4 blocks. 121 5.42 256x256 central portion of final pass MBC/PT woman/hat adding 2x2 blocks. 122 6.1 Scalar quantizer performance factors.

Load more