Video Compression and Communications: from Basics to H.261, H.263, H.264, MPEG2, MPEG4 for DVB and HSDPA-Style Adaptive Turbo-Transceivers
Total Page:16
File Type:pdf, Size:1020Kb
Video Compression and Communications: From Basics to H.261, H.263, H.264, MPEG2, MPEG4 for DVB and HSDPA-Style Adaptive Turbo-Transceivers by c L. Hanzo, P.J. Cherriman, J. Streit Department of Electronics and Computer Science, University of Southampton, UK About the Authors Lajos Hanzo (http://www-mobile.ecs.soton.ac.uk) FREng, FIEEE, FIET, DSc received his degree in electronics in 1976 and his doctorate in 1983. During his 30-year career in telecommunications he has held various research and academic posts in Hungary, Germany and the UK. Since 1986 he has been with the School of Electronics and Computer Science, University of Southampton, UK, where he holds the chair in telecom- munications. He has co-authored 15 books on mobile radio communica- tions totalling in excess of 10 000, published about 700 research papers, acted as TPC Chair of IEEE conferences, presented keynote lectures and been awarded a number of distinctions. Currently he is directing an academic research team, working on a range of research projects in the field of wireless multimedia communications sponsored by industry, the Engineering and Physical Sciences Research Council (EPSRC) UK, the European IST Programme and the Mobile Virtual Centre of Excellence (VCE), UK. He is an enthusiastic supporter of industrial and academic liaison and he offers a range of industrial courses. He is also an IEEE Distinguished Lecturer of both the Communications Society and the Vehicular Technology Society (VTS). Since 2005 he has been a Governer of the VTS. For further information on research in progress and associated publications please refer to http://www-mobile.ecs.soton.ac.uk Peter J. Cherriman graduated in 1994 with an M.Eng. degree in In- formation Engineering from the University of Southampton, UK. Since 1994 he has been with the Department of Electronics and Computer Sci- ence at the University of Southampton, UK, working towards a Ph.D. in mobile video networking which was completed in 1999. Currently he is working on projects for the Mobile Virtual Centre of Excellence, UK. His current areas of research include robust video coding, microcellular radio systems, power control, dynamic channel allocation and multiple access protocols. He has published about two dozen conference and journal papers, and holds several patents. Jurgen¨ Streit received his Dipl.-Ing. Degree in electronic engineering from the Aachen University of Technology in 1993 and his Ph.D. in image coding from the Department of Electronics and Computer Sci- ence, University of Southampton, UK, in 1995. From 1992 to 1996 Dr. Streit had been with the Department of Electronics and Computer Sci- ence working in the Communications Research Group. His work led to numerous publications. Since then he has joined a management consul- tancy working as an information technology consultant. i Other Wiley and IEEE Press Books on Related Topics 1 • R. Steele, L. Hanzo (Ed): Mobile Radio Communications: Second and Third Gener- ation Cellular and WATM Systems, John Wiley and IEEE Press, 2nd edition, 1999, ISBN 07 273-1406-8, 1064 pages • L. Hanzo, F.C.A. Somerville, J.P. Woodard: Voice Compression and Communications: Principles and Applications for Fixed and Wireless Channels; IEEE Press and John Wiley, 2001, 642 pages • L. Hanzo, P. Cherriman, J. Streit: Wireless Video Communications: Second to Third Generation and Beyond, IEEE Press and John Wiley, 2001, 1093 pages • L. Hanzo, T.H. Liew, B.L. Yeap: Turbo Coding, Turbo Equalisation and Space-Time Coding, John Wiley and IEEE Press, 2002, 751 pages • J.S. Blogh, L. Hanzo: Third-Generation Systems and Intelligent Wireless Networking: Smart Antennas and Adaptive Modulation, John Wiley and IEEE Press, 2002, 408 pages • L. Hanzo, C.H. Wong, M.S. Yee: Adaptive Wireless Transceivers: Turbo-Coded, Turbo- Equalised and Space-Time Coded TDMA, CDMA and OFDM Systems, John Wiley and IEEE Press, 2002, 737 pages • L. Hanzo, L-L. Yang, E-L. Kuan, K. Yen: Single- and Multi-Carrier CDMA: Multi- User Detection, Space-Time Spreading, Synchronisation, Networking and Standards, John Wiley and IEEE Press, June 2003, 1060 pages • L. Hanzo, M. Munster,¨ T. Keller, B-J. Choi, OFDM and MC-CDMA for Broadband Multi-User Communications, WLANs and Broadcasting, John-Wiley and IEEE Press, 2003, 978 pages • L. Hanzo, S-X. Ng, T. Keller and W.T. Webb, Quadrature Amplitude Modulation: From Basics to Adaptive Trellis-Coded, Turbo-Equalised and Space-Time Coded OFDM, CDMA and MC-CDMA Systems, John Wiley and IEEE Press, 2004, 1105 pages 1For detailed contents and sample chapters please refer to http://www-mobile.ecs.soton.ac.uk iii iv • L. Hanzo, T. Keller: An OFDM and MC-CDMA Primer, John Wiley and IEEE Press, 2006, 430 pages • L. Hanzo, F.C.A. Somerville, J.P. Woodard: Voice and Audio Compression for Wireless Communications, John Wiley and IEEE Press, 2007, 858 pages • L. Hanzo, P.J. Cherriman, J. Streit: Video Compression and Communications: H.261, H.263, H.264, MPEG4 and HSDPA-Style Adaptive Turbo-Transceivers John Wiley and IEEE Press, 2007, 680 pages • L. Hanzo, J. Blogh and S. Ni: HSDPA-Style FDD Versus TDD Networking: Smart Antennas and Adaptive Modulation John Wiley and IEEE Press, 2007, 650 pages Contents About the Authors i Other Wiley and IEEE Press Books on Related Topics i Preface v Acknowledgments vii I Video Codecs for HSDPA-Style Adaptive Videophones 1 2 Fractal Image Codecs 23 2.1 FractalPrinciples ............................... 23 2.2 One-Dimensional Fractal Coding . 26 2.2.1 FractalCodecDesign. 29 2.2.2 Fractal Codec Performance . 31 2.3 ErrorSensitivityandComplexity . ..... 35 2.4 SummaryandConclusions . 36 3 Low Bit-Rate DCT Codecs and HSDPA-Style Videophones 39 3.1 VideoCodecOutline .............................. 39 3.2 The Principle of Motion Compensation . 41 3.2.1 DistanceMeasures .. ..... ..... ..... ..... ..... 44 3.2.2 MotionSearchAlgorithms . 46 3.2.2.1 Full or Exhaustive Motion Search . 46 3.2.2.2 Gradient-Based Motion Estimation . 47 3.2.2.3 Hierarchical or Tree Search . 48 3.2.2.4 Subsampling Search . 49 3.2.2.5 Post-Processing of Motion Vectors . 50 3.2.2.6 Gain-Cost-Controlled Motion Compensation . 51 3.2.3 Other Motion Estimation Techniques . 52 3.2.3.1 Pel-Recursive Displacement Estimation . 52 ix x CONTENTS 3.2.3.2 Grid Interpolation Techniques . 53 3.2.3.3 MC Using Higher Order Transformations . 54 3.2.3.4 MCintheTransformDomain . 54 3.2.4 Conclusion ............................... 55 3.3 TransformCoding................................ 55 3.3.1 One-Dimensional Transform Coding . 55 3.3.2 Two-Dimensional Transform Coding . 56 3.3.3 Quantizer Training for Single-Class DCT . 60 3.3.4 Quantizer Training for Multiclass DCT . 61 3.4 TheCodecOutline ............................... 64 3.5 InitialIntra-FrameCoding . 64 3.6 Gain-Controlled Motion Compensation . ..... 64 3.7 The MCER Active/Passive Concept . 66 3.8 Partial Forced Update of the Reconstructed Frame Buffers .......... 67 3.9 The Gain/Cost-Controlled Inter-Frame Codec . ........ 69 3.9.1 Complexity Considerations and Reduction Techniques ........ 70 3.10 TheBit-AllocationStrategy. ..... 71 3.11Results...................................... 73 3.12 DCT Codec Performance under Erroneous Conditions . ......... 74 3.12.1 BitSensitivity.............................. 75 3.12.2 BitSensitivityofCodecIandII . 78 3.13 DCT-Based Low-Rate Video Transceivers . ...... 79 3.13.1 ChoiceofModem ........................... 79 3.13.2 Source-Matched Transceiver . 79 3.13.2.1 System1........................... 79 3.13.2.1.1 System Concept . 79 3.13.2.1.2 Sensitivity-Matched Modulation . 80 3.13.2.1.3 Source Sensitivity . 80 3.13.2.1.4 Forward Error Correction . 81 3.13.2.1.5 Transmission Format . 82 3.13.2.2 System2........................... 84 3.13.2.2.1 Automatic Repeat Request . 84 3.13.2.3 Systems3–5 ......................... 85 3.14 SystemPerformance .............................. 86 3.14.1 PerformanceofSystem1. 86 3.14.2 PerformanceofSystem2. 89 3.14.2.1 FERPerformance . 89 3.14.2.2 Slot Occupancy Performance . 90 3.14.2.3 PSNR Performance . 92 3.14.3 Performance of Systems 3–5 . 93 3.15 SummaryandConclusions . 97 CONTENTS xi 4 Low Bit-Rate VQ Codecs and HSDPA-Style Videophones 99 4.1 Introduction................................... 99 4.2 TheCodebookDesign ............................. 99 4.3 TheVectorQuantizerDesign . 101 4.3.1 Mean and Shape Gain Vector Quantization . 106 4.3.2 Adaptive Vector Quantization . 107 4.3.3 Classified Vector Quantization . 109 4.3.4 AlgorithmicComplexity . 110 4.4 Performance under Erroneous Conditions . 112 4.4.1 Bit-AllocationStrategy. 112 4.4.2 BitSensitivity..............................113 4.5 VQ-Based Low-Rate Video Transceivers . 115 4.5.1 ChoiceofModulation . 115 4.5.2 ForwardErrorCorrection . 116 4.5.3 ArchitectureofSystem1 . 118 4.5.4 ArchitectureofSystem2 . 119 4.5.5 ArchitectureofSystems3–6 . 120 4.6 SystemPerformance ..............................120 4.6.1 SimulationEnvironment . 120 4.6.2 PerformanceofSystems1and3 . 121 4.6.3 PerformanceofSystems4and5 . 123 4.6.4 PerformanceofSystems2and6 . 125 4.7 Joint Iterative Decoding of Trellis-Based VQ-Video and TCM . 126 4.7.1 Introduction...............................126 4.7.2 SystemOverview. .127 4.7.3 Compression ..............................127 4.7.4 Vector quantization decomposition . 128 4.7.5 Serial concatenation and iterative decoding . .......128 4.7.6 TransmissionFrameStructure . 130 4.7.7 Frame difference decomposition . 130 4.7.8 VQcodebook..............................132 4.7.9 VQ-induced code constraints . 133 4.7.10 VQtrellisstructure . 134 4.7.11 VQEncoding..............................137 4.7.12 VQDecoding..............................138 4.7.13 Results .................................140 4.8 SummaryandConclusions . 143 5 Low Bit-Rate Quad-Tree-Based Codecs and HSDPA-Style Videophones 147 5.1 Introduction...................................147 5.2 Quad-TreeDecomposition . 147 5.3 Quad-TreeIntensityMatch . 150 5.3.1 Zero-OrderIntensityMatch . 150 5.3.2 First-OrderIntensityMatch . 152 5.3.3 Decomposition Algorithmic Issues . 153 5.4 Model-Based Parametric Enhancement . 156 xii CONTENTS 5.4.1 EyeandMouthDetection . 157 5.4.2 Parametric Codebook Training .