Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information

Total Page:16

File Type:pdf, Size:1020Kb

Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information Index absolute threshold of hearing, 224 codebook, 26, 27 accuracy adaptive, 154 1/4th-pixel, 211 adaptive gain, 155 1/8th-pixel, 211 stochastic, 156 half-pixel, 211 stochastic gain, 156 algorithm codec Blahut, 49, 53 ACELP, 158 for lattice quantization, 34 code excited linear predictive (CELP), 79 Levinson–Durbin, 83, 85, 87, 154, 162 FS-1015 LPC-10E, 142 Linde–Buzo–Gray (LBG), 25, 26 FS-1016 CELP, 142, 153 Lloyd–Max, 22 G.722 Wideband, 141,142 LOCO, 186 G.723.1 MPC-MLQ, 142, 153 Viterbi, 36, 37, 38 G.728 LD-CELP, 141, 142, 153, 161 aliasing, 11, 13 G.729 CS-ACELP, 142, 153, 160 analysis GSM 06.10, 161 closed-loop, 154 IS-54 VSELP, 142, 153 open-loop, 153 IS-96 QCELP, 142, 153 approximating values, 15 mixed excitation linear prediction (MELP), 144, audio, 1, 63, 223 162 autoregressive model, 81 PDC PSI-CELP, 142 PDC VSELP, 142 basis RPE-LTP(GSM), 142, 153 for the n-dimensional lattice, 28 VSELP, 161 functions, 10 coding vectors, 93 adaptive, 258 binarization, 220, 257 adaptive delta modulation, 148 binary alpha blocks (BAB), 206 bit-rate, 141 adaptive differential pulse code modulation (ADPCM), 141, 142, 147, 150, centroid, 19, 25, 30 analysis-by-synthesis, 152 circular displacement, 102 arithmetic, 175, 243, 246 code audio, 223 convolutional, 36, 58 backward adaptive, 153 Elias, 256 bit-plane, 136, 189 Gallager-vanVoorhis, 255 CABAC, 217, 220 Golomb, 218, 254 CAVLC, 217 Golomb–Rice, 188 context, 175, 190, 249 Levenstein, 256 delta-modulation, 78, 147 monotonic, 254 differential, 183 prefix-free, 239 differential pulse code modulation (DPCM), 149 terminated convolutional, 37 embedded block with optimized truncation trellis, 38 (EBCOT), 189 unary, 254 embedded zerotree wavelet (EZW), 135, 136 zero-tail terminated convolutional, 37 entropy, 16, 217, 238 © in this web service Cambridge University Press www.cambridge.org Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information 266 Index coding (cont.) effective bandwidth, 14 forward adaptive, 153 encoder Huffman, 172, 183, 239 trellis, 38 linear predictive, 78, 147 entropy lossless, 238 conditional differential, 43 lossless block, 241 differential, 43 object-based, 206 relative, 50 perceptual, 223 equation predictive, 66 dilation, 125, 127 pulse code modulation (PCM), 63, 141 wavelet, 125, 127 QM, 175, 190, 220, 251 error run-length, 172, 183 mean squared (MSE), 18 scalable, 202 squared, 18 set partitioning in hierarchical trees (SPIHT), escape symbol, 218, 259 135, 136 Euler’s formula, 7 Shannon–Fano–Elias, 240 speech, 141 fidelity criterion, 16, 44 sprite, 207 filter subband, 227 autoregressive, 81 transform, 66, 91, 208, 227 deblocking, 212 video, 197 discrete-time, 67 coefficient excitation, 152 AC, 182 finite (im)pulse response (FIR), 68, 111 DC, 182 highpass, 114 companding, 145 infinite (im)pulse response (IIR), 69, 114 A-law, 145 linear phase, 112 µ-law, 145 lowpass prototype, 229 component lowpass, 112 chrominance, 178 moving average, 112 luminance, 178 moving difference, 114 compression nonrecursive, 68 image, 132 prediction, 87 lossless, 3, 238 recursive, 69 lossy, 3, time-invariant discrete-time, 67 ratio, 172, 177 conditional exchange, 252 wavelet, 110, 132 constraint length, 36 filter bank, 115 convolution, 111 analysis, 117 circular, 100, 111 cosine modulated, 229 discrete-time, 68 Haar, 115, 118 coset, 32, 34 hybrid, 232 critical bands, 224 perfect reconstruction, 127 cyclic extension, 120 polyphase, 228 synthesis, 117 decimation, 116 wavelet, 127 -in-frequency, 103 formant frequencies, 152 -in-time, 103 format discrete-time pulse response, 68 bitmap (BMP), 178 discrete-time system, 67 CIF, 64, 201 distortion HDTV, 64, 197 average, 42 NTSC, 64 just noticeable, 226 PAL, 64 measure, 15 QCIF, 64, 201 perceptual, 153 RGB, 178, 197 dot, 63 VGA, 63 downsampling, 14, 116 XVGA, 63 © in this web service Cambridge University Press www.cambridge.org Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information Index 267 YCbCr, 178 noise-masking-tone, 225 YUV, 178, 197 tone-masking-noise, 225 Fourier series expansion, 6 matrix frames covariance, 47 B-type, 211 Toeplitz’s 47, 83 I-type, 201 maximal asymptotical shaping gain, 57 inter-, 201 method intra-, 201 autocorrelation, 81, 86 P-type, 211 covariance, 81, 83 function motion compensation, 197 amplitude, 77 MIPS, 142 Dirac’s delta, 12, 68 model-templates block, 175 distortion-rate, 44 motion compensation 197 frequency, 75 block matching, 197 information rate-distortion, 42 logarithmic search, 199 Kronecker’s delta, 68, 159 motion vectors, 197, 211 match, 156 multilevel decomposition, 127 phase, 77 multimedia signals, 1, 63 rate-distortion, 15, 24, 42 multiresolution analysis, 123 sinc-, 8 mutual information, 42, 50 spreading, 226 transfer, 75 normalized frequency, 76 normalized second moment, 57 generator matrix, 28, 32 Nyquist sampling rate, 8, 11, 147 granular noise, 148 orthogonal decomposition, 10 high-resolution theory, 58 pass image, 1, 63, 171 cleanup, 190 coding, 132, 171 magnitude refinement, 190 fax, 172 significance propagation, 190 fields, 203 peak signal-to-noise ratio (PSNR), 177 grayscale, 63, 178 peakiness measure, 163 halftone, 177 pitch period, 153, 154 International Telecommunication Union (ITU), 141 pixel, 63 interval inversion, 252 power spectral density, 47 interval of quasi-stationarity, 79 pre-echo, 232 precinct, 289 Kullback Leibler distance, 50 prediction B-type, 211 lattice, 28 backward, 201, 203 n-dimensional, 30 backward adaptive, 151, 161 based on linear codes, 31 bi-, 211 cubic, 28 bidirectional, 201, 203, 211 hexagonal, 29, 31, 57 bottom-layer typical, 176 integer, 57 deterministic, 176 scaled integer, 31 differential-layer typical, 176 square, 28 filter, 87 levels, 204 forward, 201, 203 linear prediction coefficients (LPCs), 154 intra in spatial domain, 208 linear spectral parameters (LSPs), 154, 165 long-term, 154 P-type, 211 macroblock, 199, 203, 206 short-term, 153 mask-to-noise ratio, 234 weighted, 212 masking, 225 zero-value, 203 global threshold, 234 prediction coefficients, 78, 81 © in this web service Cambridge University Press www.cambridge.org Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information 268 Index prefiltering, 14 sequence process causal, 71 Gaussian, 47 detail, 126 Gaussian Markov, 49 reference, 126 profiles, 204 Shannon lower bound, 54 psychoacoustic model, 223, 226 shaping gain, 30 PAM-I, 232 signal PAM-II, 233 noncausal, 70 time-limited, 14 quantization, 6, 15 signal-to-mask ratio (SMR), 227 gain, 44 signal-to-noise ratio (SNR), 143, 177, 224 multi-pulse maximum-likelihood, 158 signals noise, 15, 46 analog, 5 rate, 16, 24, 42, 46, 60 digital, 5 step, 17 significance map, 135 quantizer significance state, 190 entropy-constrained scalar (ECSQ), sound attack, 232 20 sound pressure level, 224 fixed-rate, 16 source lattice, 30, 188 Gaussian memoryless, 45 lattice vector, 28, 29, 32, 35 generalized Gaussian, 59 n-dimensional lattice, 30 geometric, 187 nonuniform optimal scalar, 18 Laplacian, 46, 59 nonuniform scalar, 60 Markov, 234 optimal nonuniform scalar, 19 memoryless, 43, 44, 54 optimal uniform, 21 uniform, 46, 59 optimal with extended zero zone, 22 with memory, 66 scalar, 15 spatial orientation trees, 137 suboptimal with extended zero spectral band replication (SBR), 235 zone, 22 spectrum, 6 uniform fixed-rate scalar, 17 speech, 1, 63, 79, 141 uniform scalar, 17 coding, 141 variable-rate, 16 frame, 153 vector, 23, 60 production model, 143 with extended zero zone, 21 standard G.711, 141, 142 redundancy, 257 G.722, 141, 142 spatial, 183 G.726, 141, 142, 150 temporal, 183, 212 G.727, 142, 147 relative prediction error, 92 G.728, 141, 142, 153, 161 resolution, 68 G.729, 142, 153, 160 resolution reduction, 186 G3, 172 G4, 172 sampling, 6 H.261, 91, 200 dyadic, 122 H.263, 91, 200 frequency, 6 H.264, 200, 208 function, 10 HE-AAC, 227 period, 6 ISO-MPEG-4 Part 2, 200 rate, 11 ISO-MPEG-4 Part 10, 200 theorem, 6, 12, 14 JBIG-1, 173 scalability, 191, 204 JPEG, 91, 177 SNR, 191, 204 JPEG-2000, 138, 188 spatial, 191 JPEG-LS, 186 scaling function, 124 MPEG Audio, 227 scaling subspace, 124 MPEG-1, 200, 202 second moment, 56 MPEG-2, 200, 202 © in this web service Cambridge University Press www.cambridge.org Cambridge University Press 978-0-521-11432-5 — Compression for Multimedia Irina Bocharova Index More Information Index 269 MPEG-4, 135, 200, 202, 205 Karhunen–Loeve, 95, 105 MPEG-AAC, 227 Laplace, 6 MPEG-1-audio, 107 linear, 93 MPEG-1 layer III, 228 matrix, 91 stereo modified DCT (MDCT), 107, 227, 230 intensity, 232 one-sided z-, 70 middle-side (MS), 232 orthogonal, 117 parametric, 235 orthonormal, 92–94, 105 subband, 116 overlapped, 107, 227 symbol separable, 92, 105 less probable (LPS), 175, 220, 251 window, 229 more probable (MPS), 175, 220, 251 system upsampling, 117 causal, 67 stable, 67 video, 1, 63 interlaced, 202 theorem progressively scanned, 203 Cauchy’s integral, 72 sequentially scanned, 203 of residues, 74 video object plane (VOP), 206 Parseval’s, 93 Voronoi rate-distortion, 45 cell, 29 sampling, 6, 14 partition, 26 tiles, 189 region, 25, 30 transform 2D-DFT, 99 wavelet z-, 70 coefficients, 133 2D DCT, 106, 179 discrete, 122 coding, 66, 91, 208, 227 equation, 125, 127 coefficients, 93 father, 124 continuous wavelet (CWT), 121 filter bank, 127 DCT-II, 107, 214 filtering, 110, 132 DCT-IV, 107, 230 mother, 124 discrete cosine (DCT), 91, 104, 179 series decomposition, 123 discrete Fourier (DFT), 98 subbands, 133 discrete wavelet (DWT), 121, 123, 188 subspace, 124 fast Fourier (FFT), 103, 230 Fourier,6,14 Yule–Walker equations, 80, 83, 86 Hadamard, 216 IDFT, 102 zerotree, 135, 136 inverse z-, 72 zigzag ordering, 184, 217 © in this web service Cambridge University Press www.cambridge.org.
Recommended publications
  • Kulkarni Uta 2502M 11649.Pdf
    IMPLEMENTATION OF A FAST INTER-PREDICTION MODE DECISION IN H.264/AVC VIDEO ENCODER by AMRUTA KIRAN KULKARNI Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE IN ELECTRICAL ENGINEERING THE UNIVERSITY OF TEXAS AT ARLINGTON May 2012 ACKNOWLEDGEMENTS First and foremost, I would like to take this opportunity to offer my gratitude to my supervisor, Dr. K.R. Rao, who invested his precious time in me and has been a constant support throughout my thesis with his patience and profound knowledge. His motivation and enthusiasm helped me in all the time of research and writing of this thesis. His advising and mentoring have helped me complete my thesis. Besides my advisor, I would like to thank the rest of my thesis committee. I am also very grateful to Dr. Dongil Han for his continuous technical advice and financial support. I would like to acknowledge my research group partner, Santosh Kumar Muniyappa, for all the valuable discussions that we had together. It helped me in building confidence and motivated towards completing the thesis. Also, I thank all other lab mates and friends who helped me get through two years of graduate school. Finally, my sincere gratitude and love go to my family. They have been my role model and have always showed me right way. Last but not the least; I would like to thank my husband Amey Mahajan for his emotional and moral support. April 20, 2012 ii ABSTRACT IMPLEMENTATION OF A FAST INTER-PREDICTION MODE DECISION IN H.264/AVC VIDEO ENCODER Amruta Kulkarni, M.S The University of Texas at Arlington, 2011 Supervising Professor: K.R.
    [Show full text]
  • 1. in the New Document Dialog Box, on the General Tab, for Type, Choose Flash Document, Then Click______
    1. In the New Document dialog box, on the General tab, for type, choose Flash Document, then click____________. 1. Tab 2. Ok 3. Delete 4. Save 2. Specify the export ________ for classes in the movie. 1. Frame 2. Class 3. Loading 4. Main 3. To help manage the files in a large application, flash MX professional 2004 supports the concept of _________. 1. Files 2. Projects 3. Flash 4. Player 4. In the AppName directory, create a subdirectory named_______. 1. Source 2. Flash documents 3. Source code 4. Classes 5. In Flash, most applications are _________ and include __________ user interfaces. 1. Visual, Graphical 2. Visual, Flash 3. Graphical, Flash 4. Visual, AppName 6. Test locally by opening ________ in your web browser. 1. AppName.fla 2. AppName.html 3. AppName.swf 4. AppName 7. The AppName directory will contain everything in our project, including _________ and _____________ . 1. Source code, Final input 2. Input, Output 3. Source code, Final output 4. Source code, everything 8. In the AppName directory, create a subdirectory named_______. 1. Compiled application 2. Deploy 3. Final output 4. Source code 9. Every Flash application must include at least one ______________. 1. Flash document 2. AppName 3. Deploy 4. Source 10. In the AppName/Source directory, create a subdirectory named __________. 1. Source 2. Com 3. Some domain 4. AppName 11. In the AppName/Source/Com directory, create a sub directory named ______ 1. Some domain 2. Com 3. AppName 4. Source 12. A project is group of related _________ that can be managed via the project panel in the flash.
    [Show full text]
  • Discrete Cosine Transform for 8X8 Blocks with CUDA
    Discrete Cosine Transform for 8x8 Blocks with CUDA Anton Obukhov [email protected] Alexander Kharlamov [email protected] October 2008 Document Change History Version Date Responsible Reason for Change 0.8 24.03.2008 Alexander Kharlamov Initial release 0.9 25.03.2008 Anton Obukhov Added algorithm-specific parts, fixed some issues 1.0 17.10.2008 Anton Obukhov Revised document structure October 2008 2 Abstract In this whitepaper the Discrete Cosine Transform (DCT) is discussed. The two-dimensional variation of the transform that operates on 8x8 blocks (DCT8x8) is widely used in image and video coding because it exhibits high signal decorrelation rates and can be easily implemented on the majority of contemporary computing architectures. The key feature of the DCT8x8 is that any pair of 8x8 blocks can be processed independently. This makes possible fully parallel implementation of DCT8x8 by definition. Most of CPU-based implementations of DCT8x8 are firmly adjusted for operating using fixed point arithmetic but still appear to be rather costly as soon as blocks are processed in the sequential order by the single ALU. Performing DCT8x8 computation on GPU using NVIDIA CUDA technology gives significant performance boost even compared to a modern CPU. The proposed approach is accompanied with the sample code “DCT8x8” in the NVIDIA CUDA SDK. October 2008 3 1. Introduction The Discrete Cosine Transform (DCT) is a Fourier-like transform, which was first proposed by Ahmed et al . (1974). While the Fourier Transform represents a signal as the mixture of sines and cosines, the Cosine Transform performs only the cosine-series expansion.
    [Show full text]
  • Analysis Application for H.264 Video Encoding
    IT 10 061 Examensarbete 30 hp November 2010 Analysis Application for H.264 Video Encoding Ying Wang Institutionen för informationsteknologi Department of Information Technology Abstract Analysis Application for H.264 Video Encoding Ying Wang Teknisk- naturvetenskaplig fakultet UTH-enheten A video analysis application ERANA264(Ericsson Research h.264 video Besöksadress: ANalysis Application) is developed in Ångströmlaboratoriet Lägerhyddsvägen 1 this project. Erana264 is a tool that Hus 4, Plan 0 analyzes H.264 encoded video bitstreams, extracts the encoding information and Postadress: parameters, analyzes them in different Box 536 751 21 Uppsala stages and displays the results in a user friendly way. The intention is that Telefon: such an application would be used during 018 – 471 30 03 development and testing of video codecs. Telefax: The work is implemented on top of 018 – 471 30 00 existing H.264 encoder/decoder source code (C/C++) developed at Ericsson Hemsida: Research. http://www.teknat.uu.se/student Erana264 consists of three layers. The first layer is the H.264 decoder previously developed in Ericsson Research. By using the decoder APIs, the information is extracted from the bitstream and is sent to the higher layers. The second layer visualizes the different decoding stages, uses overlay to display some macro block and picture level information and provides a set of play back functions. The third layer analyzes and presents the statistics of prominent parameters in video compression process, such as video quality measurements, motion vector distribution, picture bit distribution etc. Key words: H.264, Video compression, Bitstream analysis, Video encoding Handledare: Zhuangfei Wu and Clinton Priddle Ämnesgranskare: Cris Luengo Examinator: Anders Jansson IT10061 Tryckt av: Reprocentralen ITC Acknowledgements Fist of all, I am heartily thankful to my supervisors, Fred Wu and Clinton Priddle, whose encouragement, supervision and support from the preliminary to the concluding level enabled me to develop an understanding of the subject.
    [Show full text]
  • CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter
    CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter Prediction using Binary classification techniques A graduate project submitted in partial fulfillment of the requirements for the degree of Master of Science in Software Engineering by Alex Kit Romero May 2020 The graduate project of Alex Kit Romero is approved: ____________________________________ ____________ Dr. Katya Mkrtchyan Date ____________________________________ ____________ Dr. Kyle Dewey Date ____________________________________ ____________ Dr. John J. Noga, Chair Date California State University, Northridge ii Dedication This project is dedicated to all of the Computer Science professors that I have come in contact with other the years who have inspired and encouraged me to pursue a career in computer science. The words and wisdom of these professors are what pushed me to try harder and accomplish more than I ever thought possible. I would like to give a big thanks to the open source community and my fellow cohort of computer science co-workers for always being there with answers to my numerous questions and inquiries. Without their guidance and expertise, I could not have been successful. Lastly, I would like to thank my friends and family who have supported and uplifted me throughout the years. Thank you for believing in me and always telling me to never give up. iii Table of Contents Signature Page ................................................................................................................................ ii Dedication .....................................................................................................................................
    [Show full text]
  • Video Coding Standards
    Module 8 Video Coding Standards Version 2 ECE IIT, Kharagpur Lesson 23 MPEG-1 standards Version 2 ECE IIT, Kharagpur Lesson objectives At the end of this lesson, the students should be able to : 1. Enlist the major video coding standards 2. State the basic objectives of MPEG-1 standard. 3. Enlist the set of constrained parameters in MPEG-1 4. Define the I- P- and B-pictures 5. Present the hierarchical data structure of MPEG-1 6. Define the macroblock modes supported by MPEG-1 23.0 Introduction In lesson 21 and lesson 22, we studied how to perform motion estimation and thereby temporally predict the video frames to exploit significant temporal redundancies present in the video sequence. The error in temporal prediction is encoded by standard transform domain techniques like the DCT, followed by quantization and entropy coding to exploit the spatial and statistical redundancies and achieve significant video compression. The video codecs therefore follow a hybrid coding structure in which DPCM is adopted in temporal domain and DCT or other transform domain techniques in spatial domain. Efforts to standardize video data exchange via storage media or via communication networks are actively in progress since early 1980s. A number of international video and audio standardization activities started within the International Telephone Consultative Committee (CCITT), followed by the International Radio Consultative Committee (CCIR), and the International Standards Organization / International Electrotechnical Commission (ISO/IEC). An experts group, known as the Motion Pictures Expects Group (MPEG) was established in 1988 in the framework of the Joint ISO/IEC Technical Committee with an objective to develop standards for coded representation of moving pictures, associated audio, and their combination for storage and retrieval of digital media.
    [Show full text]
  • AVC to the Max: How to Configure Encoder
    Contents Company overview …. ………………………………………………………………… 3 Introduction…………………………………………………………………………… 4 What is AVC….………………………………………………………………………… 6 Making sense of profiles, levels, and bitrate………………………………………... 7 Group of pictures and its structure..………………………………………………… 11 Macroblocks: partitioning and prediction modes….………………………………. 14 Eliminating spatial redundancy……………………………………………………… 15 Eliminating temporal redundancy……...……………………………………………. 17 Adaptive quantization……...………………………………………………………… 24 Deblocking filtering….….…………………………………………………………….. 26 Entropy encoding…………………………………….……………………………….. 2 8 Conclusion…………………………………………………………………………….. 29 Contact details..………………………………………………………………………. 30 2 www.elecard.com Company overview Elecard company, founded in 1988, is a leading provider of software products for encoding, decoding, processing, monitoring and analysis of video and audio data in 9700 companies various formats. Elecard is a vendor of professional software products and software development kits (SDKs); products for in - depth high - quality analysis and monitoring of the media content; countries 1 50 solutions for IPTV and OTT projects, digital TV broadcasting and video streaming; transcoding servers. Elecard is based in the United States, Russia, and China with 20M users headquarters located in Tomsk, Russia. Elecard products are highly appreciated and widely used by the leaders of IT industry such as Intel, Cisco, Netflix, Huawei, Blackmagic Design, etc. For more information, please visit www.elecard.com. 3 www.elecard.com Introduction Video compression is the key step in video processing. Compression allows broadcasters and premium TV providers to deliver their content to their audience. Many video compression standards currently exist in TV broadcasting. Each standard has different properties, some of which are better suited to traditional live TV while others are more suited to video on demand (VoD). Two basic standards can be identified in the history of video compression: • MPEG-2, a legacy codec used for SD video and early digital broadcasting.
    [Show full text]
  • Nvidia Video Codec Sdk - Encoder
    NVIDIA VIDEO CODEC SDK - ENCODER Programming Guide vNVENCODEAPI_PG-06155-001_v11 | July 2021 Table of Contents Chapter 1. Introduction........................................................................................................ 1 Chapter 2. Basic Encoding Flow.......................................................................................... 2 Chapter 3. Setting Up Hardware for Encoding....................................................................3 3.1. Opening an Encode Session.....................................................................................................3 3.1.1. Initializing encode device................................................................................................... 3 3.1.1.1. DirectX 9.......................................................................................................................3 3.1.1.2. DirectX 10.....................................................................................................................3 3.1.1.3. DirectX 11.....................................................................................................................4 3.1.1.4. DirectX 12.....................................................................................................................4 3.1.1.5. CUDA............................................................................................................................ 4 3.1.1.6. OpenGL........................................................................................................................
    [Show full text]
  • An Analysis of VP8, a New Video Codec for the Web Sean Cassidy
    Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 11-1-2011 An Analysis of VP8, a new video codec for the web Sean Cassidy Follow this and additional works at: http://scholarworks.rit.edu/theses Recommended Citation Cassidy, Sean, "An Analysis of VP8, a new video codec for the web" (2011). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. An Analysis of VP8, a New Video Codec for the Web by Sean A. Cassidy A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Engineering Supervised by Professor Dr. Andres Kwasinski Department of Computer Engineering Rochester Institute of Technology Rochester, NY November, 2011 Approved by: Dr. Andres Kwasinski R.I.T. Dept. of Computer Engineering Dr. Marcin Łukowiak R.I.T. Dept. of Computer Engineering Dr. Muhammad Shaaban R.I.T. Dept. of Computer Engineering Thesis Release Permission Form Rochester Institute of Technology Kate Gleason College of Engineering Title: An Analysis of VP8, a New Video Codec for the Web I, Sean A. Cassidy, hereby grant permission to the Wallace Memorial Library to repro- duce my thesis in whole or part. Sean A. Cassidy Date i Acknowledgements I would like to thank my thesis advisors, Dr. Andres Kwasinski, Dr. Marcin Łukowiak, and Dr. Muhammad Shaaban for their suggestions, advice, and support.
    [Show full text]
  • University of Texas at Arlington Dissertation Template
    IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.264 FOR BASELINE PROFILE by SHREYANKA SUBBARAYAPPA Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE IN ELECTRICAL ENGINEERING THE UNIVERSITY OF TEXAS AT ARLINGTON May 2012 Copyright © by Shreyanka Subbarayappa 2012 All Rights Reserved ACKNOWLEDGEMENTS Firstly, I would thank my advisor Prof. K. R. Rao for his valuable guidance and support, and his tireless guidance, dedication to his students and maintaining new trend in the research areas has inspired me a lot without which this thesis would not have been possible. I also like to thank the other members of my advisory committee Prof. W. Alan Davis and Prof. Kambiz Alavi for reviewing the thesis document and offering insightful comments. I appreciate all members of Multimedia Processing Lab, Tejas Sathe, Priyadarshini and Darshan Alagud for their support during my research work. I would also like to thank my friends Adarsh Keshavamurthy, Babu Hemanth Kumar, Raksha, Kirthi, Spoorthi, Premalatha, Sadaf, Tanaya, Karthik, Pooja, and my Intel manager Sumeet Kaur who kept me going through the trying times of my Masters. Finally, I am grateful to my family; my father Prof. H Subbarayappa, my mother Ms. Anasuya Subbarayappa, my sister Dr. Priyanka Subbarayappa, my brother-in-law Dr. Manju Jayram and my sweet little nephew Dishanth for their support, patience, and encouragement during my graduate journey. April 16, 2012 iii ABSTRACT IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.264 FOR BASELINE PROFILE Shreyanka Subbarayappa, M.S The University of Texas at Arlington, 2012 Supervising Professor: K.R.Rao H.264/AVC [1] is a video coding standard that has a wide range of applications ranging from high-end professional camera and editing systems to low-end mobile applications.
    [Show full text]
  • The Basics of Video Compression
    The Basics of Video Compression Marko Slyz February 18, 2003 (Sourcecoders talk) 1/18 Outline 1. Non-technical Survey of Video Compressors 2. Basic Description of MPEG 1 3. Discussion of Other Compressors 4. Numerical Comparisons 2/18 Some Applications of Video Compressors MPEG-1 Video-CD MPEG-2 HDTV, DVD (usually), some camcorders, current digital cable, TiVo/ReplayTV MPEG-4 (a.k.a. divx) internet video, future digital cable, at least one camcorder, cell phones. H.261, H.263 video conferencing (e.g. Polycom) H.26L (a.k.a. MPEG-4 part 10, JVT H.264?) (Not yet done?) Intended for both low-delay and high-delay applications. 3/18 Another Classification of Video Compressors MPEG-1 (?) Corona (MS) requires MPEG-2 royalties RealVideo ISO/IEC Standard MPEG-4 Sorenson ITU-T Standard doesn’t H.261 vp3 (on2) require H.263 Tarkin (ogg) royalties H.26L (base) WaveVideo (?) formally not formally standardized standardized 4/18 Basic Idea of MPEG 1 or 2 (and :::) An uncompressed video file consists of a sequence of frames. MPEG (often) uses DPCM between frames, and uses a block-based DCT within a frame. MPEG has lots of forward-adaptive parts, lots of static parts, and no (?), except for the DPCM just mentioned, backward-adaptive parts. 5/18 MPEG 1 or 2 Beginnings To code a particular frame, start by 1. transforming its colors into Y,Cb,Cr space, 2. then partitioning it into macroblocks, each of which consists of a 16x16 block of pixels from Y, and 8x8 blocks of the subsampled versions of Cb and Cr (4:2:0).
    [Show full text]
  • Redalyc.Performance Evaluation of Macroblock-Level Parallelization Of
    Revista Avances en Sistemas e Informática ISSN: 1657-7663 [email protected] Universidad Nacional de Colombia Colombia Alvarez, Mauricio; Ramirez, Alex; Valero, Mateo; Azevedo, Arnaldo; Meenderinck, Cor; Juurlink, Ben Performance evaluation of macroblock-level parallelization of H.264 decoding on a cc-NUMA multiprocessor architecture Revista Avances en Sistemas e Informática, vol. 6, núm. 1, junio, 2009, pp. 219-228 Universidad Nacional de Colombia Medellín, Colombia Available in: http://www.redalyc.org/articulo.oa?id=133112608024 How to cite Complete issue Scientific Information System More information about this article Network of Scientific Journals from Latin America, the Caribbean, Spain and Portugal Journal's homepage in redalyc.org Non-profit academic project, developed under the open access initiative Performance evaluation of macroblock­level parallelization of H.264 decoding on a cc­NUMA multiprocessor architecture Evaluación del rendimiento paralelo en el nivel macro bloque del decodificador H.264 en una arquitectura multiprocesador cc­NUMA Mauricio Alvarez, Ing. 1, Alex Ramirez, PhD. 1;2, Mateo Valero, PhD. 1;2 Arnaldo Azevedo, MSc. 3, Cor Meenderinck, MSc. 3, Ben Juurlink, PhD 3 1. Universitat Politècnica de Catalunya, Barcelona, Spain. 2. Barcelona Supercomputing Center, Barcelona, Spain. 3. Delft University of Technology, Delft, The Netherlands [email protected], [email protected], cor, [email protected], [email protected], [email protected] Recibido para revisión 5 de junio de 2009, aceptado 5 de junio de 2009, versión final 5 de junio de 2009 Abstract—This paper presents a study of the performance scalability mejor escalabilidad. Este artículo incluye una evaluación del impacto en of a macroblock­level parallelization of the H.264 decoder for High sincronización en bloque APIs como hilos POSIX y extensiones de Definition (HD) applications on a multiprocessor architecture.
    [Show full text]