Overview: Image and Video Coding Standards JPEG: Background
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
CT8021 H.32X G.723.1/G.728 Truespeech Co-Processor
CT8021 H.32x G.723.1/G.728 TrueSpeech Co-Processor Introduction Features The CT8021 is a speech co-processor which · TrueSpeechâ G.723.1 at 6.3, 5.3, 4.8 and performs full duplex speech compression and de- 4.1 kbps at 8KHz sampling rate (including compression functions. It provides speech G.723.1 Annex A VAD/CNG) compression for H.320, H.323 and H.324 · G.728 16 Kbps LD-CELP Multimedia Visual Telephony / Video · Download of additional speech compression Conferencing products and DSVD Modems. The software modules into external sram for CT8021 has built-in TrueSpeechâ G.723.1 (for TrueSpeechâ 8.5, G.722 & G.729-A/B H.323 and H.324) as well as G.728 LD-CELP · Real-time Full duplex or Half duplex speech speech compression (for H.320). This combination compression and decompression of ITU speech compression standards within a · Acoustic Echo Cancellation concurrent with single device enables the creation of a single full-duplex speech compression multimedia terminal which can operate in all · Full Duplex standalone Speakerphone types of Video Conferencing systems including · Host-to-Host (codec-less) and Host-CODEC H.320 ISDN-based, H.324 POTS-based, and modes of operation H.323 LAN/Internet-based. TrueSpeechâ G.723.1 · Parallel 8-bit host interface provides simple provides compressed data rates of 6.3 and 5.3 memory-mapped I/O host connection. Kbps and includes G.723.1 Annex A VAD/CNG · 1 or 2-channel DMA support (Single Cycle “silence” compression which can supply an even and Burst Modes) lower average bit rate. -
TR 101 329-7 V1.1.1 (2000-11) Technical Report
ETSI TR 101 329-7 V1.1.1 (2000-11) Technical Report TIPHON; Design Guide; Part 7: Design Guide for Elements of a TIPHON connection from an end-to-end speech transmission performance point of view 2 ETSI TR 101 329-7 V1.1.1 (2000-11) Reference DTR/TIPHON-05011 Keywords internet, IP, network, performance, protocol, quality, speech, voice ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.:+33492944200 Fax:+33493654716 Siret N° 348 623 562 00017 - NAF 742 C Association à but non lucratif enregistrée à la Sous-Préfecture de Grasse (06) N° 7803/88 Important notice Individual copies of the present document can be downloaded from: http://www.etsi.org The present document may be made available in more than one electronic version or in print. In any case of existing or perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http://www.etsi.org/tb/status/ If you find errors in the present document, send your comment to: [email protected] Copyright Notification No part may be reproduced except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. -
LOW COMPLEXITY H.264 to VC-1 TRANSCODER by VIDHYA
LOW COMPLEXITY H.264 TO VC-1 TRANSCODER by VIDHYA VIJAYAKUMAR Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE IN ELECTRICAL ENGINEERING THE UNIVERSITY OF TEXAS AT ARLINGTON AUGUST 2010 Copyright © by Vidhya Vijayakumar 2010 All Rights Reserved ACKNOWLEDGEMENTS As true as it would be with any research effort, this endeavor would not have been possible without the guidance and support of a number of people whom I stand to thank at this juncture. First and foremost, I express my sincere gratitude to my advisor and mentor, Dr. K.R. Rao, who has been the backbone of this whole exercise. I am greatly indebted for all the things that I have learnt from him, academically and otherwise. I thank Dr. Ishfaq Ahmad for being my co-advisor and mentor and for his invaluable guidance and support. I was fortunate to work with Dr. Ahmad as his research assistant on the latest trends in video compression and it has been an invaluable experience. I thank my mentor, Mr. Vishy Swaminathan, and my team members at Adobe Systems for giving me an opportunity to work in the industry and guide me during my internship. I would like to thank the other members of my advisory committee Dr. W. Alan Davis and Dr. William E Dillon for reviewing the thesis document and offering insightful comments. I express my gratitude Dr. Jonathan Bredow and the Electrical Engineering department for purchasing the software required for this thesis and giving me the chance to work on cutting edge technologies. -
Video Coding Standards
Module 8 Video Coding Standards Version 2 ECE IIT, Kharagpur Lesson 23 MPEG-1 standards Version 2 ECE IIT, Kharagpur Lesson objectives At the end of this lesson, the students should be able to : 1. Enlist the major video coding standards 2. State the basic objectives of MPEG-1 standard. 3. Enlist the set of constrained parameters in MPEG-1 4. Define the I- P- and B-pictures 5. Present the hierarchical data structure of MPEG-1 6. Define the macroblock modes supported by MPEG-1 23.0 Introduction In lesson 21 and lesson 22, we studied how to perform motion estimation and thereby temporally predict the video frames to exploit significant temporal redundancies present in the video sequence. The error in temporal prediction is encoded by standard transform domain techniques like the DCT, followed by quantization and entropy coding to exploit the spatial and statistical redundancies and achieve significant video compression. The video codecs therefore follow a hybrid coding structure in which DPCM is adopted in temporal domain and DCT or other transform domain techniques in spatial domain. Efforts to standardize video data exchange via storage media or via communication networks are actively in progress since early 1980s. A number of international video and audio standardization activities started within the International Telephone Consultative Committee (CCITT), followed by the International Radio Consultative Committee (CCIR), and the International Standards Organization / International Electrotechnical Commission (ISO/IEC). An experts group, known as the Motion Pictures Expects Group (MPEG) was established in 1988 in the framework of the Joint ISO/IEC Technical Committee with an objective to develop standards for coded representation of moving pictures, associated audio, and their combination for storage and retrieval of digital media. -
Multiple Description Coding Using Time Domain Division for MP3 Coded Sound Signal
Journal of Information Hiding and Multimedia Signal Processing ©2010 ISSN 2073-4212 Ubiquitous International Volume 1, Number 4, October 2010 Multiple Description Coding Using Time Domain Division for MP3 coded Sound Signal Ho-seok Wey1;2, Akinori Ito3, Takuma Okamoto1;3, and Yoiti Suzuki1;3 1Research Institute of Electrical Communication,Tohoku University 2-1-1, Katahira, Aoba-ku, Sendai, Miyagi 980-8577, JAPAN 2Graduate School of Information Sciences, Tohoku University, JAPAN Tel.: +81-22-217-5461, Fax.: +81-22-217-5535, E-mail: fhswey, okamoto, yoh @ais.riec.tohoku.ac.jp 3Graduate School of Engineering,f Tohoku University, g 6-6-05 Aramaki aza Aoba, Aoba-ku, Sendai, Miyagi 980-8579, JAPAN Tel.,Fax.: +81-22-795-7084, E-mail: [email protected] Received April 2010; revised May 2010 Abstract. In audio communications over a lossy packet network, packet loss conceal- ment techniques are needed to mitigate a user's frustration when perceiving the deterio- ration of the quality of the decoded signal. Multiple description coding (MDC) is a useful solution to this problem. In this paper, we describe an MDC method for concealing packet losses for wideband sound signal streaming based on the sample splitting method in the time domain and encoding by an MPEG-1 audio layer III (MP3) encoder. To enhance the quality of the restored signal, we applied a Wiener filter to the higher frequency part of the restored signal. Experiments were conducted to compare the proposed method with several conventional methods, conrming that the proposed method showed higher quality results than the conventional methods for a range of bit rates from 128 to 192 kbps when there were heavy packet losses. -
A Real-Time Audio Compression Technique Based on Fast Wavelet Filtering and Encoding
Proceedings of the Federated Conference on Computer Science DOI: 10.15439/2016F296 and Information Systems pp. 497–502 ACSIS, Vol. 8. ISSN 2300-5963 A Real-Time Audio Compression Technique Based on Fast Wavelet Filtering and Encoding Nella Romano∗, Antony Scivoletto∗, Dawid Połap† ∗Department of Electrical and Informatics Engineering, University of Catania, Viale A. Doria 6, 95125 Catania, Italy Email: [email protected], [email protected] †Institute of Mathematics, Silesian University of Technology, Kaszubska 23, 44-100 Gliwice, Poland Email: [email protected] Abstract—With the development of telecommunication tech- transmitted using less data than would be required for the nology over the last decades, the request for digital information original signal [2]. compression has increased dramatically. In many applications, Compression techniques can be classified into one of two such as high quality audio transmission and storage, the target is to achieve audio and speech signal codings at the lowest main categories: lossless and lossy. Lossless compression possible data rates, in order to offer cheaper costs in terms works by removing the redundant information present in an of transmission and storage. Recently, compression techniques audio signal, but preserving its quality and the complete in- using wavelet transform have received great attention because tegrity of the data. However, it offers small compression ratios, of their promising compression ratio, signal to noise ratio, hence it can be used if we have no stringent requirements; and flexibility in representing speech signals. In this paper we examine a new technique for analysing and compressing furthermore, it does not guarantee a constant output data rate, speech signals using biorthogonal wavelet filters. -
AVC to the Max: How to Configure Encoder
Contents Company overview …. ………………………………………………………………… 3 Introduction…………………………………………………………………………… 4 What is AVC….………………………………………………………………………… 6 Making sense of profiles, levels, and bitrate………………………………………... 7 Group of pictures and its structure..………………………………………………… 11 Macroblocks: partitioning and prediction modes….………………………………. 14 Eliminating spatial redundancy……………………………………………………… 15 Eliminating temporal redundancy……...……………………………………………. 17 Adaptive quantization……...………………………………………………………… 24 Deblocking filtering….….…………………………………………………………….. 26 Entropy encoding…………………………………….……………………………….. 2 8 Conclusion…………………………………………………………………………….. 29 Contact details..………………………………………………………………………. 30 2 www.elecard.com Company overview Elecard company, founded in 1988, is a leading provider of software products for encoding, decoding, processing, monitoring and analysis of video and audio data in 9700 companies various formats. Elecard is a vendor of professional software products and software development kits (SDKs); products for in - depth high - quality analysis and monitoring of the media content; countries 1 50 solutions for IPTV and OTT projects, digital TV broadcasting and video streaming; transcoding servers. Elecard is based in the United States, Russia, and China with 20M users headquarters located in Tomsk, Russia. Elecard products are highly appreciated and widely used by the leaders of IT industry such as Intel, Cisco, Netflix, Huawei, Blackmagic Design, etc. For more information, please visit www.elecard.com. 3 www.elecard.com Introduction Video compression is the key step in video processing. Compression allows broadcasters and premium TV providers to deliver their content to their audience. Many video compression standards currently exist in TV broadcasting. Each standard has different properties, some of which are better suited to traditional live TV while others are more suited to video on demand (VoD). Two basic standards can be identified in the history of video compression: • MPEG-2, a legacy codec used for SD video and early digital broadcasting. -
Video Coding Standards 1 Videovideo Codingcoding Standardsstandards
VideoVideo CodingCoding StandardsStandards • H.120 • H.261 • MPEG-1 and MPEG-2/H.262 • H.263 • MPEG-4 Thomas Wiegand: Digital Image Communication Video Coding Standards 1 VideoVideo CodingCoding StandardsStandards MPEG-2 digital TV 2 -6 Mbps ITU-R 601 166 Mbit/s H.261 ISDN 64 kbps Picture phone H.263 PSTN < 28.8 kbps picture phone Thomas Wiegand: Digital Image Communication Video Coding Standards 2 H.120:H.120: TheThe FirstFirst DigitalDigital VideoVideo CodingCoding StandardStandard • ITU-T (ex-CCITT) Rec. H.120: The first digital video coding standard (1984) • v1 (1984) had conditional replenishment, DPCM, scalar quantization, variable-length coding, switch for quincunx sampling • v2 (1988) added motion compensation and background prediction • Operated at 1544 (NTSC) and 2048 (PAL) kbps • Few units made, essentially not in use today Thomas Wiegand: Digital Image Communication Video Coding Standards 3 H.261:H.261: TheThe BasisBasis ofof ModernModern VideoVideo CompressionCompression • ITU-T (ex-CCITT) Rec. H.261: The first widespread practical success • First design (late ’80s) embodying typical structure that dominates today: 16x16 macroblock motion compensation, 8x8 DCT, scalar quantization, and variable-length coding • Other key aspects: loop filter, integer-pel motion compensation accuracy, 2-D VLC for coefficients • Operated at 64-2048 kbps • Still in use, although mostly as a backward- compatibility feature – overtaken by H.263 Thomas Wiegand: Digital Image Communication Video Coding Standards 4 H.261&3H.261&3 MacroblockMacroblock -
A Cross-Layer Design for Scalable Mobile Video
A Cross-Layer Design for Scalable Mobile Video The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Jakubczak, Szymon, and Dina Katabi. “A Cross-layer Design for Scalable Mobile Video.” ACM Press, 2011. 289. As Published http://dx.doi.org/10.1145/2030613.2030646 Publisher Association for Computing Machinery (ACM) Version Author's final manuscript Citable link http://hdl.handle.net/1721.1/72994 Terms of Use Creative Commons Attribution-Noncommercial-Share Alike 3.0 Detailed Terms http://creativecommons.org/licenses/by-nc-sa/3.0/ A Cross-Layer Design for Scalable Mobile Video Szymon Jakubczak Dina Katabi CSAIL MIT CSAIL MIT 32 Vassar St. 32 Vassar St. Cambridge, Mass. 02139 Cambridge, Mass. 02139 [email protected] [email protected] ABSTRACT Cisco visual index, mobile video traffic will grow 66 fold over Today’s mobile video suffers from two limitations: 1) it can- a period of five years [1]. Such predictions lead to a natu- not reduce bandwidth consumption by leveraging wireless ral question: can existing wireless technologies, e.g., WiFi, broadcast to multicast popular content to interested re- WiMax, or LTE, support this impending demand and pro- ceivers, and 2) it lacks robustness to wireless interference vide scalable and robust mobile video? and errors. This paper presents SoftCast, a cross-layer de- (a) Scalability. As demands for mobile video increase sign for mobile video that addresses both limitations. To congestion will also increase. The problem becomes particu- do so, SoftCast changes the network stack to act like a lin- larly severe when many users try to watch a popular realtime ear transform. -
Motion Compensation on DCT Domain
EURASIP Journal on Applied Signal Processing 2001:3, 147–162 © 2001 Hindawi Publishing Corporation Motion Compensation on DCT Domain Ut-Va Koc Lucent Technologies Bell Labs, 600 Mountain Avenue, Murray Hill, NJ 07974, USA Email: [email protected] K. J. Ray Liu Department of Electrical and Computer Engineering and Institute for Systems Research, University of Maryland, College Park, MD 20742, USA Email: [email protected] Received 21 May 2001 and in revised form 21 September 2001 Alternative fully DCT-based video codec architectures have been proposed in the past to address the shortcomings of the conven- tional hybrid motion compensated DCT video codec structures traditionally chosen as the basis of implementation of standard- compliant codecs. However, no prior effort has been made to ensure interoperability of these two drastically different architectures so that fully DCT-based video codecs are fully compatible with the existing video coding standards. In this paper, we establish the criteria for matching conventional codecs with fully DCT-based codecs. We find that the key to this interoperability lies in the heart of the implementation of motion compensation modules performed in the spatial and transform domains at both the encoder and the decoder. Specifically,if the spatial-domain motion compensation is compatible with the transform-domain motion compensation, then the states in both the coder and the decoder will keep track of each other even after a long series of P-frames. Otherwise, the states will diverge in proportion to the number of P-frames between two I-frames. This sets an important criterion for the development of any DCT-based motion compensation schemes. -
AMR Wideband Speech Codec; Feasibility Study Report (3GPP TR 26.901 Version 4.0.1 Release 4)
ETSI TR 126 901 V4.0.1 (2001-04) Technical Report Universal Mobile Telecommunications System (UMTS); AMR wideband speech codec; Feasibility study report (3GPP TR 26.901 version 4.0.1 Release 4) 3GPP TR 26.901 version 4.0.1 Release 4 1 ETSI TR 126 901 V4.0.1 (2001-04) Reference DTR/TSGS-0426901Uv4 Keywords UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.:+33492944200 Fax:+33493654716 Siret N° 348 623 562 00017 - NAF 742 C Association à but non lucratif enregistrée à la Sous-Préfecture de Grasse (06) N° 7803/88 Important notice Individual copies of the present document can be downloaded from: http://www.etsi.org The present document may be made available in more than one electronic version or in print. In any case of existing or perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http://www.etsi.org/tb/status/ If you find errors in the present document, send your comment to: [email protected] Copyright Notification No part may be reproduced except as authorized by written permission. The copyright and the foregoing restriction extend to reproduction in all media. © European Telecommunications Standards Institute 2001. -
White Paper Version of July 2015
White paper Digital Video File Recommendation Version of July 2015 edited by Whitepaper 1 DIGITAL VIDEO FILE RECOMMENDATIONS ............................ 4 1.1 Preamble .........................................................................................................................4 1.2 Scope ...............................................................................................................................5 1.3 File Formats ....................................................................................................................5 1.4 Codecs ............................................................................................................................5 1.4.1 Browsing .................................................................................................... 5 1.4.2 Acquisition ................................................................................................. 6 1.4.3 Programme Contribution ........................................................................... 7 1.4.4 Postproduction .......................................................................................... 7 1.4.5 Broadcast ................................................................................................... 7 1.4.6 News & Magazines & Sports ..................................................................... 8 1.4.7 High Definition ........................................................................................... 8 1.5 General Requirements ..................................................................................................9