7MPEG Advanced Audio Coding

Total Page:16

File Type:pdf, Size:1020Kb

7MPEG Advanced Audio Coding 7 MPEG advanced audio coding MPEG advanced audio coding (AAC) and its successors are currently the most powerful members in the MPEG family for high-quality, multichannel, digital au- dio compression. MPEG AAC provides a various selection of operating modes and configurations, and therefore supports a wide range of applications from low bit- rate Internet audio to multichannel broadcasting services. In this chapter, we will introduce the main features of the AAC multichannel coding system. 7.1. Introduction to advanced audio coding As part of the MPEG-2 standard (ISO/IEC 13818-7) [60], MPEG-2 AAC was fi- nalized in 1997. Initially the MPEG-2 AAC was called the MPEG-2 non-backward compatible (NBC) coding [15]. The standardization of MPEG-2 NBC started after the completion of the MPEG-2 multichannel audio standard in 1994 [59]. At that time, the MPEG audio subgroup wished to develop a new multichannel coding standard that allows an even higher quality other than the first MPEG-2 multi- channel audio standard, which is backward compatible with MPEG-1. The devel- opment goal of this new multichannel audio coding algorithm is to achieve indis- tinguishable quality according to the EBU definition [11] at the bit rate of 384 kbps or lower for five full-bandwidth channel signals, without any backward compati- bility constrain. MPEG-4 AAC, which is also referred to as MPEG-4 general audio (T/F) coder [58], added several new audio tools on top of MPEG-2 AAC. Some of the new tools borrow ideas from speech coding to even lower the encoding bit rate while maintaining the similar sound quality. Others modified certain existing MPEG-2 AAC coding blocks to enable a few desirable features and provide more function- alities. With a typical bit rate of 64 kbps per channel, MPEG-4 AAC can achieve near-transparent coding for CD-quality original input sound. Even at very low bit rates down to 16 kbps, MPEG-4 AAC still exhibits excellent performance. A scalable audio coding standard is also included in MPEG-4 audio. MPEG-4 version 1 provides large-step scalability, which integrated several existing MPEG-4 audio coding schemes to deliver a scalable audio bitstream. AAC is adopted as part of this large-step scalable audio coding architecture to provide enhancement layer bitstream. A fine-grain scalable (FGS) audio framework was developed in MPEG-4 92 MPEG advanced audio coding Scalable sampling rate Main Low complexity 6kHz 12 kHz 18 kHz 20 kHz Figure 7.1. Three MPEG-2 AAC profiles [60]. audio version 2 [65]. The new FGS audio codec is called bit-sliced arithmetic cod- ing (BSAC). It replaces the AAC’s quantization and entropy coding module to en- able the decoder to reconstruct the audio signal more and more accurately. MPEG-4 high-efficiency AAC (HE AAC) is the next generation of AAC stan- dard, which utilizes AAC as the core coder and extends the higher-frequency com- ponent and stereo information with the help of a set of limited parameters. Since this new algorithm is a model-based parametric extension to AAC, it further reduces the bit rate and is capable to reproduce a CD-quality stereo sound at 24 kbps, which is about 64 times smaller than the original file. In this chapter, audio tools in all AAC-related standards including MPEG-2 AAC, MPEG-4 AAC, MPEG-4 BSAC, and MPEG-4 HEAAC will be discussed in detail. 7.2. MPEG-2 AAC 7.2.1. Overview of MPEG-2 AAC 7.2.1.1. Profiles MPEG-2 AAC provides three profiles that can be selected by end-users according to their desired complexity and quality requirements. These three profiles are main profile, low-complexity (LC) profile, and scalable sampling rate (SSR) profile, and their relationships are shown in Figure 7.1. (1) Main profile. AAC main profile, abbreviated as AAC main, provides the best audio quality at any given bit rate among all three profiles. All coding tools except the gain control are enabled in AAC main. Thus both the memory require- ment and the computational complexity of this configuration are higher than the other two profiles. However, the main profile AAC decoder is more powerful and is capable to decode a bitstream generated by a low-complexity AAC encoder. (2) Low-complexity profile. AAC low-complexity profile, abbreviated as AAC LC, does not use the gain control and prediction tools. Moreover, the temporal MPEG-2 AAC 93 Psychoacoustic Threshold model Input signal Block type Quantization Pre- Gain Spectral Transform & processing control processing entropy coding Multiplex Compressed bitstream Figure 7.2. The block diagram of the AAC encoder. noise shaping (TNS) filter has a lower order than that of the AAC main codec. With this configuration, the AAC codec consumes considerably lower memory and processing power requirements. However, degradation of the reconstructed sound quality is not obvious. In fact, among all three profiles, AAC LC is the most popular profile adopted in industry. (3) Scalable sampling rate profile. AAC scalable sampling rate profile, abbrevi- ated as AAC SSR, is capable to provide a frequency scalable signal. The gain control is activated when the AAC codec is operating in this profile. Except the gain con- trol module, other audio tools are operating at the same configuration as the AAC LC profile. Therefore, the memory requirement and the computational complex- ity of the AAC SSR codec are also significantly lower than that of the AAC main codec. 7.2.1.2. High-level block diagrams Figures 7.2 and 7.3 are high-level block diagrams of AAC encoder and decoder, respectively. At the encoder side, input signals are first processed by the preprocessing module if it is necessary. Depending on the required bit rate and the sampling frequency of the input signal, the preprocessing module downsamples the original signal so that better quality can be achieved with the limited bit budget. If the in- put signal has out-of-range amplitude, it will also be processed in this module. If the encoder is operating in SSR profile, signals are then processed by the gain con- trol. Otherwise, they directly go to the transform block to map the time-domain signal to frequency domain and obtain spectral data. A serial of spectral process- ing is followed to remove irrelevant and unimportant information. Finally, the processed spectral data is quantized and noiseless coded. Throughout the entire encoding process, the output from the psychoacoustic model, such as block type 94 MPEG advanced audio coding Compressed bitstream Demultiplex Block type Entropy decoding Inverse Inverse Inverse Post- spectral & transform gain processing inverse quantization processing control Output signal Figure 7.3. The block diagram of the AAC decoder. and masking thresholds, is used as a guide line to control each encoding module. The compressed spectral data as well as that side information generated in each coding module are all multiplexed in the bitstream. When the decoder receives a compressed bitstream, it first demultiplexes it. Compressed spectral data is then decoded by entropy decoder and inverse quan- tizer. Side information extracted from the bitstream is used to control the inverse spectral processing, inverse transform, and inverse gain control if activated. If the original sound input to the encoder has out-of-range amplitude, then the post- processing module restores the reconstructed signal back to its original amplitude range. 7.2.2. Psychoacoustic model Psychoacoustic model plays the most important role in the perceptual audio cod- ing. The output of this model controls almost every major coding block in the AAC encoder. The main task of this psychoacoustic model is to calculate masking thresholds. Any quantization noise added in the later quantization block is imper- ceptible if its energy is smaller than the masking thresholds. Ideally, if all encoding errors are masked by the signal itself, then the reconstructed sound is perceptu- ally indistinguishable from the original input. However, if the available bit budget does not allow all spectral data to meet masking thresholds, psychoacoustic model needs to guide the quantizer so that the minimum perceptual noise will be added by loosening the psychoacoustic requirements. 7.2.3. Gain control The gain control block, which is only activated in the SSR profile, includes a polyp- hase quadrature filter (PQF), gain detectors, and gain modifiers. The PQF filter- bank separates the input signal to four equal-width frequency bands. For example, if input sound has 48 kHz sampling rate, then four-band output from the PQF MPEG-2 AAC 95 filter contains signals of 0–6 kHz, 6–12 kHz, 12–18 kHz, and 18–24 kHz. Hence, the bandwidth scalability can be achieved by discarding one or more upper band signals. The distinct feature of this structure allows the decoder to reconstruct a reduced bandwidth signal with lower computational complexity. The gain detec- tor gathers information from the output of the PQF filter and let the gain modi- fier know which one of these four bands needs amplitude adjustment and how it should be done. The inverse gain control module in the decoder consists of gain compensators and inverse polyphase quadrature filter (IPQF). Four gain compensators corre- sponding to four equal-width frequency bands are required if the full-bandwidth signal needs to be reconstructed. Each gain compensator recovers gain control data then restores the original signal dynamics. The resulting outputs after the gain compensation are combined and synthesized by the IPQF to generate final PCM data. 7.2.4. Transform 7.2.4.1. Modified discrete cosine transform The conversion between time-domain signals as the input of the encoder or the output of the decoder and their corresponding frequency representations is a fun- damental component of the MPEG AAC audio coder.
Recommended publications
  • PD7040/98 Philips Portable DVD Player
    Philips Portable DVD Player 18cm/ 7" LCD 5-hr playtime PD7040 Longer movies enjoyment on the go with USB for digital media playback Enjoy your movies anytime, anyplace! The PD7040 portable DVD player features 7”/ 18cm LCD swivel screen for your great viewing experience. You can indulge in up to 5 hours of DVD/DivX®/MPEG movies, MP3-CD/CD music and JPEG photos on the go. Play your movies, music and photos on the go • DVD, DVD+/-R, DVD+/-RW, (S)VCD, CD compatible • DivX Certified for standard DivX video playback • MP3-CD, CD and CD-RW playback • View JPEG images from picture disc Enrich your AV entertainment experience • 7" swivel color LCD panel for improved viewing flexibility • Enjoy movies in 16:9 widescreen format • Built-in stereo speakers Extra touches for your convenience • Up to 5-hour playback with a built-in battery* • USB Direct for music and photo playback • Car mount pouch included for easy in-car use • Full Resume on Power Loss • AC adaptor, car adaptor and AV cable included Portable DVD Player PD7040/98 18cm/ 7" LCD 5-hr playtime Highlights MP3-CD, CD and CD-RW playback USB Direct where you have stopped the movie last time just by reloading the disc. Making your life a lot easier! MP3 is a revolutionary compression Simply plug in your USB device on the system technology by which large digital music files can and share your stored digital music and photos be made up to 10 times smaller without with your family and friends. radically degrading their audio quality.
    [Show full text]
  • A History of Video Game Consoles Introduction the First Generation
    A History of Video Game Consoles By Terry Amick – Gerald Long – James Schell – Gregory Shehan Introduction Today video games are a multibillion dollar industry. They are in practically all American households. They are a major driving force in electronic innovation and development. Though, you would hardly guess this from their modest beginning. The first video games were played on mainframe computers in the 1950s through the 1960s (Winter, n.d.). Arcade games would be the first glimpse for the general public of video games. Magnavox would produce the first home video game console featuring the popular arcade game Pong for the 1972 Christmas Season, released as Tele-Games Pong (Ellis, n.d.). The First Generation Magnavox Odyssey Rushed into production the original game did not even have a microprocessor. Games were selected by using toggle switches. At first sales were poor because people mistakenly believed you needed a Magnavox TV to play the game (GameSpy, n.d., para. 11). By 1975 annual sales had reached 300,000 units (Gamester81, 2012). Other manufacturers copied Pong and began producing their own game consoles, which promptly got them sued for copyright infringement (Barton, & Loguidice, n.d.). The Second Generation Atari 2600 Atari released the 2600 in 1977. Although not the first, the Atari 2600 popularized the use of a microprocessor and game cartridges in video game consoles. The original device had an 8-bit 1.19MHz 6507 microprocessor (“The Atari”, n.d.), two joy sticks, a paddle controller, and two game cartridges. Combat and Pac Man were included with the console. In 2007 the Atari 2600 was inducted into the National Toy Hall of Fame (“National Toy”, n.d.).
    [Show full text]
  • Music 422 Project Report
    Music 422 Project Report Jatin Chowdhury, Arda Sahiner, and Abhipray Sahoo October 10, 2020 1 Introduction We present an audio coder that seeks to improve upon traditional coding methods by implementing block switching, spectral band replication, and gain-shape quantization. Block switching improves on coding of transient sounds. Spectral band replication exploits low sensitivity of human hearing towards high frequencies. It fills in any missing coded high frequency content with information from the low frequencies. We chose to do gain-shape quantization in order to explicitly code the energy of each band, preserving the perceptually important spectral envelope more accurately. 2 Implementation 2.1 Block Switching Block switching, as introduced by Edler [1], allows the audio coder to adaptively change the size of the block being coded. Modern audio coders typically encode frequency lines obtained using the Modified Discrete Cosine Transform (MDCT). Using longer blocks for the MDCT allows forbet- ter spectral resolution, while shorter blocks have better time resolution, due to the time-frequency trade-off known as the Fourier uncertainty principle [2]. Due to their poor time resolution, using long MDCT blocks for an audio coder can result in “pre-echo” when the coder encodes transient sounds. Block switching allows the coder to use long MDCT blocks for normal signals being encoded and use shorter blocks for transient parts of the signal. In our coder, we implement the block switching algorithm introduced in [1]. This algorithm consists of four types of block windows: long, short, start, and stop. Long and short block windows are simple sine windows, as defined in [3]: (( ) ) 1 π w[n] = sin n + (1) 2 N where N is the length of the block.
    [Show full text]
  • Forensic Analysis of the Nintendo 3DS NAND
    Edith Cowan University Research Online ECU Publications Post 2013 2019 Forensic Analysis of the Nintendo 3DS NAND Gus Pessolano Huw O. L. Read Iain Sutherland Edith Cowan University Konstantinos Xynos Follow this and additional works at: https://ro.ecu.edu.au/ecuworkspost2013 Part of the Physical Sciences and Mathematics Commons 10.1016/j.diin.2019.04.015 Pessolano, G., Read, H. O., Sutherland, I., & Xynos, K. (2019). Forensic analysis of the Nintendo 3DS NAND. Digital Investigation, 29, S61-S70. Available here This Journal Article is posted at Research Online. https://ro.ecu.edu.au/ecuworkspost2013/6459 Digital Investigation 29 (2019) S61eS70 Contents lists available at ScienceDirect Digital Investigation journal homepage: www.elsevier.com/locate/diin DFRWS 2019 USA e Proceedings of the Nineteenth Annual DFRWS USA Forensic Analysis of the Nintendo 3DS NAND * Gus Pessolano a, Huw O.L. Read a, b, , Iain Sutherland b, c, Konstantinos Xynos b, d a Norwich University, Northfield, VT, USA b Noroff University College, 4608 Kristiansand S., Vest Agder, Norway c Security Research Institute, Edith Cowan University, Perth, Australia d Mycenx Consultancy Services, Germany article info abstract Article history: Games consoles present a particular challenge to the forensics investigator due to the nature of the hardware and the inaccessibility of the file system. Many protection measures are put in place to make it deliberately difficult to access raw data in order to protect intellectual property, enhance digital rights Keywords: management of software and, ultimately, to protect against piracy. History has shown that many such Nintendo 3DS protections on game consoles are circumvented with exploits leading to jailbreaking/rooting and Games console allowing unauthorized software to be launched on the games system.
    [Show full text]
  • W4: OBJECTIVE QUALITY METRICS 2D/3D Jan Ozer [email protected] 276-235-8542 @Janozer Course Overview
    W4: OBJECTIVE QUALITY METRICS 2D/3D Jan Ozer www.streaminglearningcenter.com [email protected] 276-235-8542 @janozer Course Overview • Section 1: Validating metrics • Section 2: Comparing metrics • Section 3: Computing metrics • Section 4: Applying metrics • Section 5: Using metrics • Section 6: 3D metrics Section 1: Validating Objective Quality Metrics • What are objective quality metrics? • How accurate are they? • How are they used? • What are the subjective alternatives? What Are Objective Quality Metrics • Mathematical formulas that (attempt to) predict how human eyes would rate the videos • Faster and less expensive than subjective tests • Automatable • Examples • Video Multimethod Assessment Fusion (VMAF) • SSIMPLUS • Peak Signal to Noise Ratio (PSNR) • Structural Similarity Index (SSIM) Measure of Quality Metric • Role of objective metrics is to predict subjective scores • Correlation with Human MOS (mean opinion score) • Perfect score - objective MOS matched actual subjective tests • Perfect diagonal line Correlation with Subjective - VMAF VMAF PSNR Correlation with Subjective - SSIMPLUS PSNR SSIMPLUS SSIMPLUS How Are They Used • Netflix • Per-title encoding • Choosing optimal data rate/rez combination • Facebook • Comparing AV1, x265, and VP9 • Researchers • BBC comparing AV1, VVC, HEVC • My practice • Compare codecs and encoders • Build encoding ladders • Make critical configuration decisions Day to Day Uses • Optimize encoding parameters for cost and quality • Configure encoding ladder • Compare codecs and encoders • Evaluate
    [Show full text]
  • Audio Coding for Digital Broadcasting
    Recommendation ITU-R BS.1196-7 (01/2019) Audio coding for digital broadcasting BS Series Broadcasting service (sound) ii Rec. ITU-R BS.1196-7 Foreword The role of the Radiocommunication Sector is to ensure the rational, equitable, efficient and economical use of the radio- frequency spectrum by all radiocommunication services, including satellite services, and carry out studies without limit of frequency range on the basis of which Recommendations are adopted. The regulatory and policy functions of the Radiocommunication Sector are performed by World and Regional Radiocommunication Conferences and Radiocommunication Assemblies supported by Study Groups. Policy on Intellectual Property Right (IPR) ITU-R policy on IPR is described in the Common Patent Policy for ITU-T/ITU-R/ISO/IEC referenced in Resolution ITU-R 1. Forms to be used for the submission of patent statements and licensing declarations by patent holders are available from http://www.itu.int/ITU-R/go/patents/en where the Guidelines for Implementation of the Common Patent Policy for ITU-T/ITU-R/ISO/IEC and the ITU-R patent information database can also be found. Series of ITU-R Recommendations (Also available online at http://www.itu.int/publ/R-REC/en) Series Title BO Satellite delivery BR Recording for production, archival and play-out; film for television BS Broadcasting service (sound) BT Broadcasting service (television) F Fixed service M Mobile, radiodetermination, amateur and related satellite services P Radiowave propagation RA Radio astronomy RS Remote sensing systems S Fixed-satellite service SA Space applications and meteorology SF Frequency sharing and coordination between fixed-satellite and fixed service systems SM Spectrum management SNG Satellite news gathering TF Time signals and frequency standards emissions V Vocabulary and related subjects Note: This ITU-R Recommendation was approved in English under the procedure detailed in Resolution ITU-R 1.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • Preview - Click Here to Buy the Full Publication
    This is a preview - click here to buy the full publication IEC 62481-2 ® Edition 2.0 2013-09 INTERNATIONAL STANDARD colour inside Digital living network alliance (DLNA) home networked device interoperability guidelines – Part 2: DLNA media formats INTERNATIONAL ELECTROTECHNICAL COMMISSION PRICE CODE XH ICS 35.100.05; 35.110; 33.160 ISBN 978-2-8322-0937-0 Warning! Make sure that you obtained this publication from an authorized distributor. ® Registered trademark of the International Electrotechnical Commission This is a preview - click here to buy the full publication – 2 – 62481-2 © IEC:2013(E) CONTENTS FOREWORD ......................................................................................................................... 20 INTRODUCTION ................................................................................................................... 22 1 Scope ............................................................................................................................. 23 2 Normative references ..................................................................................................... 23 3 Terms, definitions and abbreviated terms ....................................................................... 30 3.1 Terms and definitions ............................................................................................ 30 3.2 Abbreviated terms ................................................................................................. 34 3.4 Conventions .........................................................................................................
    [Show full text]
  • PD9000/37 Philips Portable DVD Player
    Philips Portable DVD Player 22.9 cm (9") LCD 5-hr playtime PD9000 Enjoy movies longer, while on the go Enjoy your movies anytime, anyplace! The PD9000 portable DVD player features 9” TFT LCD screen for your great viewing experience. You can indulge in up to 5 hours of DVD/ DivX®/MPEG movies, MP3-CD/CD music and JPEG photos on the go. Play your movies, music and photos on the go • DVD, DVD+/-R, DVD+/-RW, (S)VCD, CD compatible • DivX Certified for standard DivX video playback • MP3-CD, CD and CD-RW playback • View JPEG images from picture disc Enrich your AV entertainment experience • 22.9 cm (9") TFT color widescreen LCD display • Built-in stereo speakers Extra touches for your convenience • Up to 5-hour playback with a built-in battery* • AC adaptor, car adaptor and AV cable included • Car mount pouch included for easy in-car use • Full Resume on Power Loss Portable DVD Player PD9000/37 22.9 cm (9") LCD 5-hr playtime Highlights Specifications DVD, DVD+/-RW, (S)VCD, CD DivX Certified Picture/Display With DivX® support, you are able to enjoy DivX • Diagonal screen size (inch): 9 inch encoded videos and movies from the Internet, • Resolution: 640(w)x220(H)x3(RGB) including purchased Hollywood films. The DivX media •Display screen type: LCD TFT format is an MPEG-4 based video compression technology that enables you to save large files like Sound movies, trailers and music videos on media like CD-R/ • Output Power: 250mW RMS(built-in speakers) RW and DVD recordable discs, USB storage and • Output power (RMS): 5mW RMS(earphone) other memory cards for playback on your DivX • Signal to noise ratio: >80dB(earphone), >62dB(built- Certified® Philips device.
    [Show full text]
  • International Organisation for Standardisation Organisation Internationale De Normalisation Iso/Iec Jtc1/Sc29/Wg11 Coding of Moving Pictures and Audio
    INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 MPEG98/N2425 Atlantic City - October 1998 Source: Audio and Test subgroup Status: Approved Title: MPEG-4 Audio verification test results: Audio on Internet Authors: Eric Scheirer, Sang-Wook Kim, Martin Dietz Table of Contents 1. Introduction.....................................................................................................................................2 2. Test motivation................................................................................................................................2 3. Codecs under Test ...........................................................................................................................3 4. Test Material...................................................................................................................................5 5. Test methodology ............................................................................................................................7 6. Test stimuli .....................................................................................................................................7 7. Test sessions ...................................................................................................................................8 8. Data Analysis..................................................................................................................................8
    [Show full text]
  • Lossy Audio Compression Identification
    2018 26th European Signal Processing Conference (EUSIPCO) Lossy Audio Compression Identification Bongjun Kim Zafar Rafii Northwestern University Gracenote Evanston, USA Emeryville, USA [email protected] zafar.rafi[email protected] Abstract—We propose a system which can estimate from an compression parameters from an audio signal, based on AAC, audio recording that has previously undergone lossy compression was presented in [3]. The first implementation of that work, the parameters used for the encoding, and therefore identify the based on MP3, was then proposed in [4]. The idea was to corresponding lossy coding format. The system analyzes the audio signal and searches for the compression parameters and framing search for the compression parameters and framing conditions conditions which match those used for the encoding. In particular, which match those used for the encoding, by measuring traces we propose a new metric for measuring traces of compression of compression in the audio signal, which typically correspond which is robust to variations in the audio content and a new to time-frequency coefficients quantized to zero. method for combining the estimates from multiple audio blocks The first work to investigate alterations, such as deletion, in- which can refine the results. We evaluated this system with audio excerpts from songs and movies, compressed into various coding sertion, or substitution, in audio signals which have undergone formats, using different bit rates, and captured digitally as well lossy compression, namely MP3, was presented in [5]. The as through analog transfer. Results showed that our system can idea was to measure traces of compression in the signal along identify the correct format in almost all cases, even at high bit time and detect discontinuities in the estimated framing.
    [Show full text]
  • What Is Ogg Vorbis?
    Ogg Vorbis Audio Compression Format Norat Rossello Castilla 5/18/2005 Ogg Vorbis 1 What is Ogg Vorbis? z Audio compression format z Comparable to MP3, VQF, AAC, TwinVQ z Free, open and unpatented z Broadcasting, radio station and television by internet ( = Streaming) 5/18/2005 Ogg Vorbis 2 1 About the name… z Ogg = name of Xiph.org container format for audio, video and metadata z Vorbis = name of specific audio compression scheme designed to be contained in Ogg FOR MORE INFO... https://www.xiph.org 5/18/2005 Ogg Vorbis 3 Some comercial characteristics z The official mime type was approved in February 2003 z Posible to encode all music or audio content in Vorbis z Designed to not be proprietary or patented audio format z Patent and licensed-free z Specification in public domain 5/18/2005 Ogg Vorbis 4 2 Audio Compression z Two classes of compression algorithms: - Lossless - Lossy FOR MORE INFO... http://www.firstpr.com.au/audiocomp 5/18/2005 Ogg Vorbis 5 Lossless algorithms z Produce compressed data that can be decoded to output that is identical to the original. z Zip, FLAC for audio 5/18/2005 Ogg Vorbis 6 3 Lossy algorithms z Discard data in order to compress it better than would normally be possible z VORBIS, MP3, JPEG z Throw away parts of the audio waveform that are irrelevant. 5/18/2005 Ogg Vorbis 7 Ogg Vorbis - Compression Factors z Vorbis is an audio codec that generates 16 bit samples at 16KHz to 48KHz, providing variable bit rates from 16 to 128 Kbps per channel FOR MORE INFO..
    [Show full text]