MPEG-4 Speech Coding

Total Page:16

File Type:pdf, Size:1020Kb

MPEG-4 Speech Coding MPEG-4 Speech coding Masayuki Nishiguchi Audio and Speech Group, HomeNet Processing Lab, HomeNet Laboratories, Sony Corporation 1 OutlineOutline Background MPEG-4 Speech coding - features MPEG-4 CELP Algorithm Performance Demonstration MPEG-4 HVXC Algorithm Performance Demonstration Summary 2 BackBackggroundround Most of the existing speech coding standards support only a single “compression” functionality. Speech coding algorithms with high coding efficiency and multiple functionalities play important role for efficient use of bandwidth and emerging new applications of multimedia systems. 3 MPEG-4MPEG-4 SpeechSpeech CodinCodingg -- featuresfeatures • Two basic algorithms - HVXC (Harmonic Vector eXcitation Coding) - CELP (Code Excited Linear Prediction) • Multi bit-rates - 1.5 ~ 24 kbps • Narrow-band and wide-band - CELP • Lowest bit-rate as an international standard coding - HVXC - 2.0 kbps (fixed) ave 1.5 kbps (var) • New Functionalities - Speed / Pitch change - HVXC - Bit-rate scalability - HVXC, CELP - Bandwidth scalability - CELP 4 Quality MPEG-4 version-1 Natural Audio GA (AAC, TwinVQ) CD FM AM Telephone HVXC Cellular CELP (NB-CELP, WB-CELP) phone 2 4 8 16 32 64 Bit-rate (kbps) 5 MPEG-4MPEG-4 CELPCELP Narrow band 3.85-12.2 kbps 10-40 ms frame Wide band 10.9-23.8 kbps 10-20 ms frame Multi-rate 200 - 800 bps step Bit-rate scalability - 2.0kbps(NB), 4.0kbps(WB) step Bandwidth scalability Fine rate control Regular pulse - WB: Low complexity Multi pulse - WB, NB: High coding efficiency 6 BlockdiaBlockdiaggramram ofof thethe CELPCELP encoderencoder Speech Input LPC parameters LPC LSP Coefficient Bit-rate Interpolation Analysis VQ Excitation Input parameters Codebook MPE/RPE Long tern LPC Control Codebook Syn.Filter Syn.Filter - Weighted Error Calc. 7 StructureStructure ofof thethe bit-ratebit-rate scalablescalable codincodingg Basic 6kbps 6kbps speech Decoder-1 2kbps Speech Input 2kbps Encoder 8kbps High Quality 2kbps speech Decoder-2 10kbps 12kbps High Quality speech Decoder-3 22kbps Wideband speech Decoder-4 8 PerformancePerformance Speech quality of MPEG-4 speech codecs were evaluated in the official MPEG-4 verification tests in Aug 98 at 2 European labs and 1 Japanese lab*. 15 Japanese items were evaluated by 16 Japanese listeners in the Japanese Lab. 15 English/German/Swedish items were evaluated by 18 German and 16 Finnish listeners in the European Labs. * ISO/IEC JTC1/SC29/WG11 MPEG98/N2424 “Report on the MPEG-4 speech codec verification tests,” Oct. 1998 9 PerformancePerformance MOS 95%CI Narrow band CELP - Japanes e 5 GSM- EFR MNRU CELP Scalable CELP 12.2 kbps 40 dB 4 G.729 MNRU 8.0 kbps 30 dB 12.0 kbps 12.0 kbps 3 8.3 kbps 8.0 kbps 6.0 kbps G.723.1 MNRU 20 dB 6.3 kbps 2 MNRU 10 dB 1 ISO/IEC JTC1/SC29/WG11 MPEG98/N2424 “Report on the MPEG-4 speech codec verification tests,” Oct. 1998 10 MOS 95%CI Wideband CELP - Japanes e 5 MP E G.722 MNRU 17.9 kbps 56.0 kbps Layer III 40 dB 4 G.722 24 kbps 48.0 kbps MNRU BW Scalable 30 db 3 RPE 16.0 kbps 18.1 kbps MNRU 20 dB 2 MNRU 10 dB 1 ISO/IEC JTC1/SC29/WG11 MPEG98/N2424 “Report on the MPEG-4 speech codec verification tests,” Oct. 1998 11 MPEG-4MPEG-4 CELPCELP demonstrationdemonstration 6 kbps NB CELP 12 kbps NB CELP 22 kbps WB CELP (BW-scalable) CELP Demo samples are generated by NEC 12 MPEG-4MPEG-4 HVXCHVXC Low bit-rate / good quality - 2.0 / 4.0kbps (fixed) , 1.5 / 3.0kbps ave. (variable) - HVXC at 2.0kbps has higher quality than FS1016 CELP at 4.8kbps Bit-rate scalability - 2.0kbps decoding is possible using 4.0kbps bit-stream Speed change & Pitch change - Attractive for fast speech database search & browsing 13 ApproachApproach Two different types of coding schemes are combined. One is suitable for voiced segments and the other for unvoiced segments. Voiced: Phase information is thrown away by harmonic representation of power spectrum of LPC residual. Frequency domain analysis / synthesis. Unvoiced: Crisp consonant is obtained by CELP coding. Time domain analysis / synthesis. 14 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 15 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 16 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 17 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 18 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 19 HarmonicHarmonic spectralspectral mamaggnitudesnitudes andand finefine pitchpitch estimationestimation Magnitude Pitch frequency Harmonic spectral envelope Freq 20 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 21 DimensionDimension conversionconversion ofof HarmonicHarmonic spectralspectral mamaggnitudesnitudes Magnitude Magnitude Frequency Magnitude Frequency Frequency 22 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 23 VectorVector quantizationquantization ofof harmonicharmonic spectralspectral envelopeenvelope -- basebase layerlayer -- Shape Codebook Fixed dimension harmonic spectrum -0 Gain Shape Weighting Codebook -1 Energy estimation 24 Scalable vector quantization of spectral envelope - base & enhancement layer - Index VQ of SE Shape0 SE Gain + VQ of SE Shape1 Index - Weighted + distortion Index DimensionDimension Dimension ConversionConversion Conversion Input - + VQ of VQ of VQ of SE SE . SE Shape2 Shape3 Shape5 Index Index Index 25 EncoderEncoder LSP V / UV V / UV /MV / MV LPCAna. - Voiced - Input LSP VQ LPC Harmonic Pitch Inv. Filter FFT magnitudes detection estimation Pitch Dimension Weighted VQ Spectral conversion shape&gain CELP coding Stochastc codebook shape & gain - Unvoiced - 26 Scalable CELP encoder for unvoiced segments - base and enhancement layer - Input LPC Speech Analysis Perceptual Weighting Filter VQ of LSP W(z) and Subtraction of Stochastic Codebook zero- Input response of H(z) + 6bits H(z) - + Perceptually GainCodebook Weighted Calculation LPC syn. Filter of Error 4bits Quantization Error Stochastic Codebook + 5bits H(z) - + Perceptually Weighted GainCodebook LPC syn. Filter Calculation of 3bits Error 27 DecoderDecoder LSP LSP Inv. VQ V / UV / MV Noise Pitch generation Output Harmonic synthesis LPC syn. Spectral Dimension Inv. VQ shape&gain conversion filter Postfilter Stochastic Stochastic Windowing shape codebook Parameter interpolation for speed control Stochastic gain 28 DecoderDecoder LSP LSP Inv. VQ V / UV / MV Noise Pitch generation Output Harmonic synthesis LPC syn. Spectral Dimension Inv. VQ shape&gain conversion filter Postfilter Stochastic Stochastic Windowing shape codebook Parameter interpolation for speed control Stochastic gain 29 DecoderDecoder LSP LSP Inv. VQ V / UV / MV Noise Pitch generation Output Harmonic synthesis LPC syn. Spectral Dimension Inv. VQ shape&gain conversion filter Postfilter Stochastic Stochastic Windowing shape codebook Parameter interpolation for speed control Stochastic gain 30 DecoderDecoder LSP LSP Inv. VQ V / UV / MV Noise Pitch generation Output Harmonic synthesis LPC syn. Spectral Dimension Inv. VQ shape&gain conversion filter Postfilter Stochastic Stochastic Windowing shape codebook Parameter interpolation for speed control Stochastic gain 31 DecoderDecoder LSP LSP Inv. VQ V / UV / MV Noise Pitch generation Output Harmonic synthesis LPC syn. Spectral Dimension Inv. VQ shape&gain conversion filter Postfilter Stochastic Stochastic Windowing shape codebook Parameter interpolation for speed control Stochastic gain 32 HarmonicHarmonic synthesissynthesis forfor voicedvoiced excitationexcitation t f f ()= ()()θ () ft∑ Amm tcos t m θωττ()=+t ( ) φ mmtd∫ 0 0 33 DecoderDecoder LSP LSP Inv. VQ V / UV / MV Noise Pitch generation Output Harmonic synthesis LPC syn. Spectral Dimension Inv. VQ shape&gain conversion filter Postfilter Stochastic Stochastic Windowing shape codebook Parameter interpolation for speed control Stochastic gain 34 ParameterParameter interpolationinterpolation forfor speedspeed controlcontrol arrays of original parameters : param[ n] arrays of interpolated parameters :mdf _ param[ m] time index before the time scale modification : n time index after the time scale modification : m ratio of speed change : spd spd > 1 speed up spd < 1 speed down define: define: = − fr= m * spd − 1 l m * spd fr0 0 = + r = − fr10 fr 1 fr1
Recommended publications
  • Speech Coding and Compression
    Introduction to voice coding/compression PCM coding Speech vocoders based on speech production LPC based vocoders Speech over packet networks Speech coding and compression Corso di Networked Multimedia Systems Master Universitario di Primo Livello in Progettazione e Gestione di Sistemi di Rete Carlo Drioli Università degli Studi di Verona Facoltà di Scienze Matematiche, Dipartimento di Informatica Fisiche e Naturali Speech coding and compression Introduction to voice coding/compression PCM coding Speech vocoders based on speech production LPC based vocoders Speech over packet networks Speech coding and compression: OUTLINE Introduction to voice coding/compression PCM coding Speech vocoders based on speech production LPC based vocoders Speech over packet networks Speech coding and compression Introduction to voice coding/compression PCM coding Speech vocoders based on speech production LPC based vocoders Speech over packet networks Introduction Approaches to voice coding/compression I Waveform coders (PCM) I Voice coders (vocoders) Quality assessment I Intelligibility I Naturalness (involves speaker identity preservation, emotion) I Subjective assessment: Listening test, Mean Opinion Score (MOS), Diagnostic acceptability measure (DAM), Diagnostic Rhyme Test (DRT) I Objective assessment: Signal to Noise Ratio (SNR), spectral distance measures, acoustic cues comparison Speech coding and compression Introduction to voice coding/compression PCM coding Speech vocoders based on speech production LPC based vocoders Speech over packet networks
    [Show full text]
  • Digital Speech Processing— Lecture 17
    Digital Speech Processing— Lecture 17 Speech Coding Methods Based on Speech Models 1 Waveform Coding versus Block Processing • Waveform coding – sample-by-sample matching of waveforms – coding quality measured using SNR • Source modeling (block processing) – block processing of signal => vector of outputs every block – overlapped blocks Block 1 Block 2 Block 3 2 Model-Based Speech Coding • we’ve carried waveform coding based on optimizing and maximizing SNR about as far as possible – achieved bit rate reductions on the order of 4:1 (i.e., from 128 Kbps PCM to 32 Kbps ADPCM) at the same time achieving toll quality SNR for telephone-bandwidth speech • to lower bit rate further without reducing speech quality, we need to exploit features of the speech production model, including: – source modeling – spectrum modeling – use of codebook methods for coding efficiency • we also need a new way of comparing performance of different waveform and model-based coding methods – an objective measure, like SNR, isn’t an appropriate measure for model- based coders since they operate on blocks of speech and don’t follow the waveform on a sample-by-sample basis – new subjective measures need to be used that measure user-perceived quality, intelligibility, and robustness to multiple factors 3 Topics Covered in this Lecture • Enhancements for ADPCM Coders – pitch prediction – noise shaping • Analysis-by-Synthesis Speech Coders – multipulse linear prediction coder (MPLPC) – code-excited linear prediction (CELP) • Open-Loop Speech Coders – two-state excitation
    [Show full text]
  • Speech Compression Using Discrete Wavelet Transform and Discrete Cosine Transform
    International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 Vol. 1 Issue 5, July - 2012 Speech Compression Using Discrete Wavelet Transform and Discrete Cosine Transform Smita Vatsa, Dr. O. P. Sahu M. Tech (ECE Student ) Professor Department of Electronicsand Department of Electronics and Communication Engineering Communication Engineering NIT Kurukshetra, India NIT Kurukshetra, India Abstract reproduced with desired level of quality. Main approaches of speech compression used today are Aim of this paper is to explain and implement waveform coding, transform coding and parametric transform based speech compression techniques. coding. Waveform coding attempts to reproduce Transform coding is based on compressing signal input signal waveform at the output. In transform by removing redundancies present in it. Speech coding at the beginning of procedure signal is compression (coding) is a technique to transform transformed into frequency domain, afterwards speech signal into compact format such that speech only dominant spectral features of signal are signal can be transmitted and stored with reduced maintained. In parametric coding signals are bandwidth and storage space respectively represented through a small set of parameters that .Objective of speech compression is to enhance can describe it accurately. Parametric coders transmission and storage capacity. In this paper attempts to produce a signal that sounds like Discrete wavelet transform and Discrete cosine original speechwhether or not time waveform transform
    [Show full text]
  • Speech Coding Using Code Excited Linear Prediction
    ISSN : 0976-8491 (Online) | ISSN : 2229-4333 (Print) IJCST VOL . 5, Iss UE SPL - 2, JAN - MAR C H 2014 Speech Coding Using Code Excited Linear Prediction 1Hemanta Kumar Palo, 2Kailash Rout 1ITER, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, India 2Gandhi Institute For Technology, Bhubaneswar, Odisha, India Abstract The main problem with the speech coding system is the optimum utilization of channel bandwidth. Due to this the speech signal is coded by using as few bits as possible to get low bit-rate speech coders. As the bit rate of the coder goes low, the intelligibility, SNR and overall quality of the speech signal decreases. Hence a comparative analysis is done of two different types of speech coders in this paper for understanding the utility of these coders in various applications so as to reduce the bandwidth and by different speech coding techniques and by reducing the number of bits without any appreciable compromise on the quality of speech. Hindi language has different number of stops than English , hence the performance of the coders must be checked on different languages. The main objective of this paper is to develop speech coders capable of producing high quality speech at low data rates. The focus of this paper is the development and testing of voice coding systems which cater for the above needs. Keywords PCM, DPCM, ADPCM, LPC, CELP Fig. 1: Speech Production Process I. Introduction Speech coding or speech compression is the compact digital The two types of speech sounds are voiced and unvoiced [1]. representations of voice signals [1-3] for the purpose of efficient They produce different sounds and spectra due to their differences storage and transmission.
    [Show full text]
  • An Ultra Low-Power Miniature Speech CODEC at 8 Kb/S and 16 Kb/S Robert Brennan, David Coode, Dustin Griesdorf, Todd Schneider Dspfactory Ltd
    To appear in ICSPAT 2000 Proceedings, October 16-19, 2000, Dallas, TX. An Ultra Low-power Miniature Speech CODEC at 8 kb/s and 16 kb/s Robert Brennan, David Coode, Dustin Griesdorf, Todd Schneider Dspfactory Ltd. 611 Kumpf Drive, Unit 200 Waterloo, Ontario, N2V 1K8 Canada Abstract SmartCODEC platform consists of an effi- cient, block-floating point, oversampled This paper describes a CODEC implementa- Weighted OverLap-Add (WOLA) filterbank, a tion on an ultra low-power miniature applica- software-programmable dual-Harvard 16-bit tion specific signal processor (ASSP) designed DSP core, two high fidelity 14-bit A/D con- for mobile audio signal processing applica- verters, a 14-bit D/A converter and a flexible tions. The CODEC records speech to and set of peripherals [1]. The system hardware plays back from a serial flash memory at data architecture (Figure 1) was designed to enable rates of 16 and 8 kb/s, with a bandwidth of 4 memory upgrades. Removable memory cards kHz. This CODEC consumes only 1 mW in a or more power-efficient memory could be package small enough for use in a range of substituted for the serial flash memory. demanding portable applications. Results, improvements and applications are also dis- The CODEC communicates with the flash cussed. memory over an integrated SPI port that can transfer data at rates up to 80 kb/s. For this application, the port is configured to block 1. Introduction transfer frame packets every 14 ms. e Speech coding is ubiquitous. Increasing de- c i WOLA Filterbank v mands for portability and fidelity coupled with A/D e and D the desire for reduced storage and bandwidth o i l Programmable d o r u utilization have increased the demand for and t D/A DSP Core A deployment of speech CODECs.
    [Show full text]
  • Cognitive Speech Coding Milos Cernak, Senior Member, IEEE, Afsaneh Asaei, Senior Member, IEEE, Alexandre Hyafil
    1 Cognitive Speech Coding Milos Cernak, Senior Member, IEEE, Afsaneh Asaei, Senior Member, IEEE, Alexandre Hyafil Abstract—Speech coding is a field where compression ear and undergoes a highly complex transformation paradigms have not changed in the last 30 years. The before it is encoded efficiently by spikes at the auditory speech signals are most commonly encoded with com- nerve. This great efficiency in information representation pression methods that have roots in Linear Predictive has inspired speech engineers to incorporate aspects of theory dating back to the early 1940s. This paper tries to cognitive processing in when developing efficient speech bridge this influential theory with recent cognitive studies applicable in speech communication engineering. technologies. This tutorial article reviews the mechanisms of speech Speech coding is a field where research has slowed perception that lead to perceptual speech coding. Then considerably in recent years. This has occurred not it focuses on human speech communication and machine because it has achieved the ultimate in minimizing bit learning, and application of cognitive speech processing in rate for transparent speech quality, but because recent speech compression that presents a paradigm shift from improvements have been small and commercial applica- perceptual (auditory) speech processing towards cognitive tions (e.g., cell phones) have been mostly satisfactory for (auditory plus cortical) speech processing. The objective the general public, and the growth of available bandwidth of this tutorial is to provide an overview of the impact has reduced requirements to compress speech even fur- of cognitive speech processing on speech compression and discuss challenges faced in this interdisciplinary speech ther.
    [Show full text]
  • International Organisation for Standardisation Organisation Internationale De Normalisation Iso/Iec Jtc1/Sc29/Wg11 Coding of Moving Pictures and Audio
    INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO ISO/IEC JTC1/SC29/WG11 MPEG98/N2425 Atlantic City - October 1998 Source: Audio and Test subgroup Status: Approved Title: MPEG-4 Audio verification test results: Audio on Internet Authors: Eric Scheirer, Sang-Wook Kim, Martin Dietz Table of Contents 1. Introduction.....................................................................................................................................2 2. Test motivation................................................................................................................................2 3. Codecs under Test ...........................................................................................................................3 4. Test Material...................................................................................................................................5 5. Test methodology ............................................................................................................................7 6. Test stimuli .....................................................................................................................................7 7. Test sessions ...................................................................................................................................8 8. Data Analysis..................................................................................................................................8
    [Show full text]
  • A Novel Speech Enhancement Approach Based on Modified Dct and Improved Pitch Synchronous Analysis
    American Journal of Applied Sciences 11 (1): 24-37, 2014 ISSN: 1546-9239 ©2014 Science Publication doi:10.3844/ajassp.2014.24.37 Published Online 11 (1) 2014 (http://www.thescipub.com/ajas.toc) A NOVEL SPEECH ENHANCEMENT APPROACH BASED ON MODIFIED DCT AND IMPROVED PITCH SYNCHRONOUS ANALYSIS 1Balaji, V.R. and 2S. Subramanian 1Department of ECE, Sri Krishna College of Engineering and Technology, Coimbatore, India 2Department of CSE, Coimbatore Institute of Engineering and Technology, Coimbatore, India Received 2013-06-03, Revised 2013-07-17; Accepted 2013-11-21 ABSTRACT Speech enhancement has become an essential issue within the field of speech and signal processing, because of the necessity to enhance the performance of voice communication systems in noisy environment. There has been a number of research works being carried out in speech processing but still there is always room for improvement. The main aim is to enhance the apparent quality of the speech and to improve the intelligibility. Signal representation and enhancement in cosine transformation is observed to provide significant results. Discrete Cosine Transformation has been widely used for speech enhancement. In this research work, instead of DCT, Advanced DCT (ADCT) which simultaneous offers energy compaction along with critical sampling and flexible window switching. In order to deal with the issue of frame to frame deviations of the Cosine Transformations, ADCT is integrated with Pitch Synchronous Analysis (PSA). Moreover, in order to improve the noise minimization performance of the system, Improved Iterative Wiener Filtering approach called Constrained Iterative Wiener Filtering (CIWF) is used in this approach. Thus, a novel ADCT based speech enhancement using improved iterative filtering algorithm integrated with PSA is used in this approach.
    [Show full text]
  • Speech Compression
    information Review Speech Compression Jerry D. Gibson Department of Electrical and Computer Engineering, University of California, Santa Barbara, CA 93118, USA; [email protected]; Tel.: +1-805-893-6187 Academic Editor: Khalid Sayood Received: 22 April 2016; Accepted: 30 May 2016; Published: 3 June 2016 Abstract: Speech compression is a key technology underlying digital cellular communications, VoIP, voicemail, and voice response systems. We trace the evolution of speech coding based on the linear prediction model, highlight the key milestones in speech coding, and outline the structures of the most important speech coding standards. Current challenges, future research directions, fundamental limits on performance, and the critical open problem of speech coding for emergency first responders are all discussed. Keywords: speech coding; voice coding; speech coding standards; speech coding performance; linear prediction of speech 1. Introduction Speech coding is a critical technology for digital cellular communications, voice over Internet protocol (VoIP), voice response applications, and videoconferencing systems. In this paper, we present an abridged history of speech compression, a development of the dominant speech compression techniques, and a discussion of selected speech coding standards and their performance. We also discuss the future evolution of speech compression and speech compression research. We specifically develop the connection between rate distortion theory and speech compression, including rate distortion bounds for speech codecs. We use the terms speech compression, speech coding, and voice coding interchangeably in this paper. The voice signal contains not only what is said but also the vocal and aural characteristics of the speaker. As a consequence, it is usually desired to reproduce the voice signal, since we are interested in not only knowing what was said, but also in being able to identify the speaker.
    [Show full text]
  • What Is Ogg Vorbis?
    Ogg Vorbis Audio Compression Format Norat Rossello Castilla 5/18/2005 Ogg Vorbis 1 What is Ogg Vorbis? z Audio compression format z Comparable to MP3, VQF, AAC, TwinVQ z Free, open and unpatented z Broadcasting, radio station and television by internet ( = Streaming) 5/18/2005 Ogg Vorbis 2 1 About the name… z Ogg = name of Xiph.org container format for audio, video and metadata z Vorbis = name of specific audio compression scheme designed to be contained in Ogg FOR MORE INFO... https://www.xiph.org 5/18/2005 Ogg Vorbis 3 Some comercial characteristics z The official mime type was approved in February 2003 z Posible to encode all music or audio content in Vorbis z Designed to not be proprietary or patented audio format z Patent and licensed-free z Specification in public domain 5/18/2005 Ogg Vorbis 4 2 Audio Compression z Two classes of compression algorithms: - Lossless - Lossy FOR MORE INFO... http://www.firstpr.com.au/audiocomp 5/18/2005 Ogg Vorbis 5 Lossless algorithms z Produce compressed data that can be decoded to output that is identical to the original. z Zip, FLAC for audio 5/18/2005 Ogg Vorbis 6 3 Lossy algorithms z Discard data in order to compress it better than would normally be possible z VORBIS, MP3, JPEG z Throw away parts of the audio waveform that are irrelevant. 5/18/2005 Ogg Vorbis 7 Ogg Vorbis - Compression Factors z Vorbis is an audio codec that generates 16 bit samples at 16KHz to 48KHz, providing variable bit rates from 16 to 128 Kbps per channel FOR MORE INFO..
    [Show full text]
  • User Manual Contents
    Register your product and get support at 8209 www.philips.com/welcome 48PFS8209 48PFS8209 55PFS8209 55PFS8209 User Manual Contents 6.7 Batteries 28 1 TV Tour 4 6.8 Cleaning 28 1.1 Android TV 4 1.2 Apps 4 7 Gesture Control 30 1.3 Movies and Missed Shows 4 7.1 About Gesture Control 30 1.4 Social Networks 4 7.2 Camera 30 1.5 Pause TV and Recordings 4 7.3 Hand Gestures 30 1.6 Gaming 4 7.4 Gesture Overview 30 1.7 Skype 4 7.5 Tips 30 1.8 3D 5 1.9 Smartphones and Tablets 5 8 Home Menu 32 8.1 Open the Home Menu 32 2 Setting Up 6 8.2 Overview 32 2.1 Read Safety 6 8.3 Notifications 32 2.2 TV Stand and Wall Mounting 6 8.4 Search 32 2.3 Tips on Placement 6 2.4 Power Cable 6 9 Now on TV 33 2.5 Antenna Cable 6 9.1 About Now on TV 33 2.6 Satellite Dish 7 9.2 What You Need 33 9.3 Using Now on TV 33 3 Network 8 3.1 Connect to Network 8 10 Apps 34 3.2 Network Settings 9 10.1 About Apps 34 3.3 Network Devices 10 10.2 Install an App 34 3.4 File Sharing 10 10.3 Start an App 34 10.4 Chrome™ 34 4 Connections 11 10.5 App Lock 34 4.1 Tips on Connections 11 10.6 Widgets 35 4.2 EasyLink HDMI CEC 12 10.7 Remove Apps and Widgets 35 4.3 CI+ CAM with Smart Card 13 10.8 Clear Internet Memory 35 4.4 Set-Top Box - STB 14 10.9 Android Settings 35 4.5 Satellite Receiver 14 10.10 Terms of Use - Apps 36 4.6 Home Theatre System - HTS 15 4.7 Blu-ray Disc Player 16 11 Video on Demand 37 4.8 DVD Player 16 11.1 About Video on Demand 37 4.9 Game Console 17 11.2 Rent a Movie 37 4.10 Gamepad 17 11.3 Streaming 37 4.11 USB Hard Drive 18 12 TV on Demand 38 4.12 USB Keyboard or Mouse
    [Show full text]
  • Meeting Abstracts
    PROGRAM OF The 118th Meeting of the AcousticalSociety of America Adam's Mark Hotel ß St. Louis, Missouri ß 27 November-1 December 1989 MONDAY EVENING, 27 NOVEMBER 1989 ST. LOUIS BALLROOM D, 7:00 TO 9:00 P.M. Tutorial on Architectural Acoustics Mauro Pierucci, Chairman Departmentof Aerospaceand EngineeringMechanics, San DiegoState University,San Diego, California 92182 TUI. Architecturalacoustics: The forgottendimension. Ewart A. Wetherill (Wilson, lhrig, and Associates, lnc., 5776 Broadway,Oakland, CA 94618) The basic considerationsof architeclural acouslics---isolation from unwanled noise and vibration, control of mechanicalsystem noise, and room acousticsdesign---are all clearly exemplifiedin Sabinc'sdesign for BostonSymphony Hall. Openedin ! 900,this hall isone of theoutstanding successes in musical acoustics. Yet, aswe approachthe hundredthanniversary of Sabine'sfirst experiments, acoustical characteristics remain one of the leastconsidered aspects of buildingdesign. This is due, in part, to the difficultyof visualizingthe acouslicaloutcome of designdecisions, complicated by individualjudgment as to whatconstitutes good acous- tics. However,the lack of a comprehensiveteaching program remains the dominantproblem. Significant advancesover the past 2 or 3 decadesin measurementand evaluationhave refinedthe ability to design predictabilityand to demonsIrateacoustical concerns to others. New techniquessuch as sound intensity measurements,new descriptors for roomacoustics phenomena, and the refinemen t of recording,analysis, and amplificationtechniques provide fresh insights into the behaviorof soundin air and other media.These topics are reviewedwith particularemphasis on the needfor a comparableadvance in translationof acousticprinci- plesinto buildingtechnologies. Sl J. Acoust.Soc. Am. Suppl. 1, VoL86, Fall1989 118thMeeting: Acoustical Society of America S1 TUESDAY MORNING, 28 NOVEMBER 1989 ST. LOUIS BALLROOM C, 8:00 A.M. TO 12:00 NOON SessionA.
    [Show full text]