(12) United States Patent (10) Patent N0.: US 7,788,091 B2 Goudar Et A]

Total Page:16

File Type:pdf, Size:1020Kb

(12) United States Patent (10) Patent N0.: US 7,788,091 B2 Goudar Et A] US007788091B2 (12) United States Patent (10) Patent N0.: US 7,788,091 B2 Goudar et a]. (45) Date of Patent: Aug. 31, 2010 (54) METHODS, DEVICES AND SYSTEMS FOR 6,766,289 B2 7/2004 Kandhadai et a1. IMPROVED PITCH ENHANCEMENT AND 6,789,059 B2 9/2004 Kandhadai et a1. AUTOCORRELATION IN VOICE CODECS 2002/0007269 A1 V2002 G30 2002/0103638 A1 8/2002 Gao (75) Inventors: ChanaveeragoudaV. Goudar, 2003/0078771 A1 4/2003 Jung et 31' Bangalore (IN); Murali M. Deshpande, ghyssen _ ee et al. Bangalore (IN); Pankal Rabha’ 2004/0181400 A1 * 9/2004 Kannan et al. ............ .. 704/223 Bangalore (IN) OTHER PUBLICATIONS (73) Asslgneez Texas Instruments Incorporated’ 3GPP2, “Selectable Mode Vocoder Service Option for Wideband Dallas’ TX (Us) Spread Spectrum Communication Systems,” Dec. 2001, C.S0030-0, _ _ _ _ _ Version 2.0.* ( * ) Nonce: sublect to any dlsclalmer: the term Ofthls 3GPP2, “Selectable Mode Vocoder Service Option for Wideband Pawnt is extended or adjusted under 35 Spread Spectrum Communication Systems,” Dec. 2001, C.S0030-0, U.S.C. 154(1)) by 1315 days. Version 2.0, pp. 3-4, 13-14, 61, 143-145, 159-178, Figs. 5.3-1 and 5.6-1,2. (21) Appl. No.: 11/231,686 Baron, “Five Chips From Tl4or, Is It Six?” Microprocessor Report, Instat-MDR, Mar. 17, 2003, Figs. 1, 3, 4. (22) Filed: Sep. 21, 2005 * Cited by examiner (65) Prior Publication Data Primary ExamineriAngela A Armstrong Us 6’ (74-)AZZOI’I’I6J), Agent, OrFirmiWade Brady, Frederick J. Telecky, Jr. Related US. Application Data (57) ABSTRACT (60) Provisional application No. 60/612,494, ?led on Sep. 22, 2004, provisional application NO_ 60/612,497, An electronic circuit includes a storage circuit and a micro ?led on Sep 22, 2004' processor operable together With the storage circuit as a speech coder. The speech coder has a backward pitch (51) Int. Cl. enhancement in frames or subframes having a length and at G10L 11/04 (2006.01) least one main pulse and at least one backward pitch enhance G10L 19/12 (2006.01) ment pulse preceding the main pulse by a portion of the length (52) US. Cl. ...................... .. 704/207; 704/223; 704/221 Called a pitch lag, and is operable to limit in number any such (58) Field of Classi?cation Search ..................... .. None backward Pitch enhancement Pulse Or pulses to a predeter See application ?le for Complete Search history mined maximum number more than none upon an occurrence When the length divided by the pitch lag is at least one more (56) References Cited than that maximum number. Other forms of the invention involve systems, circuits, devices, processes and methods of U.S. PATENT DOCUMENTS operation. 6,470,309 B1 10/2002 McCree 6,493,665 B1 12/2002 Su et al. 40 Claims, 9 Drawing Sheets 320 330 340 \ \ / SPEECH, PITCH PERCEPTUAL WEIGHTED INFORMATION WEIGHTING SPEECH / \ FILTER MODIFICATION 350 3/65 /360 SELECT RATE PI'JKQON AND FRAME RATE ssof ESTI TYPE DEPENDENT. TYPE- —> . DEPENDENT SPEECH PROCESSING CTRL _’ VAD CLASSIFY LPC / \ 395/ ANALYZE 385 370 L- LSFSMOOTH LSF QUANTIZATION / \ 390 375 US. Patent Aug. 31, 2010 Sheet 1 019 US 7,788,091 B2 1000 CELILAR WLAN AP —DSL INTERNETO— _ BASE (ACCESS CABLE / ——ETHERNET 1040 \ 1060 CELL PHONE PCB 1020 O / 1010 1080 ‘F / VOICE <-> DSL WLAN <—> CABLE @ @ GATEWAY A_ © @D 1085 C) Q Q USB 0 O G Y G O 6 PERSONAL _DSL _ 1055 o O O :lCOMPUTER ——-—CAB|_E @ —I:_I'HERNET :4 \ 1050 FIG. 1 US. Patent Aug. 31, 2010 Sheet 5 of9 US 7,788,091 B2 FIG. 5 320 \ SPEECH PRE-PROCESSING I I 330\ SPEECH RATE /520 WEIGHTING DECISION I WEIGHTED SPEECH 340/ MODIFICATION I v CODEBOOK PROCESSING 550/ ACB/FCB1/FCB2/FCB3 ERROR FUNCTION: TARGET FILTER SIGNAL MATRIX \ /J 2 8 = llTg-glHlcll / \ GAIN EXCITATION vECToR _:_ _0_ ~ 0 FIG. 6 < ~ 0 Ei=c_+pi= - + - 5 0 ~ 1 <— PULSE — ~ 0 —**_ _0_ elNTP V PITCH ENHANCEMENT: | LSF US. Patent Aug. 31, 2010 Sheet 6 of9 US 7,788,091 B2 FIG. 7 710 \ CALCULATION OF IMPULSE RESPONSE OF WEIGHTED SYNTHESIS FILTER I PITCH ENHANCEMENT OF IMPULSE 720 RESPONSE USING FORWARD AND \ BACKWARD ENHANCEMENTS NUMBER OF BACKWARD ENHANCEMENT Pmax=(|nt)Subframe Size/Pitch lag I PITCH ENHANCED IMPULSE RESPONSE 730 / FOR FIXED CODEBOOK SEARCH FIG. 8 310 \ CALCULATION OF IMPULSE RESPONSE OF WEIGHTED SYNTHESIS FILTER I 820 \ CALCULATE NUMBER OF BACKWARD ENHANCEMENT Pmax=(|nt)Subframe Size/Pitch lag 830 IS SUBFRAME SIZE =53 OR 54? Pmax=min (2,Pmax) 840 f I? PITCH ENHANCEMENT OF IMPULSE / RESPONSE USING FORWARD AND 850 BACKWARD ENHANCEMENTS I CALCULATE AUTOCORRELATION (Phi) 860 / MATRIX OF IMPULSE RESPONSES I CODEBOOK SEARCH USING Phi 870 / MATRIX BASED ENERGY CALCULATION US. Patent Aug. 31, 2010 Sheet 7 of9 US 7,788,091 B2 LAG=17 <I> FIG. 9 20% 25-3 30-: 35-: 40-5 45-3 50-: D 510 15 2025 30354045 50 LAG=17 FIG. 10A US. Patent Aug. 31, 2010 Sheet 8 of9 US 7,788,091 B2 LAG=25 FIG. ]0B 10-: 15-: 25: \ 30-: 35—: LAG=40 FIG. ]0C 0 51015 20 25 30 35 39 US. Patent Aug. 31, 2010 Sheet 9 0f 9 US 7,788,091 B2 1100 FIG. 11 / @1105 IDENTIFY REGIONS OF EQUAL /1110 IMPULSE RESPONSE VECTORS V imax =MAX(i) [1120 imi" : N (D IN REGION imax =MAX 7' 1130 SET i‘=imax AND I=ImaX / 1180 COMPUTE <I>(i,j) WHERE i=i' AND I=r AND STORE <1>(i,j) \1150 I INCREMENTALLY COMPUTE-<I>(i,j) (i min ,j'-i'+imin )} AND STORE @(i, V 1170 imin =1‘min imax =jmax '1 \118O I imin=imin+1 imax =imax \1190 _—______l US 7,788,091 B2 1 2 METHODS, DEVICES AND SYSTEMS FOR communications. An example is Voice over Internet Protocol IMPROVED PITCH ENHANCEMENT AND (VoIP) enabling phone calls over the Internet. AUTOCORRELATION IN VOICE CODECS Wireless and Wireline data communications using Wireless local area netWorks (WLAN), such as IEEE 802.11 compli CROSS-REFERENCE TO RELATED ant, have become especially popular in a Wide range of instal APPLICATIONS lations, ranging from home netWorks to commercial estab lishments. Other Wireless netWorks such as IEEE 802.16 This application is related to provisional US. Patent Appli (WiMax) are emerging. Short-range Wireless data communi cation Ser. No. 60/612,494, ?led Sep. 22, 2004, titled “Meth cation according to the “Bluetooth” and other IEEE 802.15 ods, Devices and Systems for Improved Pitch Enhancement technology permits computer peripherals to communicate in Voice Codecs,” for Which priority under 35 U.S.C. 119(e) With a personal computer or Workstation Within the same (1) is hereby claimed and Which is hereby incorporated herein room. by reference. Security is important in both Wireline and Wireless com This application is related to provisional US. Patent Appli munications for improved security of retail and other business cation Ser. No. 60/612,497, ?led Sep. 22, 2004, titled “Meth commercial transactions in electronic commerce and Wher ods, Devices and Systems for Improved Codebook Search for ever personal and/or commercial privacy is desirable. Added Voice Codecs,” for Which priority under 35 U.S.C. 1 19(e)(1) features and security add further processing tasks to the com is hereby claimed and Which is hereby incorporated herein by munications system. These portend added softWare and hard reference. Ware in systems Where affordability and poWer dissipation are This application is co-?led so that the present U.S. non 20 already important concerns. provisional patent application “Methods, Devices and Sys In very general terms, a speech coder or voice coder is tems for Improved Codebook Search for Voice Codecs” Ser. based on the idea that the vocal chords and vocal tract are No. 11/231,643 and the present U.S. non-provisional patent analogous to a ?lter. The vocal chords and vocal tract gener application “Methods, Devices and Systems for Improved ally make a variety of sounds. Some sounds are voiced and Pitch Enhancement and Autocorrelation in Voice Codecs” 25 generally have a pitch level or levels at a given time. Other Ser. No. 11/231,686 each have the same application ?ling sounds are unvoiced and have a rushing or Whispering or date, and each of said patent applications hereby incorporates sudden consonantal sound to them. To facilitate the voice the other by reference. coding process, voice sounds are converted into an electrical Waveform by a microphone and analog to digital converter. STATEMENT REGARDING FEDERALLY 30 The electrical Waveform is conceptually cut up into succes SPONSORED RESEARCH OR DEVELOPMENT sive frames of a feW milliseconds in duration called a target signal. The frames are individually approximated by the voice Not applicable. coder electronics. In speech or voice coder electronics, pulses can be pro BACKGROUND OF THE INVENTION 35 vided at different times to excite a ?lter. Each pulse has a very Wide spectrum of frequencies Which are comprised in the pulse. The ?lter selects some of the frequencies such as by This invention is in the ?eld of information and communi passing only a band of frequencies, thus the term bandpass cations, and is more speci?cally directed to improved pro ?lter. Circuits and/or processes that provide various pulses, cesses, circuits, devices, and systems for information and 40 more or less ?ltered, excite the ?lter to supply as its output an communication processing, and processes of operating and approximation to the voice sounds of a target signal. Finding making them. Without limitation, the background is further the appropriate pulses to use for the excitation pulses for the described in connection With Wireless and Wireline commu voice coder approximation purposes is involved in the subject nications processing.
Recommended publications
  • Packetcable™ 2.0 Codec and Media Specification PKT-SP-CODEC
    PacketCable™ 2.0 Codec and Media Specification PKT-SP-CODEC-MEDIA-I10-120412 ISSUED Notice This PacketCable specification is the result of a cooperative effort undertaken at the direction of Cable Television Laboratories, Inc. for the benefit of the cable industry and its customers. This document may contain references to other documents not owned or controlled by CableLabs. Use and understanding of this document may require access to such other documents. Designing, manufacturing, distributing, using, selling, or servicing products, or providing services, based on this document may require intellectual property licenses from third parties for technology referenced in this document. Neither CableLabs nor any member company is responsible to any party for any liability of any nature whatsoever resulting from or arising out of use or reliance upon this document, or any document referenced herein. This document is furnished on an "AS IS" basis and neither CableLabs nor its members provides any representation or warranty, express or implied, regarding the accuracy, completeness, noninfringement, or fitness for a particular purpose of this document, or any document referenced herein. 2006-2012 Cable Television Laboratories, Inc. All rights reserved. PKT-SP-CODEC-MEDIA-I10-120412 PacketCable™ 2.0 Document Status Sheet Document Control Number: PKT-SP-CODEC-MEDIA-I10-120412 Document Title: Codec and Media Specification Revision History: I01 - Released 04/05/06 I02 - Released 10/13/06 I03 - Released 09/25/07 I04 - Released 04/25/08 I05 - Released 07/10/08 I06 - Released 05/28/09 I07 - Released 07/02/09 I08 - Released 01/20/10 I09 - Released 05/27/10 I10 – Released 04/12/12 Date: April 12, 2012 Status: Work in Draft Issued Closed Progress Distribution Restrictions: Authors CL/Member CL/ Member/ Public Only Vendor Key to Document Status Codes: Work in Progress An incomplete document, designed to guide discussion and generate feedback, that may include several alternative requirements for consideration.
    [Show full text]
  • Of the Cdma2000® System Specifications
    3GPP2 S.P0052-0 Ver.0.2.7 Date: August 21, 2003 1 2 System Release Guide for the 3 Release <ALPHA> 4 of the cdma2000® System Specifications 5 COPYRIGHT © 2003 3GPP2 and its Organizational Partners claim copyright in this document and individual Or- ganizational Partners may copyright and issue documents or standards publications in in- dividual Organizational Partner’s name based on this document. Requests for reproduction of this document should be directed to the 3GPP2 Secretariat at [email protected]. Requests to reproduce individual Organizational Partner’s documents should be directed to that Organizational Partner. See www.3gpp2.org for more information. 6 7 © 2003 3GPP2 - i - S.P0052: CDMA2000® System Release Guide –Release <Alpha> 1 Executive Summary 2 The System Release Guide (SRG) for the Release <ALPHA> provides an overview 3 for and reference to the Release <ALPHA> of the 3GPP2 wireless telecommuni- 4 cation system (cdma2000®) capabilities, features, and services. This document 5 is intended for use by persons and /or companies who are developing and / or 6 deploying cdma2000 systems or by persons who are otherwise interested in 7 cdma2000 systems. 8 Air interface support for HRPD and enhanced IOS are included and provide 9 high-speed forward link data rate service capability up to 2.4576 Mbps in a 10 1.25 MHz. Since cdma2000 uses many IP based protocols to a large degree, it 11 offers various features of IP based services. The system in this release contains 12 support for the Legacy System, and limited support for the 3GPP2 Legacy Mo- 13 bile Station Domain, making use of IP-based transport and signaling.
    [Show full text]
  • A Novel Transcoding Algorithm for SMV and G.723.1 Speech Coders Via Direct Parameter Transformation
    A Novel Transcoding Algorithm for SMV and G.723.1 Speech Coders via Direct Parameter Transformation Seongho Seo, Dalwon Jang, Sunil Lee, and Chang D. Yoo Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea [email protected] Abstract as input. The length of each speech frame is 30 ms which cor- responds to 240 samples. Input speech of each frame is first In this paper, a novel transcoding algorithm for the Selectable high pass filtered and then divided into 4 subframes of 60 sam- Mode Vocoder (SMV) and the G.723.1 speech coder is pro- ples each. For every subframe, a 10th order linear prediction posed. In contrast to the conventional tandem transcoding al- coefficients (LPC) are computed. The linear predictive analysis gorithm, the proposed algorithm converts the parameters of one requires a look-ahead of 7.5 ms long. The LPC set for the last coder to the other without going through the decoding and en- subframe is quantized using the Predictive Split Vector Quan- coding process. The proposed algorithm is composed of four tizer (PSVQ) and transmitted. The unquantized LPC sets are parts: the parameter decoding, Line Spectral Pair (LSP) conver- used to obtain the perceptually weighted speech signal. For ev- sion, pitch period conversion and rate selection. The evaluation ery two subframes, the open-loop pitch analysis is performed in results show that the proposed algorithm achieves equivalent the domain of the weighted speech signal. Then, the adaptive speech quality to that of tandem transcoding with reduced com- codebook (ACB) and fixed codebook (FCB) are searched on a putational complexity and delay.
    [Show full text]
  • Network Working Group R. Gellens Request for Comments: 4281 Qualcomm Category: Standards Track D
    Network Working Group R. Gellens Request for Comments: 4281 Qualcomm Category: Standards Track D. Singer Apple P. Frojdh Ericsson November 2005 The Codecs Parameter for "Bucket" Media Types Status of This Memo This document specifies an Internet standards track protocol for the Internet community, and requests discussion and suggestions for improvements. Please refer to the current edition of the "Internet Official Protocol Standards" (STD 1) for the standardization state and status of this protocol. Distribution of this memo is unlimited. Copyright Notice Copyright (C) The Internet Society (2005). Abstract Several MIME type/subtype combinations exist that can contain different media formats. A receiving agent thus needs to examine the details of such media content to determine if the specific elements can be rendered given an available set of codecs. Especially when the end system has limited resources, or the connection to the end system has limited bandwidth, it would be helpful to know from the Content-Type alone if the content can be rendered. This document adds a new parameter, "codecs", to various type/subtype combinations to allow for unambiguous specification of the codecs indicated by the media formats contained within. By labeling content with the specific codecs indicated to render the contained media, receiving systems can determine if the codecs are supported by the end system, and if not, can take appropriate action (such as rejecting the content, sending notification of the situation, transcoding the content to a supported type, fetching and installing the required codecs, further inspection to determine if it will be sufficient to support a subset of the indicated codecs, etc.) Gellens, et al.
    [Show full text]
  • Superseded Packetcable™ Codec and Media Specification (PKT-SP-CODEC-MEDIA-I01-060406)
    PacketCable™ Codec and Media Specification PKT-SP-CODEC-MEDIA-I01-060406 ISSUED Notice This PacketCable specification is a cooperative effort undertaken at the direction of Cable Television Laboratories, Inc. (CableLabs®) for the benefit of the cable industry. Neither CableLabs, nor any other entity participating in the creation of this document, is responsible for any liability of any nature whatsoever resulting from or arising out of use or reliance upon this document by any party. This document is furnished on an AS-IS basis and neither CableLabs, nor other participating entity, provides any representation or warranty, express or implied, regarding its accuracy, completeness, or fitness for a particular purpose. © Copyright 2006 Cable Television Laboratories, Inc. All rights reserved. PKT-SP-CODEC-MEDIA-I01-060406 PacketCable™ Document Status Sheet Document Control Number: PKT-SP-CODEC-MEDIA-I01-060406 Document Title: Codec and Media Specification PacketCable Release: 2 Revision History: I01 – Released 04/05/2006 Date: April 6, 2006 Status: Work in Draft Issued Closed Progress Distribution Restrictions: Authors CL/Member CL/ Member/ Public Only Vendor Key to Document Status Codes: Work in Progress An incomplete document, designed to guide discussion and generate feedback, that may include several alternative requirements for consideration. Draft A document in specification format considered largely complete, but lacking review by Members and vendors. Drafts are susceptible to substantial change during the review process. Issued A stable document, which has undergone rigorous member and vendor review and is suitable for product design and development, cross-vendor interoperability, and for certification testing. Closed A static document, reviewed, tested, validated, and closed to further engineering change requests to the specification through CableLabs.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 6,445,696 B1 Foodeei Et Al
    USOO644.5696B1 (12) United States Patent (10) Patent No.: US 6,445,696 B1 Foodeei et al. (45) Date of Patent: Sep. 3, 2002 (54) EFFICIENT WARIABLE RATE CODING OF (57) ABSTRACT VOICE OVER ASYNCHRONOUSTRANSFER MODE The invention uses an ATM Adaptation Layer of type 2 (AAL2) standard mechanism to define efficient Support for (75) Inventors: Majid Foodeei, San Francisco; Variable Rate Coding (VRC). The VRC in this context Anthony E. Raetz, Menlo Park, both of typically refers to codecs, which adapt their rate to infor CA (US) mation content variations in Speech and audio. Such VRC (73) Assignee: Network Equipment Technologies, results in lower average rate than the constant rate codecs or Inc., Fremont, CA (US) use of constant rate codecs coupled with Silence Suppression - 0 (SS), currently deployed in voice over ATM schemes using (*) Notice: Subject to any disclaimer, the term of this AAL2. Possible ATM transport, trunking and access appli patent is extended or adjusted under 35 cations encompass both Circuit Emulation Services (CES) U.S.C. 154(b) by 0 days. and Local Loop Emulation Services (LLES). A typical VRC profile encompasses options for all Sub-rates within one or (21) Appl. No.: 09/513,667 more VRC standard or proprietary variable rate codec. The (22) Filed: Feb. 25, 2000 output of a rate determination algorithm (RDA), commonly part of variable rate codec, is fed into present AAL2 inter (51) Int. Cl." ............................. H04J 3/24; HO4L 12/56 working function (IWF). The IWF in AAL2, which normally (52) U.S. Cl. .................... 370/356; 370/395.6; 370/465; Supports SS or multiple rates (as opposed to voice content 370/469; 370/471; 370/474 VRC), is extended to accommodate VRC and thereafter (58) Field of Search ................................
    [Show full text]
  • Supported Codecs and Formats Codecs
    Supported Codecs and Formats Codecs: D..... = Decoding supported .E.... = Encoding supported ..V... = Video codec ..A... = Audio codec ..S... = Subtitle codec ...I.. = Intra frame-only codec ....L. = Lossy compression .....S = Lossless compression ------- D.VI.. 012v Uncompressed 4:2:2 10-bit D.V.L. 4xm 4X Movie D.VI.S 8bps QuickTime 8BPS video .EVIL. a64_multi Multicolor charset for Commodore 64 (encoders: a64multi ) .EVIL. a64_multi5 Multicolor charset for Commodore 64, extended with 5th color (colram) (encoders: a64multi5 ) D.V..S aasc Autodesk RLE D.VIL. aic Apple Intermediate Codec DEVIL. amv AMV Video D.V.L. anm Deluxe Paint Animation D.V.L. ansi ASCII/ANSI art DEVIL. asv1 ASUS V1 DEVIL. asv2 ASUS V2 D.VIL. aura Auravision AURA D.VIL. aura2 Auravision Aura 2 D.V... avrn Avid AVI Codec DEVI.. avrp Avid 1:1 10-bit RGB Packer D.V.L. avs AVS (Audio Video Standard) video DEVI.. avui Avid Meridien Uncompressed DEVI.. ayuv Uncompressed packed MS 4:4:4:4 D.V.L. bethsoftvid Bethesda VID video D.V.L. bfi Brute Force & Ignorance D.V.L. binkvideo Bink video D.VI.. bintext Binary text DEVI.S bmp BMP (Windows and OS/2 bitmap) D.V..S bmv_video Discworld II BMV video D.VI.S brender_pix BRender PIX image D.V.L. c93 Interplay C93 D.V.L. cavs Chinese AVS (Audio Video Standard) (AVS1-P2, JiZhun profile) D.V.L. cdgraphics CD Graphics video D.VIL. cdxl Commodore CDXL video D.V.L. cinepak Cinepak DEVIL. cljr Cirrus Logic AccuPak D.VI.S cllc Canopus Lossless Codec D.V.L.
    [Show full text]
  • Media and Radio Signal Processing for Mobile Communications Kyunghun Jung , Russell M
    Cambridge University Press 978-1-108-42103-4 — Media and Radio Signal Processing for Mobile Communications Kyunghun Jung , Russell M. Mersereau Frontmatter More Information Media and Radio Signal Processing for Mobile Communications Get to grips with the principles and practise of signal processing used in real mobile communications systems. Focusing particularly on speech and video processing, pion- eering experts employ a detailed, top-down analytical approach to outline the network architectures and protocol structures of multiple generations of mobile communications systems, identify the logical ranges where media and radio signal processing occur, and analyze the procedures for capturing, compressing, transmitting and presenting media. Chapters are uniquely structured to show the evolution of network architectures and technical elements between generations up to and including 5G, with an emphasis on maximizing service quality and network capacity through reusing existing infrastruc- ture and technologies. Examples and data taken from commercial networks provide an in-depth insight into the operation of a number of different systems, including GSM, cdma2000, W-CDMA, LTE, and LTE-A, making this a practical, hands-on guide for both practicing engineers and graduate students in wireless communications. Kyunghun Jung is a Principal Engineer at Samsung Electronics, where he leads the research and standardization for bringing immersive media services and vehicular applications to 5G systems. Russell M. Mersereau is Regents Professor Emeritus in the School of Electrical and Computer Engineering at the Georgia Institute of Technology, and a Fellow of the IEEE. © in this web service Cambridge University Press www.cambridge.org Cambridge University Press 978-1-108-42103-4 — Media and Radio Signal Processing for Mobile Communications Kyunghun Jung , Russell M.
    [Show full text]
  • Wideband Speech Coding Standards and Applications
    Wideband Speech Coding Standards and Applications Abstract Increasing the bandwidth of sound signals from the telephone bandwidth of 200-3400 Hz to the wider bandwidth of 50-7000 Hz results in increased intelligibility and naturalness of speech and gives a feeling of transparent communication. The emerging end-to-end digital communication systems enable the use of wideband speech coding in a wide area of applications. Recognizing the need of high quality wideband speech codecs, several standardization activities have been recently conducted, resulting in the selection of a new wideband speech codec, AMR-WB, at bit rates from 6.6 to 23.85 kbit/s by both 3GPP and ITU-T. The adoption of AMR-WB by the two bodies is of significant importance since for the first time the same codec is adopted for wireless as well as wireline services. This will eliminate the need for transcoding, and ease the implementation of wideband voice applications and services across a wide range of communication systems and platforms. This document presents a summary of wideband speech coding standards for wideband telephony applications. The quality advantages and applications of wideband speech coding are first presented, and the issue of telephony over packet networks is discussed. Several wideband speech coding standards are discussed and a special emphasis is given to the AMR-WB standard recently selected by 3GPP and ITU-T. 1. Introduction Most speech coding systems in use today are based on telephone-bandwidth narrowband speech, nominally limited to about 200-3400 Hz and sampled at a rate of 8 kHz. This limitation built into the conventional telephone system dates back to the first transcontinental telephone service established between New-York and San Francisco in 1915.
    [Show full text]
  • Selectable Mode Vocoder Service Option for Wideband Spread Spectrum Communication Systems
    3GPP2 C.S0030-0 Version 2.0 Date: December 2001 Selectable Mode Vocoder Service Option for Wideband Spread Spectrum Communication Systems COPYRIGHT 3GPP2 and its Organizational Partners claim copyright in this document and individual Organizational Partners may copyright and issue documents or standards publications in individual Organizational Partner's name based on this document. Requests for reproduction of this document should be directed to the 3GPP2 Secretariat at [email protected]. Requests to reproduce individual Organizational Partner's documents should be directed to that Organizational Partner. See www.3gpp2.org for more information. Intentionally left blank. C.S0030-0 Version 2.0 1 FOREWORD 2 These technical requirements form a standard for Service Option 56, a variable-rate 3 two-way speech service option. The maximum speech-coding rate of the service option is 4 8.55 kbps. 5 This standard does not address the quality or reliability of Service Option 56, nor does it 6 cover equipment performance or measurement procedures. 7 i C.S0030-0 Version 2.0 1 NOTES 2 1. Accompanying “Recommended Minimum Performance Standard for the Selectable 3 Mode Vocoder, Service Option 56,” provides specifications and measurement methods. 4 2. “Base station” refers to the functions performed on the land-line side, which are 5 typically distributed among a cell, a sector of a cell, a mobile switching center, and a 6 personal communications switching center. 7 3. This document uses the following verbal forms: “Shall” and “shall not” identify 8 requirements to be followed strictly to conform to the standard and from which no 9 deviation is permitted.
    [Show full text]
  • Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems
    3GPP2 C.S0030-0 Version 3.0 Date: January 2004 1 2 Selectable Mode Vocoder (SMV) Service Option for 3 Wideband Spread Spectrum Communication 4 Systems 5 6 COPYRIGHT 3GPP2 and its Organizational Partners claim copyright in this document and individual Organizational Partners may copyright and issue documents or standards publications in individual Organizational Partner's name based on this document. Requests for reproduction of this document should be directed to the 3GPP2 Secretariat at [email protected]. Requests to reproduce individual Organizational Partner's documents should be directed to that Organizational Partner. See www.3gpp2.org for more information. 7 1 Intentionally left blank. C.S0030-0 v3.0 1 FOREWORD 2 These technical requirements form a standard for Service Option 56, a variable-rate 3 two-way speech service option. The maximum speech-coding rate of the service option is 4 8.55 kbps. 5 This standard does not address the quality or reliability of Service Option 56, nor does it 6 cover equipment performance or measurement procedures. 7 i C.S0030-0 v3.0 1 NOTES 2 1. Accompanying “Recommended Minimum Performance Standard for the Selectable 3 Mode Vocoder, Service Option 56,” provides specifications and measurement methods. 4 2. “Base station” refers to the functions performed on the land-line side, which are 5 typically distributed among a cell, a sector of a cell, a mobile switching center, and a 6 personal communications switching center. 7 3. This document uses the following verbal forms: “Shall” and “shall not” identify 8 requirements to be followed strictly to conform to the standard and from which no 9 deviation is permitted.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 7.254.533 B1 Jabri Et Al
    US00725.4533B1 (12) United States Patent (10) Patent No.: US 7.254.533 B1 Jabri et al. (45) Date of Patent: Aug. 7, 2007 (54) METHOD AND APPARATUS FOR A THIN 3GPP TS 26.090, “Adaptive Multi-Rate (AMR) speech codec: CELP VOICE CODEC Transcoding fuctions”, Release 5.0.0 (Jun. 2002), 3' Generation Partnership Project (3GPP), http://www.3gpp2.org/. (75) Inventors: Marwan A. Jabri, Broadway (AU); 3GPP TS 26.104 “ANSI-C code for the floating-point AMR speech Nicola Chong-White, Chatswood (AU): codec”. Release 5.00, (Jun. 2002) 3" Gerneration Partnership Jianwei Wang, Killarney Heights (AU) Project (3GPP), http://www.3gpp2.org/. 3GPP TS 26.173 “ANSI-C code for the Adaptive Multi-Rate (73) Assignee: Dilithium Networks Pty Ltd., Sydney, Wideband speech codec, (Mar. 2002) 3" Generation Partnership NSW (AU) Project (3GPP), http://www.3gpp2.org/. *) Notice: Subject to anyy disclaimer, the term of this 3GPP TS 26.190 “AMR Wideband speech codec; Transcoding patent is extended or adjusted under 35 Functions (Release 5), 3 Generation Partnership Project (3GPP); U.S.C. 154(b) by 872 days. Dec. 2001, http://www.3gpp2.org/. (21) Appl. No.: 10/688,857 (Continued) Primary Examiner V. Paul Harper (22) Filed: Oct. 17, 2003 (74) Attorney, Agent, or Firm Townsend and Townsend Related U.S. Application Data and Crew LLP (60) Provisional application No. 60/439.366, filed on Jan. (57) ABSTRACT 9, 2003, provisional application No. 60/419,776, filed on Oct. 17, 2002. An apparatus and method for encoding and decoding a voice (51) Int. Cl. signal.
    [Show full text]