(12) United States Patent (10) Patent No.: US 7,848,922 B1 Jabri Et Al

Total Page:16

File Type:pdf, Size:1020Kb

(12) United States Patent (10) Patent No.: US 7,848,922 B1 Jabri Et Al US007848922B1 (12) United States Patent (10) Patent No.: US 7,848,922 B1 Jabri et al. (45) Date of Patent: *Dec. 7, 2010 (54) METHOD AND APPARATUS FORATHIN 6,115,688 A * 9/2000 Brandenburg et al. ....... TO4,503 AUDIO CODEC 6,115,689 A * 9/2000 Malvar ................ ... TO4,503 6,167,373 A 12/2000 Morii ......................... TO4,219 (76) Inventors: Marwan A. Jabri, 656 Hilary Dr., 6,314.393 B1 1 1/2001 Zheng et al. Tiburon, CA (US) 94920; Nicola 6.424,939 B1* 7/2002 Herre et al. ................. TO4,219 Chong-White, 364 Penshurst St., 6,717,955 B1 4/2004 Holler Chatswood, NSW (AU) 2067: Jianwei 6,799,060 B1* 9/2004 Kim ........................... 455,563 Wang, 2700 Lincoln Village Cir., Apt. 6,807,524 B1 * 10/2004 Bessette et al. .......... 704/200.1 291, Larkspur, CA (US) 94939 6,912,584 B2 * 6/2005 Wang et al. ................. TO9,231 (*) Notice: Subject to any disclaimer, the term of this (Continued) patent is extended or adjusted under 35 U.S.C. 154(b) by 512 days. OTHER PUBLICATIONS 3GPP TS 26.073 ANSI-C code for the Adaptive Multi Rate (AMR This patent is Subject to a terminal dis- speech codec'. Release 5.00, (Mar. 2002) SEN R claimer. Project (3GPP), http://www.3gpp2.org/. (21) Appl. No.: 11/890,263 (Continued) (22) Filed: Aug. 2, 2007 Primary Examiner Vijay B Chawan (74) Attorney, Agent, or Firm Hoffmann & Baron, LLP Related U.S. Application Data (57) ABSTRACT (63) Continuation of application No. 10/688,857, filed on Oct. 17, 2003, now Pat. No. 7,254,533. An apparatus and method for encoding and decoding a voice (60) Provisional application No. 60/439,366, filed on Jan. signal. The apparatus includes an encoder configured togen 9, 2003, provisional application No. 60/419,776, filed erate an output bitstream signal from an input voice signal. on Oct. 17, 2002. The output bitstream signal is associated with at least a first standard of a first plurality of CELP voice compression stan (51) Int. Cl. dards. Additionally, the apparatus includes a decoder config GIOL 9/04 (2006.01) ured to generate an output voice signal from an input bit (52) U.S. Cl. .................... 704/219; 704/220; 704/200.1; Stream signal. The input bitstream signal is associated with at 704/229; 704/223; 704/262 least a first standard of a second plurality of CELP voice (58) Field of Classification Search ................. 704/219, compression standards. The CELP encoder includes a plural 704/220, 230, 200.1, 500-504, 223, 229, ity of codec-specific encoder modules. Additionally, the 704/262, 264 CELP encoder includes a plurality of generic encoder mod See application file for complete search history. ules. The CELP decoder includes a plurality of codec-specific (56) References Cited decoder modules. Additionally, the CELP decoder includes a plurality of generic decoder modules. U.S. PATENT DOCUMENTS 5,787,390 A * 7/1998 Quinquis et al. ............ TO4,219 20 Claims, 23 Drawing Sheets 1712 s(n) PC Perceptual t 1710 processing fairfLP Analysis-> Classification-E Rate 18 pitch search Gain index Calculate Generate Frame Gain random quantization excitation -:FrameType Rate %NELP encoding Determination Rate 4 or Rate 1 CELP encoding LPC to LSP LSP Type 0 Rate Ar Rate 1 conversion quantization ACBlag, ACB gain, FCBlag, 1716 FCBgain, pred. Rate 4 or Rate 1 RCELP encoding switch, interp. path Type 1 Rate: or Rate 1 ACBlag, ACB gain, FCBlag, FCBgain, pred. switch US 7,848.922 B1 Page 2 U.S. PATENT DOCUMENTS ETSI GSM 06.60, “Enhanced Full Rate (EFR) Speech transcoding” version 8.0.1 (Nov. 2000), European Telecommunications Standards 7,254,533 B1* 8/2007 Jabri et al. .................. TO4,219 Institute (ETSI), http:IIwww.etsi.org/. 7,539,612 B2* 5/2009 Thumpudi et al. ....... TO4/2001 ETSI GSM 06.20, “Half rate speech: Half rate speech transcoding”. 2002fOO28670 A1 3/2002 Ohsuge version 8.01 (Nov. 2000). European Telecommunications Standards 2003. O103524 A1 6/2003 Hasegawa Institute (ETSI), http:IIwww.etsi.org/. ISO/IEC 14496-3 MPEG4-CELP Coder, “Information OTHER PUBLICATIONS Technology—coding of Audiovisual Objects, Part3: Audio, Subpart 3GPP TS 26.090, “Adaptive Multi-Rate (AMR) speech codec; 3. CELP, ISO/JTC 1/SC 29 N2203CELP. May 1998. Transcoding functions'. Release 5.0.0 (Jun. 2002), 3rd Generation ITU-T G.723.1 "Speech Coders: Dual rate speech coder for multi Partnership Project (3GPP), http://www.3gpp2.org/. media communications transmission at 5.3 and 6.3 kbit/s', ITU-T 3GPP TS 26.104 “ANSI-C code for the floating-point AMR speech Recommendation G.723.1 (1996), Geneva, http://www.itu.org/. codec', Release 5.00, (Jun. 2002)3rd Generation Partnership Project ITU-T G.723.1 Annex B “Dual rate speech coder for multimedia (3GPP), http://www.3gpp2.org/. communications transmitting at 5.3 and 6.3 kbit/s, Annex B: Alter 3GPP TS 26.173 “ANSI-C code for the Adaptive Multi-Rate native specification based on floating point arithmetic'. ITU-T Rec Wideband speech codec”. (Mar. 2002) 3rd Generation Partnership ommendation G.723.1—Annex B. http:IIwww.itu.org/. Project (3GPP), http://www.3gpp2.org/. ITU-T G.728 "Coding of speech at 16 kbit/s using low-delay code 3GPP TS 26.190 “AMRWideband speech codec; Transcoding Func excited linear prediction'. ITU Recommendation G.728 (1992), tions (Release 5), 3rd Generation Partnership Project (3GPP); Dec. Geneva, http://www.itu.org/. 2001, http://www.3gpp2.org/. ITU-T G.729“Coding of speech at 8 kbit/s using conjugate-Structure 3GPP TS 26.204 “ANSI-C code for the floating-point Adaptive Algebraiccode-excited linear-prediction (CS-ACELP). ITU-T Rec Multi-Rate Wideband (AMR-WB) speech codec, Release 5.0.0. ommendation G.729 (1996), Geneva, http://www.itu.org/. (Mar. 2002)3rd Generation Partnership Project (3GPP): http://www. ITU-T G.729A "Coding of Speech at 8 kbit/s using conjugate struc 3gpp2.org/. ture algebraiccode-excited linear-prediction (CS-ACELP) Annex A: 3GPP2 C.S0030-0 “Selectable Mode Vocoder Service Option for Reduced complexity 8kbit/s. CS-ACELP speech codec'. ITU-T Rec Wideband Spread Spectrum Communication Systems', 3rd Genera ommendation G.729—Annex A, Nov. 1996, http://www.itu.org/. tion Partnership Project (3GPP2), Dec. 2001, http://www.3gpp2. ITU-T G.729C "Annex C: Reference floating-point implementation org/. for G.729 CSACELP 8 kbit/s speech coding”, ITU-T Recommenda ANSI/TIA/EIA-136-Rev.C., part 410. “TDMA Cellular/ tion G.729—Annex C. Sep. 1998, http://www.itu.org/. PCS Radio Interface, Enhance Full Rate Voice Codec (ACELP).” Spanias, A.S. "Speech Coding: A Tutorial Review'. Proc. IEEE, vol. Formerly IS-641. TIA published standard, Jun. 1, 2001, http://www. 82, No. 10, pp. 1541-1582, Oct. 1994. tiaoline.org. TIA/EIA/IS-127-2 “Enhanced Variable Rate Codec, Speech Service Cox, "Speech Coding Standards.” Speech Coding and Synthesis, Option 3 for WidebandSpread Spectrum Digital Systems' Telecom W.B.Kleijn et al., eds., pp. 49-78, Elsevier Science, (1995), The munications Industry Association 1999. Netherlands. TIA/EIAIIS-733, “High Rate Speech Service Option 17 for ETSI, GSM 6.10 "Recommendation GSM 6.10 Full-Rate Speech Wideband Spread Spectrum Communication Systems”. TIA pub Transcoding, version 8.02 (Nov. 2000). European Telecommunica lished standard, Nov. 17, 1997. tions Standards Institute (ETSI), http:IIwww.etsi.org/. * cited by examiner U.S. Patent Dec. 7, 2010 Sheet 1 of 23 US 7,848,922 B1 CODEC 1 EnCOCer EnCOced bitstream Speech Samples aCCOrding to COdec (e.g. PCM) Standard 1 CODEC 2 EnCOOder EnCOded bitstream Speech Samples aCCOrding to COdec (e.g. PCM) Standard 2 CODEC KEnCOCer EnCOced bitstream Speech Samples according to Codec (e.g. PCM) Standard K FIG. 1A U.S. Patent Dec. 7, 2010 Sheet 2 of 23 US 7,848,922 B1 CODEC 1 DeCOOder EnCOced bitstream Speech Samples aCCOrding to COdeC (e.g. PCM) Standard 1 CODEC 2 DeCOcer EnCOced bitstream Speech Samples according to Codec (e.g. PCM) Standard 2 CODECK DeCOCler EnCOded bitstream Speech Samples acCording to COdec (e.g. PCM) Standard K FIG. 1B U.S. Patent Dec. 7, 2010 Sheet 3 of 23 US 7,848,922 B1 210 EnCOced bitStream Speech Samples Thin COCeC aCCOrding to One of the standards (e.g. PCM) (encoder part) {Codec 1, Codec 2, ...Codec K} EnCOced bitstream Speech Samples Thin COdeC acCording to One of (e.g. PCM) (encoder part) the Standards {Codec 1, Codec 2, ...Codec K} 200 FIG. 2 U.S. Patent Dec. 7, 2010 Sheet 4 of 23 US 7,848,922 B1 U.S. Patent US 7,848,922 B1 U.S. Patent Dec. 7, 2010 Sheet 6 of 23 US 7,848,922 B1 520 Input speech sample (PCM) Pre-processing 510 530 LP analysis and quantization 540 Open-loop pitch lag analysis 550 Adaptive codebook lag analysis and Quantization 560 Adaptive Codebook gain analysis and duantization 570 Fixed COcebook index analysis and Quantization 58O Fixed COcebOOk gain analysis and duantization 590 COceC bitStream Bitstream packing FIG. 5 U.S. Patent Dec. 7, 2010 Sheet 7 of 23 US 7,848,922 B1 620 COdeC bitstream BitStream Unpackin 610 O 9 630 ExCitation reconstruction 640 Synthesis filtering 650 Output speech sample (PCM) Post processing 660 FIG. 6 U.S. Patent Dec. 7, 2010 Sheet 8 of 23 US 7,848,922 B1 Codec 1 (C1) 720 C1 Pre-processing C1 Parameter(Q1) 1 Encoding Genericcareece Pre-processing so C1 Parameter 2 Encoding Specific Pre-processing (Q1) Generic Parameter 1 Encoding Specific Parameter 1 C1 Bitstream Packing EnCOding COmbine all COceCS EnCOding into a single Specific Parameter 2 CKPre-processing 9 CK Parameter 1 Encoding (Q1)Q1 Generic Bitstream CK Parameter 2 Encoding Packing (QN) Specific Bitstream Packing CKBitstream Packing Universal Thin COdeC (EnCOder) FIG 7 U.S.
Recommended publications
  • Packetcable™ 2.0 Codec and Media Specification PKT-SP-CODEC
    PacketCable™ 2.0 Codec and Media Specification PKT-SP-CODEC-MEDIA-I10-120412 ISSUED Notice This PacketCable specification is the result of a cooperative effort undertaken at the direction of Cable Television Laboratories, Inc. for the benefit of the cable industry and its customers. This document may contain references to other documents not owned or controlled by CableLabs. Use and understanding of this document may require access to such other documents. Designing, manufacturing, distributing, using, selling, or servicing products, or providing services, based on this document may require intellectual property licenses from third parties for technology referenced in this document. Neither CableLabs nor any member company is responsible to any party for any liability of any nature whatsoever resulting from or arising out of use or reliance upon this document, or any document referenced herein. This document is furnished on an "AS IS" basis and neither CableLabs nor its members provides any representation or warranty, express or implied, regarding the accuracy, completeness, noninfringement, or fitness for a particular purpose of this document, or any document referenced herein. 2006-2012 Cable Television Laboratories, Inc. All rights reserved. PKT-SP-CODEC-MEDIA-I10-120412 PacketCable™ 2.0 Document Status Sheet Document Control Number: PKT-SP-CODEC-MEDIA-I10-120412 Document Title: Codec and Media Specification Revision History: I01 - Released 04/05/06 I02 - Released 10/13/06 I03 - Released 09/25/07 I04 - Released 04/25/08 I05 - Released 07/10/08 I06 - Released 05/28/09 I07 - Released 07/02/09 I08 - Released 01/20/10 I09 - Released 05/27/10 I10 – Released 04/12/12 Date: April 12, 2012 Status: Work in Draft Issued Closed Progress Distribution Restrictions: Authors CL/Member CL/ Member/ Public Only Vendor Key to Document Status Codes: Work in Progress An incomplete document, designed to guide discussion and generate feedback, that may include several alternative requirements for consideration.
    [Show full text]
  • Of the Cdma2000® System Specifications
    3GPP2 S.P0052-0 Ver.0.2.7 Date: August 21, 2003 1 2 System Release Guide for the 3 Release <ALPHA> 4 of the cdma2000® System Specifications 5 COPYRIGHT © 2003 3GPP2 and its Organizational Partners claim copyright in this document and individual Or- ganizational Partners may copyright and issue documents or standards publications in in- dividual Organizational Partner’s name based on this document. Requests for reproduction of this document should be directed to the 3GPP2 Secretariat at [email protected]. Requests to reproduce individual Organizational Partner’s documents should be directed to that Organizational Partner. See www.3gpp2.org for more information. 6 7 © 2003 3GPP2 - i - S.P0052: CDMA2000® System Release Guide –Release <Alpha> 1 Executive Summary 2 The System Release Guide (SRG) for the Release <ALPHA> provides an overview 3 for and reference to the Release <ALPHA> of the 3GPP2 wireless telecommuni- 4 cation system (cdma2000®) capabilities, features, and services. This document 5 is intended for use by persons and /or companies who are developing and / or 6 deploying cdma2000 systems or by persons who are otherwise interested in 7 cdma2000 systems. 8 Air interface support for HRPD and enhanced IOS are included and provide 9 high-speed forward link data rate service capability up to 2.4576 Mbps in a 10 1.25 MHz. Since cdma2000 uses many IP based protocols to a large degree, it 11 offers various features of IP based services. The system in this release contains 12 support for the Legacy System, and limited support for the 3GPP2 Legacy Mo- 13 bile Station Domain, making use of IP-based transport and signaling.
    [Show full text]
  • A Novel Transcoding Algorithm for SMV and G.723.1 Speech Coders Via Direct Parameter Transformation
    A Novel Transcoding Algorithm for SMV and G.723.1 Speech Coders via Direct Parameter Transformation Seongho Seo, Dalwon Jang, Sunil Lee, and Chang D. Yoo Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea [email protected] Abstract as input. The length of each speech frame is 30 ms which cor- responds to 240 samples. Input speech of each frame is first In this paper, a novel transcoding algorithm for the Selectable high pass filtered and then divided into 4 subframes of 60 sam- Mode Vocoder (SMV) and the G.723.1 speech coder is pro- ples each. For every subframe, a 10th order linear prediction posed. In contrast to the conventional tandem transcoding al- coefficients (LPC) are computed. The linear predictive analysis gorithm, the proposed algorithm converts the parameters of one requires a look-ahead of 7.5 ms long. The LPC set for the last coder to the other without going through the decoding and en- subframe is quantized using the Predictive Split Vector Quan- coding process. The proposed algorithm is composed of four tizer (PSVQ) and transmitted. The unquantized LPC sets are parts: the parameter decoding, Line Spectral Pair (LSP) conver- used to obtain the perceptually weighted speech signal. For ev- sion, pitch period conversion and rate selection. The evaluation ery two subframes, the open-loop pitch analysis is performed in results show that the proposed algorithm achieves equivalent the domain of the weighted speech signal. Then, the adaptive speech quality to that of tandem transcoding with reduced com- codebook (ACB) and fixed codebook (FCB) are searched on a putational complexity and delay.
    [Show full text]
  • Network Working Group R. Gellens Request for Comments: 4281 Qualcomm Category: Standards Track D
    Network Working Group R. Gellens Request for Comments: 4281 Qualcomm Category: Standards Track D. Singer Apple P. Frojdh Ericsson November 2005 The Codecs Parameter for "Bucket" Media Types Status of This Memo This document specifies an Internet standards track protocol for the Internet community, and requests discussion and suggestions for improvements. Please refer to the current edition of the "Internet Official Protocol Standards" (STD 1) for the standardization state and status of this protocol. Distribution of this memo is unlimited. Copyright Notice Copyright (C) The Internet Society (2005). Abstract Several MIME type/subtype combinations exist that can contain different media formats. A receiving agent thus needs to examine the details of such media content to determine if the specific elements can be rendered given an available set of codecs. Especially when the end system has limited resources, or the connection to the end system has limited bandwidth, it would be helpful to know from the Content-Type alone if the content can be rendered. This document adds a new parameter, "codecs", to various type/subtype combinations to allow for unambiguous specification of the codecs indicated by the media formats contained within. By labeling content with the specific codecs indicated to render the contained media, receiving systems can determine if the codecs are supported by the end system, and if not, can take appropriate action (such as rejecting the content, sending notification of the situation, transcoding the content to a supported type, fetching and installing the required codecs, further inspection to determine if it will be sufficient to support a subset of the indicated codecs, etc.) Gellens, et al.
    [Show full text]
  • Superseded Packetcable™ Codec and Media Specification (PKT-SP-CODEC-MEDIA-I01-060406)
    PacketCable™ Codec and Media Specification PKT-SP-CODEC-MEDIA-I01-060406 ISSUED Notice This PacketCable specification is a cooperative effort undertaken at the direction of Cable Television Laboratories, Inc. (CableLabs®) for the benefit of the cable industry. Neither CableLabs, nor any other entity participating in the creation of this document, is responsible for any liability of any nature whatsoever resulting from or arising out of use or reliance upon this document by any party. This document is furnished on an AS-IS basis and neither CableLabs, nor other participating entity, provides any representation or warranty, express or implied, regarding its accuracy, completeness, or fitness for a particular purpose. © Copyright 2006 Cable Television Laboratories, Inc. All rights reserved. PKT-SP-CODEC-MEDIA-I01-060406 PacketCable™ Document Status Sheet Document Control Number: PKT-SP-CODEC-MEDIA-I01-060406 Document Title: Codec and Media Specification PacketCable Release: 2 Revision History: I01 – Released 04/05/2006 Date: April 6, 2006 Status: Work in Draft Issued Closed Progress Distribution Restrictions: Authors CL/Member CL/ Member/ Public Only Vendor Key to Document Status Codes: Work in Progress An incomplete document, designed to guide discussion and generate feedback, that may include several alternative requirements for consideration. Draft A document in specification format considered largely complete, but lacking review by Members and vendors. Drafts are susceptible to substantial change during the review process. Issued A stable document, which has undergone rigorous member and vendor review and is suitable for product design and development, cross-vendor interoperability, and for certification testing. Closed A static document, reviewed, tested, validated, and closed to further engineering change requests to the specification through CableLabs.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 6,445,696 B1 Foodeei Et Al
    USOO644.5696B1 (12) United States Patent (10) Patent No.: US 6,445,696 B1 Foodeei et al. (45) Date of Patent: Sep. 3, 2002 (54) EFFICIENT WARIABLE RATE CODING OF (57) ABSTRACT VOICE OVER ASYNCHRONOUSTRANSFER MODE The invention uses an ATM Adaptation Layer of type 2 (AAL2) standard mechanism to define efficient Support for (75) Inventors: Majid Foodeei, San Francisco; Variable Rate Coding (VRC). The VRC in this context Anthony E. Raetz, Menlo Park, both of typically refers to codecs, which adapt their rate to infor CA (US) mation content variations in Speech and audio. Such VRC (73) Assignee: Network Equipment Technologies, results in lower average rate than the constant rate codecs or Inc., Fremont, CA (US) use of constant rate codecs coupled with Silence Suppression - 0 (SS), currently deployed in voice over ATM schemes using (*) Notice: Subject to any disclaimer, the term of this AAL2. Possible ATM transport, trunking and access appli patent is extended or adjusted under 35 cations encompass both Circuit Emulation Services (CES) U.S.C. 154(b) by 0 days. and Local Loop Emulation Services (LLES). A typical VRC profile encompasses options for all Sub-rates within one or (21) Appl. No.: 09/513,667 more VRC standard or proprietary variable rate codec. The (22) Filed: Feb. 25, 2000 output of a rate determination algorithm (RDA), commonly part of variable rate codec, is fed into present AAL2 inter (51) Int. Cl." ............................. H04J 3/24; HO4L 12/56 working function (IWF). The IWF in AAL2, which normally (52) U.S. Cl. .................... 370/356; 370/395.6; 370/465; Supports SS or multiple rates (as opposed to voice content 370/469; 370/471; 370/474 VRC), is extended to accommodate VRC and thereafter (58) Field of Search ................................
    [Show full text]
  • Supported Codecs and Formats Codecs
    Supported Codecs and Formats Codecs: D..... = Decoding supported .E.... = Encoding supported ..V... = Video codec ..A... = Audio codec ..S... = Subtitle codec ...I.. = Intra frame-only codec ....L. = Lossy compression .....S = Lossless compression ------- D.VI.. 012v Uncompressed 4:2:2 10-bit D.V.L. 4xm 4X Movie D.VI.S 8bps QuickTime 8BPS video .EVIL. a64_multi Multicolor charset for Commodore 64 (encoders: a64multi ) .EVIL. a64_multi5 Multicolor charset for Commodore 64, extended with 5th color (colram) (encoders: a64multi5 ) D.V..S aasc Autodesk RLE D.VIL. aic Apple Intermediate Codec DEVIL. amv AMV Video D.V.L. anm Deluxe Paint Animation D.V.L. ansi ASCII/ANSI art DEVIL. asv1 ASUS V1 DEVIL. asv2 ASUS V2 D.VIL. aura Auravision AURA D.VIL. aura2 Auravision Aura 2 D.V... avrn Avid AVI Codec DEVI.. avrp Avid 1:1 10-bit RGB Packer D.V.L. avs AVS (Audio Video Standard) video DEVI.. avui Avid Meridien Uncompressed DEVI.. ayuv Uncompressed packed MS 4:4:4:4 D.V.L. bethsoftvid Bethesda VID video D.V.L. bfi Brute Force & Ignorance D.V.L. binkvideo Bink video D.VI.. bintext Binary text DEVI.S bmp BMP (Windows and OS/2 bitmap) D.V..S bmv_video Discworld II BMV video D.VI.S brender_pix BRender PIX image D.V.L. c93 Interplay C93 D.V.L. cavs Chinese AVS (Audio Video Standard) (AVS1-P2, JiZhun profile) D.V.L. cdgraphics CD Graphics video D.VIL. cdxl Commodore CDXL video D.V.L. cinepak Cinepak DEVIL. cljr Cirrus Logic AccuPak D.VI.S cllc Canopus Lossless Codec D.V.L.
    [Show full text]
  • Media and Radio Signal Processing for Mobile Communications Kyunghun Jung , Russell M
    Cambridge University Press 978-1-108-42103-4 — Media and Radio Signal Processing for Mobile Communications Kyunghun Jung , Russell M. Mersereau Frontmatter More Information Media and Radio Signal Processing for Mobile Communications Get to grips with the principles and practise of signal processing used in real mobile communications systems. Focusing particularly on speech and video processing, pion- eering experts employ a detailed, top-down analytical approach to outline the network architectures and protocol structures of multiple generations of mobile communications systems, identify the logical ranges where media and radio signal processing occur, and analyze the procedures for capturing, compressing, transmitting and presenting media. Chapters are uniquely structured to show the evolution of network architectures and technical elements between generations up to and including 5G, with an emphasis on maximizing service quality and network capacity through reusing existing infrastruc- ture and technologies. Examples and data taken from commercial networks provide an in-depth insight into the operation of a number of different systems, including GSM, cdma2000, W-CDMA, LTE, and LTE-A, making this a practical, hands-on guide for both practicing engineers and graduate students in wireless communications. Kyunghun Jung is a Principal Engineer at Samsung Electronics, where he leads the research and standardization for bringing immersive media services and vehicular applications to 5G systems. Russell M. Mersereau is Regents Professor Emeritus in the School of Electrical and Computer Engineering at the Georgia Institute of Technology, and a Fellow of the IEEE. © in this web service Cambridge University Press www.cambridge.org Cambridge University Press 978-1-108-42103-4 — Media and Radio Signal Processing for Mobile Communications Kyunghun Jung , Russell M.
    [Show full text]
  • Wideband Speech Coding Standards and Applications
    Wideband Speech Coding Standards and Applications Abstract Increasing the bandwidth of sound signals from the telephone bandwidth of 200-3400 Hz to the wider bandwidth of 50-7000 Hz results in increased intelligibility and naturalness of speech and gives a feeling of transparent communication. The emerging end-to-end digital communication systems enable the use of wideband speech coding in a wide area of applications. Recognizing the need of high quality wideband speech codecs, several standardization activities have been recently conducted, resulting in the selection of a new wideband speech codec, AMR-WB, at bit rates from 6.6 to 23.85 kbit/s by both 3GPP and ITU-T. The adoption of AMR-WB by the two bodies is of significant importance since for the first time the same codec is adopted for wireless as well as wireline services. This will eliminate the need for transcoding, and ease the implementation of wideband voice applications and services across a wide range of communication systems and platforms. This document presents a summary of wideband speech coding standards for wideband telephony applications. The quality advantages and applications of wideband speech coding are first presented, and the issue of telephony over packet networks is discussed. Several wideband speech coding standards are discussed and a special emphasis is given to the AMR-WB standard recently selected by 3GPP and ITU-T. 1. Introduction Most speech coding systems in use today are based on telephone-bandwidth narrowband speech, nominally limited to about 200-3400 Hz and sampled at a rate of 8 kHz. This limitation built into the conventional telephone system dates back to the first transcontinental telephone service established between New-York and San Francisco in 1915.
    [Show full text]
  • Selectable Mode Vocoder Service Option for Wideband Spread Spectrum Communication Systems
    3GPP2 C.S0030-0 Version 2.0 Date: December 2001 Selectable Mode Vocoder Service Option for Wideband Spread Spectrum Communication Systems COPYRIGHT 3GPP2 and its Organizational Partners claim copyright in this document and individual Organizational Partners may copyright and issue documents or standards publications in individual Organizational Partner's name based on this document. Requests for reproduction of this document should be directed to the 3GPP2 Secretariat at [email protected]. Requests to reproduce individual Organizational Partner's documents should be directed to that Organizational Partner. See www.3gpp2.org for more information. Intentionally left blank. C.S0030-0 Version 2.0 1 FOREWORD 2 These technical requirements form a standard for Service Option 56, a variable-rate 3 two-way speech service option. The maximum speech-coding rate of the service option is 4 8.55 kbps. 5 This standard does not address the quality or reliability of Service Option 56, nor does it 6 cover equipment performance or measurement procedures. 7 i C.S0030-0 Version 2.0 1 NOTES 2 1. Accompanying “Recommended Minimum Performance Standard for the Selectable 3 Mode Vocoder, Service Option 56,” provides specifications and measurement methods. 4 2. “Base station” refers to the functions performed on the land-line side, which are 5 typically distributed among a cell, a sector of a cell, a mobile switching center, and a 6 personal communications switching center. 7 3. This document uses the following verbal forms: “Shall” and “shall not” identify 8 requirements to be followed strictly to conform to the standard and from which no 9 deviation is permitted.
    [Show full text]
  • Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems
    3GPP2 C.S0030-0 Version 3.0 Date: January 2004 1 2 Selectable Mode Vocoder (SMV) Service Option for 3 Wideband Spread Spectrum Communication 4 Systems 5 6 COPYRIGHT 3GPP2 and its Organizational Partners claim copyright in this document and individual Organizational Partners may copyright and issue documents or standards publications in individual Organizational Partner's name based on this document. Requests for reproduction of this document should be directed to the 3GPP2 Secretariat at [email protected]. Requests to reproduce individual Organizational Partner's documents should be directed to that Organizational Partner. See www.3gpp2.org for more information. 7 1 Intentionally left blank. C.S0030-0 v3.0 1 FOREWORD 2 These technical requirements form a standard for Service Option 56, a variable-rate 3 two-way speech service option. The maximum speech-coding rate of the service option is 4 8.55 kbps. 5 This standard does not address the quality or reliability of Service Option 56, nor does it 6 cover equipment performance or measurement procedures. 7 i C.S0030-0 v3.0 1 NOTES 2 1. Accompanying “Recommended Minimum Performance Standard for the Selectable 3 Mode Vocoder, Service Option 56,” provides specifications and measurement methods. 4 2. “Base station” refers to the functions performed on the land-line side, which are 5 typically distributed among a cell, a sector of a cell, a mobile switching center, and a 6 personal communications switching center. 7 3. This document uses the following verbal forms: “Shall” and “shall not” identify 8 requirements to be followed strictly to conform to the standard and from which no 9 deviation is permitted.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 7.254.533 B1 Jabri Et Al
    US00725.4533B1 (12) United States Patent (10) Patent No.: US 7.254.533 B1 Jabri et al. (45) Date of Patent: Aug. 7, 2007 (54) METHOD AND APPARATUS FOR A THIN 3GPP TS 26.090, “Adaptive Multi-Rate (AMR) speech codec: CELP VOICE CODEC Transcoding fuctions”, Release 5.0.0 (Jun. 2002), 3' Generation Partnership Project (3GPP), http://www.3gpp2.org/. (75) Inventors: Marwan A. Jabri, Broadway (AU); 3GPP TS 26.104 “ANSI-C code for the floating-point AMR speech Nicola Chong-White, Chatswood (AU): codec”. Release 5.00, (Jun. 2002) 3" Gerneration Partnership Jianwei Wang, Killarney Heights (AU) Project (3GPP), http://www.3gpp2.org/. 3GPP TS 26.173 “ANSI-C code for the Adaptive Multi-Rate (73) Assignee: Dilithium Networks Pty Ltd., Sydney, Wideband speech codec, (Mar. 2002) 3" Generation Partnership NSW (AU) Project (3GPP), http://www.3gpp2.org/. *) Notice: Subject to anyy disclaimer, the term of this 3GPP TS 26.190 “AMR Wideband speech codec; Transcoding patent is extended or adjusted under 35 Functions (Release 5), 3 Generation Partnership Project (3GPP); U.S.C. 154(b) by 872 days. Dec. 2001, http://www.3gpp2.org/. (21) Appl. No.: 10/688,857 (Continued) Primary Examiner V. Paul Harper (22) Filed: Oct. 17, 2003 (74) Attorney, Agent, or Firm Townsend and Townsend Related U.S. Application Data and Crew LLP (60) Provisional application No. 60/439.366, filed on Jan. (57) ABSTRACT 9, 2003, provisional application No. 60/419,776, filed on Oct. 17, 2002. An apparatus and method for encoding and decoding a voice (51) Int. Cl. signal.
    [Show full text]