ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-L1.2
Session:Speech Coding for Network Applications
Time:Tuesday, May 16, 10:50 - 11:10
Presentation: Lecture
Topic: Speech and Spoken Language Processing: Wide-band Speech Coding
Title: An Embedded Scalable Wideband Codec based on the GSM EFR Codec
Authors: Peter Jax, Bernd Geiser, University of Technology Aachen (RWTH), Germany; Stefan Schandl, Hervé Taddei, Siemens AG, Austria; Peter Vary, University of Technology Aachen (RWTH), Germany
Abstract: We present a technique to extend narrowband (NB) speech communication systems, using e.g. the GSM enhanced full rate (EFR) codec, with wideband (WB, 50-7000 Hz) capability. The limited acoustic bandwidth of narrowband speech coding is extended using a fairly coarse description of the missing high frequency band (3.4-7 kHz) in terms of temporal and spectral envelopes. The high-band parameters are quantized, transmitted and then used at the receiver side to regenerate the high frequency components. The parameter encoding is done by applying split vector quantization in a transformed domain. This quantization scheme can be scaled to match any given target bit rate. Several example configurations have been implemented and tested in MUSHRA-style listening tests.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012