ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P9.5
Session:Speech Coding
Time:Wednesday, May 17, 14:00 - 16:00
Presentation: Poster
Topic: Speech and Spoken Language Processing: Wide-band Speech Coding
Title: High Rate Design of Transform Coders with Gaussian Mixture Companders
Authors: Ethan Duni, Bhaskar Rao, University of California, San Diego, United States
Abstract: This paper examines the problem of designing fixed-rate transform coders for sources with arbitrary distributions, under input-weighted squared error distortion measures. As a component of this system, a flexible scalar compander using Gaussian Mixtures is proposed. An algorithm is developed to set the parameters of the system using a data-driven technique that automatically balances the source statistics, distortion measure, and structure of the transform coder to minimize the high-rate distortion. The implementation of Gaussian Mixture companders is explored, resulting in a flexible, low-complexity scalar quantizer. The operation of this system for the problem of wideband speech spectrum quantization with Log Spectral Distortion is illustrated, and shown to provide good performance with very low, rate-independent complexity.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012