Paper: | SLP-P9.5 |
Session: | Speech Coding |
Time: | Wednesday, May 17, 14:00 - 16:00 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: Wide-band Speech Coding |
Title: |
High Rate Design of Transform Coders with Gaussian Mixture Companders |
Authors: |
Ethan Duni, Bhaskar Rao, University of California, San Diego, United States |
Abstract: |
This paper examines the problem of designing fixed-rate transform coders for sources with arbitrary distributions, under input-weighted squared error distortion measures. As a component of this system, a flexible scalar compander using Gaussian Mixtures is proposed. An algorithm is developed to set the parameters of the system using a data-driven technique that automatically balances the source statistics, distortion measure, and structure of the transform coder to minimize the high-rate distortion. The implementation of Gaussian Mixture companders is explored, resulting in a flexible, low-complexity scalar quantizer. The operation of this system for the problem of wideband speech spectrum quantization with Log Spectral Distortion is illustrated, and shown to provide good performance with very low, rate-independent complexity. |