ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SPTM-L3.2
Session:Applications to Speech and Audio
Time:Tuesday, May 16, 16:50 - 17:10
Presentation: Lecture
Topic: Signal Processing Theory and Methods: Signal Restoration, Reconstruction, and Enhancement
Title: The Effect of Memory Inclusion on Mutual Information between Speech Frequency Bands
Authors: Amr H. Nour-Eldin, Turaj Zakizadeh Shabestary, Peter Kabal, McGill University, Canada
Abstract: In this paper, we investigate the effect of temporal correlation on the dependence between the speech narrow and high frequency bands covering the 0.3-3.4 kHz and 3.7-8 kHz ranges, respectively. We follow the technique of using Gaussian mixture modelling of spectral envelopes represented by Mel-frequency cepstral coefficients. The correlation between the disjoint speech frequency bands is quantified through mutual information (MI) and its ratio to highband entropy. Speech exhibits considerable temporal correlation that is not explicitly accounted for by static parametrization of spectral envelopes. Including memory in speech parametrization (through delta features) incorporates such temporal information of speech in its modelling, and hence, MI gains are to be expected resulting in bandwidth extension with better performance. Results show that exploiting delta features can increase certainty about the highband (ratio of MI to highband entropy) by as much as 216% relatively, corresponding to an absolute increase of 12%.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012