ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P5.5
Session:Feature-based Robust Speech Recognition
Time:Tuesday, May 16, 16:30 - 18:30
Presentation: Poster
Topic: Speech and Spoken Language Processing: Feature-based Robust Speech Recognition (e.g., noise, etc)
Title: Cepstral Statistics Compensation Using Online Pseudo Stereo Codebooks for Robust Speech Recognition in Additive Noise Environments
Authors: Jeih-weih Hung, National Chi Nan University, Taiwan
Abstract: In this paper, we propose the cepstral statistics compensation (CSC) algorithm, which alleviates the effect of additive noise on the cepstral features for speech recognition. It is a simple but quite efficient noise reduction technique that makes use of online constructed pseudo stereo codebooks. The statistics, such as mean and variance, for the cepstral features in both clean and noisy environments are evaluated using the pseudo stereo codebooks. Then a transform is obtained for the noise-corrupted cepstra so that the statistics of the transformed ones are close to those of clean cepstra. Experimental results show that CSC provided a 13% reduction in word error rate when compared to the results obtained using cepstral mean and variance normalization (CMVN), and a 34% reduction in error rate when compared to baseline processing in the noise range of 0-20dB on experiments conducted on Aurora-2 Test Set A noisy digits database. In addition, we also provide some other noise robustness approaches based on the pseudo stereo codebooks and show their effectiveness in noisy speech recognition.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012