ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-L2.4
Session:Advances in Robust Speech Recognition
Time:Tuesday, May 16, 15:00 - 15:20
Presentation: Lecture
Topic: Speech and Spoken Language Processing: Model-based robust Speech Recognition
Title: WEIGHTED LIKELIHOOD RATIO (WLR) HIDDEN MARKOV MODEL FOR NOISY SPEECH RECOGNITION
Authors: Chao Huang, Yingchun Huang, Frank K. Soong, Jian-Lai Zhou, Microsoft Research Asia, China
Abstract: In this paper we present a weighted likelihood ratio (WLR) based Hidden Markov Model and apply it to speech recognition in noise. The WLR measure emphasizes spectral peaks than valleys in comparing two given speech spectra. The measure is more consistent with human perception of speech formants where natural resonances of vocal track are and tends to be more robust to broad-band noise interferences than other measures. A complete HMM framework of this measure is derived and a mixture of exponential kernels is used to model the output probability density function. The new WLR-HMM is tested on the Aurora2 connected digits database in noise. It shows more robust performance than the MFCC trained GMM baseline system. When combined with the dynamic cepstral features, the multiple-stream WLR-HMM shows a 39% relative improvement over the baseline system.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012