Technical Program

Paper Detail

Paper:	SLP-L13.1
Session:	Missing Data Methods in Robust Speech Recognition
Time:	Friday, May 19, 16:30 - 16:50
Presentation:	Lecture
Topic:	Speech and Spoken Language Processing: Model-based robust Speech Recognition
Title:	RECOGNITION OF REVERBERANT SPEECH USING FULL CEPSTRAL FEATURES AND SPECTRAL MISSING DATA
Authors:	Kalle J. Palomäki, Helsinki University of Technology, Finland; Guy J. Brown, Jon P. Barker, University of Sheffield, United Kingdom
Abstract:	We describe a novel approach to feature combination within the missing data (MD) framework for automatic speech recognition, and show its application to reverberated speech. Likelihoods from a spectral MD classifier are combined with those from a full cepstral feature vector-based recogniser. Even though the performance of the cepstral recogniser is substantially below that of the MD recogniser, the combined recogniser performs better in all conditions. We also describe improvements to the generation of time-frequency masks for the MD recogniser. Our system is compared with a previous approach based on a hybrid MLP-HMM recogniser with MSG and PLP feature vectors. The proposed system has a substantial performance advantage in the most reverberated conditions.