ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P19.4
Session:Model-based Robust Speech Recognition
Time:Friday, May 19, 10:00 - 12:00
Presentation: Poster
Topic: Speech and Spoken Language Processing: Model-based robust Speech Recognition
Title: PATTERN-BASED DYNAMIC COMPENSATION TOWARDS ROBUST SPEECH RECOGNITION IN MOBILE ENVIRONMENTS
Authors: Huayun Zhang, Jun Xu, InfoTalk Technology, Singapore
Abstract: Today, the high mobility provided by wireless networks places users in a wild variety of noise and channel conditions, which poses serious challenge to telephone-base Acoustic Speech Recognition (ASR). In this paper, we propose a Pattern-based Dynamic Compensation (PDC) scheme to improve the robustness of ASR in mobile environments. In PDC, a distortion pattern-set is employed to normalize the environmental variations in training data according to a set of pre-defined application scenarios. At recognition time, instantaneous distortion is calculated as a linear combination of several possible patterns. To online estimate the combination weights robustly, a Bayesian learning process with Speech-conditioned Prior Evolution is introduced into PDC (PDC-SPE). In outdoor experiments, the PDC-SPE method outperforms other commonly used compensation/adaptation methods and leads to 20~25% relative reduction in Word Error Rate (WER) over a well-trained baseline system.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012