ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P20.12
Session:Acoustic Modeling and Adaptation
Time:Friday, May 19, 14:00 - 16:00
Presentation: Poster
Topic: Speech and Spoken Language Processing: Speaker adaptation and normalization (e.g., VTLN)
Title: VTLN Warping Factor Estimation Using Accumulation of Sufficient Statistics
Authors: Jonas Lööf, Hermann Ney, University of Technology Aachen (RWTH), Germany; Srinivasan Umesh, Indian Institute of Technology Kanpur, India
Abstract: In this paper we present an efficient and flexible approach to VTLN warping factor estimation. Due to the equivalence of frequency warping and linear transformation of cepstral coefficients, warping factors can be efficiently estimated by accumulating the sufficient statistics for linear transformation estimation, and searching the constrained space of transformations given by the explicit mapping between warping factors and linear transformation matrices. We show that the positive effect of using a properly normalized optimization criterion for warping factor estimation, which has been previously demonstrated for a signal analysis front-end without a filter-bank, carries over to a MFCC front-end, resulting in a net improvement in word error rate.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012