Paper: | SLP-P20.12 |
Session: | Acoustic Modeling and Adaptation |
Time: | Friday, May 19, 14:00 - 16:00 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: Speaker adaptation and normalization (e.g., VTLN) |
Title: |
VTLN Warping Factor Estimation Using Accumulation of Sufficient Statistics |
Authors: |
Jonas Lööf, Hermann Ney, University of Technology Aachen (RWTH), Germany; Srinivasan Umesh, Indian Institute of Technology Kanpur, India |
Abstract: |
In this paper we present an efficient and flexible approach to VTLN warping factor estimation. Due to the equivalence of frequency warping and linear transformation of cepstral coefficients, warping factors can be efficiently estimated by accumulating the sufficient statistics for linear transformation estimation, and searching the constrained space of transformations given by the explicit mapping between warping factors and linear transformation matrices. We show that the positive effect of using a properly normalized optimization criterion for warping factor estimation, which has been previously demonstrated for a signal analysis front-end without a filter-bank, carries over to a MFCC front-end, resulting in a net improvement in word error rate. |