Paper: | SLP-L10.2 |
Session: | Speaker Adaptation |
Time: | Friday, May 19, 10:20 - 10:40 |
Presentation: |
Lecture
|
Topic: |
Speech and Spoken Language Processing: Environmental adaptation |
Title: |
Feature Adaptation Based on Gaussian Posteriors |
Authors: |
Suleyman S. Kozat, Karthik Visweswariah, Ramesh Gopinath, IBM T. J. Watson Research Center, United States |
Abstract: |
In this paper we consider the use of non-linear methods for feature adaptation to reduce the mismatch between test and training conditions. The non-linearity is introduced by using the posteriors of a set of Gaussians to (softly) partition the observation space for feature adaptation. The modeling framework used is based on the fMPE models povey2005 applied to FMLLR matrices directly. However, the parameters are estimated to maximize the likelihood of the test data. We observe a relative gain of 14% on top of FMLLR, which was a 42% relative gain over the baseline. |