Paper: | SS-5.4 |
Session: | Dealing with Intrinsic Speech Variabilities in ASR |
Time: | Wednesday, May 17, 15:00 - 15:20 |
Presentation: |
Special Session Lecture
|
Topic: |
Special Sessions: Dealing with intrinsic speech variabilities in ASR |
Title: |
ADAPTATION OF HYBRID ANN/HMM USING WEIGHTS INTERPOLATION |
Authors: |
Stefano Scanzio, Pietro Laface, Politecnico di Torino, Italy; Roberto Gemello, Franco Mana, Loquendo, Italy |
Abstract: |
Many techniques for speaker or channel adaptation have been successfully applied to automatic speech recognition. Most of these techniques have been proposed for the adaptation of Hidden Markov Models (HMMs). Far less proposals have been made for the adaptation of the Artificial Neural Networks (ANNs) used in the hybrid HMM-ANN approach. This paper presents an adaptation technique for ANNs that, similar to the framework of MAP estimation, tries to exploit in the adaptation process prior information that is particularly useful to deal with the problem of sparse training data. We show that the integration of a priori information can be simply achieved by linear interpolation of the weights of an "a priori" network and of a speaker specific network. Good improvements with respect to the baseline results are reported evaluating this technique on the Wall Street Journal WSJ0 and WSJ1 databases and on TIMIT corpus using different amounts of adaptation data. |