ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P3.8
Session:Novel LVCSR Algorithms
Time:Tuesday, May 16, 14:00 - 16:00
Presentation: Poster
Topic: Speech and Spoken Language Processing: Alternative Statistical and Machine Learning Methods for General ASR (e.g., no-HMM methods)
Title: Towards ASR Based on Hierarchical Posterior-based Keyword Recognition
Authors: Petr Fousek, Hynek Hermansky, IDIAP Research Institute, Switzerland
Abstract: The paper presents an alternative approach to automatic recognition of speech in which each targeted word is classified by a separate binary classifier against all other sounds. No time alignment is done. To build a recognizer for N words, N parallel binary classifiers are applied. The system first estimates uniformly sampled posterior probabilities of phoneme classes, followed by a second step in which a rather long sliding time window is applied to the phoneme posterior estimates and its content is classified by an artificial neural network to yield posterior probability of the keyword. On small vocabulary ASR task, the system still does not reach the performance of the state-of-the-art system but its conceptual simplicity, the ease of adding new target words, and its inherent resistance to out-of-vocabulary sounds may prove significant advantage in many applications.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012