ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P6.2
Session:Speech Understanding, Translation, Applications and Systems
Time:Tuesday, May 16, 16:30 - 18:30
Presentation: Poster
Topic: Speech and Spoken Language Processing: Speech Understanding
Title: Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection
Authors: Marcus Tomalin, Phil C. Woodland, University of Cambridge, United Kingdom
Abstract: This paper compares the performance of two types of Prosodic Feature Models (PFMs) in a sentence boundary detection task. Specifically, systems are compared that use discriminatively trained Gaussian Mixture Models (MMI-GMMs) and CART-Style Decision Trees (CDT-PFMs), along with task-specific language models, in a lattice-based decoding framework in order automatically to insert Slash Unit (SU) boundaries into Automatic Speech Recognition (ASR) transcriptions of input audio files. It is shown that a system which uses MMI-GMMs performs as well as a system that uses conventional CDT-PFMs. In addition, it is shown that, when the CDT-PFM and MMI-GMM systems are combined by taking weighted averages of their respective probability streams, Error rate improvements of up to 0.8% abs over the CDT-PFM baseline can be obtained for four different test sets.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012