Technical Program

Paper Detail

Paper:	SLP-P6.2
Session:	Speech Understanding, Translation, Applications and Systems
Time:	Tuesday, May 16, 16:30 - 18:30
Presentation:	Poster
Topic:	Speech and Spoken Language Processing: Speech Understanding
Title:	Discriminatively Trained Gaussian Mixture Models for Sentence Boundary Detection
Authors:	Marcus Tomalin, Phil C. Woodland, University of Cambridge, United Kingdom
Abstract:	This paper compares the performance of two types of Prosodic Feature Models (PFMs) in a sentence boundary detection task. Specifically, systems are compared that use discriminatively trained Gaussian Mixture Models (MMI-GMMs) and CART-Style Decision Trees (CDT-PFMs), along with task-specific language models, in a lattice-based decoding framework in order automatically to insert Slash Unit (SU) boundaries into Automatic Speech Recognition (ASR) transcriptions of input audio files. It is shown that a system which uses MMI-GMMs performs as well as a system that uses conventional CDT-PFMs. In addition, it is shown that, when the CDT-PFM and MMI-GMM systems are combined by taking weighted averages of their respective probability streams, Error rate improvements of up to 0.8% abs over the CDT-PFM baseline can be obtained for four different test sets.