ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P18.1
Session:LVCSR Systems
Time:Friday, May 19, 10:00 - 12:00
Presentation: Poster
Topic: Speech and Spoken Language Processing: Lattices and Multi-pass strategies
Title: The CU-HTK Mandarin Broadcast News Transcription System
Authors: Rohit Sinha, Mark J. F. Gales, Do Yeong Kim, Xunying Andrew Liu, Khe Chai Sim, Phil C. Woodland, University of Cambridge, United Kingdom
Abstract: This paper discusses the development of the CU-HTK Mandarin Broadcast News (BN) transcription system. The Mandarin BN task includes a significant amount of English data. Hence techniques have been investigated to allow the same system to handle both Mandarin and English by augmenting the Mandarin training sets with English acoustic and language model training data. A range of acoustic models were built including models based on Gaussianised features, speaker adaptive training and feature-space MPE. A multi-branch system architecture is described in which multiple acoustic model types, alternate phone sets and segmentations can be used in a system combination framework to generate the final output. The final system shows state-of-the-art performance over a range of test sets.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012