ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P17.1
Session:Spoken Language Modeling, Identification and Characterization
Time:Thursday, May 18, 16:30 - 18:30
Presentation: Poster
Topic: Speech and Spoken Language Processing: Language Identification
Title: CHINESE DIALECT IDENTIFICATION USING TONE FEATURES BASED ON PITCH FLUX
Authors: Bin Ma, Donglai Zhu, Rong Tong, Institute for Infocomm Research, Singapore
Abstract: This paper presents a method to extract tone relevant features based on pitch flux from continuous speech signal. The autocorrelations of two adjacent frames are calculated and the covariance between them is estimated to extract multi-dimensional pitch flux features. These features, together with MFCCs, are modeled in a 2-stream GMM models, and are tested in a 3-dialect identification task for Chinese. The pitch flux features have shown to be very effective in identifying tonal languages with short speech segments. For the test speech segments of 3 seconds, 2-stream model achieves more than 30% error reduction over MFCC-based model.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012