ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-L9.1
Session:Spoken Language Identification
Time:Thursday, May 18, 16:30 - 16:50
Presentation: Lecture
Topic: Speech and Spoken Language Processing: Language Identification
Title: LANGUAGE IDENTIFICATION USING PITCH CONTOUR INFORMATION IN THE ERGODIC MARKOV MODEL
Authors: Chi-Yueh Lin, Hsiao-Chuan Wang, National Tsing Hua University, Taiwan
Abstract: It had been shown that a segment of pitch contour represented by a set of Legendre polynomial coefficients was successful to the pair-wise language identification task. Feature vectors comprising these polynomial coefficients were formerly modeled by a Gaussian mixture model (GMM) for each language. However, the static model like GMM does not take advantage of the temporal information across several pitch contours. It is intuitive that the temporal information of prosodic features should be used for capturing the characteristics of a specific language. In this paper, a novel dynamic model in ergodic topology is proposed. The experiments show that the proposed method significantly improves the identification rate, even for stress-timed and syllable-timed languages.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012