Paper: | SLP-L9.1 |
Session: | Spoken Language Identification |
Time: | Thursday, May 18, 16:30 - 16:50 |
Presentation: |
Lecture
|
Topic: |
Speech and Spoken Language Processing: Language Identification |
Title: |
LANGUAGE IDENTIFICATION USING PITCH CONTOUR INFORMATION IN THE ERGODIC MARKOV MODEL |
Authors: |
Chi-Yueh Lin, Hsiao-Chuan Wang, National Tsing Hua University, Taiwan |
Abstract: |
It had been shown that a segment of pitch contour represented by a set of Legendre polynomial coefficients was successful to the pair-wise language identification task. Feature vectors comprising these polynomial coefficients were formerly modeled by a Gaussian mixture model (GMM) for each language. However, the static model like GMM does not take advantage of the temporal information across several pitch contours. It is intuitive that the temporal information of prosodic features should be used for capturing the characteristics of a specific language. In this paper, a novel dynamic model in ergodic topology is proposed. The experiments show that the proposed method significantly improves the identification rate, even for stress-timed and syllable-timed languages. |