Technical Program

Paper Detail

Paper:	SLP-P15.4
Session:	Spoken Document Search, Navigation and Summarization
Time:	Thursday, May 18, 14:00 - 16:00
Presentation:	Poster
Topic:	Speech and Spoken Language Processing: Speech data mining and document retrieval
Title:	SPOKEN PROPER NAME RETRIEVAL IN AUDIO STREAMS FOR LIMITED-RESOURCE LANGUAGES VIA LATTICE BASED SEARCH USING HYBRID REPRESENTATIONS
Authors:	Murat Akbacak, John H. L. Hansen, University of Texas, Dallas, United States
Abstract:	Research in multilingual speech recognition has shown that current speech recognition technology generalizes across different languages, and that similar modeling assumptions hold, provided that linguistic knowledge (e.g., phoneme inventory, pronunciation dictionary, etc.) and transcribed speech data are available for the target language. Linguists make a very conservative estimate that 4000 languages are spoken today in the world, and in many of these languages, very limited linguistic knowledge and speech data/resources are available. Rapid transition to a new target language becomes a practical concern within the concept of tiered resources. In this study, we present our research efforts towards multilingual spoken information retrieval with limitations in acoustic training data. We propose different retrieval algorithms to leverage existing resources from resource-rich languages as well as the target language using a lattice-based search. We use Latin-American Spanish as the target language. After searching for queries consisting of Spanish proper names in Spanish Broadcast News data, we obtain performance (max-F value of 28.3%) close to that of a Spanish based system (trained on speech data from 36 speakers) using only 25% of all the available speech data from the original target language.