ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:SLP-P13.5
Session:Speech Synthesis III
Time:Thursday, May 18, 10:00 - 12:00
Presentation: Poster
Topic: Speech and Spoken Language Processing: Segmental-Level and/or concatenative synthesis
Title: EFFICIENT INTERACTIVE WEIGHT TUNING FOR TTS SYNTHESIS: REDUCING USER FATIGUE BY IMPROVING USER CONSISTENCY
Authors: Francesc Alías, Enginyeria i Arquitectura La Salle. Ramon Llull University., Spain; Xavier Llorà, University of Illinois at Urbana-Champaign, United States; Lluís Formiga, Enginyeria i Arquitectura La Salle. Ramon Llull University., Spain; Kumara Sastry, David E. Goldberg, University of Illinois at Urbana-Champaign, United States
Abstract: The quality of corpus-based text-to-speech systems depends on the accuracy of the unit selection process, which in turn relies on the cost function definition. This function should map the user perceptual preference when selecting synthesis units, which is a very difficult task. This paper continues our previous work on fusing the human judgements with the cost function by means of interactive weight tuning. The application of active interactive genetics algorithms mitigates user fatigue by improving user consistency. As a result, the obtained weights generate more natural synthetic speech when compared to previous objective and subjective proposals.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012