Paper: | SLP-P18.9 |
Session: | LVCSR Systems |
Time: | Friday, May 19, 10:00 - 12:00 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: Miscellaneous Topics |
Title: |
Efficient Grammar Generation and Tuning for Interactive Voice Response Applications |
Authors: |
Ellis Cave, Intervoice, Inc., United States; Mithun Balakrishna, Dan Moldovan, University of Texas, Dallas, United States |
Abstract: |
This paper presents a procedure to efficiently create and tune context free grammars for directed dialog speech applications using only spoken test user utterances. We present a procedure to transcribe utterances with improved accuracy by post-processing the ASR n-best lists with higher level knowledge sources and additional information from the application prompt. We then present a semantic categorizer for the transcriptions, a statistical filtering mechanism for modifying the grammars and, a mechanism to raise an alarm condition in case of large in-flow of errors. We also illustrate the importance of additional improvements gained by using the semantic classification strength in a feedback loop to the transcription mechanism. |