Paper: | SLP-P11.12 |
Session: | Front-end For Robust Speech Recognition |
Time: | Wednesday, May 17, 16:30 - 18:30 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: End-point detection and barge-in methods |
Title: |
DOUBLE-TALK FREE SPOKEN DIALOGUE INTERFACE COMBINING SOUND FIELD CONTROL WITH SEMI-BLIND SOURCE SEPARATION |
Authors: |
Shigeki Miyabe, Tomoya Takatani, Yoshimitsu Mori, Hiroshi Saruwatari, Kiyohiro Shikano, Nara Institute of Science and Technology, Japan; Yosuke Tatekura, Shizuoka University, Japan |
Abstract: |
In this paper we introduce a new double-talk free spoken dialogue interface combining sound field control and a source separation technique based on independent component analysis (ICA). First, sound field control provides silent zones on the microphone elements and prevents the response sound from being observed. In the second step, we propose a novel semi-blind source separation algorithm to suppress the error caused by fluctuation of the room transfer function. By using a direct input of response sound signal to ICA, a source separation problem can be converted to a supervised learning problem. Since the problem becomes easier, the proposed method showed higher performances than the method using blind source separation. |