Paper: | SLP-P1.10 |
Session: | Feature Extraction and Modeling |
Time: | Tuesday, May 16, 10:30 - 12:30 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: Feature Extraction and Modeling |
Title: |
Maximum Likelihood based Temporal Frame Selection |
Authors: |
Tingyao Wu, Dirk Van Compernolle, Jacques Duchateau, Hugo Van hamme, Katholieke Universiteit Leuven, Belgium |
Abstract: |
In this paper, we propose a maximum likelihood (ML) based frame selection approach. A fixed frame rate adopted in most state-of-the-art speech recognition systems can face some problems, such as accidentally meeting noisy frames, assigning the same importance to each frame, and pitch asynchronous representation. As an attempt to avoid those problems, our approach selects reliable frames from a fine resolution along the time axis. In a phoneme recognition task, we show that significant improvements are achieved with the frame selection approach comparing to a system with a fixed frame rate. |