Paper: | SLP-P19.9 |
Session: | Model-based Robust Speech Recognition |
Time: | Friday, May 19, 10:00 - 12:00 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: Confidence Measures and Rejection algorithms |
Title: |
RANDOM FORESTS-BASED CONFIDENCE ANNOTATION USING NOVEL FEATURES FROM CONFUSION NETWORK |
Authors: |
Jian Xue, Yunxin Zhao, University of missouri-Columbia, United States |
Abstract: |
In this paper, we propose a set of new features for confidence annotation, including three features derived from confusion network and one from statistical significance test. We also propose using Random Forests as confidence classifier. The new features are combined with a set of eight previously proposed confidence features, and the Random Forests is compared with Decision tree and Support Vector Machine. Experiments were conducted on telehealth captioning task with a vocabulary size of 46,489. Average confidence annotation accuracy of 84.69% was achieved on 5 doctors’ test set. In addition, Random Forests was shown useful for feature importance ranking. The proposed features are shown important in confidence annotation and Random Forests achieved best results among the three classifiers. |