Technical Program

Paper Detail

Paper:	SLP-P19.9
Session:	Model-based Robust Speech Recognition
Time:	Friday, May 19, 10:00 - 12:00
Presentation:	Poster
Topic:	Speech and Spoken Language Processing: Confidence Measures and Rejection algorithms
Title:	RANDOM FORESTS-BASED CONFIDENCE ANNOTATION USING NOVEL FEATURES FROM CONFUSION NETWORK
Authors:	Jian Xue, Yunxin Zhao, University of missouri-Columbia, United States
Abstract:	In this paper, we propose a set of new features for confidence annotation, including three features derived from confusion network and one from statistical significance test. We also propose using Random Forests as confidence classifier. The new features are combined with a set of eight previously proposed confidence features, and the Random Forests is compared with Decision tree and Support Vector Machine. Experiments were conducted on telehealth captioning task with a vocabulary size of 46,489. Average confidence annotation accuracy of 84.69% was achieved on 5 doctors’ test set. In addition, Random Forests was shown useful for feature importance ranking. The proposed features are shown important in confidence annotation and Random Forests achieved best results among the three classifiers.