Paper: | SLP-P8.7 |
Session: | Speaker Recognition: Features |
Time: | Wednesday, May 17, 10:00 - 12:00 |
Presentation: |
Poster
|
Topic: |
Speech and Spoken Language Processing: Speaker Identification |
Title: |
RULES BASED FEATURE MODIFICATION FOR AFFECTIVE SPEAKER RECOGNITION |
Authors: |
Zhaohui Wu, Dongdong Li, Yingchun Yang, Zhejiang University, China |
Abstract: |
One of the largest challenges in speaker recognition application is dealing with speaker-emotion variability. In this paper, we further investigate the rules based feature modification for robust speaker recognition with emotional speech. Specifically, we learn the rules of prosodic features modification from a small amount of the content matched source-target pairs. Features with emotion information are adapted from the prevalent neutral features by applying the modification rules. The converted features are trained together with the neutral features to build the speaker models. The effect of individual and combined modifications of duration, pitch and amplitude is also studied using EPST dataset recorded by 8 professional actors with 14 kinds of emotion expressiveness. It demonstrates that duration modifications play the most important role; and that, pitch modifications are more effective than amplitude modifications. Promising result with an improved identification rate by 7.83% is achieved compared to the traditional speaker recognition. |