Paper: | SLP-L11.4 |
Session: | Advances in Speech Analysis and Representations |
Time: | Friday, May 19, 15:00 - 15:20 |
Presentation: |
Lecture
|
Topic: |
Speech and Spoken Language Processing: Speech Analysis |
Title: |
GENERALIZED PERCEPTUAL FEATURES FOR VOCALIZATION ANALYSIS ACROSS MULTIPLE SPECIES |
Authors: |
Patrick Clemins, Marek Trawicki, Kuntoro Adi, Jidong Tao, Michael Johnson, Marquette University, United States |
Abstract: |
This paper introduces the Greenwood Function Cepstral Coefficient (GFCC) and Generalized Perceptual Linear Prediction (GPLP) feature extraction models for the analysis of animal vocalizations across arbitrary species. These features are generalizations of the well-known Mel-Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP) approaches, tailored to take optimal advantage of available knowledge of each species’ auditory frequency range and/or audiogram data. Illustrative results are presented comparing use of the GFCC and GPLP features versus MFCC features over the same frequency ranges. |