Paper: | AE-L1.5 |
Session: | Audio Structure, Similarity and Segmentation |
Time: | Tuesday, May 16, 11:50 - 12:10 |
Presentation: |
Lecture
|
Topic: |
Audio and Electroacoustics: Audio for Multimedia |
Title: |
AUDIO ELEMENTS BASED AUDITORY SCENE SEGMENTATION |
Authors: |
Lie Lu, Microsoft Research Asia, China; Rui Cai, Tsinghua University, China; Alan Hanjalic, Technical University of Delft, Netherlands |
Abstract: |
Auditory scene segmentation is an important step in the process of high-level semantic inference from audio data streams, and in particular, a prerequisite for auditory scene categorization. In this paper, we analyze the limits of previous works on auditory scene segmentation, and then propose a novel method that, conceptually, is inspired by the ideas used in video scene segmentation, and is based on an analysis of audio elements and key audio elements, which can be seen as equivalents to the words and keywords in a text document, respectively. Experiments performed on 1.5 hours of audio data indicate that the proposed approach is promising. |