Technical Program

Paper Detail

Paper:	AE-L1.5
Session:	Audio Structure, Similarity and Segmentation
Time:	Tuesday, May 16, 11:50 - 12:10
Presentation:	Lecture
Topic:	Audio and Electroacoustics: Audio for Multimedia
Title:	AUDIO ELEMENTS BASED AUDITORY SCENE SEGMENTATION
Authors:	Lie Lu, Microsoft Research Asia, China; Rui Cai, Tsinghua University, China; Alan Hanjalic, Technical University of Delft, Netherlands
Abstract:	Auditory scene segmentation is an important step in the process of high-level semantic inference from audio data streams, and in particular, a prerequisite for auditory scene categorization. In this paper, we analyze the limits of previous works on auditory scene segmentation, and then propose a novel method that, conceptually, is inspired by the ideas used in video scene segmentation, and is based on an analysis of audio elements and key audio elements, which can be seen as equivalents to the words and keywords in a text document, respectively. Experiments performed on 1.5 hours of audio data indicate that the proposed approach is promising.