ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:AE-L1.6
Session:Audio Structure, Similarity and Segmentation
Time:Tuesday, May 16, 12:10 - 12:30
Presentation: Lecture
Topic: Audio and Electroacoustics: Applications to Music
Title: Comparing Audio and Video Segmentations for Music Videos Indexing
Authors: Olivier Gillet, Gaël Richard, GET / Télécom Paris, France
Abstract: Music videos are good examples of multimedia documents in which the structures of the audio and video streams are highly correlated. This paper presents a system that matches these structures and extracts correlation measures. Audio segmentation is performed at the event level by detecting onsets, and at a higher level by a novelty detection algorithm identifying instrumentation changes. Video segmentation is performed by detecting changes in the motion intensity descriptor, and at the shot level by using a classical histogram-based shot detection algorithm. Audio-visual correlation measures are computed on the extracted structures. Possible applications include video retrieval from audio content or classification of music videos by genre.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012