Paper: | AE-L1.6 |
Session: | Audio Structure, Similarity and Segmentation |
Time: | Tuesday, May 16, 12:10 - 12:30 |
Presentation: |
Lecture
|
Topic: |
Audio and Electroacoustics: Applications to Music |
Title: |
Comparing Audio and Video Segmentations for Music Videos Indexing |
Authors: |
Olivier Gillet, Gaël Richard, GET / Télécom Paris, France |
Abstract: |
Music videos are good examples of multimedia documents in which the structures of the audio and video streams are highly correlated. This paper presents a system that matches these structures and extracts correlation measures. Audio segmentation is performed at the event level by detecting onsets, and at a higher level by a novelty detection algorithm identifying instrumentation changes. Video segmentation is performed by detecting changes in the motion intensity descriptor, and at the shot level by using a classical histogram-based shot detection algorithm. Audio-visual correlation measures are computed on the extracted structures. Possible applications include video retrieval from audio content or classification of music videos by genre. |