ICASSP 2006 - May 15-19, 2006 - Toulouse, France

Technical Program

Paper Detail

Paper:IMDSP-P4.5
Session:Image/Video Indexing and Retrieval
Time:Tuesday, May 16, 16:30 - 18:30
Presentation: Poster
Topic: Image and Multidimensional Signal Processing: Video Indexing, Retrieval and Editing
Title: A combined LSTM-RNN - HMM - Approach for Meeting Event Segmentation and Recognition
Authors: Stephan Reiter, Björn Schuller, Gerhard Rigoll, Technische Universität München, Germany
Abstract: Automatic segmentation and classification of recorded meetings provides a basis that enables effective browsing and querying in a meeting archive. Yet, robustness of today's approaches is often not reliable enough. We therefore strive to improve on this task by introduction of a tandem approach combining the discriminative abilities of recurrent neural nets and warping capabilities of hidden markov models. Thereby long short-term memory cells are used for audio-visual frame analysis within the neural net. These help to overcome typical long time lags. Extensive test runs on the public M4 Scripted Meeting Corpus show great performance applying our suggested novel approach.



IEEESignal Processing Society

©2018 Conference Management Services, Inc. -||- email: webmaster@icassp2006.org -||- Last updated Friday, August 17, 2012