Paper: | IMDSP-P4.5 |
Session: | Image/Video Indexing and Retrieval |
Time: | Tuesday, May 16, 16:30 - 18:30 |
Presentation: |
Poster
|
Topic: |
Image and Multidimensional Signal Processing: Video Indexing, Retrieval and Editing |
Title: |
A combined LSTM-RNN - HMM - Approach for Meeting Event Segmentation and Recognition |
Authors: |
Stephan Reiter, Björn Schuller, Gerhard Rigoll, Technische Universität München, Germany |
Abstract: |
Automatic segmentation and classification of recorded meetings provides a basis that enables effective browsing and querying in a meeting archive. Yet, robustness of today's approaches is often not reliable enough. We therefore strive to improve on this task by introduction of a tandem approach combining the discriminative abilities of recurrent neural nets and warping capabilities of hidden markov models. Thereby long short-term memory cells are used for audio-visual frame analysis within the neural net. These help to overcome typical long time lags. Extensive test runs on the public M4 Scripted Meeting Corpus show great performance applying our suggested novel approach. |