Open Research will be unavailable from 3am to 7am on Thursday 4th December 2025 AEDT due to scheduled maintenance.
 

Layered Dynamic Mixture Model for Pattern Discovery in Asynchronous Multi-Modal Streams

Date

Authors

Xie, Lexing
Kennedy, Lyndon
Chang, Shih-Fu
Divakaran, Ajay
Sun, Huifang
Lin, Ching-Yung

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers (IEEE Inc)

Abstract

We propose a layered dynamic mixture model for asynchronous multi-modal fusion for unsupervised pattern discovery in video. The lower layer of the model uses generative temporal structures such as a hierarchical hidden Markov model to convert the audiovisual streams into mid-level labels, it also models the correlations in text with probabilistic latent semantic analysis. The upper layer fuses the statistical evidence across diverse modalities with a flexible meta-mixture model that assumes loose temporal correspondence. Evaluation on a large news database shows that multi-modal clusters have better correspondence to news topics than audio-visual clusters alone; novel analysis techniques suggest that meaningful clusters occur when the prediction of salient features by the model concurs with those shown in the story clusters.

Description

Citation

Source

Proceedings of ICCASP 2005

Book Title

Entity type

Access Statement

License Rights

Restricted until

2037-12-31