Xie, Lexing; Kennedy, Lyndon; Chang, Shih-Fu; Divakaran, Ajay; Sun, Huifang; Lin, Ching-Yung
We propose a layered dynamic mixture model for asynchronous multi-modal fusion for unsupervised pattern discovery in video. The lower layer of the model uses generative temporal structures such as a hierarchical hidden Markov model to convert the audiovisual streams into mid-level labels, it also models the correlations in text with probabilistic latent semantic analysis. The upper layer fuses the statistical evidence across diverse modalities with a flexible meta-mixture model that assumes...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.