Data Modeling Strategies for Imbalanced Learning in Visual Search

Date

2007

Authors

Tesic, Jelena
Natsev, Apostol
Xie, Lexing
Smith, John R

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE Computer Society

Abstract

In this paper we examine a novel approach to the difficult problem of querying video databases using visual topics with few examples. Typically with visual topics, the examples are not sufficiently diverse to create a robust model of the user's need. As a result, direct modeling using the provided topic examples as training data is inadequate. Otherwise, systems resort to multiple content-based searches using each example in turn, which typically provides poor results. We propose a new technique of leveraging unlabeled data to expand the diversity of the topic examples as well as provide a robust set of negative examples that allow direct modeling. The approach intelligently models a pseudo-negative space using unbiased and biased methods for data sampling and data selection. We apply the proposed method in a fusion framework to improve discriminative support vector machine modeling, and improve the overall system performance. The result is an enhanced performance over any of the baseline models, as well as improved robustness with respect to training examples, visual features, and visual support of video topics in TRECVID. The proposed method outperforms a baseline retrieval approach by more than 18% on the TRECVID 2006 video collection and query topics.

Description

Keywords

Keywords: Baseline models; Content-based; Data modelling; Data sampling; Data Selection; Direct modeling; Enhanced performance; international conferences; Machine-Model (MM); Negative examples; nove l approach; Retrieval (MIR); Robust modeling; system performances;

Citation

Source

Proceedings IEEE International Conference on Multimedia Communications & Computing (ICME 07)

Type

Conference paper

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

2037-12-31