Spatial encoding of visual words for image classification
Loading...
Date
Authors
Liu, Dong
Wang, Shengsheng
Porikli, Fatih
Journal Title
Journal ISSN
Volume Title
Publisher
Society of Photo-optical Instrumentation Engineers (SPIE)
Abstract
Appearance-based bag-of-visual words (BoVW) models are employed to represent the frequency of a
vocabulary of local features in an image. Due to their versatility, they are widely popular, although they ignore the
underlying spatial context and relationships among the features. Here, we present a unified representation that
enhances BoVWs with explicit local and global structure models. Three aspects of our method should be noted in
comparison to the previous approaches. First, we use a local structure feature that encodes the spatial attributes
between a pair of points in a discriminative fashion using class-label information. We introduce a bag-of-structural
words (BoSW) model for the given image set and describe each image with this model on its coarsely
sampled relevant keypoints. We then combine the codebook histograms of BoVW and BoSW to train a classifier.
Rigorous experimental evaluations on four benchmark data sets demonstrate that the unified representation
outperforms the conventional models and compares favorably to more sophisticated scene classification techniques.
Description
Citation
Collections
Source
Journal of Electronic Imaging
Type
Book Title
Entity type
Access Statement
Open Access
License Rights
Restricted until
Downloads
File
Description