Nielsen, Frank; Nock, Richard; Amari, Shun-ichi
Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-X used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the α-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retrieval systems, we symmetrize the α-divergences using the concept of mixed divergences. First, we...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.