Direct modeling of image keypoints distribution through copula-based image signatures

Redi, Miriam; Merialdo, Bernard

ICMR 2013, ACM International Conference on Multimedia Retrieval, April 16-19, Dallas, Texas, USA

Local Image Descriptors (LID) aggregation models such as Bag of Words and Fisher Vectors represent an image based on the distribution of its LIDs given a global model, e.g. a
visual codebook or a Gaussian Mixture. Inspired by Copula theory, in this paper we propose a LID-based feature that represents directly the behavior of the image LID distribution, without requiring to compute a global model. Following the definition of Copula, we represent the distribution of the image LIDs by describing, on one side, its marginals, and on the other side, a Copula function. The Copula defines the dependencies between the marginals and their mapping to a multivariate probability distribution function. We test the resulting feature for scene recognition and video retrieval (Trecvid data), showing that our
approach outperforms, in both tasks, the Bag of Words and the Fisher Vectors Model.

Detail

Document

DOI

BIBTEX

Type:

Conference

City:

Dallas

Date:

2013-04-16

Department:

Data Science

Eurecom Ref:

3939

© ACM, 2013. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ICMR 2013, ACM International Conference on Multimedia Retrieval, April 16-19, Dallas, Texas, USA http://dx.doi.org/10.1145/2461466.2461498