EURECOM and ECNU at TrecVid 2010 : The semantic indexing task

Redi, Miriam; Mérialdo, Bernard; Wang, Feng
TRECVID 2010, 14th International Workshop on Video Retrieval Evaluation, November 15-17, 2010 National Institute of Standards and Technology, Gaithersburg, Maryland USA

This year EURECOM and ECNU participated together at the TRECVID Semantic Indexing

Task. We built four different systems for the light (10 concepts) submission. Three of our runs

are functionally similar to the system used by EURECOM for last year's High Level Feature

Extraction task (see [6] for further details).

We keep as a basic run (Fusebase) the best-performing system from 2009, testing how such

system performs on the new dataset; we then improve the EURECOM Fusebase by adding

a global descriptor, originally built for scene recognition, and proved to be effective in the

TRECVID context for spatially-independent concepts like "Nighttime". We then experiment

with a multi-modal analysis, combining the visual features with the textual metadata that have

been provided with the 2010 video database. As last run, we try a new system based on Hamming

Embedding and Weighted Visual words.


Type:
Conference
City:
Gaithersburg
Date:
2010-11-15
Department:
Data Science
Eurecom Ref:
3358
Copyright:
© NIST. Personal use of this material is permitted. The definitive version of this paper was published in TRECVID 2010, 14th International Workshop on Video Retrieval Evaluation, November 15-17, 2010 National Institute of Standards and Technology, Gaithersburg, Maryland USA and is available at :

PERMALINK : https://www.eurecom.fr/publication/3358