Multi-modal classifier fusion for video shot content retrieval

Souvannavong, Fabrice;Mérialdo, Bernard;Huet, Benoit

WIAMIS 2005, 6th International Workshop on Image Analysis for Multimedia Interactive Services, April 13-15, 2005, Montreux, Switzerland

In this paper we present a new chromosome to solve the problem of classifier fusion using genetic algorithm. Experiments are conducted in the context of TRECVID. In particular we focus on the feature extraction task that consists in retrieving video shots expressing one of predefined semantic concepts. Three modalities (visual, textual and motion) and two features per modality are used to describe the content of a video shot. Thus, we require fusion techniques to efficiently manage all these heterogeneous sources of information. A first step achieves the classification per feature and concept, then a genetic algorithm is used to efficiently fuse the output of all classifiers. For this purpose, a dynamic binary tree is proposed to model the novel chromosome for hierarchical fusion.

Detail

Document

BIBTEX

Type:

Conférence

City:

Montreux

Date:

2005-04-13

Department:

Data Science

Eurecom Ref:

1700