Natural language access to video databases

Francis, Danny; Pidou, Paul; Merialdo, Bernard; Huet, Benoit
BIGMM 2017, 3rd International Conference on Multimedia Big Data, 19-21 April 2017, Laguna Hills, USA

This paper deals with natural language access to video databases. Two approaches are proposed: in the first one we use queries to find images similar to  ideo keyframes, and in
the second one we generate text descriptions from keyframes and compare them with queries. We propose four implementations of these approaches: one implementation of the first approach, two implementations of the second one and one implementation mixing both approaches. The results of our implementations are discussed, in particular regarding the visual content of natural language queries.

DOI
Type:
Conférence
City:
Laguna Hills
Date:
2017-04-19
Department:
Data Science
Eurecom Ref:
5199
Copyright:
© 2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/5199