Natural language access to video databases

Francis, Danny; Pidou, Paul; Merialdo, Bernard; Huet, Benoit

BIGMM 2017, 3rd International Conference on Multimedia Big Data, 19-21 April 2017, Laguna Hills, USA

This paper deals with natural language access to video databases. Two approaches are proposed: in the first one we use queries to find images similar to ideo keyframes, and in

the second one we generate text descriptions from keyframes and compare them with queries. We propose four implementations of these approaches: one implementation of the first approach, two implementations of the second one and one implementation mixing both approaches. The results of our implementations are discussed, in particular regarding the visual content of natural language queries.

Detail

Document

DOI

BIBTEX

Type:

Conférence

City:

Laguna Hills

Date:

2017-04-19

Department:

Data Science

Eurecom Ref:

5199

© 2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.