BIGMM 2017, 3rd International Conference on Multimedia Big Data, 19-21 April 2017, Laguna Hills, USA
This paper deals with natural language access to video databases. Two approaches are proposed: in the first one we use queries to find images similar to ideo keyframes, and in
the second one we generate text descriptions from keyframes and compare them with queries. We propose four implementations of these approaches: one implementation of the first approach, two implementations of the second one and one implementation mixing both approaches. The results of our implementations are discussed, in particular regarding the visual content of natural language queries.
© 2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.