Understanding videos with face recognition: a complete pipeline and applications

Lisena, Pasquale; Laaksonen, Jorma; Troncy, Raphaël
Multimedia Systems Journal, Special issue on data-driven Personalization of television content, Vol.28, N°6, December 2022

When browsing or studying a video corpus, particularly relevant information consists in knowing who are the people appearing in the scenes. In this paper, we show how a combination of state of the art techniques can be organised in a pipeline for face recognition of celebrities. In particular, we propose a system which combines MTCNN for detecting faces and FaceNet for extracting face embeddings, which are used to train a set of classifiers. The face recognition results obtained at a frame level are then combined with those in consecutive frames, relying on automatic object tracking. Differently from previous work, we use images automatically retrieved by web search engines. We evaluate the systems one three datasets including historical videos from 1945 to 1969 and  contemporary videos, obtaining a good precision score. In addition, we show how the obtained results can be applied to foster historical studies.

DOI
HAL
Type:
Journal
Date:
2022-06-15
Department:
Data Science
Eurecom Ref:
6931
Copyright:
© Springer. Personal use of this material is permitted. The definitive version of this paper was published in Multimedia Systems Journal, Special issue on data-driven Personalization of television content, Vol.28, N°6, December 2022 and is available at : http://dx.doi.org/10.1007/s00530-022-00959-x

PERMALINK : https://www.eurecom.fr/publication/6931