Temporal normalization of videos using visual speech

Saeed, Usman; Dugelay, Jean-Luc

 

 

 

 

 

 

Pose and illumination variation has been considered the major cause of poor recognition results in automatic face recognition as compared to other biometrics. With the advent of video based face recognition a decade ago we were presented with some new opportunities, algorithms were developed to take advantage of the abundance of data and behavioral aspect of recognition. But this modality introduced some new challenges also, one of them was the variation introduced by speech. In this paper we present a novel method for handling this variation by using temporal normalization based on lip motion. Evaluation was carried out by comparing face recognition results from original non-normalized videos and normalized videos.


DOI
Type:
Conférence
City:
Beijing
Date:
2009-10-19
Department:
Sécurité numérique
Eurecom Ref:
2867
Copyright:
© ACM, 2009. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in http://dx.doi.org/10.1145/1631081.1631084

PERMALINK : https://www.eurecom.fr/publication/2867