Temporal normalization of videos using visual speech

Saeed, Usman; Dugelay, Jean-Luc
MIFOR 2009, 1st ACM Workshop on Multimedia in Forensics, October 19-24, 2009, Beijing, China

 

 

 

 

 

 

Pose and illumination variation has been considered the major cause of poor recognition results in automatic face recognition as compared to other biometrics. With the advent of video based face recognition a decade ago we were presented with some new opportunities, algorithms were developed to take advantage of the abundance of data and behavioral aspect of recognition. But this modality introduced some new challenges also, one of them was the variation introduced by speech. In this paper we present a novel method for handling this variation by using temporal normalization based on lip motion. Evaluation was carried out by comparing face recognition results from original non-normalized videos and normalized videos.


DOI
Type:
Conference
City:
Beijing
Date:
2009-10-19
Department:
Digital Security
Eurecom Ref:
2867
Copyright:
© ACM, 2009. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in MIFOR 2009, 1st ACM Workshop on Multimedia in Forensics, October 19-24, 2009, Beijing, China
http://dx.doi.org/10.1145/1631081.1631084

PERMALINK : https://www.eurecom.fr/publication/2867