Temporal normalization of videos using visual speech

Saeed, Usman; Dugelay, Jean-Luc

MIFOR 2009, 1st ACM Workshop on Multimedia in Forensics, October 19-24, 2009, Beijing, China

Pose and illumination variation has been considered the major cause of poor recognition results in automatic face recognition as compared to other biometrics. With the advent of video based face recognition a decade ago we were presented with some new opportunities, algorithms were developed to take advantage of the abundance of data and behavioral aspect of recognition. But this modality introduced some new challenges also, one of them was the variation introduced by speech. In this paper we present a novel method for handling this variation by using temporal normalization based on lip motion. Evaluation was carried out by comparing face recognition results from original non-normalized videos and normalized videos.

Detail

Document

DOI

BIBTEX

Type:

Conference

City:

Beijing

Date:

2009-10-19

Department:

Digital Security

Eurecom Ref:

2867

© ACM, 2009. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in MIFOR 2009, 1st ACM Workshop on Multimedia in Forensics, October 19-24, 2009, Beijing, China
http://dx.doi.org/10.1145/1631081.1631084