Unsupervised multi-view dimensionality reduction with application to audio-visual speaker retrieval

Zhao, Xuran; Evans, Nicholas; Dugelay, Jean-Luc
WIFS 2013, IEEE International Workshop on Information Forensics and Security, 18-21 November, Guangzhou, China

Best Student Paper Award

This paper presents a novel approach to unsupervised multi-view dimensionality reduction and reports its application to multi-modal biometrics retrieval, specifically audiovisual speaker retrieval. We propose a new concept referred to as multi-view subspace agreement, which aims to learn a subspace for each view which respects the similarity relationships between data points in the other view. The proposed algorithm is unsupervised but exhibits discriminative characteristics and is thus well suited to applications such as retrieval and clustering where class labels are generally unavailable. The effectiveness of the proposed algorithm is evaluated under an audio-visual speaker retrieval experiment with the MOBIO database. The retrieval performance of the proposed approach out-performs other single-view or multi-view dimensionality reduction methods with a significant margin.

Sécurité numérique
Eurecom Ref:
© 2013 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/4143