Open-set semi-supervised audio-visual speaker recognition using co-training LDA and sparse representation classifiers

Zhao, Xuran; Evans, Nicholas; Dugelay, Jean-Luc

ICASSP 2013, 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, May 26-31, Vancouver, Canada

Semi-supervised learning is attracting growing interest within the biometrics community. Almost all prior work focuses on closedset scenarios, in which samples labelled automatically are assumed to belong to an enrolled class. This is often not the case in realistic applications and thus open-set alternatives are needed. This paper proposes a new approach to open-set, semi-supervised learning based on co-training, Linear Discriminant Analysis (LDA) subspaces and Sparse Representation Classifiers (SRCs). Experiments on the standard MOBIO dataset show how the new approach can utilize automatically labelled data to augment a smaller, manually labelled dataset and thus improve the performance of an open-set audio-visual person recognition system.

Detail

Document

DOI

BIBTEX

Type:

Conference

City:

Vancouver

Date:

2013-05-26

Department:

Digital Security

Eurecom Ref:

4016

© 2013 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.