Speaker diarization using unsupervised discriminant analysis of inter-channel delay features

Evans, Nicholas; Fredouille, Corinne; Bonastre, Jean-François
ICASSP 2009, International conference on Acoustics, Speech and Signal Processing, April 19-24, 2009, Taipei, Taiwan

 

 

 

 

 

 

When multiple microphones are available estimates of inter-channel delay, which characterise a speaker's location, can be used as features for speaker diarization. Background noise and reverberation can, however, lead to noisy features and poor performance. To ameliorate these problems, this paper presents a new approach to the discriminant analysis of delay features for speaker diarization. This novel and onetheless unsupervised approach aims to increase speaker separability in delay-space. We assess the approach on subsets of four standard NIST RT datasets and demonstrate a relative improvement in diarization error rate of 25% on a separate evaluation set using delay features alone.


DOI
HAL
Type:
Conference
City:
Taipei
Date:
2009-04-19
Department:
Digital Security
Eurecom Ref:
2654
Copyright:
© 2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
See also:

PERMALINK : https://www.eurecom.fr/publication/2654