Utterance verification for text-dependent speaker recognition: a comparative assessment using the RedDots corpus

Kinnunen, Tomi; Sahidullah, Md; Kukanov, Ivan; Delgado, Héctor; Todisco, Massimiliano; Sarkar, Achintya; Thomsen, Nicolai; Hautamaki, Ville; Evans, Nicholas; Tan, Zheng-Hua
INTERSPEECH 2016, Annual Conference of the International Speech Communication Association, September 8-12, 2016, San Francisco, USA

Text-dependent automatic speaker verification naturally calls for the simultaneous verification of speaker identity and spoken content. These two tasks can be achieved with automatic speaker verification (ASV) and utterance verification (UV) technologies. While both have been addressed previously in the literature, a treatment of simultaneous speaker and utterance verification with a modern, standard database is so far lacking. This is despite the burgeoning demand for voice biometrics in a plethora of practical security applications. With the goal of improving overall verification performance, this paper reports different strategies for simultaneous ASV and UV in the context of short-duration, text-dependent speaker verification. Experiments performed on the recently released RedDots corpus are reported for three different ASV systems and four different
UV systems. Results show that the combination of utterance verification with automatic speaker verification is (almost) universally beneficial with significant performance improvements being observed.

DOI
Type:
Conférence
City:
San Francisco
Date:
2016-09-08
Department:
Sécurité numérique
Eurecom Ref:
4934
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in INTERSPEECH 2016, Annual Conference of the International Speech Communication Association, September 8-12, 2016, San Francisco, USA and is available at : http://dx.doi.org/10.21437/Interspeech.2016-1125

PERMALINK : https://www.eurecom.fr/publication/4934