Vocoder drift compensation by x-vector alignment in speaker anonymisation

Panariello, Michele; Todisco, Massimiliano; Evans, Nicholas
SIG-SPSC 2023, 3rd ISCA Security and Privacy in Speech Communication Symposium, co-located with 3rd VoicePrivacy Challenge Workshop, 19 August 2023, Dublin, Ireland

For the most popular x-vector-based approaches to speaker anonymisation, the bulk of the anonymisation can stem from vocoding rather than from the core anonymisation function which is used to substitute an original speaker x-vector with that of a fictitious pseudo-speaker. This phenomenon can impede the design of better anonymisation systems since there is a lack of fine-grained control over the x-vector space. The work reported in this paper explores the origin of so-called vocoder drift and shows that it is due to the mismatch between the substituted x-vector and the original representations of the linguistic content, intonation and prosody. Also reported is an original approach to vocoder drift compensation. While anonymisation performance degrades as expected, compensation reduces vocoder drift substantially, offers improved control over the x-vector space and lays a foundation for the design of better anonymisation functions in the future.

 

Type:
Conférence
City:
Dublin
Date:
2023-08-19
Department:
Sécurité numérique
Eurecom Ref:
7376
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in SIG-SPSC 2023, 3rd ISCA Security and Privacy in Speech Communication Symposium, co-located with 3rd VoicePrivacy Challenge Workshop, 19 August 2023, Dublin, Ireland and is available at :

PERMALINK : https://www.eurecom.fr/publication/7376