Graduate School and Research Center in Digital Sciences

Articulation rate filtering of CQCC features for automatic speaker verification

Todisco, Massimiliano; Delgado, Héctor; Evans, Nicholas

INTERSPEECH 2016, Annual Conference of the International Speech Communication Association, September 8-12, 2016, San Francisco, USA

This paper introduces a new articulation rate filter and reports its combination with recently proposed constant Q cepstral coefficients (CQCCs) in their first application to automatic speaker verification (ASV). CQCC features are extracted with the constant Q transform (CQT), a perceptually-inspired alternative to Fourier-based approaches to time-frequency analysis. The CQT offers greater frequency resolution at lower frequencies and greater time resolution at higher frequencies. When coupled with cepstral analysis and the new articulation rate filter, the resulting CQCC features are readily modelled using conventional techniques. A comparative assessment of CQCCs and mel frequency cepstral coefficients (MFCC) for a short-duration speaker verification scenario shows that CQCCs generally outperform MFCCs and that the two feature representations are highly complementary; fusion experiments with the RSR2015 and RedDots databases show relative reductions in equal error rates of as much as 60% compared to an MFCC baseline.

Document Bibtex

Title:Articulation rate filtering of CQCC features for automatic speaker verification
Keywords:Automatic speaker verification, constant Q cepstral coefficients, articulatory filter
Type:Conference
Language:English
City:San Francisco
Country:UNITED STATES
Date:
Department:Digital Security
Eurecom ref:4937
Copyright: © ISCA. Personal use of this material is permitted. The definitive version of this paper was published in INTERSPEECH 2016, Annual Conference of the International Speech Communication Association, September 8-12, 2016, San Francisco, USA and is available at :
Bibtex: @inproceedings{EURECOM+4937, year = {2016}, title = {{A}rticulation rate filtering of {CQCC} features for automatic speaker verification }, author = {{T}odisco, {M}assimiliano and {D}elgado, {H}{\'e}ctor and {E}vans, {N}icholas}, booktitle = {{INTERSPEECH} 2016, {A}nnual {C}onference of the {I}nternational {S}peech {C}ommunication {A}ssociation, {S}eptember 8-12, 2016, {S}an {F}rancisco, {USA}}, address = {{S}an {F}rancisco, {UNITED} {STATES}}, month = {09}, url = {http://www.eurecom.fr/publication/4937} }
See also: