Graduate School and Research Center in Digital Sciences

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

Delgado, Hector; Todisco, Massimiliano; Sahidullah, Md; Evans, Nicholas; Kinnunen, Tomi; Lee, Kong Aik; Yamagishi, Junichi

ODYSSEY 2018, The Speaker and Language Recognition Workshop, June 26-29, 2018, Les Sables d'Olonne, France

The now-acknowledged vulnerabilities of automatic speaker verification (ASV) technology to spoofing attacks have spawned interests to develop so-called spoofing countermeasures. By providing common databases, protocols and metrics for their assessment, the ASVspoof initiative was born to spearhead research in this area. The first competitive ASVspoof challenge held in 2015 focused on the assessment of countermeasures to protect ASV technology from voice conversion and speech synthesis spoofing attacks. The second challenge switched focus to the consideration of replay spoofing attacks and countermeasures. This paper describes Version 2.0 of the ASVspoof 2017 database which was released to correct data anomalies detected post-evaluation. The paper contains as-yet unpublished meta-data which describes recording and playback devices and acoustic environments. These support the analysis of replay detection performance and limits. Also described are new results for the official ASVspoof baseline system which is based upon a constant Q cesptral coefficient frontend and a Gaussian mixture model backend. Reported are enhancements to the baseline system in the form of log-energy coefficients and cepstral mean and variance normalisation in addition to an alternative i-vector backend. The best results correspond to a 48% relative reduction in equal error rate when compared to the original baseline system. 

Document Hal Bibtex

Title:ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements
Type:Conference
Language:English
City:Les Sables d'Olonne
Country:FRANCE
Date:
Department:Digital Security
Eurecom ref:5504
Copyright: © ISCA. Personal use of this material is permitted. The definitive version of this paper was published in ODYSSEY 2018, The Speaker and Language Recognition Workshop, June 26-29, 2018, Les Sables d'Olonne, France and is available at :
Bibtex: @inproceedings{EURECOM+5504, year = {2018}, title = {{ASV}spoof 2017 {V}ersion 2.0: meta-data analysis and baseline enhancements}, author = {{D}elgado, {H}ector and {T}odisco, {M}assimiliano and {S}ahidullah, {M}d and {E}vans, {N}icholas and {K}innunen, {T}omi and {L}ee, {K}ong {A}ik and {Y}amagishi, {J}unichi}, booktitle = {{ODYSSEY} 2018, {T}he {S}peaker and {L}anguage {R}ecognition {W}orkshop, {J}une 26-29, 2018, {L}es {S}ables d'{O}lonne, {F}rance}, address = {{L}es {S}ables d'{O}lonne, {FRANCE}}, month = {06}, url = {http://www.eurecom.fr/publication/5504} }
See also: