Ecole d'ingénieur et centre de recherche en Sciences du numérique

Artificial bandwidth extension using conditional variational auto-encoders and adversarial learning

Bachhav, Pramod; Todisco, Massimiliano; Evans, Nicholas

ICASSP 2020, 45th International Conference on Acoustics, Speech, and Signal Processing, 4-8 May 2020, Barcelona, Spain

Artificial bandwidth extension (ABE) algorithms have been developed to estimate missing highband frequency components (4-8kHz) to improve quality of narrowband (0-4kHz) telephone calls. Most ABE solutions employ deep neural networks (DNNs) due to their well-known ability to model highly complex, non-linear relationship between narrowband and highband features. Generative models such as conditional variational auto-encoders (CVAEs) are capable of modelling complex data distributions via latent representation learning. This paper reports their application to ABE. CVAEs, form of directed, graphical models, are exploited to model the probability distribution of highband features conditioned on narrowband features. While CVAEs are trained with the standard mean square criterion (MSE), their combination with adversarial learning give further improvements. When compared to results obtained with the baseline approach, the wideband PESQ is improved significantly by 0.21 points. The performance is also compared on an automatic speech recognition (ASR) task on the TIMIT dataset where word error rate (WER) is decreased by an absolute value of 0.3%.

Document Doi Bibtex

Titre:Artificial bandwidth extension using conditional variational auto-encoders and adversarial learning
Mots Clés:Variational auto-encoder, generative adversarial network, latent variable, artificial bandwidth extension, speech quality
Type:Poster / Demo
Langue:English
Ville:Barcelona
Pays:ESPAGNE
Date:
Département:Sécurité numérique
Eurecom ref:6177
Copyright: © 2020 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Bibtex: @poster / demo{EURECOM+6177, year = {2020}, title = {{A}rtificial bandwidth extension using conditional variational auto-encoders and adversarial learning}, author = {{B}achhav, {P}ramod and {T}odisco, {M}assimiliano and {E}vans, {N}icholas}, number = {EURECOM+6177}, month = {05}, institution = {Eurecom} address = {{B}arcelona, {ESPAGNE}}, url = {http://www.eurecom.fr/publication/6177} }
Voir aussi: