Ecole d'ingénieur et centre de recherche en Sciences du numérique

Artificial bandwidth extension with memory inclusion using semi-supervised stacked auto-encoders

Bachhav, Pramod; Todisco, Massimiliano; Evans, Nicholas

INTERSPEECH 2018, 19th Annual Conference of the International Speech Communication Association, September 2-6, 2018, Hyderabad, India

Artificial bandwidth extension (ABE) algorithms have been developed to improve quality when wideband devices receive speech signals from narrowband devices or infrastructure. The utilisation of contextual information in the form of dynamic features or explicit memory captured from neighbouring frames is common to ABE research, however the use of additional cues augments complexity and can introduce latency. Previous work shows that unsupervised, linear dimensionality reduction techniques help to reduce complexity. This paper reports a semisupervised, non-linear approach to dimensionality reduction using a stacked auto-encoder. In further contrast to previous work, it operates on raw spectra from which a low dimensional narrowband representation is learned in a data-driven manner. Three different objective speech quality measures show that the new features can be used with a standard regression model to improve ABE performance. Improvements in the mutual information between learned features and missing higher frequency components are also observed whereas improvements in speech quality are corroborated by informal listening tests.

Document Bibtex

Titre:Artificial bandwidth extension with memory inclusion using semi-supervised stacked auto-encoders
Mots Clés:artificial bandwidth extension, auto-encoder, dimensionality reduction, mutual information
Type:Conférence
Langue:English
Ville:Hyderabad
Pays:INDE
Date:
Département:Sécurité numérique
Eurecom ref:5592
Copyright: © ISCA. Personal use of this material is permitted. The definitive version of this paper was published in INTERSPEECH 2018, 19th Annual Conference of the International Speech Communication Association, September 2-6, 2018, Hyderabad, India and is available at :
Bibtex: @inproceedings{EURECOM+5592, year = {2018}, title = {{A}rtificial bandwidth extension with memory inclusion using semi-supervised stacked auto-encoders}, author = {{B}achhav, {P}ramod and {T}odisco, {M}assimiliano and {E}vans, {N}icholas}, booktitle = {{INTERSPEECH} 2018, 19th {A}nnual {C}onference of the {I}nternational {S}peech {C}ommunication {A}ssociation, {S}eptember 2-6, 2018, {H}yderabad, {I}ndia }, address = {{H}yderabad, {INDE}}, month = {09}, url = {http://www.eurecom.fr/publication/5592} }
Voir aussi: