Improving robustness of a likelihood-based beamformer in a real environment for automatic speech recognition

Brayda, Luca;Wellekens, Christian J;Omologo, Maurizio
SPECOM 2006, 11th International Conference Speech and Computer, June 25-29, 2006, Saint-Petersburg, Russia

Performance of distant-talking speech recognizers in real noisy environments can be increased using a microphone array. In this work we propose an N-best extension of the Limabeam algorithm, which is a likelihood-based adaptive lter-and-sum beamformer. We show that this algorithm can be used to optimize the noisy acoustic features using in parallel the N-best hypothesized transcriptions generated at a rst recognition step. The parallel and independent optimizations increase the likelihood of minimal word error rate hypotheses and the resulting N-best hypotheses list is automatically re-ranked. Results show improvements over delay-and-sum beamforming and Unsupervised Limabeam on a real database with considerable amount of noise and limited reverberation.


Type:
Conference
City:
Saint-Petersburg
Date:
2006-06-25
Department:
Digital Security
Eurecom Ref:
2054
Copyright:
© Elsevier. Personal use of this material is permitted. The definitive version of this paper was published in SPECOM 2006, 11th International Conference Speech and Computer, June 25-29, 2006, Saint-Petersburg, Russia and is available at :

PERMALINK : https://www.eurecom.fr/publication/2054