Variational bayesian GMM for speech recognition

Valente, Fabio; Wellekens, Christian J
EUROSPEECH 2003, 8th european conference on speech communication and technology, September 1-4, 2003, Geneva, Switzerland

In this paper, we explore the potentialities of Variational Bayesian (VB) learning for speech recognition problems. VB methods deal in a more rigorous way with model selection and are a generalization of MAP learning. VB training for Gaussian Mixture Models is less affected than EM-ML training by over- fitting and singular solutions. We compare two types of Variational Bayesian Gaussian Mixture Models (VBGMM) with classical EM-ML GMM in a phoneme recognition task on the TIMIT database. VB learning performs better than EM-ML learning and is less affected by the initial model guess.


Type:
Conference
City:
Geneva
Date:
2003-09-01
Department:
Digital Security
Eurecom Ref:
1194
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in EUROSPEECH 2003, 8th european conference on speech communication and technology, September 1-4, 2003, Geneva, Switzerland and is available at :

PERMALINK : https://www.eurecom.fr/publication/1194