In this paper we study the use of Variational Bayesian (VB) methods for speaker change detection and we compare results with the classical BIC solution. VB methods are approximated learning algorithms for fully bayesian inference that cannot be achieved in an exact form. They embed in the objective function (also known as free energy) a term that penalizes more complex models. Experiments are run on the Hub4 1996 evaluation data set and show that the VB outperforms the BIC of almost 7%. Anyway as long as the decision must be taken on a limited amount of data the VB based method must be tuned as the BIC based method in order to produce reasonable results.
Variational Bayesian speaker change detection
Interspeech 2005, 9th European Conference on Speech Communication and Technology, September 4-8, 2005, Lisbon, Portugal
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in Interspeech 2005, 9th European Conference on Speech Communication and Technology, September 4-8, 2005, Lisbon, Portugal and is available at :
PERMALINK : https://www.eurecom.fr/publication/1691