Detection of speaker changes in an audio document

Delacourt, Perrine;Kryze, David;Wellekens, Christian J
EUROSPEECH 1999, ISCA Conference on Speech Communication and Technology, 4-10 September 1999, Budapest, Hungary

This paper addresses the problem of speaker-based segmentation.
The aim is to segment the audio data with respect to the speakers. In our study, we assume that no prior information on speakers is available and that people do not speak simultaneously. Our segmentation technique is operated in two passes: first, the most likely speaker changes are detected and then, they are validated or discarded during the second pass. The practical significance of this study is illustrated by applying our technique to synthesized and real data to show its efficiency and to compare its performances with another segmentation technique.


Type:
Conférence
City:
Budapest
Date:
1999-09-04
Department:
Sécurité numérique
Eurecom Ref:
177
Copyright:
© ISCA. Personal use of this material is permitted. The definitive version of this paper was published in EUROSPEECH 1999, ISCA Conference on Speech Communication and Technology, 4-10 September 1999, Budapest, Hungary and is available at :

PERMALINK : https://www.eurecom.fr/publication/177