Keyword spotting enhancement for video soundtrack indexing

Gelin, Philippe; Wellekens, Christian J
ICSLP 1996, IEEE 4th International Symposium on Chinese Spoken Language Processing, October 3-6, 1996, Philadelphia, USA

Multimedia databases contain an increasing number of videos that are not easily semantically accessed. Among the useful indices that can be extracted from the soundtrack, the presence of a keyword at some place plays a prominent role. This paper deals with the specificities of such a keyword spotter and the enhancements brought to our previous technique (1996) based on frame labeling. To be useful, such a keyword spotter has to be speaker-independent. Moreover, it has to be able to detect any word from an open vocabulary. This directly implies the use of a phonemic representation of the word. These constraints usually lead to an excessively time-consuming tool. The division of the indexing process into two parts-the first one off-line, the second one at query time-allows a faster response.


DOI
Type:
Conférence
City:
Philadelphia
Date:
1996-10-03
Department:
Sécurité numérique
Eurecom Ref:
577
Copyright:
© 1996 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/577