Weighting informativeness of bag-of-visual-words by Kernel optimization for video concept detection
VLS-MCMR 2010, International Workshop on Very-Large-Scale Multimedia Corpus, Mining and Retrieval, 29 October 2010, Florence, Italy
Bag-of-Visual-Words (BoW) feature has been demonstrated e®ective and widely used in video concept detection due to its discriminative ability by capturing the local information in images. In the current approaches, all the words in the visual vocabulary are treated equally for the detection of dif- ferent concepts. This cannot highlight the concept-speci¯c visual information, and thus limits the discriminative ability of BoW feature. In this paper, we propose an approach to boost the performance of video concept detection based on BoW. This is achieved by assigning di®erent weights to the visual words according to their informativeness for the de- tection of di®erent concepts. Kernel alignment score (KAS) is used to measure the discriminative ability of SVM kernels, and the visual words are weighted as a kernel optimization problem. We show that the SVMs based on weighted visual words with our approach outperform the uniformly weight- ing and TF-IDF weighting schemes, and the MAP for the 20 concepts from TRECVID 2009 high-level feature extraction is signi¯cantly improved.
| Mots Clés: | Bag-of-Visual-Words, Kernel Optimization, Concept Detection |
| Type: | Conférence |
| Langue: | English |
| Ville: | Florence |
| Pays: | ITALIE |
| Date: | Octobre 2010 |
| Département: | Communications Multimédia |
| Eurecom ref: | 3245 |
| Copyright: | © ACM, 2010. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VLS-MCMR 2010, International Workshop on Very-Large-Scale Multimedia Corpus, Mining and Retrieval, 29 October 2010, Florence, Italy http://dx.doi.org/10.1145/1878137.1878150 |
| Bibtex: | @inproceedings{EURECOM+3245, doi = {http://dx.doi.org/10.1145/1878137.1878150 }, year = {2010}, title = {{W}eighting informativeness of bag-of-visual-words by {K}ernel optimization for video concept detection}, author = {{W}ang, {F}eng and {M}{\'e}rialdo, {B}ernard }, booktitle = {{VLS}-{MCMR} 2010, {I}nternational {W}orkshop on {V}ery-{L}arge-{S}cale {M}ultimedia {C}orpus, {M}ining and {R}etrieval, 29 {O}ctober 2010, {F}lorence, {I}taly }, address = {{F}lorence, {ITALIE}}, month = {10}, url = {http://www.eurecom.fr/publication/3245} } |
| Voir aussi: |
|
Permalink: http://www.eurecom.fr/publication/3245


