Multi-level fusion for semantic video content indexing and retrieval