Multi-modal classifier fusion for video shot content retrieval