Towards effective spatio-temporal analysis for content-based video retrieval