User variance and its impact on video retrieval benchmarking