Visual versus textual embedding for video retrieval