When textual and visual information join forces for multimedia retrieval