Temporally consistent key frame selection from video for face recognition