Knowledge extraction in web media: At the frontier of NLP, machine learning and semantics