A novel ensemble method for named entity recognition and disambiguation based on neural network

Canale, Lorenzo; Lisena, Pasquale; Troncy, Raphaël
ISWC 2018, International forum for the Semantic Web and Linked Data Community, 8-12 october 2018, Monterey, CA, USA / Also published in LNCS, Vol.11136

Named entity recognition (NER) and disambiguation (NED) are subtasks of information extraction that aim to recognize named entities mentioned in text, to assign them pre-defi ned types, and to link them with their matching entities in a knowledge base. Many approaches, often exposed as web APIs, have been proposed to solve these tasks during the last years. These APIs classify entities using different taxonomies and disambiguate them with different knowledge bases. In this paper, we describe Ensemble Nerd, a framework that collects numerous extractors responses, normalizes them and combines them in order to produce a  nal entity list according to the pattern (surface form, type, link). The presented approach is based on representing the extractors responses as real-value vectors and on using them as input samples for two Deep Learning networks: ENNTR (Ensemble Neural Network for Type Recognition) and ENND (Ensemble Neural Network for Disambiguation). We train these networks using speci c gold standards. We show that the models produced outperform each single extractor responses in terms of micro and macro F1 measures computed by the GERBIL framework.

DOI
HAL
Type:
Conférence
City:
Monterey
Date:
2018-10-08
Department:
Data Science
Eurecom Ref:
5564
Copyright:
© Springer. Personal use of this material is permitted. The definitive version of this paper was published in ISWC 2018, International forum for the Semantic Web and Linked Data Community, 8-12 october 2018, Monterey, CA, USA / Also published in LNCS, Vol.11136 and is available at : http://doi.org/10.1007/978-3-030-00671-6_6

PERMALINK : https://www.eurecom.fr/publication/5564