Methods for automated dataset interlinking

Scharffe, François; Fan, Zhengjie; Ferrara, Alfio; Khrouf, Houda; Nikolov, Andriy
Report Datalift/2011/D4.1/v0.2

Interlinking data is a crucial step in the Datalift platform framework. It ensures that the published datasets are connected with others on the Web. Many techniques are developed on this topic in order to automate the task of finding similar entities in two datasets. In this deliverable, we first clarify terminology in the field of linking data. Then we classify and overview many techniques used to automate data linking on the web. We finally review 11 state-of-the-art tools and classify them according to which technique they use.


HAL
Type:
Rapport
Date:
2011-12-24
Department:
Data Science
Eurecom Ref:
3936
Copyright:
© EURECOM. Personal use of this material is permitted. The definitive version of this paper was published in Report Datalift/2011/D4.1/v0.2 and is available at :
See also:

PERMALINK : https://www.eurecom.fr/publication/3936