Zero-shot information extraction to enhance a knowledge graph describing silk textiles

Schleider, Thomas; Troncy, Raphaël
LaTeCH-CLfL 2021, 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, 11 November 2021, Punta Cana, Dominican Republic (Online Event)

The knowledge of the European silk textile production is a typical case for which the information collected is heterogeneous, spread across many museums and sparse since rarely complete. Knowledge Graphs for this cultural heritage domain, when being developed with appropriate ontologies and vocabularies, enable to integrate and reconcile this diverse information. However, many of these original museum records still have some metadata gaps. In this paper, we present a zero-shot learning approach that leverages the ConceptNet common sense knowledge graph to predict categorical metadata informing about the silk objects production. We compared the performance of our approach with traditional supervised deep learning-based methods that do require training data. We demonstrate promising and competitive performance for similar datasets and circumstances and the ability to predict sometimes more fine-grained information. Our results can be reproduced using the code and datasets published at https://github.com/silknow/ZSL-KG-silk.

DOI
Type:
Conference
City:
Punta Cana
Date:
2021-11-11
Department:
Data Science
Eurecom Ref:
6734
Copyright:
Copyright ACL. Personal use of this material is permitted. The definitive version of this paper was published in LaTeCH-CLfL 2021, 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, 11 November 2021, Punta Cana, Dominican Republic (Online Event) and is available at : http://dx.doi.org/10.18653/v1/2021.latechclfl-1.16

PERMALINK : https://www.eurecom.fr/publication/6734