RuDiK: Rule discovery in knowledge bases

Papotti, Paolo; Ortona, Stefano; Meduri; Venkata Vamsikrishna
VLDB 2018, 44th International Conference on Very Large Data Bases, 27-31 August 2018, Rio de Janeiro, Brazil / Proceedings of the VLDB Endowment, Vol.11, N°12, August 2018


RuDiK is a system for the discovery of declarative rules over knowledge-bases (KBs). RuDiK discovers both positive rules, which identify relationships between entities, e.g., "if two persons have the same parent, they are siblings", and negative rules, which identify data contradictions, e.g., "if two persons are married, one cannot be the child of the other". Rules help domain experts to curate data in large KBs. Positive rules suggest new facts to mitigate incompleteness and negative rules detect erroneous facts. Also, negative rules are useful to generate negative examples for learning algorithms. RuDiK goes beyond existing solutions since it discovers rules with a more expressive rule language w.r.t. previous approaches, which leads to wide coverage of the facts in the KB, and its mining is robust to existing errors and incompleteness in the KB. The system has been deployed for multiple KBs, including Yago, DBpedia, Freebase and WikiData, and identifies new facts and real errors with 85% to 97% accuracy, respectively. This demonstration shows how RuDiK can be used to interact with domain experts. Once the audience pick a KB and a predicate, they will add new facts, remove errors, and train a machine learning system with automatically generated examples. 


DOI
Type:
Conference
City:
Rio de Janeiro
Date:
2018-08-27
Department:
Data Science
Eurecom Ref:
5608
Copyright:
© ACM, 2018. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VLDB 2018, 44th International Conference on Very Large Data Bases, 27-31 August 2018, Rio de Janeiro, Brazil / Proceedings of the VLDB Endowment, Vol.11, N°12, August 2018
 http://doi.org/10.14778/3229863.3236231
See also:

PERMALINK : https://www.eurecom.fr/publication/5608