Several applications, such as text-to-SQL and computational fact checking, exploit the relationship between relational data and natural language text. However, state of the art solutions simply fail in managing “data-ambiguity", i.e., the case when there are multiple interpretations of the relationship between text and data. Given the ambiguity in language, text can be mapped to different subsets of data, but existing training corpora only have examples in which every sentence/question is annotated precisely w.r.t. the relation. This unrealistic assumption leaves the target applications unable to handle ambiguous cases. To tackle this problem, we present a deep learning method that identifies every pair of data ambiguous attributes and a label that describes both columns. Such metadata can then be used to generate examples with data ambiguities for any input table.
Attribute ambiguity discovery: A deep learning approach via unsupervised learning
SEBD 2023, 31st Symposium on Advanced Database Systems, 02-05 July 2023, Galzignano Terme, Padua, Italy
PERMALINK : https://www.eurecom.fr/publication/7432