Attribute ambiguity discovery: A deep learning approach via unsupervised learning

Veltri, Enzo; Badaro, Gilbert; Saeed, Mohammed; Papotti, Paolo

SEBD 2023, 31st Symposium on Advanced Database Systems, 02-05 July 2023, Galzignano Terme, Padua, Italy

Several applications, such as text-to-SQL and computational fact checking, exploit the relationship between relational data and natural language text. However, state of the art solutions simply fail in managing “data-ambiguity", i.e., the case when there are multiple interpretations of the relationship between text and data. Given the ambiguity in language, text can be mapped to different subsets of data, but existing training corpora only have examples in which every sentence/question is annotated precisely w.r.t. the relation. This unrealistic assumption leaves the target applications unable to handle ambiguous cases. To tackle this problem, we present a deep learning method that identifies every pair of data ambiguous attributes and a label that describes both columns. Such metadata can then be used to generate examples with data ambiguities for any input table.

Detail

Document

BIBTEX

Type:

Conference

City:

Galzignano Terme

Date:

2023-07-02

Department:

Data Science

Eurecom Ref:

7432

CEUR