Meta-mappings for schema mapping reuse

Atzeni, Paolo; Bellomarini, Luigi; Papotti, Paolo; Torlone, Riccardo
VLDB Endowment, Volume 12, Issue 5, January 2019

The de nition of mappings between heterogeneous schemas is a critical activity of any database application. Existing tools provide high level interfaces for the discovery of correspondences between elements of schemas, but schema mappings need to be manually speci ed every time from scratch, even if the scenario at hand is similar to one that has already been addressed. The problem is that schema mappings are precisely de ned over a pair of schemas and cannot directly be reused on different scenarios. We tackle this challenge by generalizing schema mappings as meta-mappings: formalisms that describe transformations between generic data structures called meta-schemas. We formally characterize schema mapping reuse and explain how meta-mappings are able to: (i) capture enterprise knowledge from previously de ned schema mappings and (ii) use this knowledge to suggest new mappings. We develop techniques to infer metamappings from existing mappings, to organize them into a searchable repository, and to leverage the repository to propose to users mappings suitable for their needs. We study effectiveness and efficiency in an extensive evaluation over real-world scenarios and show that our system can infer, store, and search millions of meta-mappings in seconds.

Data Science
Eurecom Ref:
© ACM, 2019. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in VLDB Endowment, Volume 12, Issue 5, January 2019