Scalable data exchange with functional dependencies

Bruno Marnette, Giansalvatore Mecca, Paolo Papotti

Research output: Chapter in Book/Report/Conference proceedingChapter

30 Citations (Scopus)

Abstract

The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Pages105-116
Number of pages12
Volume3
Edition1
Publication statusPublished - Sep 2010
Externally publishedYes

    Fingerprint

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Marnette, B., Mecca, G., & Papotti, P. (2010). Scalable data exchange with functional dependencies. In Proceedings of the VLDB Endowment (1 ed., Vol. 3, pp. 105-116)