Scalable data exchange with functional dependencies

Bruno Marnette, Giansalvatore Mecca, Paolo Papotti

Research output: Chapter in Book/Report/Conference proceedingChapter

30 Citations (Scopus)

Abstract

The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Pages105-116
Number of pages12
Volume3
Edition1
Publication statusPublished - Sep 2010
Externally publishedYes

Fingerprint

Electronic data interchange

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Marnette, B., Mecca, G., & Papotti, P. (2010). Scalable data exchange with functional dependencies. In Proceedings of the VLDB Endowment (1 ed., Vol. 3, pp. 105-116)

Scalable data exchange with functional dependencies. / Marnette, Bruno; Mecca, Giansalvatore; Papotti, Paolo.

Proceedings of the VLDB Endowment. Vol. 3 1. ed. 2010. p. 105-116.

Research output: Chapter in Book/Report/Conference proceedingChapter

Marnette, B, Mecca, G & Papotti, P 2010, Scalable data exchange with functional dependencies. in Proceedings of the VLDB Endowment. 1 edn, vol. 3, pp. 105-116.
Marnette B, Mecca G, Papotti P. Scalable data exchange with functional dependencies. In Proceedings of the VLDB Endowment. 1 ed. Vol. 3. 2010. p. 105-116
Marnette, Bruno ; Mecca, Giansalvatore ; Papotti, Paolo. / Scalable data exchange with functional dependencies. Proceedings of the VLDB Endowment. Vol. 3 1. ed. 2010. pp. 105-116
@inbook{1566bb1e60394d499f4850d031215192,
title = "Scalable data exchange with functional dependencies",
abstract = "The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.",
author = "Bruno Marnette and Giansalvatore Mecca and Paolo Papotti",
year = "2010",
month = "9",
language = "English",
volume = "3",
pages = "105--116",
booktitle = "Proceedings of the VLDB Endowment",
edition = "1",

}

TY - CHAP

T1 - Scalable data exchange with functional dependencies

AU - Marnette, Bruno

AU - Mecca, Giansalvatore

AU - Papotti, Paolo

PY - 2010/9

Y1 - 2010/9

N2 - The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

AB - The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

UR - http://www.scopus.com/inward/record.url?scp=79952774500&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952774500&partnerID=8YFLogxK

M3 - Chapter

VL - 3

SP - 105

EP - 116

BT - Proceedings of the VLDB Endowment

ER -