μBE: User guided source selection and schema mediation for internet scale data integration

Ashraf Aboulnaga, Kareem El Gebaly

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

The typical approach to data integration is to start by defining a common mediated schema, and then to map the data sources being integrated to this schema. In Internetscale data integration tasks, where there may be hundreds or thousands of data sources providing data of relevance to a particular domain, a better approach is to allow the user to discover the mediated schema and the set of sources to use through an iterative exploration of the space of possible schemas and sources. In this paper, we present μBE, a data integration tool that helps in this iterative exploratory process by automatically choosing the data sources to include in a data integration system and defining a mediated schema on these sources. The data integration system desired by the user may depend on several subjective and objective criteria, and the user guides μBE towards finding this system by iteratively solving a series of constrained non-linear optimization problems, and modifying the parameters and constraints of the problem in the next iteration based on the solution found in the current iteration. Our formulation of the optimization problem is designed to make it easy for the user to provide such feedback. A simple, intuitive user interface helps the user in this process. We experimentally demonstrate that μBE is efficient and finds highquality data integration solutions.

Original languageEnglish
Title of host publication23rd International Conference on Data Engineering, ICDE 2007
Pages186-195
Number of pages10
DOIs
Publication statusPublished - 24 Sep 2007
Event23rd International Conference on Data Engineering, ICDE 2007 - Istanbul, Turkey
Duration: 15 Apr 200720 Apr 2007

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627

Other

Other23rd International Conference on Data Engineering, ICDE 2007
CountryTurkey
CityIstanbul
Period15/4/0720/4/07

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Information Systems

Fingerprint Dive into the research topics of 'μBE: User guided source selection and schema mediation for internet scale data integration'. Together they form a unique fingerprint.

  • Cite this

    Aboulnaga, A., & El Gebaly, K. (2007). μBE: User guided source selection and schema mediation for internet scale data integration. In 23rd International Conference on Data Engineering, ICDE 2007 (pp. 186-195). [4221667] (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2007.367864