μBE

User guided source selection and schema mediation for internet scale data integration

Ashraf Aboulnaga, Kareem El Gebaly

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

The typical approach to data integration is to start by defining a common mediated schema, and then to map the data sources being integrated to this schema. In Internetscale data integration tasks, where there may be hundreds or thousands of data sources providing data of relevance to a particular domain, a better approach is to allow the user to discover the mediated schema and the set of sources to use through an iterative exploration of the space of possible schemas and sources. In this paper, we present μBE, a data integration tool that helps in this iterative exploratory process by automatically choosing the data sources to include in a data integration system and defining a mediated schema on these sources. The data integration system desired by the user may depend on several subjective and objective criteria, and the user guides μBE towards finding this system by iteratively solving a series of constrained non-linear optimization problems, and modifying the parameters and constraints of the problem in the next iteration based on the solution found in the current iteration. Our formulation of the optimization problem is designed to make it easy for the user to provide such feedback. A simple, intuitive user interface helps the user in this process. We experimentally demonstrate that μBE is efficient and finds highquality data integration solutions.

Original languageEnglish
Title of host publicationProceedings - International Conference on Data Engineering
Pages186-195
Number of pages10
DOIs
Publication statusPublished - 24 Sep 2007
Externally publishedYes
Event23rd International Conference on Data Engineering, ICDE 2007 - Istanbul, Turkey
Duration: 15 Apr 200720 Apr 2007

Other

Other23rd International Conference on Data Engineering, ICDE 2007
CountryTurkey
CityIstanbul
Period15/4/0720/4/07

Fingerprint

Data integration
Internet
User interfaces
Feedback

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Aboulnaga, A., & El Gebaly, K. (2007). μBE: User guided source selection and schema mediation for internet scale data integration. In Proceedings - International Conference on Data Engineering (pp. 186-195). [4221667] https://doi.org/10.1109/ICDE.2007.367864

μBE : User guided source selection and schema mediation for internet scale data integration. / Aboulnaga, Ashraf; El Gebaly, Kareem.

Proceedings - International Conference on Data Engineering. 2007. p. 186-195 4221667.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Aboulnaga, A & El Gebaly, K 2007, μBE: User guided source selection and schema mediation for internet scale data integration. in Proceedings - International Conference on Data Engineering., 4221667, pp. 186-195, 23rd International Conference on Data Engineering, ICDE 2007, Istanbul, Turkey, 15/4/07. https://doi.org/10.1109/ICDE.2007.367864
Aboulnaga A, El Gebaly K. μBE: User guided source selection and schema mediation for internet scale data integration. In Proceedings - International Conference on Data Engineering. 2007. p. 186-195. 4221667 https://doi.org/10.1109/ICDE.2007.367864
Aboulnaga, Ashraf ; El Gebaly, Kareem. / μBE : User guided source selection and schema mediation for internet scale data integration. Proceedings - International Conference on Data Engineering. 2007. pp. 186-195
@inproceedings{abbe7cf5dfe540d194d11a70b6b9e76b,
title = "μBE: User guided source selection and schema mediation for internet scale data integration",
abstract = "The typical approach to data integration is to start by defining a common mediated schema, and then to map the data sources being integrated to this schema. In Internetscale data integration tasks, where there may be hundreds or thousands of data sources providing data of relevance to a particular domain, a better approach is to allow the user to discover the mediated schema and the set of sources to use through an iterative exploration of the space of possible schemas and sources. In this paper, we present μBE, a data integration tool that helps in this iterative exploratory process by automatically choosing the data sources to include in a data integration system and defining a mediated schema on these sources. The data integration system desired by the user may depend on several subjective and objective criteria, and the user guides μBE towards finding this system by iteratively solving a series of constrained non-linear optimization problems, and modifying the parameters and constraints of the problem in the next iteration based on the solution found in the current iteration. Our formulation of the optimization problem is designed to make it easy for the user to provide such feedback. A simple, intuitive user interface helps the user in this process. We experimentally demonstrate that μBE is efficient and finds highquality data integration solutions.",
author = "Ashraf Aboulnaga and {El Gebaly}, Kareem",
year = "2007",
month = "9",
day = "24",
doi = "10.1109/ICDE.2007.367864",
language = "English",
isbn = "1424408032",
pages = "186--195",
booktitle = "Proceedings - International Conference on Data Engineering",

}

TY - GEN

T1 - μBE

T2 - User guided source selection and schema mediation for internet scale data integration

AU - Aboulnaga, Ashraf

AU - El Gebaly, Kareem

PY - 2007/9/24

Y1 - 2007/9/24

N2 - The typical approach to data integration is to start by defining a common mediated schema, and then to map the data sources being integrated to this schema. In Internetscale data integration tasks, where there may be hundreds or thousands of data sources providing data of relevance to a particular domain, a better approach is to allow the user to discover the mediated schema and the set of sources to use through an iterative exploration of the space of possible schemas and sources. In this paper, we present μBE, a data integration tool that helps in this iterative exploratory process by automatically choosing the data sources to include in a data integration system and defining a mediated schema on these sources. The data integration system desired by the user may depend on several subjective and objective criteria, and the user guides μBE towards finding this system by iteratively solving a series of constrained non-linear optimization problems, and modifying the parameters and constraints of the problem in the next iteration based on the solution found in the current iteration. Our formulation of the optimization problem is designed to make it easy for the user to provide such feedback. A simple, intuitive user interface helps the user in this process. We experimentally demonstrate that μBE is efficient and finds highquality data integration solutions.

AB - The typical approach to data integration is to start by defining a common mediated schema, and then to map the data sources being integrated to this schema. In Internetscale data integration tasks, where there may be hundreds or thousands of data sources providing data of relevance to a particular domain, a better approach is to allow the user to discover the mediated schema and the set of sources to use through an iterative exploration of the space of possible schemas and sources. In this paper, we present μBE, a data integration tool that helps in this iterative exploratory process by automatically choosing the data sources to include in a data integration system and defining a mediated schema on these sources. The data integration system desired by the user may depend on several subjective and objective criteria, and the user guides μBE towards finding this system by iteratively solving a series of constrained non-linear optimization problems, and modifying the parameters and constraints of the problem in the next iteration based on the solution found in the current iteration. Our formulation of the optimization problem is designed to make it easy for the user to provide such feedback. A simple, intuitive user interface helps the user in this process. We experimentally demonstrate that μBE is efficient and finds highquality data integration solutions.

UR - http://www.scopus.com/inward/record.url?scp=34548731868&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34548731868&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2007.367864

DO - 10.1109/ICDE.2007.367864

M3 - Conference contribution

SN - 1424408032

SN - 9781424408030

SP - 186

EP - 195

BT - Proceedings - International Conference on Data Engineering

ER -