UFeed: Refining web data integration based on user feedback

Ahmed El-Roby, Ashraf Aboulnaga

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

One of the main challenges in large-scale data integration for relational schemas is creating an accurate mediated schema, and generating accurate semantic mappings between heterogeneous data sources and this mediated schema. Some applications can start with a moderately accurate mediated schema and mappings and reffne them over time, which is referred to as the pay-as-you-go approach to data integration. Creating the mediated schema and mappings automatically to bootstrap the pay-as-you-go approach has been extensively studied. However, reffning the mediated schema and mappings is still an open challenge because the data sources are usually heterogeneous and use diverse and sometimes ambiguous vocabularies. In this paper, we introduce UFeed, a system that re- ffnes relational mediated schemas and mappings based on user feedback over query answers. UFeed translates user actions into reffnement operations that are applied to the mediated schema and mappings to improve their quality. We experimentally verify that UFeed improves the quality of query answers over real heterogeneous data sources extracted from the web.

Original languageEnglish
Title of host publicationCIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery
Pages187-196
Number of pages10
VolumePart F131841
ISBN (Electronic)9781450349185
DOIs
Publication statusPublished - 6 Nov 2017
Event26th ACM International Conference on Information and Knowledge Management, CIKM 2017 - Singapore, Singapore
Duration: 6 Nov 201710 Nov 2017

Other

Other26th ACM International Conference on Information and Knowledge Management, CIKM 2017
CountrySingapore
CitySingapore
Period6/11/1710/11/17

Fingerprint

World Wide Web
Data integration
Data sources
Query
Pay-as-you-go tax
Bootstrap

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

El-Roby, A., & Aboulnaga, A. (2017). UFeed: Refining web data integration based on user feedback. In CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management (Vol. Part F131841, pp. 187-196). Association for Computing Machinery. https://doi.org/10.1145/3132847.3132887

UFeed : Refining web data integration based on user feedback. / El-Roby, Ahmed; Aboulnaga, Ashraf.

CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management. Vol. Part F131841 Association for Computing Machinery, 2017. p. 187-196.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

El-Roby, A & Aboulnaga, A 2017, UFeed: Refining web data integration based on user feedback. in CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management. vol. Part F131841, Association for Computing Machinery, pp. 187-196, 26th ACM International Conference on Information and Knowledge Management, CIKM 2017, Singapore, Singapore, 6/11/17. https://doi.org/10.1145/3132847.3132887
El-Roby A, Aboulnaga A. UFeed: Refining web data integration based on user feedback. In CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management. Vol. Part F131841. Association for Computing Machinery. 2017. p. 187-196 https://doi.org/10.1145/3132847.3132887
El-Roby, Ahmed ; Aboulnaga, Ashraf. / UFeed : Refining web data integration based on user feedback. CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management. Vol. Part F131841 Association for Computing Machinery, 2017. pp. 187-196
@inproceedings{2d973c6c4e5e446cb768eecac8ee7473,
title = "UFeed: Refining web data integration based on user feedback",
abstract = "One of the main challenges in large-scale data integration for relational schemas is creating an accurate mediated schema, and generating accurate semantic mappings between heterogeneous data sources and this mediated schema. Some applications can start with a moderately accurate mediated schema and mappings and reffne them over time, which is referred to as the pay-as-you-go approach to data integration. Creating the mediated schema and mappings automatically to bootstrap the pay-as-you-go approach has been extensively studied. However, reffning the mediated schema and mappings is still an open challenge because the data sources are usually heterogeneous and use diverse and sometimes ambiguous vocabularies. In this paper, we introduce UFeed, a system that re- ffnes relational mediated schemas and mappings based on user feedback over query answers. UFeed translates user actions into reffnement operations that are applied to the mediated schema and mappings to improve their quality. We experimentally verify that UFeed improves the quality of query answers over real heterogeneous data sources extracted from the web.",
author = "Ahmed El-Roby and Ashraf Aboulnaga",
year = "2017",
month = "11",
day = "6",
doi = "10.1145/3132847.3132887",
language = "English",
volume = "Part F131841",
pages = "187--196",
booktitle = "CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - UFeed

T2 - Refining web data integration based on user feedback

AU - El-Roby, Ahmed

AU - Aboulnaga, Ashraf

PY - 2017/11/6

Y1 - 2017/11/6

N2 - One of the main challenges in large-scale data integration for relational schemas is creating an accurate mediated schema, and generating accurate semantic mappings between heterogeneous data sources and this mediated schema. Some applications can start with a moderately accurate mediated schema and mappings and reffne them over time, which is referred to as the pay-as-you-go approach to data integration. Creating the mediated schema and mappings automatically to bootstrap the pay-as-you-go approach has been extensively studied. However, reffning the mediated schema and mappings is still an open challenge because the data sources are usually heterogeneous and use diverse and sometimes ambiguous vocabularies. In this paper, we introduce UFeed, a system that re- ffnes relational mediated schemas and mappings based on user feedback over query answers. UFeed translates user actions into reffnement operations that are applied to the mediated schema and mappings to improve their quality. We experimentally verify that UFeed improves the quality of query answers over real heterogeneous data sources extracted from the web.

AB - One of the main challenges in large-scale data integration for relational schemas is creating an accurate mediated schema, and generating accurate semantic mappings between heterogeneous data sources and this mediated schema. Some applications can start with a moderately accurate mediated schema and mappings and reffne them over time, which is referred to as the pay-as-you-go approach to data integration. Creating the mediated schema and mappings automatically to bootstrap the pay-as-you-go approach has been extensively studied. However, reffning the mediated schema and mappings is still an open challenge because the data sources are usually heterogeneous and use diverse and sometimes ambiguous vocabularies. In this paper, we introduce UFeed, a system that re- ffnes relational mediated schemas and mappings based on user feedback over query answers. UFeed translates user actions into reffnement operations that are applied to the mediated schema and mappings to improve their quality. We experimentally verify that UFeed improves the quality of query answers over real heterogeneous data sources extracted from the web.

UR - http://www.scopus.com/inward/record.url?scp=85037369302&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85037369302&partnerID=8YFLogxK

U2 - 10.1145/3132847.3132887

DO - 10.1145/3132847.3132887

M3 - Conference contribution

AN - SCOPUS:85037369302

VL - Part F131841

SP - 187

EP - 196

BT - CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management

PB - Association for Computing Machinery

ER -