Query processing and optimization on the web

Mourad Ouzzani, Athman Bouguettaya

Research output: Contribution to journalArticle

22 Citations (Scopus)

Abstract

The advent of the Internet and the Web and their subsequent ubiquity have brought forth opportunities to connect information sources across all types of boundaries (local, regional, organizational, etc.). Examples of such information sources include databases, XML documents, and other unstructured sources. Uniformly querying those information sources has been extensively investigated. A major challenge relates to query optimization. Indeed, querying multiple information sources scattered on the Web raises several barriers for achieving efficiency. This is due to the characteristics of Web information sources that include volatility, heterogeneity, and autonomy. Those characteristics impede a straightforward application of classical query optimization techniques. They add new dimensions to the optimization problem such as the choice of objective function, selection of relevant information sources, limited query capabilities, and unpredictable events. In this paper, we survey the current research on fundamental problems to efficiently process queries over Web data integration systems. We also outline a classification for optimization techniques and a framework for evaluating them.

Original languageEnglish
Pages (from-to)187-218
Number of pages32
JournalDistributed and Parallel Databases
Volume15
Issue number3
DOIs
Publication statusPublished - 1 May 2004
Externally publishedYes

Fingerprint

Query Optimization
Query processing
Query Processing
Data integration
Optimization Techniques
XML
Query
XML Database
Internet
Data Integration
Query optimization
Information sources
World Wide Web
Volatility
Objective function
Optimization Problem

Keywords

  • Data integration
  • Databases
  • Mediators
  • Query optimization
  • Web

ASJC Scopus subject areas

  • Information Systems
  • Theoretical Computer Science
  • Computational Theory and Mathematics

Cite this

Query processing and optimization on the web. / Ouzzani, Mourad; Bouguettaya, Athman.

In: Distributed and Parallel Databases, Vol. 15, No. 3, 01.05.2004, p. 187-218.

Research output: Contribution to journalArticle

Ouzzani, Mourad ; Bouguettaya, Athman. / Query processing and optimization on the web. In: Distributed and Parallel Databases. 2004 ; Vol. 15, No. 3. pp. 187-218.
@article{b471105f347d4dbb821f2cddf69b316d,
title = "Query processing and optimization on the web",
abstract = "The advent of the Internet and the Web and their subsequent ubiquity have brought forth opportunities to connect information sources across all types of boundaries (local, regional, organizational, etc.). Examples of such information sources include databases, XML documents, and other unstructured sources. Uniformly querying those information sources has been extensively investigated. A major challenge relates to query optimization. Indeed, querying multiple information sources scattered on the Web raises several barriers for achieving efficiency. This is due to the characteristics of Web information sources that include volatility, heterogeneity, and autonomy. Those characteristics impede a straightforward application of classical query optimization techniques. They add new dimensions to the optimization problem such as the choice of objective function, selection of relevant information sources, limited query capabilities, and unpredictable events. In this paper, we survey the current research on fundamental problems to efficiently process queries over Web data integration systems. We also outline a classification for optimization techniques and a framework for evaluating them.",
keywords = "Data integration, Databases, Mediators, Query optimization, Web",
author = "Mourad Ouzzani and Athman Bouguettaya",
year = "2004",
month = "5",
day = "1",
doi = "10.1023/B:DAPD.0000018574.71588.06",
language = "English",
volume = "15",
pages = "187--218",
journal = "Distributed and Parallel Databases",
issn = "0926-8782",
publisher = "Springer Netherlands",
number = "3",

}

TY - JOUR

T1 - Query processing and optimization on the web

AU - Ouzzani, Mourad

AU - Bouguettaya, Athman

PY - 2004/5/1

Y1 - 2004/5/1

N2 - The advent of the Internet and the Web and their subsequent ubiquity have brought forth opportunities to connect information sources across all types of boundaries (local, regional, organizational, etc.). Examples of such information sources include databases, XML documents, and other unstructured sources. Uniformly querying those information sources has been extensively investigated. A major challenge relates to query optimization. Indeed, querying multiple information sources scattered on the Web raises several barriers for achieving efficiency. This is due to the characteristics of Web information sources that include volatility, heterogeneity, and autonomy. Those characteristics impede a straightforward application of classical query optimization techniques. They add new dimensions to the optimization problem such as the choice of objective function, selection of relevant information sources, limited query capabilities, and unpredictable events. In this paper, we survey the current research on fundamental problems to efficiently process queries over Web data integration systems. We also outline a classification for optimization techniques and a framework for evaluating them.

AB - The advent of the Internet and the Web and their subsequent ubiquity have brought forth opportunities to connect information sources across all types of boundaries (local, regional, organizational, etc.). Examples of such information sources include databases, XML documents, and other unstructured sources. Uniformly querying those information sources has been extensively investigated. A major challenge relates to query optimization. Indeed, querying multiple information sources scattered on the Web raises several barriers for achieving efficiency. This is due to the characteristics of Web information sources that include volatility, heterogeneity, and autonomy. Those characteristics impede a straightforward application of classical query optimization techniques. They add new dimensions to the optimization problem such as the choice of objective function, selection of relevant information sources, limited query capabilities, and unpredictable events. In this paper, we survey the current research on fundamental problems to efficiently process queries over Web data integration systems. We also outline a classification for optimization techniques and a framework for evaluating them.

KW - Data integration

KW - Databases

KW - Mediators

KW - Query optimization

KW - Web

UR - http://www.scopus.com/inward/record.url?scp=1942518311&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=1942518311&partnerID=8YFLogxK

U2 - 10.1023/B:DAPD.0000018574.71588.06

DO - 10.1023/B:DAPD.0000018574.71588.06

M3 - Article

AN - SCOPUS:1942518311

VL - 15

SP - 187

EP - 218

JO - Distributed and Parallel Databases

JF - Distributed and Parallel Databases

SN - 0926-8782

IS - 3

ER -