XSEED: Accurate and fast cardinality estimation for XPath queries

Ning Zhang, M. Tamer Özsu, Ashraf Aboulnaga, Ihab F. Ilyas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

42 Citations (Scopus)

Abstract

We propose XSEED, a synopsis of path queries for cardinality estimation that is accurate, robust, efficient, and adaptive to memory budgets. XSEED starts from a very small kernel, and then incrementally updates information of the synopsis. With such an incremental construction, a synopsis structure can be dynamically configured to accommodate different memory budgets. Cardinality estimation based on XSEED can be performed very efficiently and accurately. Extensive experiments on both synthetic and real data sets show that even with less memory, XSEED could achieve accuracy that is an order of magnitude better than that of other synopsis structures. The cardinality estimation time is under 2% of the actual querying time for a wide range of queries in all test cases.

Original languageEnglish
Title of host publicationProceedings - International Conference on Data Engineering
Volume2006
DOIs
Publication statusPublished - 17 Oct 2006
Externally publishedYes
Event22nd International Conference on Data Engineering, ICDE '06 - Atlanta, GA, United States
Duration: 3 Apr 20067 Apr 2006

Other

Other22nd International Conference on Data Engineering, ICDE '06
CountryUnited States
CityAtlanta, GA
Period3/4/067/4/06

Fingerprint

Data storage equipment
Experiments

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Zhang, N., Özsu, M. T., Aboulnaga, A., & Ilyas, I. F. (2006). XSEED: Accurate and fast cardinality estimation for XPath queries. In Proceedings - International Conference on Data Engineering (Vol. 2006). [1617429] https://doi.org/10.1109/ICDE.2006.178

XSEED : Accurate and fast cardinality estimation for XPath queries. / Zhang, Ning; Özsu, M. Tamer; Aboulnaga, Ashraf; Ilyas, Ihab F.

Proceedings - International Conference on Data Engineering. Vol. 2006 2006. 1617429.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zhang, N, Özsu, MT, Aboulnaga, A & Ilyas, IF 2006, XSEED: Accurate and fast cardinality estimation for XPath queries. in Proceedings - International Conference on Data Engineering. vol. 2006, 1617429, 22nd International Conference on Data Engineering, ICDE '06, Atlanta, GA, United States, 3/4/06. https://doi.org/10.1109/ICDE.2006.178
Zhang N, Özsu MT, Aboulnaga A, Ilyas IF. XSEED: Accurate and fast cardinality estimation for XPath queries. In Proceedings - International Conference on Data Engineering. Vol. 2006. 2006. 1617429 https://doi.org/10.1109/ICDE.2006.178
Zhang, Ning ; Özsu, M. Tamer ; Aboulnaga, Ashraf ; Ilyas, Ihab F. / XSEED : Accurate and fast cardinality estimation for XPath queries. Proceedings - International Conference on Data Engineering. Vol. 2006 2006.
@inproceedings{8237909ff0ed4bc293e3f59caa45a176,
title = "XSEED: Accurate and fast cardinality estimation for XPath queries",
abstract = "We propose XSEED, a synopsis of path queries for cardinality estimation that is accurate, robust, efficient, and adaptive to memory budgets. XSEED starts from a very small kernel, and then incrementally updates information of the synopsis. With such an incremental construction, a synopsis structure can be dynamically configured to accommodate different memory budgets. Cardinality estimation based on XSEED can be performed very efficiently and accurately. Extensive experiments on both synthetic and real data sets show that even with less memory, XSEED could achieve accuracy that is an order of magnitude better than that of other synopsis structures. The cardinality estimation time is under 2{\%} of the actual querying time for a wide range of queries in all test cases.",
author = "Ning Zhang and {\"O}zsu, {M. Tamer} and Ashraf Aboulnaga and Ilyas, {Ihab F.}",
year = "2006",
month = "10",
day = "17",
doi = "10.1109/ICDE.2006.178",
language = "English",
isbn = "0769525709",
volume = "2006",
booktitle = "Proceedings - International Conference on Data Engineering",

}

TY - GEN

T1 - XSEED

T2 - Accurate and fast cardinality estimation for XPath queries

AU - Zhang, Ning

AU - Özsu, M. Tamer

AU - Aboulnaga, Ashraf

AU - Ilyas, Ihab F.

PY - 2006/10/17

Y1 - 2006/10/17

N2 - We propose XSEED, a synopsis of path queries for cardinality estimation that is accurate, robust, efficient, and adaptive to memory budgets. XSEED starts from a very small kernel, and then incrementally updates information of the synopsis. With such an incremental construction, a synopsis structure can be dynamically configured to accommodate different memory budgets. Cardinality estimation based on XSEED can be performed very efficiently and accurately. Extensive experiments on both synthetic and real data sets show that even with less memory, XSEED could achieve accuracy that is an order of magnitude better than that of other synopsis structures. The cardinality estimation time is under 2% of the actual querying time for a wide range of queries in all test cases.

AB - We propose XSEED, a synopsis of path queries for cardinality estimation that is accurate, robust, efficient, and adaptive to memory budgets. XSEED starts from a very small kernel, and then incrementally updates information of the synopsis. With such an incremental construction, a synopsis structure can be dynamically configured to accommodate different memory budgets. Cardinality estimation based on XSEED can be performed very efficiently and accurately. Extensive experiments on both synthetic and real data sets show that even with less memory, XSEED could achieve accuracy that is an order of magnitude better than that of other synopsis structures. The cardinality estimation time is under 2% of the actual querying time for a wide range of queries in all test cases.

UR - http://www.scopus.com/inward/record.url?scp=33749635065&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33749635065&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2006.178

DO - 10.1109/ICDE.2006.178

M3 - Conference contribution

AN - SCOPUS:33749635065

SN - 0769525709

SN - 9780769525709

VL - 2006

BT - Proceedings - International Conference on Data Engineering

ER -