Spatio-temporal histograms

Hicham G. Elmongui, Mohamed Mokbel, Walid G. Aref

Research output: Contribution to journalConference article

7 Citations (Scopus)

Abstract

This paper presents a framework for building and continuously maintaining spatio-temporal histograms (ST-Histograms, for short). ST-Histograms are used for selectivity estimation of continuous pipelined query operators. Unlike traditional histograms that examine and/or sample all incoming data tuples, ST-Histograms are built by monitoring the actual selectivities of the outstanding continuous queries. ST-Histograms have three main features: (1) The ST-Histograms are built with (almost) no overhead to the system. We use only feedback (i.e., the actual selectivity) from the existing continuous queries. (2) Rather than wasting system resources in maintaining accurate histograms for the whole spatial space, we only maintain accurate histograms for that part of the space that is relevant to the current existing queries. The rest of the space has less accurate histograms. (3) The ST-Histograms are equipped with a periodicity detection procedure that predicts the future execution of the continuous queries. Hence, the query processing engine can continuously adapt the continuous query pipeline to reflect this prediction. Experimental results based on a real implementation inside a data stream management system show a superior performance of ST-Histograms in terms of providing accurate operator selectivity estimations with no extra overhead.

Original languageEnglish
Pages (from-to)19-36
Number of pages18
JournalLecture Notes in Computer Science
Volume3633
Publication statusPublished - 18 Oct 2005
Externally publishedYes

Fingerprint

Histogram
Mathematical operators
Query processing
Continuous Queries
Pipelines
Selectivity
Engines
Feedback
Monitoring
Query Processing
Operator
Data Streams
Periodicity
Engine
Query
Predict
Resources
Prediction

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Elmongui, H. G., Mokbel, M., & Aref, W. G. (2005). Spatio-temporal histograms. Lecture Notes in Computer Science, 3633, 19-36.

Spatio-temporal histograms. / Elmongui, Hicham G.; Mokbel, Mohamed; Aref, Walid G.

In: Lecture Notes in Computer Science, Vol. 3633, 18.10.2005, p. 19-36.

Research output: Contribution to journalConference article

Elmongui, HG, Mokbel, M & Aref, WG 2005, 'Spatio-temporal histograms', Lecture Notes in Computer Science, vol. 3633, pp. 19-36.
Elmongui HG, Mokbel M, Aref WG. Spatio-temporal histograms. Lecture Notes in Computer Science. 2005 Oct 18;3633:19-36.
Elmongui, Hicham G. ; Mokbel, Mohamed ; Aref, Walid G. / Spatio-temporal histograms. In: Lecture Notes in Computer Science. 2005 ; Vol. 3633. pp. 19-36.
@article{b865287855074890a6f6c2bdbaf9a99b,
title = "Spatio-temporal histograms",
abstract = "This paper presents a framework for building and continuously maintaining spatio-temporal histograms (ST-Histograms, for short). ST-Histograms are used for selectivity estimation of continuous pipelined query operators. Unlike traditional histograms that examine and/or sample all incoming data tuples, ST-Histograms are built by monitoring the actual selectivities of the outstanding continuous queries. ST-Histograms have three main features: (1) The ST-Histograms are built with (almost) no overhead to the system. We use only feedback (i.e., the actual selectivity) from the existing continuous queries. (2) Rather than wasting system resources in maintaining accurate histograms for the whole spatial space, we only maintain accurate histograms for that part of the space that is relevant to the current existing queries. The rest of the space has less accurate histograms. (3) The ST-Histograms are equipped with a periodicity detection procedure that predicts the future execution of the continuous queries. Hence, the query processing engine can continuously adapt the continuous query pipeline to reflect this prediction. Experimental results based on a real implementation inside a data stream management system show a superior performance of ST-Histograms in terms of providing accurate operator selectivity estimations with no extra overhead.",
author = "Elmongui, {Hicham G.} and Mohamed Mokbel and Aref, {Walid G.}",
year = "2005",
month = "10",
day = "18",
language = "English",
volume = "3633",
pages = "19--36",
journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
issn = "0302-9743",
publisher = "Springer Verlag",

}

TY - JOUR

T1 - Spatio-temporal histograms

AU - Elmongui, Hicham G.

AU - Mokbel, Mohamed

AU - Aref, Walid G.

PY - 2005/10/18

Y1 - 2005/10/18

N2 - This paper presents a framework for building and continuously maintaining spatio-temporal histograms (ST-Histograms, for short). ST-Histograms are used for selectivity estimation of continuous pipelined query operators. Unlike traditional histograms that examine and/or sample all incoming data tuples, ST-Histograms are built by monitoring the actual selectivities of the outstanding continuous queries. ST-Histograms have three main features: (1) The ST-Histograms are built with (almost) no overhead to the system. We use only feedback (i.e., the actual selectivity) from the existing continuous queries. (2) Rather than wasting system resources in maintaining accurate histograms for the whole spatial space, we only maintain accurate histograms for that part of the space that is relevant to the current existing queries. The rest of the space has less accurate histograms. (3) The ST-Histograms are equipped with a periodicity detection procedure that predicts the future execution of the continuous queries. Hence, the query processing engine can continuously adapt the continuous query pipeline to reflect this prediction. Experimental results based on a real implementation inside a data stream management system show a superior performance of ST-Histograms in terms of providing accurate operator selectivity estimations with no extra overhead.

AB - This paper presents a framework for building and continuously maintaining spatio-temporal histograms (ST-Histograms, for short). ST-Histograms are used for selectivity estimation of continuous pipelined query operators. Unlike traditional histograms that examine and/or sample all incoming data tuples, ST-Histograms are built by monitoring the actual selectivities of the outstanding continuous queries. ST-Histograms have three main features: (1) The ST-Histograms are built with (almost) no overhead to the system. We use only feedback (i.e., the actual selectivity) from the existing continuous queries. (2) Rather than wasting system resources in maintaining accurate histograms for the whole spatial space, we only maintain accurate histograms for that part of the space that is relevant to the current existing queries. The rest of the space has less accurate histograms. (3) The ST-Histograms are equipped with a periodicity detection procedure that predicts the future execution of the continuous queries. Hence, the query processing engine can continuously adapt the continuous query pipeline to reflect this prediction. Experimental results based on a real implementation inside a data stream management system show a superior performance of ST-Histograms in terms of providing accurate operator selectivity estimations with no extra overhead.

UR - http://www.scopus.com/inward/record.url?scp=26444549007&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=26444549007&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:26444549007

VL - 3633

SP - 19

EP - 36

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SN - 0302-9743

ER -