Accurate estimation of the cost of spatial selections

Ashraf Aboulnaga, Jeffrey F. Naughton

Research output: Chapter in Book/Report/Conference proceedingChapter

45 Citations (Scopus)

Abstract

Optimizing queries that involve operations on spatial data requires estimating the selectivity and cost of these operations. In this paper, we focus on estimating the cost of spatial selections, or window queries, where the query windows and data objects are general polygons. Cost estimation techniques previously proposed in the literature only handle rectangular query windows over rectangular data objects, thus ignoring the very significant cost of exact geometry, comparison (the refinement step in a `filter and refine' query processing strategy). The cost of the exact geometry comparison depends on the selectivity of the filtering step and the average number of vertices in the candidate objects identified by this step. In this paper, we introduce a new type of histogram for spatial data that captures the complexity and size of the spatial objects as well as their location. Capturing these attributes makes this type of histogram useful for accurate estimation, as we experimentally demonstrate. We also investigate sampling-based estimation approaches. Sampling can yield better selectivity estimates than histograms for polygon data, but at the high cost of performing exact geometry comparisons for all the sampled objects.

Original languageEnglish
Title of host publicationProceedings - International Conference on Data Engineering
Place of PublicationLos Alamitos, CA, United States
PublisherIEEE
Pages123-134
Number of pages12
Publication statusPublished - 1 Jan 2000
Externally publishedYes
Event2000 IEEE 16th International Conference on Data Engineering (ICDE'00) - San Diego, CA, USA
Duration: 29 Feb 20003 Mar 2000

Other

Other2000 IEEE 16th International Conference on Data Engineering (ICDE'00)
CitySan Diego, CA, USA
Period29/2/003/3/00

Fingerprint

Costs
Geometry
Sampling
Query processing
Data acquisition

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Aboulnaga, A., & Naughton, J. F. (2000). Accurate estimation of the cost of spatial selections. In Proceedings - International Conference on Data Engineering (pp. 123-134). Los Alamitos, CA, United States: IEEE.

Accurate estimation of the cost of spatial selections. / Aboulnaga, Ashraf; Naughton, Jeffrey F.

Proceedings - International Conference on Data Engineering. Los Alamitos, CA, United States : IEEE, 2000. p. 123-134.

Research output: Chapter in Book/Report/Conference proceedingChapter

Aboulnaga, A & Naughton, JF 2000, Accurate estimation of the cost of spatial selections. in Proceedings - International Conference on Data Engineering. IEEE, Los Alamitos, CA, United States, pp. 123-134, 2000 IEEE 16th International Conference on Data Engineering (ICDE'00), San Diego, CA, USA, 29/2/00.
Aboulnaga A, Naughton JF. Accurate estimation of the cost of spatial selections. In Proceedings - International Conference on Data Engineering. Los Alamitos, CA, United States: IEEE. 2000. p. 123-134
Aboulnaga, Ashraf ; Naughton, Jeffrey F. / Accurate estimation of the cost of spatial selections. Proceedings - International Conference on Data Engineering. Los Alamitos, CA, United States : IEEE, 2000. pp. 123-134
@inbook{c2c5dd90b0f44c4f9a5298939704c46a,
title = "Accurate estimation of the cost of spatial selections",
abstract = "Optimizing queries that involve operations on spatial data requires estimating the selectivity and cost of these operations. In this paper, we focus on estimating the cost of spatial selections, or window queries, where the query windows and data objects are general polygons. Cost estimation techniques previously proposed in the literature only handle rectangular query windows over rectangular data objects, thus ignoring the very significant cost of exact geometry, comparison (the refinement step in a `filter and refine' query processing strategy). The cost of the exact geometry comparison depends on the selectivity of the filtering step and the average number of vertices in the candidate objects identified by this step. In this paper, we introduce a new type of histogram for spatial data that captures the complexity and size of the spatial objects as well as their location. Capturing these attributes makes this type of histogram useful for accurate estimation, as we experimentally demonstrate. We also investigate sampling-based estimation approaches. Sampling can yield better selectivity estimates than histograms for polygon data, but at the high cost of performing exact geometry comparisons for all the sampled objects.",
author = "Ashraf Aboulnaga and Naughton, {Jeffrey F.}",
year = "2000",
month = "1",
day = "1",
language = "English",
pages = "123--134",
booktitle = "Proceedings - International Conference on Data Engineering",
publisher = "IEEE",

}

TY - CHAP

T1 - Accurate estimation of the cost of spatial selections

AU - Aboulnaga, Ashraf

AU - Naughton, Jeffrey F.

PY - 2000/1/1

Y1 - 2000/1/1

N2 - Optimizing queries that involve operations on spatial data requires estimating the selectivity and cost of these operations. In this paper, we focus on estimating the cost of spatial selections, or window queries, where the query windows and data objects are general polygons. Cost estimation techniques previously proposed in the literature only handle rectangular query windows over rectangular data objects, thus ignoring the very significant cost of exact geometry, comparison (the refinement step in a `filter and refine' query processing strategy). The cost of the exact geometry comparison depends on the selectivity of the filtering step and the average number of vertices in the candidate objects identified by this step. In this paper, we introduce a new type of histogram for spatial data that captures the complexity and size of the spatial objects as well as their location. Capturing these attributes makes this type of histogram useful for accurate estimation, as we experimentally demonstrate. We also investigate sampling-based estimation approaches. Sampling can yield better selectivity estimates than histograms for polygon data, but at the high cost of performing exact geometry comparisons for all the sampled objects.

AB - Optimizing queries that involve operations on spatial data requires estimating the selectivity and cost of these operations. In this paper, we focus on estimating the cost of spatial selections, or window queries, where the query windows and data objects are general polygons. Cost estimation techniques previously proposed in the literature only handle rectangular query windows over rectangular data objects, thus ignoring the very significant cost of exact geometry, comparison (the refinement step in a `filter and refine' query processing strategy). The cost of the exact geometry comparison depends on the selectivity of the filtering step and the average number of vertices in the candidate objects identified by this step. In this paper, we introduce a new type of histogram for spatial data that captures the complexity and size of the spatial objects as well as their location. Capturing these attributes makes this type of histogram useful for accurate estimation, as we experimentally demonstrate. We also investigate sampling-based estimation approaches. Sampling can yield better selectivity estimates than histograms for polygon data, but at the high cost of performing exact geometry comparisons for all the sampled objects.

UR - http://www.scopus.com/inward/record.url?scp=0033877659&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033877659&partnerID=8YFLogxK

M3 - Chapter

AN - SCOPUS:0033877659

SP - 123

EP - 134

BT - Proceedings - International Conference on Data Engineering

PB - IEEE

CY - Los Alamitos, CA, United States

ER -