Duplicate elimination in space-partitioning tree indexes

M. Y. Eltabakh, Mourad Ouzzani, Walid G. Aref

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Space-partitioning trees, like the disk-based trie, quadtree, kd-tree and their variants, are a family of access methods that index multi-dimensional objects. In the case of indexing non-zero extent objects, e.g., line segments and rectangles, space-partitioning trees may replicate objects over multiple space partitions, e.g., PMR quadtree, expanded MX-CIF quadtree, and extended kd-tree. As a result, the answer to a query over these indexes may include duplicates that need to be eliminated, i.e., the same object may be reported more than once. In this paper, we propose generic duplicate elimination techniques for the class of space-partitioning trees in the context of SP-GiST; an extensible indexing framework for realizing space-partitioning trees. The proposed techniques are embedded inside the INDEX-SCAN operator. Therefore, duplicate copies of the same object do not propagate in the query plan, and the elimination process is transparent to the end-users. Two cases for the index structures are considered based on whether or not the objects' coordinates are stored inside the index tree. The theoretical and experimental analysis illustrate that the proposed techniques achieve savings in the storage requirements, I/O operations, and processing time when compared to adding a separate duplicate elimination operator in the query plan.

Original languageEnglish
Title of host publication19th International Conference on Scientific and Statistical Database Management, SSDBM 2007
DOIs
Publication statusPublished - 1 Dec 2007
Event19th International Conference on Scientific and Statistical Database Management, SSDBM 2007 - Banff, AB, Canada
Duration: 9 Jul 200711 Jul 2007

Publication series

NameProceedings of the International Conference on Scientific and Statistical Database Management, SSDBM
ISSN (Print)1099-3371

Other

Other19th International Conference on Scientific and Statistical Database Management, SSDBM 2007
CountryCanada
CityBanff, AB
Period9/7/0711/7/07

    Fingerprint

ASJC Scopus subject areas

  • Software
  • Applied Mathematics

Cite this

Eltabakh, M. Y., Ouzzani, M., & Aref, W. G. (2007). Duplicate elimination in space-partitioning tree indexes. In 19th International Conference on Scientific and Statistical Database Management, SSDBM 2007 [4274963] (Proceedings of the International Conference on Scientific and Statistical Database Management, SSDBM). https://doi.org/10.1109/SSDBM.2007.10