Optimal data-space partitioning of spatial data for parallel I/O

Hakan Ferhatosmanoǧlu, Divyakant Agrawal, Ömer Eǧecioǧlu, Amr El Abbadi

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

It is desirable to design partitioning methods that minimize the I/O time incurred during query execution in spatial databases. This paper explores optimal partitioning for two-dimensional data for a class of queries and develops multi-disk allocation techniques that maximize the degree of I/O parallelism obtained in each case. We show that hexagonal partitioning has optimal I/O performance for circular queries among all partitioning methods that use convex non-overlapping regions. An analysis and extension of this result to all possible partitioning techniques is also given. For rectangular queries, we show that hexagonal partitioning has overall better I/O performance for a general class of range queries, except for rectilinear queries, in which case rectangular grid partitioning is superior. By using current algorithms for rectangular grid partitioning, parallel storage and retrieval algorithms for hexagonal partitioning can be constructed. Some of these results carry over to circular partitioning of the data-which is an example of a non-convex region.

Original languageEnglish
Pages (from-to)75-101
Number of pages27
JournalDistributed and Parallel Databases
Volume17
Issue number1
DOIs
Publication statusPublished - 1 Jan 2005
Externally publishedYes

    Fingerprint

Keywords

  • Data-space partitioning
  • Disk and page allocation
  • Parallel I/O
  • Range query
  • Two-dimensional data

ASJC Scopus subject areas

  • Information Systems
  • Theoretical Computer Science
  • Computational Theory and Mathematics

Cite this