A demonstration of spatialhadoop

An efficient mapreduce framework for spatial data

Ahmed Eldawy, Mohamed Mokbel

Research output: Contribution to journalArticle

142 Citations (Scopus)

Abstract

This demo presents SpatialHadoop as the first full-fledged MapReduce framework with native support for spatial data. Spatial- Hadoop is a comprehensive extension to Hadoop that pushes spatial data inside the core functionality of Hadoop. SpatialHadoop runs existing Hadoop programs as is, yet, it achieves order(s) of magnitude better performance than Hadoop when dealing with spatial data. SpatialHadoop employs a simple spatial high level language, a two-level spatial index structure, basic spatial components built inside the MapReduce layer, and three basic spatial operations: range queries, k-NN queries, and spatial join. Other spatial operations can be similarly deployed in SpatialHadoop. We demonstrate a real system prototype of SpatialHadoop running on an Amazon EC2 cluster against two sets of real spatial data obtained from Tiger Files and OpenStreetMap with sizes 60GB and 300GB, respectively.

Original languageEnglish
Pages (from-to)1230-1233
Number of pages4
JournalProceedings of the VLDB Endowment
Volume6
Issue number12
DOIs
Publication statusPublished - 1 Jan 2013
Externally publishedYes

Fingerprint

High level languages
Demonstrations

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

A demonstration of spatialhadoop : An efficient mapreduce framework for spatial data. / Eldawy, Ahmed; Mokbel, Mohamed.

In: Proceedings of the VLDB Endowment, Vol. 6, No. 12, 01.01.2013, p. 1230-1233.

Research output: Contribution to journalArticle

@article{d265d0782e504ceabc7b98e7d9192ff7,
title = "A demonstration of spatialhadoop: An efficient mapreduce framework for spatial data",
abstract = "This demo presents SpatialHadoop as the first full-fledged MapReduce framework with native support for spatial data. Spatial- Hadoop is a comprehensive extension to Hadoop that pushes spatial data inside the core functionality of Hadoop. SpatialHadoop runs existing Hadoop programs as is, yet, it achieves order(s) of magnitude better performance than Hadoop when dealing with spatial data. SpatialHadoop employs a simple spatial high level language, a two-level spatial index structure, basic spatial components built inside the MapReduce layer, and three basic spatial operations: range queries, k-NN queries, and spatial join. Other spatial operations can be similarly deployed in SpatialHadoop. We demonstrate a real system prototype of SpatialHadoop running on an Amazon EC2 cluster against two sets of real spatial data obtained from Tiger Files and OpenStreetMap with sizes 60GB and 300GB, respectively.",
author = "Ahmed Eldawy and Mohamed Mokbel",
year = "2013",
month = "1",
day = "1",
doi = "10.14778/2536274.2536283",
language = "English",
volume = "6",
pages = "1230--1233",
journal = "Proceedings of the VLDB Endowment",
issn = "2150-8097",
publisher = "Very Large Data Base Endowment Inc.",
number = "12",

}

TY - JOUR

T1 - A demonstration of spatialhadoop

T2 - An efficient mapreduce framework for spatial data

AU - Eldawy, Ahmed

AU - Mokbel, Mohamed

PY - 2013/1/1

Y1 - 2013/1/1

N2 - This demo presents SpatialHadoop as the first full-fledged MapReduce framework with native support for spatial data. Spatial- Hadoop is a comprehensive extension to Hadoop that pushes spatial data inside the core functionality of Hadoop. SpatialHadoop runs existing Hadoop programs as is, yet, it achieves order(s) of magnitude better performance than Hadoop when dealing with spatial data. SpatialHadoop employs a simple spatial high level language, a two-level spatial index structure, basic spatial components built inside the MapReduce layer, and three basic spatial operations: range queries, k-NN queries, and spatial join. Other spatial operations can be similarly deployed in SpatialHadoop. We demonstrate a real system prototype of SpatialHadoop running on an Amazon EC2 cluster against two sets of real spatial data obtained from Tiger Files and OpenStreetMap with sizes 60GB and 300GB, respectively.

AB - This demo presents SpatialHadoop as the first full-fledged MapReduce framework with native support for spatial data. Spatial- Hadoop is a comprehensive extension to Hadoop that pushes spatial data inside the core functionality of Hadoop. SpatialHadoop runs existing Hadoop programs as is, yet, it achieves order(s) of magnitude better performance than Hadoop when dealing with spatial data. SpatialHadoop employs a simple spatial high level language, a two-level spatial index structure, basic spatial components built inside the MapReduce layer, and three basic spatial operations: range queries, k-NN queries, and spatial join. Other spatial operations can be similarly deployed in SpatialHadoop. We demonstrate a real system prototype of SpatialHadoop running on an Amazon EC2 cluster against two sets of real spatial data obtained from Tiger Files and OpenStreetMap with sizes 60GB and 300GB, respectively.

UR - http://www.scopus.com/inward/record.url?scp=84891071699&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84891071699&partnerID=8YFLogxK

U2 - 10.14778/2536274.2536283

DO - 10.14778/2536274.2536283

M3 - Article

VL - 6

SP - 1230

EP - 1233

JO - Proceedings of the VLDB Endowment

JF - Proceedings of the VLDB Endowment

SN - 2150-8097

IS - 12

ER -