GeoTrend

Spatial trending queries on real-time Microblogs

Amr Magdy, Ahmed M. Aly, Mohamed Mokbel, Sameh Elnikety, Yuxiong He, Suman Nath, Walid G. Aref

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

This paper presents GeoTrend; a system for scalable support of spatial trend discovery on recent microblogs, e.g., tweets and online reviews, that come in real time. GeoTrend is distinguished from existing techniques in three aspects: (1) It discovers trends in arbitrary spatial regions, e.g., city blocks. (2) It supports trending measures that effectively capture trending items under a variety of definitions that suit different applications. (3) It promotes recent microblogs as first-class citizens and optimizes its system components to digest a continuous flow of fast data in main-memory while removing old data efficiently. GeoTrend queries are top-k queries that discover the most trending k keywords that are posted within an arbitrary spatial region and during the last T time units. To support its queries efficiently, GeoTrend employs an in-memory spatial index that is able to efficiently digest incoming data and expire data that is beyond the last T time units. The index also materializes top-k keywords in different spatial regions so that incoming queries can be processed with low latency. In case of peak times, a main-memory optimization technique is employed to shed less important data, so that the system still sustains high query accuracy with limited memory resources. Experimental results based on real Twitter feed and Bing Mobile spatial search queries show the scalability of GeoTrend to support arrival rates of up to 50,000 microblog/second, average query latency of 3 milli-seconds, and at least 90+% query accuracy even under limited memory resources.

Original languageEnglish
Title of host publication24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450345897
DOIs
Publication statusPublished - 31 Oct 2016
Externally publishedYes
Event24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016 - Burlingame, United States
Duration: 31 Oct 20163 Nov 2016

Other

Other24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016
CountryUnited States
CityBurlingame
Period31/10/163/11/16

Fingerprint

Query
Real-time
Data storage equipment
Latency
resource
Spatial Index
Scalability
Resources
Unit
Arbitrary
Optimization Techniques
Optimise
Experimental Results
index
trend
Trends

Keywords

  • Indexing
  • Microblogs
  • Query Processing
  • Real-time
  • Spatial
  • Trend

ASJC Scopus subject areas

  • Earth-Surface Processes
  • Computer Science Applications
  • Modelling and Simulation
  • Computer Graphics and Computer-Aided Design
  • Information Systems

Cite this

Magdy, A., Aly, A. M., Mokbel, M., Elnikety, S., He, Y., Nath, S., & Aref, W. G. (2016). GeoTrend: Spatial trending queries on real-time Microblogs. In 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016 [7] Association for Computing Machinery. https://doi.org/10.1145/2996913.2996986

GeoTrend : Spatial trending queries on real-time Microblogs. / Magdy, Amr; Aly, Ahmed M.; Mokbel, Mohamed; Elnikety, Sameh; He, Yuxiong; Nath, Suman; Aref, Walid G.

24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016. Association for Computing Machinery, 2016. 7.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Magdy, A, Aly, AM, Mokbel, M, Elnikety, S, He, Y, Nath, S & Aref, WG 2016, GeoTrend: Spatial trending queries on real-time Microblogs. in 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016., 7, Association for Computing Machinery, 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016, Burlingame, United States, 31/10/16. https://doi.org/10.1145/2996913.2996986
Magdy A, Aly AM, Mokbel M, Elnikety S, He Y, Nath S et al. GeoTrend: Spatial trending queries on real-time Microblogs. In 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016. Association for Computing Machinery. 2016. 7 https://doi.org/10.1145/2996913.2996986
Magdy, Amr ; Aly, Ahmed M. ; Mokbel, Mohamed ; Elnikety, Sameh ; He, Yuxiong ; Nath, Suman ; Aref, Walid G. / GeoTrend : Spatial trending queries on real-time Microblogs. 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016. Association for Computing Machinery, 2016.
@inproceedings{09966c8c32374c75952193dfe0d4beb4,
title = "GeoTrend: Spatial trending queries on real-time Microblogs",
abstract = "This paper presents GeoTrend; a system for scalable support of spatial trend discovery on recent microblogs, e.g., tweets and online reviews, that come in real time. GeoTrend is distinguished from existing techniques in three aspects: (1) It discovers trends in arbitrary spatial regions, e.g., city blocks. (2) It supports trending measures that effectively capture trending items under a variety of definitions that suit different applications. (3) It promotes recent microblogs as first-class citizens and optimizes its system components to digest a continuous flow of fast data in main-memory while removing old data efficiently. GeoTrend queries are top-k queries that discover the most trending k keywords that are posted within an arbitrary spatial region and during the last T time units. To support its queries efficiently, GeoTrend employs an in-memory spatial index that is able to efficiently digest incoming data and expire data that is beyond the last T time units. The index also materializes top-k keywords in different spatial regions so that incoming queries can be processed with low latency. In case of peak times, a main-memory optimization technique is employed to shed less important data, so that the system still sustains high query accuracy with limited memory resources. Experimental results based on real Twitter feed and Bing Mobile spatial search queries show the scalability of GeoTrend to support arrival rates of up to 50,000 microblog/second, average query latency of 3 milli-seconds, and at least 90+{\%} query accuracy even under limited memory resources.",
keywords = "Indexing, Microblogs, Query Processing, Real-time, Spatial, Trend",
author = "Amr Magdy and Aly, {Ahmed M.} and Mohamed Mokbel and Sameh Elnikety and Yuxiong He and Suman Nath and Aref, {Walid G.}",
year = "2016",
month = "10",
day = "31",
doi = "10.1145/2996913.2996986",
language = "English",
booktitle = "24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - GeoTrend

T2 - Spatial trending queries on real-time Microblogs

AU - Magdy, Amr

AU - Aly, Ahmed M.

AU - Mokbel, Mohamed

AU - Elnikety, Sameh

AU - He, Yuxiong

AU - Nath, Suman

AU - Aref, Walid G.

PY - 2016/10/31

Y1 - 2016/10/31

N2 - This paper presents GeoTrend; a system for scalable support of spatial trend discovery on recent microblogs, e.g., tweets and online reviews, that come in real time. GeoTrend is distinguished from existing techniques in three aspects: (1) It discovers trends in arbitrary spatial regions, e.g., city blocks. (2) It supports trending measures that effectively capture trending items under a variety of definitions that suit different applications. (3) It promotes recent microblogs as first-class citizens and optimizes its system components to digest a continuous flow of fast data in main-memory while removing old data efficiently. GeoTrend queries are top-k queries that discover the most trending k keywords that are posted within an arbitrary spatial region and during the last T time units. To support its queries efficiently, GeoTrend employs an in-memory spatial index that is able to efficiently digest incoming data and expire data that is beyond the last T time units. The index also materializes top-k keywords in different spatial regions so that incoming queries can be processed with low latency. In case of peak times, a main-memory optimization technique is employed to shed less important data, so that the system still sustains high query accuracy with limited memory resources. Experimental results based on real Twitter feed and Bing Mobile spatial search queries show the scalability of GeoTrend to support arrival rates of up to 50,000 microblog/second, average query latency of 3 milli-seconds, and at least 90+% query accuracy even under limited memory resources.

AB - This paper presents GeoTrend; a system for scalable support of spatial trend discovery on recent microblogs, e.g., tweets and online reviews, that come in real time. GeoTrend is distinguished from existing techniques in three aspects: (1) It discovers trends in arbitrary spatial regions, e.g., city blocks. (2) It supports trending measures that effectively capture trending items under a variety of definitions that suit different applications. (3) It promotes recent microblogs as first-class citizens and optimizes its system components to digest a continuous flow of fast data in main-memory while removing old data efficiently. GeoTrend queries are top-k queries that discover the most trending k keywords that are posted within an arbitrary spatial region and during the last T time units. To support its queries efficiently, GeoTrend employs an in-memory spatial index that is able to efficiently digest incoming data and expire data that is beyond the last T time units. The index also materializes top-k keywords in different spatial regions so that incoming queries can be processed with low latency. In case of peak times, a main-memory optimization technique is employed to shed less important data, so that the system still sustains high query accuracy with limited memory resources. Experimental results based on real Twitter feed and Bing Mobile spatial search queries show the scalability of GeoTrend to support arrival rates of up to 50,000 microblog/second, average query latency of 3 milli-seconds, and at least 90+% query accuracy even under limited memory resources.

KW - Indexing

KW - Microblogs

KW - Query Processing

KW - Real-time

KW - Spatial

KW - Trend

UR - http://www.scopus.com/inward/record.url?scp=85011028508&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85011028508&partnerID=8YFLogxK

U2 - 10.1145/2996913.2996986

DO - 10.1145/2996913.2996986

M3 - Conference contribution

BT - 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2016

PB - Association for Computing Machinery

ER -