Taghreed

A system for querying, analyzing, and visualizing geotagged microblogs

Amr Magdy, Louai Alarabi, Saif Al-Harthi, Mashaal Musleh, Thanaa M. Ghanem, Sohaib Ghani, Mohamed Mokbel

Research output: Chapter in Book/Report/Conference proceedingConference contribution

35 Citations (Scopus)

Abstract

This paper presents Taghreed; a full-fledged system for efficient and scalable querying, analyzing, and visualizing geo- tagged microblogs, e.g., tweets. Taghreed supports arbitrary queries on a large number (Billions) of microblogs that go up to several months in the past. Taghreed consists of four main components: (1) Indexer, (2) query engine, (3) recovery manager, and (4) visualizer. Taghreed indexer efficiently digests incoming microblogs with high arrival rates in light memory-resident indexes. When the memory be- comes full, a flushing policy manager transfers the mem- ory contents to disk indexes which are managing Billions of microblogs for several months. On memory failure, the recovery manager restores the system status from replicated copies for the main-memory content. Taghreed query engine consists of two modules: a query optimizer and a query processor. The query optimizer generates an optimal query plan to be executed by the query processor through efficient retrieval techniques to provide low query response, i.e., order of milliseconds. Taghreed visualizer allows end users to issue a wide variety of spatiotemporal queries. Then, it graphically presents the answers and allows interactive exploration through them. Taghreed is the first system that addresses all these challenges collectively for microblogs data. In the paper, each system component is described in detail.

Original languageEnglish
Title of host publication22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014
PublisherAssociation for Computing Machinery
Pages163-172
Number of pages10
Volume04-07-November-2014
ISBN (Electronic)9781450331319
DOIs
Publication statusPublished - 4 Nov 2014
Externally publishedYes
Event22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014 - Dallas, United States
Duration: 4 Nov 20147 Nov 2014

Other

Other22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014
CountryUnited States
CityDallas
Period4/11/147/11/14

Fingerprint

Query
Data storage equipment
Managers
engine
Engines
Recovery
flushing
Engine
Computer systems
Retrieval
Module
index
Arbitrary

Keywords

  • Indexing
  • Microblogs
  • Query Processing
  • Spatio-temporal

ASJC Scopus subject areas

  • Earth-Surface Processes
  • Computer Science Applications
  • Modelling and Simulation
  • Computer Graphics and Computer-Aided Design
  • Information Systems

Cite this

Magdy, A., Alarabi, L., Al-Harthi, S., Musleh, M., Ghanem, T. M., Ghani, S., & Mokbel, M. (2014). Taghreed: A system for querying, analyzing, and visualizing geotagged microblogs. In 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014 (Vol. 04-07-November-2014, pp. 163-172). Association for Computing Machinery. https://doi.org/10.1145/2666310.2666397

Taghreed : A system for querying, analyzing, and visualizing geotagged microblogs. / Magdy, Amr; Alarabi, Louai; Al-Harthi, Saif; Musleh, Mashaal; Ghanem, Thanaa M.; Ghani, Sohaib; Mokbel, Mohamed.

22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014. Vol. 04-07-November-2014 Association for Computing Machinery, 2014. p. 163-172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Magdy, A, Alarabi, L, Al-Harthi, S, Musleh, M, Ghanem, TM, Ghani, S & Mokbel, M 2014, Taghreed: A system for querying, analyzing, and visualizing geotagged microblogs. in 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014. vol. 04-07-November-2014, Association for Computing Machinery, pp. 163-172, 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014, Dallas, United States, 4/11/14. https://doi.org/10.1145/2666310.2666397
Magdy A, Alarabi L, Al-Harthi S, Musleh M, Ghanem TM, Ghani S et al. Taghreed: A system for querying, analyzing, and visualizing geotagged microblogs. In 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014. Vol. 04-07-November-2014. Association for Computing Machinery. 2014. p. 163-172 https://doi.org/10.1145/2666310.2666397
Magdy, Amr ; Alarabi, Louai ; Al-Harthi, Saif ; Musleh, Mashaal ; Ghanem, Thanaa M. ; Ghani, Sohaib ; Mokbel, Mohamed. / Taghreed : A system for querying, analyzing, and visualizing geotagged microblogs. 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014. Vol. 04-07-November-2014 Association for Computing Machinery, 2014. pp. 163-172
@inproceedings{948703dbcfd44d54ba359c4877a5a858,
title = "Taghreed: A system for querying, analyzing, and visualizing geotagged microblogs",
abstract = "This paper presents Taghreed; a full-fledged system for efficient and scalable querying, analyzing, and visualizing geo- tagged microblogs, e.g., tweets. Taghreed supports arbitrary queries on a large number (Billions) of microblogs that go up to several months in the past. Taghreed consists of four main components: (1) Indexer, (2) query engine, (3) recovery manager, and (4) visualizer. Taghreed indexer efficiently digests incoming microblogs with high arrival rates in light memory-resident indexes. When the memory be- comes full, a flushing policy manager transfers the mem- ory contents to disk indexes which are managing Billions of microblogs for several months. On memory failure, the recovery manager restores the system status from replicated copies for the main-memory content. Taghreed query engine consists of two modules: a query optimizer and a query processor. The query optimizer generates an optimal query plan to be executed by the query processor through efficient retrieval techniques to provide low query response, i.e., order of milliseconds. Taghreed visualizer allows end users to issue a wide variety of spatiotemporal queries. Then, it graphically presents the answers and allows interactive exploration through them. Taghreed is the first system that addresses all these challenges collectively for microblogs data. In the paper, each system component is described in detail.",
keywords = "Indexing, Microblogs, Query Processing, Spatio-temporal",
author = "Amr Magdy and Louai Alarabi and Saif Al-Harthi and Mashaal Musleh and Ghanem, {Thanaa M.} and Sohaib Ghani and Mohamed Mokbel",
year = "2014",
month = "11",
day = "4",
doi = "10.1145/2666310.2666397",
language = "English",
volume = "04-07-November-2014",
pages = "163--172",
booktitle = "22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014",
publisher = "Association for Computing Machinery",

}

TY - GEN

T1 - Taghreed

T2 - A system for querying, analyzing, and visualizing geotagged microblogs

AU - Magdy, Amr

AU - Alarabi, Louai

AU - Al-Harthi, Saif

AU - Musleh, Mashaal

AU - Ghanem, Thanaa M.

AU - Ghani, Sohaib

AU - Mokbel, Mohamed

PY - 2014/11/4

Y1 - 2014/11/4

N2 - This paper presents Taghreed; a full-fledged system for efficient and scalable querying, analyzing, and visualizing geo- tagged microblogs, e.g., tweets. Taghreed supports arbitrary queries on a large number (Billions) of microblogs that go up to several months in the past. Taghreed consists of four main components: (1) Indexer, (2) query engine, (3) recovery manager, and (4) visualizer. Taghreed indexer efficiently digests incoming microblogs with high arrival rates in light memory-resident indexes. When the memory be- comes full, a flushing policy manager transfers the mem- ory contents to disk indexes which are managing Billions of microblogs for several months. On memory failure, the recovery manager restores the system status from replicated copies for the main-memory content. Taghreed query engine consists of two modules: a query optimizer and a query processor. The query optimizer generates an optimal query plan to be executed by the query processor through efficient retrieval techniques to provide low query response, i.e., order of milliseconds. Taghreed visualizer allows end users to issue a wide variety of spatiotemporal queries. Then, it graphically presents the answers and allows interactive exploration through them. Taghreed is the first system that addresses all these challenges collectively for microblogs data. In the paper, each system component is described in detail.

AB - This paper presents Taghreed; a full-fledged system for efficient and scalable querying, analyzing, and visualizing geo- tagged microblogs, e.g., tweets. Taghreed supports arbitrary queries on a large number (Billions) of microblogs that go up to several months in the past. Taghreed consists of four main components: (1) Indexer, (2) query engine, (3) recovery manager, and (4) visualizer. Taghreed indexer efficiently digests incoming microblogs with high arrival rates in light memory-resident indexes. When the memory be- comes full, a flushing policy manager transfers the mem- ory contents to disk indexes which are managing Billions of microblogs for several months. On memory failure, the recovery manager restores the system status from replicated copies for the main-memory content. Taghreed query engine consists of two modules: a query optimizer and a query processor. The query optimizer generates an optimal query plan to be executed by the query processor through efficient retrieval techniques to provide low query response, i.e., order of milliseconds. Taghreed visualizer allows end users to issue a wide variety of spatiotemporal queries. Then, it graphically presents the answers and allows interactive exploration through them. Taghreed is the first system that addresses all these challenges collectively for microblogs data. In the paper, each system component is described in detail.

KW - Indexing

KW - Microblogs

KW - Query Processing

KW - Spatio-temporal

UR - http://www.scopus.com/inward/record.url?scp=84961219499&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84961219499&partnerID=8YFLogxK

U2 - 10.1145/2666310.2666397

DO - 10.1145/2666310.2666397

M3 - Conference contribution

VL - 04-07-November-2014

SP - 163

EP - 172

BT - 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2014

PB - Association for Computing Machinery

ER -