SOLE

Scalable on-line execution of continuous queries on spatio-temporal data streams

Mohamed Mokbel, Walid G. Aref

Research output: Contribution to journalArticle

68 Citations (Scopus)

Abstract

This paper presents the scalable on-line execution (SOLE) algorithm for continuous and on-line evaluation of concurrent continuous spatio-temporal queries over data streams. Incoming spatio-temporal data streams are processed in-memory against a set of outstanding continuous queries. The SOLE algorithm utilizes the scarce memory resource efficiently by keeping track of only the significant objects. In-memory stored objects are expired (i.e., dropped) from memory once they become insignificant. SOLE is a scalable algorithm where all the continuous outstanding queries share the same buffer pool. In addition, SOLE is presented as a spatio-temporal join between two input streams, a stream of spatio-temporal objects and a stream of spatio-temporal queries. To cope with intervals of high arrival rates of objects and/or queries, SOLE utilizes a load-shedding approach where some of the stored objects are dropped from memory. SOLE is implemented as a pipelined query operator that can be combined with traditional query operators in a query execution plan to support a wide variety of continuous queries. Performance experiments based on a real implementation of SOLE inside a prototype of a data stream management system show the scalability and efficiency of SOLE in highly dynamic environments.

Original languageEnglish
Pages (from-to)971-995
Number of pages25
JournalVLDB Journal
Volume17
Issue number5
DOIs
Publication statusPublished - 1 Aug 2008
Externally publishedYes

Fingerprint

Data storage equipment
Scalability
Experiments

ASJC Scopus subject areas

  • Information Systems
  • Hardware and Architecture

Cite this

SOLE : Scalable on-line execution of continuous queries on spatio-temporal data streams. / Mokbel, Mohamed; Aref, Walid G.

In: VLDB Journal, Vol. 17, No. 5, 01.08.2008, p. 971-995.

Research output: Contribution to journalArticle

@article{5e82b35a993947388bdf606e0fd450a1,
title = "SOLE: Scalable on-line execution of continuous queries on spatio-temporal data streams",
abstract = "This paper presents the scalable on-line execution (SOLE) algorithm for continuous and on-line evaluation of concurrent continuous spatio-temporal queries over data streams. Incoming spatio-temporal data streams are processed in-memory against a set of outstanding continuous queries. The SOLE algorithm utilizes the scarce memory resource efficiently by keeping track of only the significant objects. In-memory stored objects are expired (i.e., dropped) from memory once they become insignificant. SOLE is a scalable algorithm where all the continuous outstanding queries share the same buffer pool. In addition, SOLE is presented as a spatio-temporal join between two input streams, a stream of spatio-temporal objects and a stream of spatio-temporal queries. To cope with intervals of high arrival rates of objects and/or queries, SOLE utilizes a load-shedding approach where some of the stored objects are dropped from memory. SOLE is implemented as a pipelined query operator that can be combined with traditional query operators in a query execution plan to support a wide variety of continuous queries. Performance experiments based on a real implementation of SOLE inside a prototype of a data stream management system show the scalability and efficiency of SOLE in highly dynamic environments.",
author = "Mohamed Mokbel and Aref, {Walid G.}",
year = "2008",
month = "8",
day = "1",
doi = "10.1007/s00778-007-0046-1",
language = "English",
volume = "17",
pages = "971--995",
journal = "VLDB Journal",
issn = "1066-8888",
publisher = "Springer New York",
number = "5",

}

TY - JOUR

T1 - SOLE

T2 - Scalable on-line execution of continuous queries on spatio-temporal data streams

AU - Mokbel, Mohamed

AU - Aref, Walid G.

PY - 2008/8/1

Y1 - 2008/8/1

N2 - This paper presents the scalable on-line execution (SOLE) algorithm for continuous and on-line evaluation of concurrent continuous spatio-temporal queries over data streams. Incoming spatio-temporal data streams are processed in-memory against a set of outstanding continuous queries. The SOLE algorithm utilizes the scarce memory resource efficiently by keeping track of only the significant objects. In-memory stored objects are expired (i.e., dropped) from memory once they become insignificant. SOLE is a scalable algorithm where all the continuous outstanding queries share the same buffer pool. In addition, SOLE is presented as a spatio-temporal join between two input streams, a stream of spatio-temporal objects and a stream of spatio-temporal queries. To cope with intervals of high arrival rates of objects and/or queries, SOLE utilizes a load-shedding approach where some of the stored objects are dropped from memory. SOLE is implemented as a pipelined query operator that can be combined with traditional query operators in a query execution plan to support a wide variety of continuous queries. Performance experiments based on a real implementation of SOLE inside a prototype of a data stream management system show the scalability and efficiency of SOLE in highly dynamic environments.

AB - This paper presents the scalable on-line execution (SOLE) algorithm for continuous and on-line evaluation of concurrent continuous spatio-temporal queries over data streams. Incoming spatio-temporal data streams are processed in-memory against a set of outstanding continuous queries. The SOLE algorithm utilizes the scarce memory resource efficiently by keeping track of only the significant objects. In-memory stored objects are expired (i.e., dropped) from memory once they become insignificant. SOLE is a scalable algorithm where all the continuous outstanding queries share the same buffer pool. In addition, SOLE is presented as a spatio-temporal join between two input streams, a stream of spatio-temporal objects and a stream of spatio-temporal queries. To cope with intervals of high arrival rates of objects and/or queries, SOLE utilizes a load-shedding approach where some of the stored objects are dropped from memory. SOLE is implemented as a pipelined query operator that can be combined with traditional query operators in a query execution plan to support a wide variety of continuous queries. Performance experiments based on a real implementation of SOLE inside a prototype of a data stream management system show the scalability and efficiency of SOLE in highly dynamic environments.

UR - http://www.scopus.com/inward/record.url?scp=46749110489&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=46749110489&partnerID=8YFLogxK

U2 - 10.1007/s00778-007-0046-1

DO - 10.1007/s00778-007-0046-1

M3 - Article

VL - 17

SP - 971

EP - 995

JO - VLDB Journal

JF - VLDB Journal

SN - 1066-8888

IS - 5

ER -