OceanRT: Real-time analytics over large temporal data

Shiming Zhang, Yin Yang, Wei Fan, Liang Lan, Mingxuan Yuan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

16 Citations (Scopus)

Abstract

We demonstrate OceanRT, a novel cloud-based infrastructure that performs online analytics in real time, over large-scale temporal data such as call logs from a telecommunication company. Apart from proprietary systems for which few details have been revealed, most existing big-data analytics systems are built on top of an offline, MapReduce-style infrastructure, which inherently limits their efficiency. In contrast, OceanRT employs a novel computing architecture consisting of interconnected Access Query Engines (AQEs), as well as a new storage scheme that ensures data locality and fast access for temporal data. Our preliminary evaluation shows that OceanRT can be up to 10× faster than Impala [10], 12× faster than Shark [5], and 200× faster than Hive [13]. The demo will show how OceanRT manages a real call log dataset (around 5TB per day) from a large mobile network operator in China. Besides presenting the processing of a few preset queries, we also allow the audience to issue ad hoc HiveQL [13] queries, watch how OceanRT answers them, and compare the speed of OceanRT with its competitors.

Original languageEnglish
Title of host publicationSIGMOD 2014 - Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
PublisherAssociation for Computing Machinery
Pages1099-1102
Number of pages4
ISBN (Print)9781450323765
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014 - Snowbird, UT, United States
Duration: 22 Jun 201427 Jun 2014

Other

Other2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014
CountryUnited States
CitySnowbird, UT
Period22/6/1427/6/14

    Fingerprint

Keywords

  • Design
  • Management
  • Performance

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this

Zhang, S., Yang, Y., Fan, W., Lan, L., & Yuan, M. (2014). OceanRT: Real-time analytics over large temporal data. In SIGMOD 2014 - Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data (pp. 1099-1102). Association for Computing Machinery. https://doi.org/10.1145/2588555.2594513