StarDB: A large-scale DBMS for strings

Majed Sahli, Essam Mansour, Panos Kalnis

Research output: Chapter in Book/Report/Conference proceedingChapter

2 Citations (Scopus)

Abstract

Strings and applications using them are proliferating in science and business. Currently, strings are stored in file systems and processed using ad-hoc procedural code. Existing techniques are not flexible and cannot efficiently handle complex queries or large datasets. In this paper, we demonstrate StarDB, a distributed database system for analytics on strings. StarDB hides data and system complexities and allows users to focus on analytics. It uses a comprehensive set of parallel string operations and provides a declarative query language to solve complex queries. StarDB automatically tunes itself and runs with over 90% efficiency on supercomputers, public clouds, clusters, and workstations. We test StarDB using real datasets that are 2 orders of magnitude larger than the datasets reported by previous works.

Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
PublisherAssociation for Computing Machinery
Pages1844-1847
Number of pages4
Volume8
Edition12
Publication statusPublished - 2015
Event3rd Workshop on Spatio-Temporal Database Management, STDBM 2006, Co-located with the 32nd International Conference on Very Large Data Bases, VLDB 2006 - Seoul, Korea, Republic of
Duration: 11 Sep 200611 Sep 2006

Other

Other3rd Workshop on Spatio-Temporal Database Management, STDBM 2006, Co-located with the 32nd International Conference on Very Large Data Bases, VLDB 2006
CountryKorea, Republic of
CitySeoul
Period11/9/0611/9/06

Fingerprint

Distributed database systems
Computer workstations
Query languages
Supercomputers
Industry

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Sahli, M., Mansour, E., & Kalnis, P. (2015). StarDB: A large-scale DBMS for strings. In Proceedings of the VLDB Endowment (12 ed., Vol. 8, pp. 1844-1847). Association for Computing Machinery.

StarDB : A large-scale DBMS for strings. / Sahli, Majed; Mansour, Essam; Kalnis, Panos.

Proceedings of the VLDB Endowment. Vol. 8 12. ed. Association for Computing Machinery, 2015. p. 1844-1847.

Research output: Chapter in Book/Report/Conference proceedingChapter

Sahli, M, Mansour, E & Kalnis, P 2015, StarDB: A large-scale DBMS for strings. in Proceedings of the VLDB Endowment. 12 edn, vol. 8, Association for Computing Machinery, pp. 1844-1847, 3rd Workshop on Spatio-Temporal Database Management, STDBM 2006, Co-located with the 32nd International Conference on Very Large Data Bases, VLDB 2006, Seoul, Korea, Republic of, 11/9/06.
Sahli M, Mansour E, Kalnis P. StarDB: A large-scale DBMS for strings. In Proceedings of the VLDB Endowment. 12 ed. Vol. 8. Association for Computing Machinery. 2015. p. 1844-1847
Sahli, Majed ; Mansour, Essam ; Kalnis, Panos. / StarDB : A large-scale DBMS for strings. Proceedings of the VLDB Endowment. Vol. 8 12. ed. Association for Computing Machinery, 2015. pp. 1844-1847
@inbook{71f2a7f662a54f91803d770d37eb592d,
title = "StarDB: A large-scale DBMS for strings",
abstract = "Strings and applications using them are proliferating in science and business. Currently, strings are stored in file systems and processed using ad-hoc procedural code. Existing techniques are not flexible and cannot efficiently handle complex queries or large datasets. In this paper, we demonstrate StarDB, a distributed database system for analytics on strings. StarDB hides data and system complexities and allows users to focus on analytics. It uses a comprehensive set of parallel string operations and provides a declarative query language to solve complex queries. StarDB automatically tunes itself and runs with over 90{\%} efficiency on supercomputers, public clouds, clusters, and workstations. We test StarDB using real datasets that are 2 orders of magnitude larger than the datasets reported by previous works.",
author = "Majed Sahli and Essam Mansour and Panos Kalnis",
year = "2015",
language = "English",
volume = "8",
pages = "1844--1847",
booktitle = "Proceedings of the VLDB Endowment",
publisher = "Association for Computing Machinery",
edition = "12",

}

TY - CHAP

T1 - StarDB

T2 - A large-scale DBMS for strings

AU - Sahli, Majed

AU - Mansour, Essam

AU - Kalnis, Panos

PY - 2015

Y1 - 2015

N2 - Strings and applications using them are proliferating in science and business. Currently, strings are stored in file systems and processed using ad-hoc procedural code. Existing techniques are not flexible and cannot efficiently handle complex queries or large datasets. In this paper, we demonstrate StarDB, a distributed database system for analytics on strings. StarDB hides data and system complexities and allows users to focus on analytics. It uses a comprehensive set of parallel string operations and provides a declarative query language to solve complex queries. StarDB automatically tunes itself and runs with over 90% efficiency on supercomputers, public clouds, clusters, and workstations. We test StarDB using real datasets that are 2 orders of magnitude larger than the datasets reported by previous works.

AB - Strings and applications using them are proliferating in science and business. Currently, strings are stored in file systems and processed using ad-hoc procedural code. Existing techniques are not flexible and cannot efficiently handle complex queries or large datasets. In this paper, we demonstrate StarDB, a distributed database system for analytics on strings. StarDB hides data and system complexities and allows users to focus on analytics. It uses a comprehensive set of parallel string operations and provides a declarative query language to solve complex queries. StarDB automatically tunes itself and runs with over 90% efficiency on supercomputers, public clouds, clusters, and workstations. We test StarDB using real datasets that are 2 orders of magnitude larger than the datasets reported by previous works.

UR - http://www.scopus.com/inward/record.url?scp=84953885877&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84953885877&partnerID=8YFLogxK

M3 - Chapter

AN - SCOPUS:84953885877

VL - 8

SP - 1844

EP - 1847

BT - Proceedings of the VLDB Endowment

PB - Association for Computing Machinery

ER -