Logbase

A scalable logstructured database system in the cloud

Hoang Tam Vo, Sheng Wang, Divyakant Agrawal, Gang Chen, Beng Chin Ooi

Research output: Chapter in Book/Report/Conference proceedingChapter

39 Citations (Scopus)

Abstract

Numerous applications such as financial transactions (e.g., stock trading) are write-heavy in nature. The shift from reads to writes in web applications has also been accelerating in recent years. Writeahead-logging is a common approach for providing recovery capability while improving performance in most storage systems. However, the separation of log and application data incurs write overheads observed in write-heavy environments and hence adversely affects the write throughput and recovery time in the system. In this paper, we introduce LogBase-a scalable log-structured database system that adopts log-only storage for removing the write bottleneck and supporting fast system recovery. It is designed to be dynamically deployed on commodity clusters to take advantage of elastic scaling property of cloud environments. LogBase provides in-memory multiversion indexes for supporting efficient access to data maintained in the log. LogBase also supports transactions that bundle read and write operations spanning across multiple records. We implemented the proposed system and compared it with HBase and a disk-based log-structured record-oriented system modeled after RAMCloud. The experimental results show that LogBase is able to provide sustained write throughput, efficient data access out of the cache, and effective system recovery.

Original languageEnglish
Title of host publicationProceedings of the VLDB Endowment
Pages1004-1015
Number of pages12
Volume5
Edition10
Publication statusPublished - Jun 2012
Externally publishedYes

Fingerprint

Recovery
Throughput
Data storage equipment

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Vo, H. T., Wang, S., Agrawal, D., Chen, G., & Ooi, B. C. (2012). Logbase: A scalable logstructured database system in the cloud. In Proceedings of the VLDB Endowment (10 ed., Vol. 5, pp. 1004-1015)

Logbase : A scalable logstructured database system in the cloud. / Vo, Hoang Tam; Wang, Sheng; Agrawal, Divyakant; Chen, Gang; Ooi, Beng Chin.

Proceedings of the VLDB Endowment. Vol. 5 10. ed. 2012. p. 1004-1015.

Research output: Chapter in Book/Report/Conference proceedingChapter

Vo, HT, Wang, S, Agrawal, D, Chen, G & Ooi, BC 2012, Logbase: A scalable logstructured database system in the cloud. in Proceedings of the VLDB Endowment. 10 edn, vol. 5, pp. 1004-1015.
Vo HT, Wang S, Agrawal D, Chen G, Ooi BC. Logbase: A scalable logstructured database system in the cloud. In Proceedings of the VLDB Endowment. 10 ed. Vol. 5. 2012. p. 1004-1015
Vo, Hoang Tam ; Wang, Sheng ; Agrawal, Divyakant ; Chen, Gang ; Ooi, Beng Chin. / Logbase : A scalable logstructured database system in the cloud. Proceedings of the VLDB Endowment. Vol. 5 10. ed. 2012. pp. 1004-1015
@inbook{61146a8b9ff44f168e13075508709f81,
title = "Logbase: A scalable logstructured database system in the cloud",
abstract = "Numerous applications such as financial transactions (e.g., stock trading) are write-heavy in nature. The shift from reads to writes in web applications has also been accelerating in recent years. Writeahead-logging is a common approach for providing recovery capability while improving performance in most storage systems. However, the separation of log and application data incurs write overheads observed in write-heavy environments and hence adversely affects the write throughput and recovery time in the system. In this paper, we introduce LogBase-a scalable log-structured database system that adopts log-only storage for removing the write bottleneck and supporting fast system recovery. It is designed to be dynamically deployed on commodity clusters to take advantage of elastic scaling property of cloud environments. LogBase provides in-memory multiversion indexes for supporting efficient access to data maintained in the log. LogBase also supports transactions that bundle read and write operations spanning across multiple records. We implemented the proposed system and compared it with HBase and a disk-based log-structured record-oriented system modeled after RAMCloud. The experimental results show that LogBase is able to provide sustained write throughput, efficient data access out of the cache, and effective system recovery.",
author = "Vo, {Hoang Tam} and Sheng Wang and Divyakant Agrawal and Gang Chen and Ooi, {Beng Chin}",
year = "2012",
month = "6",
language = "English",
volume = "5",
pages = "1004--1015",
booktitle = "Proceedings of the VLDB Endowment",
edition = "10",

}

TY - CHAP

T1 - Logbase

T2 - A scalable logstructured database system in the cloud

AU - Vo, Hoang Tam

AU - Wang, Sheng

AU - Agrawal, Divyakant

AU - Chen, Gang

AU - Ooi, Beng Chin

PY - 2012/6

Y1 - 2012/6

N2 - Numerous applications such as financial transactions (e.g., stock trading) are write-heavy in nature. The shift from reads to writes in web applications has also been accelerating in recent years. Writeahead-logging is a common approach for providing recovery capability while improving performance in most storage systems. However, the separation of log and application data incurs write overheads observed in write-heavy environments and hence adversely affects the write throughput and recovery time in the system. In this paper, we introduce LogBase-a scalable log-structured database system that adopts log-only storage for removing the write bottleneck and supporting fast system recovery. It is designed to be dynamically deployed on commodity clusters to take advantage of elastic scaling property of cloud environments. LogBase provides in-memory multiversion indexes for supporting efficient access to data maintained in the log. LogBase also supports transactions that bundle read and write operations spanning across multiple records. We implemented the proposed system and compared it with HBase and a disk-based log-structured record-oriented system modeled after RAMCloud. The experimental results show that LogBase is able to provide sustained write throughput, efficient data access out of the cache, and effective system recovery.

AB - Numerous applications such as financial transactions (e.g., stock trading) are write-heavy in nature. The shift from reads to writes in web applications has also been accelerating in recent years. Writeahead-logging is a common approach for providing recovery capability while improving performance in most storage systems. However, the separation of log and application data incurs write overheads observed in write-heavy environments and hence adversely affects the write throughput and recovery time in the system. In this paper, we introduce LogBase-a scalable log-structured database system that adopts log-only storage for removing the write bottleneck and supporting fast system recovery. It is designed to be dynamically deployed on commodity clusters to take advantage of elastic scaling property of cloud environments. LogBase provides in-memory multiversion indexes for supporting efficient access to data maintained in the log. LogBase also supports transactions that bundle read and write operations spanning across multiple records. We implemented the proposed system and compared it with HBase and a disk-based log-structured record-oriented system modeled after RAMCloud. The experimental results show that LogBase is able to provide sustained write throughput, efficient data access out of the cache, and effective system recovery.

UR - http://www.scopus.com/inward/record.url?scp=84873167171&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84873167171&partnerID=8YFLogxK

M3 - Chapter

VL - 5

SP - 1004

EP - 1015

BT - Proceedings of the VLDB Endowment

ER -