Practical hardening of crash-tolerant systems

Miguel Correia, Daniel Gómez Ferro, Flavio P. Junqueira, Marco Serafini

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

Recent failures of production systems have highlighted the importance of tolerating faults beyond crashes. The industry has so far addressed this problem by hardening crash-tolerant systems with ad hoc error detection checks, potentially overlooking critical fault scenarios. We propose a generic and principled hardening technique for Arbitrary State Corruption (ASC) faults, which specifically model the effects of realistic data corruptions on distributed processes. Hardening does not require the use of trusted components or the replication of the process over multiple physical servers. We implemented a wrapper library to transparently harden distributed processes. To exercise our library and evaluate our technique, we obtained ASC-tolerant versions of Paxos, of a subset of the ZooKeeper API, and of an eventually consistent storage by implementing crash-tolerant protocols and automatically hardening them using our library. Our evaluation shows that the throughput of our ASC-hardened state machine replication outperforms its Byzantine-tolerant counterpart by up to 70%.

Original languageEnglish
Title of host publicationProceedings of the 2012 USENIX Annual Technical Conference, USENIX ATC 2012
PublisherUSENIX Association
Pages453-466
Number of pages14
ISBN (Electronic)9781931971935
Publication statusPublished - 1 Jan 2019
Event2012 USENIX Annual Technical Conference, USENIX ATC 2012 - Boston, United States
Duration: 13 Jun 201215 Jun 2012

Publication series

NameProceedings of the 2012 USENIX Annual Technical Conference, USENIX ATC 2012

Conference

Conference2012 USENIX Annual Technical Conference, USENIX ATC 2012
CountryUnited States
CityBoston
Period13/6/1215/6/12

    Fingerprint

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Correia, M., Ferro, D. G., Junqueira, F. P., & Serafini, M. (2019). Practical hardening of crash-tolerant systems. In Proceedings of the 2012 USENIX Annual Technical Conference, USENIX ATC 2012 (pp. 453-466). (Proceedings of the 2012 USENIX Annual Technical Conference, USENIX ATC 2012). USENIX Association.