Recovering from Multiple Process Failures in the Time Warp Mechanism

Divyakant Agrawal, Jonathan R. Agre

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

In this paper we describe a recovery protocol for distributed systems using the Time Warp control mechanism. The proposed protocol is fault tolerant to multiple process failures. Time Warp is an optimistic execution technique in which synchronization is achieved using rollback. Our recovery protocol exploits the redundancy already available to implement process rollback in the Time Warp mechanism. Thus, the recovery protocol has little additional bookkeeping overhead, which contrasts with many other recovery protocols.

Original languageEnglish
Pages (from-to)1504-1514
Number of pages11
JournalIEEE Transactions on Computers
Volume41
Issue number12
DOIs
Publication statusPublished - Dec 1992

    Fingerprint

Keywords

  • Distributed processing
  • distributed simulation
  • fault tolerance
  • optimistic synchronization
  • parallel processing
  • rollback recovery
  • virtual time

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computational Theory and Mathematics

Cite this