Application-level diagnostic and membership protocols for generic time-triggered systems

Marco Serafini, Péter Bokor, Neeraj Suri, Jonny Vinter, Astrit Ademaj, Wolfgang Brandstätter, Fulvio Tagliabo, Jens Koch

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

We present online tunable diagnostic and membership protocols for generic time-triggered (TT) systems to detect crashes, send/receive omission faults, and network partitions. Compared to existing diagnostic and membership protocols for TT systems, our protocols do not rely on the single-fault assumption and also tolerate non-fail-silent (Byzantine) faults. They run at the application level and can be added on top of any TT system (possibly as a middleware component) without requiring modifications at the system level. The information on detected faults is accumulated using a penalty/reward algorithm to handle transient faults. After a fault is detected, the likelihood of node isolation can be adapted to different system configurations, including configurations where functions with different criticality levels are integrated. All protocols are formally verified using model checking. Using actual automotive and aerospace parameters, we also experimentally demonstrate the transient fault handling capabilities of the protocols.

Original languageEnglish
Article number5692145
Pages (from-to)177-193
Number of pages17
JournalIEEE Transactions on Dependable and Secure Computing
Volume8
Issue number2
DOIs
Publication statusPublished - 26 Jan 2011
Externally publishedYes

Fingerprint

Computer systems
Network protocols
Model checking
Middleware

Keywords

  • Diagnosis
  • membership
  • time-triggered systems
  • transient faults

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Application-level diagnostic and membership protocols for generic time-triggered systems. / Serafini, Marco; Bokor, Péter; Suri, Neeraj; Vinter, Jonny; Ademaj, Astrit; Brandstätter, Wolfgang; Tagliabo, Fulvio; Koch, Jens.

In: IEEE Transactions on Dependable and Secure Computing, Vol. 8, No. 2, 5692145, 26.01.2011, p. 177-193.

Research output: Contribution to journalArticle

Serafini, M, Bokor, P, Suri, N, Vinter, J, Ademaj, A, Brandstätter, W, Tagliabo, F & Koch, J 2011, 'Application-level diagnostic and membership protocols for generic time-triggered systems', IEEE Transactions on Dependable and Secure Computing, vol. 8, no. 2, 5692145, pp. 177-193. https://doi.org/10.1109/TDSC.2010.23
Serafini, Marco ; Bokor, Péter ; Suri, Neeraj ; Vinter, Jonny ; Ademaj, Astrit ; Brandstätter, Wolfgang ; Tagliabo, Fulvio ; Koch, Jens. / Application-level diagnostic and membership protocols for generic time-triggered systems. In: IEEE Transactions on Dependable and Secure Computing. 2011 ; Vol. 8, No. 2. pp. 177-193.
@article{afe50a93918b4d7bab1c2a3d994fd8b4,
title = "Application-level diagnostic and membership protocols for generic time-triggered systems",
abstract = "We present online tunable diagnostic and membership protocols for generic time-triggered (TT) systems to detect crashes, send/receive omission faults, and network partitions. Compared to existing diagnostic and membership protocols for TT systems, our protocols do not rely on the single-fault assumption and also tolerate non-fail-silent (Byzantine) faults. They run at the application level and can be added on top of any TT system (possibly as a middleware component) without requiring modifications at the system level. The information on detected faults is accumulated using a penalty/reward algorithm to handle transient faults. After a fault is detected, the likelihood of node isolation can be adapted to different system configurations, including configurations where functions with different criticality levels are integrated. All protocols are formally verified using model checking. Using actual automotive and aerospace parameters, we also experimentally demonstrate the transient fault handling capabilities of the protocols.",
keywords = "Diagnosis, membership, time-triggered systems, transient faults",
author = "Marco Serafini and P{\'e}ter Bokor and Neeraj Suri and Jonny Vinter and Astrit Ademaj and Wolfgang Brandst{\"a}tter and Fulvio Tagliabo and Jens Koch",
year = "2011",
month = "1",
day = "26",
doi = "10.1109/TDSC.2010.23",
language = "English",
volume = "8",
pages = "177--193",
journal = "IEEE Transactions on Dependable and Secure Computing",
issn = "1545-5971",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "2",

}

TY - JOUR

T1 - Application-level diagnostic and membership protocols for generic time-triggered systems

AU - Serafini, Marco

AU - Bokor, Péter

AU - Suri, Neeraj

AU - Vinter, Jonny

AU - Ademaj, Astrit

AU - Brandstätter, Wolfgang

AU - Tagliabo, Fulvio

AU - Koch, Jens

PY - 2011/1/26

Y1 - 2011/1/26

N2 - We present online tunable diagnostic and membership protocols for generic time-triggered (TT) systems to detect crashes, send/receive omission faults, and network partitions. Compared to existing diagnostic and membership protocols for TT systems, our protocols do not rely on the single-fault assumption and also tolerate non-fail-silent (Byzantine) faults. They run at the application level and can be added on top of any TT system (possibly as a middleware component) without requiring modifications at the system level. The information on detected faults is accumulated using a penalty/reward algorithm to handle transient faults. After a fault is detected, the likelihood of node isolation can be adapted to different system configurations, including configurations where functions with different criticality levels are integrated. All protocols are formally verified using model checking. Using actual automotive and aerospace parameters, we also experimentally demonstrate the transient fault handling capabilities of the protocols.

AB - We present online tunable diagnostic and membership protocols for generic time-triggered (TT) systems to detect crashes, send/receive omission faults, and network partitions. Compared to existing diagnostic and membership protocols for TT systems, our protocols do not rely on the single-fault assumption and also tolerate non-fail-silent (Byzantine) faults. They run at the application level and can be added on top of any TT system (possibly as a middleware component) without requiring modifications at the system level. The information on detected faults is accumulated using a penalty/reward algorithm to handle transient faults. After a fault is detected, the likelihood of node isolation can be adapted to different system configurations, including configurations where functions with different criticality levels are integrated. All protocols are formally verified using model checking. Using actual automotive and aerospace parameters, we also experimentally demonstrate the transient fault handling capabilities of the protocols.

KW - Diagnosis

KW - membership

KW - time-triggered systems

KW - transient faults

UR - http://www.scopus.com/inward/record.url?scp=78751658192&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78751658192&partnerID=8YFLogxK

U2 - 10.1109/TDSC.2010.23

DO - 10.1109/TDSC.2010.23

M3 - Article

VL - 8

SP - 177

EP - 193

JO - IEEE Transactions on Dependable and Secure Computing

JF - IEEE Transactions on Dependable and Secure Computing

SN - 1545-5971

IS - 2

M1 - 5692145

ER -