Where academic tradition
meets the exciting future

Modelling Fault Tolerance of Transient Faults

Dubravka Ilic, Elena Troubitsyna, Modelling Fault Tolerance of Transient Faults. In: Cliff Jones Alexander Romanovsky Elena Troubitsyna Michael Butler (Ed.), Proceedings of the Workshop on Rigorous Engineering of Fault-Tolerant Systems (REFT 2005), 84-92, 2005.

Abstract:

In this paper we focus on analysis of transient physical faults and designing mechanisms to tolerate them. Transient faults are temporal faults that appear for some time and might disappear and reappear later. They are common in control systems. However transient fault appearing even for a short time might result in a system error. Hence fault tolerance mechanisms for detecting and recovering from temporal faults are of great importance in the design of control systems. Often the system module which detects errors and performs error recovery is called a Failure Management System. Its purpose is to prevent the propagation of errors in the system. In this paper we propose a formal approach to specifying the Failure Management System in the B Method. We focus on deriving a general specification and development pattern for Failure Management Systems for tolerating transient faults.

Files:

Abstract in PDF-format

BibTeX entry:

@INPROCEEDINGS{inpIlTr05b,
  title = {Modelling Fault Tolerance of Transient Faults},
  booktitle = {Proceedings of the Workshop on Rigorous Engineering of Fault-Tolerant Systems (REFT 2005)},
  author = {Ilic, Dubravka and Troubitsyna, Elena},
  editor = {Michael Butler, Cliff Jones Alexander Romanovsky Elena Troubitsyna},
  pages = {84-92},
  year = {2005},
  keywords = {fault tolerance, transient fault, B-Method, refinement},
}

Belongs to TUCS Research Unit(s): Distributed Systems Laboratory (DS Lab)

Edit publication