A single-version scheme of fault tolerant computing

This paper describes how to design low-cost reliable computing software for various application systems, by incorporating a single-version fault tolerant scheme along with run-time signature-based control-flow checking. Most of the ordinary systems lack fault tolerant software fix. The conventional...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: Saha, Goutan Kumar
Formato: Articulo
Lenguaje:Inglés
Publicado: 2006
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/9514
http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr06-4.pdf
Aporte de:
id I19-R120-10915-9514
record_format dspace
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
bit errors in memory and register
Fault tolerance
spellingShingle Ciencias Informáticas
bit errors in memory and register
Fault tolerance
Saha, Goutan Kumar
A single-version scheme of fault tolerant computing
topic_facet Ciencias Informáticas
bit errors in memory and register
Fault tolerance
description This paper describes how to design low-cost reliable computing software for various application systems, by incorporating a single-version fault tolerant scheme along with run-time signature-based control-flow checking. Most of the ordinary systems lack fault tolerant software fix. The conventional fault tolerant approaches viz., Recovery Block (RB), N Version Programming (NVP) etc., are too costly to fix in an ordinary low-cost application system because, both the RB and NVP rely on multiple (at least three) versions of both software and computing machines. However, the proposed approach needs a single version (SV) of an enhanced application program that gets executed on one computing machine only. It is common that we often face interrupted service (caused either by an intermittent fault in an application program or in hardware), during the service delivery period of an ordinary cheaper application system. Execution of an application program often show malfunctions or it gets interrupted due to memory bit errors. Error Correction Codes (ECC) (viz., parity, Hamming codes, CRC etc.,) that are used in memory, are not as effective for online correction of multiple bit errors, as they are, for the detection of few bit errors. Again, software implemented ECC has a significant overhead over both time and code redundancy. In other words, built in ECC in memory, cannot recover all bit errors but can detect only. As a result, if an error is detected by ECC, the application program needs to be restarted for its re-execution afresh in various microprocessor based application systems. So, the ECC alone is useful for designing a fail-stop kind of system but it suffers from high time redundancy. Other software implemented fault- tolerance schemes are also towards fail-stop kind. But, the proposed (SV) based approach is capable of tolerating such errors without stopping the execution of an application. This SV Scheme (SVS) aims to provide an uninterrupted service at no extra money, but at an acceptable more execution time and memory space. This SV is a non- fail-stop kind fault tolerance scheme that can be implemented in various computing systems without spending an additional money, and as a result, major part of common people in our society, can gain reliable service from the low-cost, SV-based computing system.
format Articulo
Articulo
author Saha, Goutan Kumar
author_facet Saha, Goutan Kumar
author_sort Saha, Goutan Kumar
title A single-version scheme of fault tolerant computing
title_short A single-version scheme of fault tolerant computing
title_full A single-version scheme of fault tolerant computing
title_fullStr A single-version scheme of fault tolerant computing
title_full_unstemmed A single-version scheme of fault tolerant computing
title_sort single-version scheme of fault tolerant computing
publishDate 2006
url http://sedici.unlp.edu.ar/handle/10915/9514
http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr06-4.pdf
work_keys_str_mv AT sahagoutankumar asingleversionschemeoffaulttolerantcomputing
AT sahagoutankumar singleversionschemeoffaulttolerantcomputing
bdutipo_str Repositorios
_version_ 1764820491295522816