A multipath routing method for tolerating permanent and non-permanent faults

The intensive and continuous use of high-performance computers for executing computationally intensive applications, coupled with the large number of elements that make them up, dramatically increase the likelihood of failures during their operation. The interconnection network is a critical part o...

Descripción completa

Detalles Bibliográficos
Autores principales: Zarza, Gonzalo, Lugones, Diego, Franco, Daniel, Luque Fadón, Emilio
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2009
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/20912
Aporte de:
id I19-R120-10915-20912
record_format dspace
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
high-performance computers
high-speed interconnection
fault-tolerant routing method
Communications Applications
Reliability, Testing, and Fault-Tolerance
spellingShingle Ciencias Informáticas
high-performance computers
high-speed interconnection
fault-tolerant routing method
Communications Applications
Reliability, Testing, and Fault-Tolerance
Zarza, Gonzalo
Lugones, Diego
Franco, Daniel
Luque Fadón, Emilio
A multipath routing method for tolerating permanent and non-permanent faults
topic_facet Ciencias Informáticas
high-performance computers
high-speed interconnection
fault-tolerant routing method
Communications Applications
Reliability, Testing, and Fault-Tolerance
description The intensive and continuous use of high-performance computers for executing computationally intensive applications, coupled with the large number of elements that make them up, dramatically increase the likelihood of failures during their operation. The interconnection network is a critical part of such systems, therefore, network faults have an extremely high impact because most routing algorithms are not designed to tolerate faults. In such algorithms, just a single fault may stall messages in the network, preventing the finalization of applications, or may lead to deadlocked confi gurations. This work focuses on the problem of fault tolerance for high-speed interconnection networks by designing a fault-tolerant routing method to solve an unbounded number of dynamic faults (permanent and non- permanent). To accomplish this task we take advantage of the communication path redundancy, by means of a multipath routing approach. Experiments show that our method allows applications to finalize their execution in the presence of several number of faults, with an average performance value of 97% compared to the fault-free scenarios.
format Objeto de conferencia
Objeto de conferencia
author Zarza, Gonzalo
Lugones, Diego
Franco, Daniel
Luque Fadón, Emilio
author_facet Zarza, Gonzalo
Lugones, Diego
Franco, Daniel
Luque Fadón, Emilio
author_sort Zarza, Gonzalo
title A multipath routing method for tolerating permanent and non-permanent faults
title_short A multipath routing method for tolerating permanent and non-permanent faults
title_full A multipath routing method for tolerating permanent and non-permanent faults
title_fullStr A multipath routing method for tolerating permanent and non-permanent faults
title_full_unstemmed A multipath routing method for tolerating permanent and non-permanent faults
title_sort multipath routing method for tolerating permanent and non-permanent faults
publishDate 2009
url http://sedici.unlp.edu.ar/handle/10915/20912
work_keys_str_mv AT zarzagonzalo amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT lugonesdiego amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT francodaniel amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT luquefadonemilio amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT zarzagonzalo multipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT lugonesdiego multipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT francodaniel multipathroutingmethodfortoleratingpermanentandnonpermanentfaults
AT luquefadonemilio multipathroutingmethodfortoleratingpermanentandnonpermanentfaults
bdutipo_str Repositorios
_version_ 1764820465095802881