A multipath routing method for tolerating permanent and non-permanent faults
The intensive and continuous use of high-performance computers for executing computationally intensive applications, coupled with the large number of elements that make them up, dramatically increase the likelihood of failures during their operation. The interconnection network is a critical part o...
Autores principales: | , , , |
---|---|
Formato: | Objeto de conferencia |
Lenguaje: | Inglés |
Publicado: |
2009
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/20912 |
Aporte de: |
id |
I19-R120-10915-20912 |
---|---|
record_format |
dspace |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas high-performance computers high-speed interconnection fault-tolerant routing method Communications Applications Reliability, Testing, and Fault-Tolerance |
spellingShingle |
Ciencias Informáticas high-performance computers high-speed interconnection fault-tolerant routing method Communications Applications Reliability, Testing, and Fault-Tolerance Zarza, Gonzalo Lugones, Diego Franco, Daniel Luque Fadón, Emilio A multipath routing method for tolerating permanent and non-permanent faults |
topic_facet |
Ciencias Informáticas high-performance computers high-speed interconnection fault-tolerant routing method Communications Applications Reliability, Testing, and Fault-Tolerance |
description |
The intensive and continuous use of high-performance computers for executing computationally intensive applications, coupled with the large number of elements that make them up, dramatically increase the likelihood of failures during their operation.
The interconnection network is a critical part of such systems, therefore, network faults have an extremely high impact because most routing algorithms are not designed to tolerate faults. In such algorithms, just a single fault may stall messages in the network, preventing the finalization of applications, or may lead to deadlocked confi gurations.
This work focuses on the problem of fault tolerance for high-speed interconnection networks by designing a fault-tolerant routing method to solve an unbounded number of dynamic faults (permanent and non- permanent). To accomplish this task we take advantage of the communication path redundancy, by means of a multipath routing approach.
Experiments show that our method allows applications to finalize their execution in the presence of several number of faults, with an average performance value of 97% compared to the fault-free scenarios. |
format |
Objeto de conferencia Objeto de conferencia |
author |
Zarza, Gonzalo Lugones, Diego Franco, Daniel Luque Fadón, Emilio |
author_facet |
Zarza, Gonzalo Lugones, Diego Franco, Daniel Luque Fadón, Emilio |
author_sort |
Zarza, Gonzalo |
title |
A multipath routing method for tolerating permanent and non-permanent faults |
title_short |
A multipath routing method for tolerating permanent and non-permanent faults |
title_full |
A multipath routing method for tolerating permanent and non-permanent faults |
title_fullStr |
A multipath routing method for tolerating permanent and non-permanent faults |
title_full_unstemmed |
A multipath routing method for tolerating permanent and non-permanent faults |
title_sort |
multipath routing method for tolerating permanent and non-permanent faults |
publishDate |
2009 |
url |
http://sedici.unlp.edu.ar/handle/10915/20912 |
work_keys_str_mv |
AT zarzagonzalo amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT lugonesdiego amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT francodaniel amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT luquefadonemilio amultipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT zarzagonzalo multipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT lugonesdiego multipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT francodaniel multipathroutingmethodfortoleratingpermanentandnonpermanentfaults AT luquefadonemilio multipathroutingmethodfortoleratingpermanentandnonpermanentfaults |
bdutipo_str |
Repositorios |
_version_ |
1764820465095802881 |