A tool for detecting transient faults in execution of parallel scientific applications on multicore clusters
Transient faults are becoming a critical concern among current trends of design of general-purpose multiprocessors. Because of their capability to corrupt programs outputs, their impact gains importance when considering long duration, parallel scientific applications, due to the high cost of relaunc...
Guardado en:
| Autores principales: | Montezanti, Diego Miguel, Rucci, Enzo, Rexachs del Rosario, Dolores, Luque Fadón, Emilio, Naiouf, Marcelo, De Giusti, Armando Eduardo |
|---|---|
| Formato: | Objeto de conferencia |
| Lenguaje: | Inglés |
| Publicado: |
2013
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/31729 |
| Aporte de: |
Ejemplares similares
-
A tool for detecting transient faults in execution of parallel scientific applications on multicore clusters
por: Montezanti, Diego Miguel, et al.
Publicado: (2014) -
SMCV: a Methodology for Detecting Transient Faults in Multicore Clusters
por: Montezanti, Diego Miguel, et al.
Publicado: (2012) -
Characterizing a Detection Strategy for Transient Faults in HPC
por: Montezanti, Diego Miguel, et al.
Publicado: (2016) -
Fault Tolerance in Multicore Clusters. Techniques to Balance Performance and
Dependability
por: Meyer, Hugo
Publicado: (2016) -
Managing receiver-based message logging overheads in parallel applications
por: Meyer, Hugo, et al.
Publicado: (2013)