A tool for detecting transient faults in execution of parallel scientific applications on multicore clusters
Transient faults are becoming a critical concern among current trends of design of generalpurpose multiprocessors. Because of their capability to corrupt programs outputs, their impact gains importance when considering long duration, parallel scientific applications, due to the high cost of re-launc...
Guardado en:
| Autores principales: | Montezanti, Diego Miguel, Rucci, Enzo, Rexachs del Rosario, Dolores, Luque Fadón, Emilio, Naiouf, Marcelo, De Giusti, Armando Eduardo |
|---|---|
| Formato: | Articulo |
| Lenguaje: | Inglés |
| Publicado: |
2014
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/34544 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr14-5.pdf |
| Aporte de: |
Ejemplares similares
-
A tool for detecting transient faults in execution of parallel scientific applications on multicore clusters
por: Montezanti, Diego Miguel, et al.
Publicado: (2013) -
SMCV: a Methodology for Detecting Transient Faults in Multicore Clusters
por: Montezanti, Diego Miguel, et al.
Publicado: (2012) -
Characterizing a Detection Strategy for Transient Faults in HPC
por: Montezanti, Diego Miguel, et al.
Publicado: (2016) -
Fault Tolerance in Multicore Clusters. Techniques to Balance Performance and
Dependability
por: Meyer, Hugo
Publicado: (2016) -
Managing receiver-based message logging overheads in parallel applications
por: Meyer, Hugo, et al.
Publicado: (2013)