Improving the performance of matrix inversion with a Tesla GPU

We study two different techniques for the computation of a matrix inverse, the traditional approach based on Gaussian factorization and the Gauss-Jordan elimination alternative more suitable for parallel architectures. The target architecture is a current general-purpose multi-core processor (CPU) c...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Ezzatti, Pablo, Quintana Ortí, Enrique S., Remón, Alfredo
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2010
Materias:
GPU
CPU
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/152637
http://39jaiio.sadio.org.ar/sites/default/files/39jaiio-hpc-03.pdf
Aporte de:
Descripción
Sumario:We study two different techniques for the computation of a matrix inverse, the traditional approach based on Gaussian factorization and the Gauss-Jordan elimination alternative more suitable for parallel architectures. The target architecture is a current general-purpose multi-core processor (CPU) connected to a graphics processor (GPU). Parallelism is obtained from the use of libraries MKL (for the CPU) and CUBLAS (for the GPU), as well as, performing simultaneously operations in both architectures. Numerical experiments performed on a system equipped with two Intel QuadCore processors and a Tesla C1060 GPU, illustrate the efficiency attained by the Gauss-Jordan elimination implementation.