Strategies to optimize the LU factorization algorithm on multicore computers

The number of cores in multicore computers has an irreversible tendency to increase. Also, computers with multiple sockets to insert multicore chips are based on a complex hardware design and are becoming more common. To parallelize the algorithms that run on this type of computers in order to obtai...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Soler, Janet, Ortiz, Javier, Wolfmann, Aaron Gustavo
Formato: conferenceObject
Lenguaje:Inglés
Publicado: 2022
Materias:
Acceso en línea:http://hdl.handle.net/11086/28443
Aporte de:
Descripción
Sumario:The number of cores in multicore computers has an irreversible tendency to increase. Also, computers with multiple sockets to insert multicore chips are based on a complex hardware design and are becoming more common. To parallelize the algorithms that run on this type of computers in order to obtain a higher performance rate, is a goal that can only be achieved by taking into account hardware architecture. As hardware evolves, so must software. This leads to old parallelization strategies quickly become obsolete. This paper presents a series of alternatives for parallelization the LU factorization algorithm and its results intended to running on a multicore system. Simple strategies lead to poor results. This study presents complex strategies that merge double levels of parallelism with asynchronous scheduling whose results reach up to the State-of-the-art in the field and even go further.