IJPAM: Volume 22, No. 2 (2005)


M.R. Guarracino$^1$, F. Perla$^2$, P. Zanetti$^3$
$^1$Institute for High Performance Computing and Networking - ICAR-CNR
Italian National Research Council
Via P. Castellino, Naples, 111 - 80131, ITALY
e-mail: mario.guarracino@icar.cnr.it
$^{2,3}$ University of Naples Parthenope
Via Medina, Naples, 40 - 80133, ITALY
$^2$e-mail: francesca.perla@uniparthenope.it
$^3$e-mail: paolo.zanetti@uniparthenope.it

Abstract.The aim of this paper is to show an effective reorganization of the nonsymmetric block lanczos algorithm efficient, portable and scalable for multiple instructions multiple data (MIMD) distributed memory message passing architectures. Basic operations implemented here are matrix-matrix multiplications, eventually with a transposed and a sparse factor, LU factorisation and triangular systems solution. Since the communication overhead of the algorithm inhibits an efficient parallel implementation, we propose a reorganization of the block algorithm which reduces the total amount of communication involved in linear algebra operations. Then, we develop an efficient parallelization of the matrix-matrix multiplication when one of the factor is sparse. Some other linear algebra operations are performed using ScaLAPACK library. The parallel eigensolver has been tested on a cluster of PCs. All reported results show the proposed algorithm is efficient and scalable on the target architectures for problems of adequate dimension, and it can be the computational kernel of a robust software for large sparse eigenvalue problems.

Received: June 28, 2005

AMS Subject Classification: 65F15, 65F50, 68W10

Key Words and Phrases: nonsymmetric eigensolver, parallel block Lanczos algorithm, distributed memory architectures

Source: International Journal of Pure and Applied Mathematics
ISSN: 1311-8080
Year: 2005
Volume: 22
Issue: 2