Parallel Sparse Matrix Algorithms for numerical computing matrix-vector multiplication.

Parallel Sparse Matrix Algorithmsfor numerical computing

matrix-vector multiplication

Introduction

• Matrix computing is an important part in numeric computing.

• Sparse Matrix is very important in Matrix computing.

• Sparse matrix can be used in all kinds of computing.

Construction

• Importance of Parallel Sparse Matrix Computing • Introduction to Sparse Matrix• Introduction to Matrix-Vector Multiplication

• How to use parallel technology solve it?

• Conclusion

Sparse Matrix

• Concept

• Sparse Matrix Storage/Save

Concept

• In the mathematical subfield of numerical analysis, a sparse matrix is a matrix populated primarily with zeros.(Stoer & Bulirsch 2002, p. 619).

• If has a matrix A[m*n] , where NZ = nonzero elements.

• If NZ<<m*n then A is Sparse Matrix

Sparse Matrix Storage/Save

• A new data structure to store the sparse matrix.

• New data structure is easy to transformed from tradition data structure.

• Less space to store.

• Fast seeking address of the elements.

Example

• Matrix A[4*4] with elements:

• 0 ， 0 ， 0 ， 2• 1 ， 0 ， 0 ， 6• 0 ， 1 ， 0 ， 0• 0 ， 0 ， 0 ， 0

Space: 4x4xB = 16B

New structure only store non-zero elements:

• 0 ， 3 ， 2• 1 ， 0 ， 1• 1 ， 3 ， 6• 2 ， 1 ， 1

space: 3x4xB = 12B

Matrix-vector multiplication

• Matrix-Vector Multiplication define by :

• Where A ij is a matrix define by [i*j]

• X j is vector has j elements.

Parallel Method

• Produce Matrix• Produce Vector• Transform Matrix to Sparse Matrix• Broadcast Vector to each slave processor• Partition Sparse Matrix• Send each buffer to each salve processor• Each salve do Matrix-Vector Multiplication • Send result to master• Done

Data structure and input parameter

User input matrix : 2D array

User input vector : 1D array

Usage : exefilename <matrix high> <matrix width> <vector size> <nonzero element number>

Produce Matrix

• void producematrix(int ** _mt,int _row,int _col,int _zero){ • for (i = 0; i < _row; i++) {• for (j = 0; j < _col; j++) {• tempelements = rand() % 50; //get random• if (residualelements == 0) {• _mt[i][j] = 0;• }else{• p = rand() % 1;• if (p < residualzero/residualelements) {• _mt[i][j] = 0;• residualzero--;• }else{• _mt[i][j] = tempelements;• residualelements--;• }• }• }• }

Transform Matrix to Sparse Matrix

• int * producesparse1d(int ** _mt,int _mtrow,int _mtcol,int _nonzero){• int * tempsp = (int*)malloc(sizeof(int)*_nonzero * 3);• int m,n;• m = 0;• for (int i = 0; i < _mtrow; i++) {• for (int j = 0; j < _mtcol; j++) {• if (_mt[i][j]!=0) {• tempsp[m] = i;• tempsp[m+1] = j;• tempsp[m+2] = _mt[i][j];• m = m +3;• }• }• }• return tempsp;• }

Broadcast Vector to each slave processor

Partition Sparse Matrix

Send each buffer to each salve processor

Parallel logic MPI_Status stat; MPI_Init(&argc,&argv); MPI_Comm_size(MPI_COMM_WORLD,&numprocs); MPI Regular MPI_Comm_rank(MPI_COMM_WORLD,&myid);

MPI_Bcast; //broadcast vector to slave

if (myid == 0){ //master MPI_Send(sendbuffer); //send each buffer to each slave• }else{ //slave • MPI_Recv(recvbuffer); //receive buffer from master SparseMult(recvbuffer,vect); //multiplication MPI_Send(slaveresult); //send result to master}

• MPI_Finalize();

Results

Conclusions & Analysis

• Spend more time on communication with each processors.

• Unbalanced communication and computing is bottle-neck

Bibliography • [1]Blaise,B. (2009). Lawrence Livermore National Laboratory.Retrieved May 2009 from the

World Wide Web: https://computing.llnl.gov/tutorials/parallel_comp/• [2] Bruce Hendrickson, Robert Leland and Steve Plimpton, An Efficient Parallel Algorithm

for Matrix – Vector Multiplication, Sandia National Laboratories, Albuquerque, NM87185• [3] Stoer, Josef; Bulirsch, Roland (2002), Introduction to Numerical Analysis (3rd ed.), Berli

n, New York: Springer-Verlag, ISBN 978-0-387-95452-3 . • [4] L.M. Romero and E.L. Zapata, Data Distributions for Sparse Matrix Vector Multiplicati

on, Univ ersity of Malaga, J.Parallel Computing, vol. 21, no.4, April 1999• [5] Martin Johnson Numerical Algorithm, Lecture notes in Parallel Computing, IIMS Massey

University at Albany, Auckland, New Zealand. 2009• [6] Message Passing Interface http://en.wikipedia.org/wiki/Message_Passing_Interface• [7] R.Raz. On the complexity of matrix product. SIAM Journal on Computing, 32:1356-1369,

2003• [8] SPARSE MATRIX http://en.wikipedia.org/wiki/Sparse_matrix• [9] SPARSE MATRIX http://baike.baidu.com/view/891721.htm• [10] V. Pan. How to multiply matrices faster. Lecture notes in computer science, volume 179.

Springer-Verlag, 1985• [11] Wang Shun and Wang Xiao Ge Parallel Algorithm for Matrix – Vector Multiplication,

Tsinghua University Library, CHINA

questions

Parallel Sparse Matrix Algorithms for numerical computing matrix-vector multiplication.

Documents

Transcript of Parallel Sparse Matrix Algorithms for numerical computing matrix-vector multiplication.

Sparse Matrix-Matrix Multiplication: Applications, Algorithms, …gmballa/talks/LA15.pdf · Sparse Matrix-Matrix Multiplication: Applications, Algorithms, and Implementations Organizers:

Optimizing Sparse Matrix-Matrix Multiplication on a ...

SPARSE STORAGE RECOMMENDATION SYSTEM FOR SPARSE MATRIX VECTOR MULTIPLICATION ON GPU

Performance Evaluation of the Sparse Matrix-Vector Multiplication

Case study: Sparse Matrix-Vector Multiplication

Optimizing the Performance of Sparse Matrix-Vector Multiplication

Optimization of GPU-Based Sparse Matrix Multiplication for ...

Irregular Applications –Sparse Matrix Vector Multiplication

Optimizing Sparse Matrix-Vector Multiplication Using Index ...kkourt/papers/cf08-spmv-kkourt-pr.pdf(indirect access) (∗) (P) CF 08: Optimizing Sparse Matrix-Vector Multiplication

Accelerating Sparse Matrix-Vector Multiplication on GPUs ...

Sparse Matrix-Matrix Multiplication on Multilevel Memory … · 2018. 4. 4. · Sparse Matrix-Matrix Multiplication on Multilevel Memory Architectures : Algorithms and Experiments

Optimization of Sparse Matrix-Vector Multiplication on ...bebop.cs.berkeley.edu/pubs/williams2007-multicore-spmv.pdfOptimization of Sparse Matrix-Vector Multiplication on Emerging

A new sparse matrix vector multiplication graphics ...biomechanics.stanford.edu/paper/IJNME15.pdf · A NEW SPARSE MATRIX VECTOR MULTIPLICATION GPU ALGORITHM 1785 simple expressions

Implementing Sparse Matrix-Vector Multiplication on ... · Sparse matrix-vector multiplication (SpMV) is of singular impor-tance in sparse linear algebra. In contrast to the uniform

Optimization of Sparse Matrix-Vector Multiplication on ...crd-legacy.lbl.gov/~oliker/papers/SC07_SPMV.pdf · Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore

Sparse Matrix-Matrix Multiplication: Applications ...gmballa/talks/LA15.pdfSparse Matrix-Matrix Multiplication: Applications, Algorithms, and Implementations Organizers: Grey Ballard

MatRaptor: A Sparse-Sparse Matrix Multiplication ......Sparse-sparse matrix-matrix multiplication (SpGEMM) is a key computational primitive in many important application do-mains such

Midpoint-Based Parallel Sparse Matrix-Matrix Multiplication Algorithm

Efficient Dense And Sparse Matrix Multiplication On …webee.technion.ac.il/~ran/papers/EfficientDenseAndSparseMatrix...Efficient Dense And Sparse Matrix Multiplication On GP-SIMD

Image Editing Based on Sparse Matrix-Vector Multiplication