CSE5304—Project Proposal Parallel Matrix Multiplication

CSE5304—Project Proposal

Parallel Matrix Multiplication

Tian Mi

An naive version with MPI

Result:

An naive version with MPI

Processor0 reads input fileProcessor0 distributes one matrixProcessor0 broadcasts the other matrixAll processors in parallel

Do the multiplication of each piece of data

Processor0 gathers the resultProcessor0 writes result to output file

MPI_Scatter

MPI_Bcast

MPI_Gather

Data generation

Data generation in R with package “igraph”

Integer in range of [-1000, 1000]Matrix size:

Matrix 512*512 1024*1024 2048*2048 4096*4096

File size 2.69 MB 10.7 MB 43.1 MB 172 MB

Result

Data size: 1024*1024# Processors Experiments(second) Average(s) Speedup

1 44 41 45 37 42 41.8 1

2 23 20 21 19 22 21 1.99

4 11 10 19 18 16 14.8 2.82

8 10 9 8 9 10 9.2 4.54

16 9 9 11 9 6 8.8 4.75

32 8 10 8 7 7 8 5.23

64 8 8 8 8 8 8 5.23

128 10 9 6 8 9 8.4 4.98

Result

Data size: 1024*1024

1015202530354045

1 2 4 8 16 32 64 128

# processors

Result

1 2 4 8 16 32 64 128

# processors

Result

# Processors Time(s) Speedup

1 751 1

2 498 1.508032

4 258 2.910853

8 127 5.913386

16 84 8.940476

32 51 14.72549

64 55 13.65455

128 48 15.64583

Result

0100200300400

500600700800

1 2 4 8 16 32 64 128

# processors

Result

1012141618

1 2 4 8 16 32 64 128

# processors

Result

# Processors Time(s) Speedup

1 5920 1

2 3630 1.630854

4 2813 2.104515

8 925 6.4

16 745 7.946309

32 576 10.27778

64 #DIV/0!

128 #DIV/0!

Analysis

To see the superlinear speedup increase the computation, which is not dominan

t enough larger matrix and larger integer

However, larger matrix or long integer will also increase the communication time (broadcast, scatter, gather)

Cannon's algorithm--Example

http://www.vampire.vanderbilt.edu/education-outreach/me343_fall2008/notes/parallelMM_10_09.pdf

Cannon's algorithm

Still Implementing and debuggingNo result to share at present

Thank you

Questions & Comments?

CSE5304—Project Proposal Parallel Matrix Multiplication

Documents

Transcript of CSE5304—Project Proposal Parallel Matrix Multiplication

Communication-Avoiding Parallel Recursive …...Communication-Avoiding Parallel Recursive Algorithms for Matrix Multiplication Benjamin Lipshitz Electrical Engineering and Computer

Chapter 7-Matrix Multiplication from the book Parallel Computing by Michael J. Quinn

Parallel CREW matrix multiplication · Parallel CREW matrix multiplication Contents I Reminder: Array total on EREW-PRAM I Reminder: How to multiply matrices I CREW matrix vector

7. Parallel Methods for Matrix-Vector Multiplication. Parallel Methods for Matrix-Vector Multiplication 7. Parallel Methods for Matrix-Vector Multiplication 1 7.1. Introduction ...

Red-Blue Pebbling Revisited: Near Optimal Parallel Matrix ... · Red-Blue Pebbling Revisited: Near Optimal Parallel Matrix-Matrix Multiplication Technical Report Grzegorz Kwasniewski1,

Parallel Programming Parallel Matrix Multiplication klauserc/FS10/PP

CS 140 : Matrix multiplication Linear algebra problems Matrix multiplication I : cache issues Matrix multiplication II: parallel issues Thanks to Jim Demmel.

PARALLEL MATRIX MULTIPLICATION: A SYSTEMATIC ......matrix multiplication algorithms. The journey starts with a description of how matrices are dis-tributed to meshes of nodes (e.g.,

Parallel Methods for Matrix Multiplication

PARALLEL COMPUTING OF MATRIX MULTIPLICATION IN OPEN … · 2019-11-26 · A matrix multiplication program in Open MP has been used for it [4]. In some research papers, matrix multiplication

Matrix-Matrix Multiplication › users › flame › LAFF › Notes › Week5.pdfWeek 5. Matrix-Matrix Multiplication 164 Is matrix-matrix multiplication associative? Homework 5.2.2.1

CS 240A : Matrix multiplication Matrix multiplication I : parallel issues Matrix multiplication II: cache issues Thanks to Jim Demmel and Kathy Yelick.

Parallel Algorithms for Sparse Matrix Multiplication and ...xh102/pods057.pdf · In this paper, we design massively parallel algorithms for sparse ma-trix multiplication, as well

Matrices multiplication using MPI · 2020-02-11 · Ideas to Parallel Matrix Multiplication: • 1 single task: Multiplication of 1 row in matrix A to 1 column in matrix B • A x

Parallel Methods for Matrix-Vector Multiplication

Lecture 5: Parallel Matrix Algorithms (part 3)zxu2/acms60212-40212/Lec-06-3.pdf · Algorithms (part 3) 1 . A Simple Parallel Dense Matrix-Matrix Multiplication Let =[ ] × and =[

Lab 1: Parallel Algorithms of Matrix-Vector Multiplication

PARALLEL MATRIX MULTIPLICATION: A SYSTEMATIC JOURNEY · tributed to meshes of nodes (e.g., MPI processes), relates these distributions to scalable parallel implementation of matrix-vector

Communication-optimal parallel 2.5D matrix multiplication ...€¦ · Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms Edgar Solomonik and

Parallel Matrix Multiplication - Cannon's Algorithm and 2 ...