n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10...

7

Transcript of n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10...

Page 1: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1
Page 2: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1

n

m

b

1

22

22

23

24

24

23

22

22

23

21

21

20

20

20

20

19

19

2

3

4

5

6

7

8

9

11

12

13

14

15

16

17

18

19

17

3

4

4

5

5

5

6

6

6

7

7

7

8

8

9

9

10

9

10

11

12

12

12

13

13

13

14

14

15

15

18

18

18

17

16

15

11

11

2 4

17

16

19

21

14

10

8

3

P0 P1

P2 P3v

h blks

blks

(c)

solutionflow

updateflow

(a)

solution

flow

flow

update

(b)

Page 3: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1

0123456789

101112131415

1718

16

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

6

7

8

15

16

17

0 1 3 4 9 10 12 13

2 5 11 14

18: Supernode

: Node

: Subtree

Page 4: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1
Page 5: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1

4

65

18

33334

10111516

5555

10

12141718

66666666

12

358

18

22223

1267

11111

0

78

20000

13141518

777713

9111617

4444

9

22

03

576

7

01818

0P 1P 2P 3P 4P 5P 6P

0 1P 6 7P

T : Dense Supernodal Matrix

4 5P2 3P

7P

8

7

6

2

3 9

16

17

15

18

5 11 14

0

T T TTTT T T

02 3

10 00 01

678

186 7 8

T

575

4

46 7

5

15161718

15 16 17

T

0

00

678

3333

678

18

555

151617

111

222

333

444

555

666

777

000

267

58

1865

18

111617

111516

141718

78

2 141518

0

U UU

U U U U U U UU

U U

018 518

0 1 3 4 5 6

0 35

2

7

7

7 2

32 54 6701

0

05

6666

15161718

UT

6666

15161718

14

14

600

00

2678

2

T33333

5678

185

T

555

11

511151617

T

T

02

13P

4 576

and the computed solution for index k:

kpk

U : Update vector

processor p owns the right-hand side

P : Subcube as the Logical Grid

6 7

4 5P

0 1

2 3P

12 131 4 10

Page 6: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1
Page 7: n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10 11 15 16 5 5 5 5 10 12 14 17 18 6 6 6 6 12 3 5 8 18 2 2 2 2 3 1 2 6 7 1 1 1 1 1

0

2

4

6

8

10

0 20 40 60 80 100 120 140

Tim

e (

sec.)

Number of Processors

bcsstk15bcsstk30

copter2hsct2

144pf3D

0

0.5

1

1.5

2

0 20 40 60 80 100 120 140

Tim

e (

sec.)

Number of Processors

bcsstk15bcsstk30

copter2hsct2

144pf3D

(a) (b)

(a) (b)

0

200

400

600

800

1000

1200

1400

1600

0 20 40 60 80 100 120 140

Perf

orm

ance (

MF

LO

PS

)

Number of Processors

bcsstk15bcsstk30

copter2hsct2

144pf3D

0

500

1000

1500

2000

2500

3000

3500

4000

4500

0 20 40 60 80 100 120 140

Perf

orm

ance (

MF

LO

PS

)

Number of Processors

bcsstk15bcsstk30

copter2hsct2

144pf3D