n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10...
Transcript of n b P0 P1 P2 P3 m - University of Minnesotakumar/papers/sparse-2D-trisolve.pdf4 6 5 18 3 3 3 3 4 10...
n
m
b
1
22
22
23
24
24
23
22
22
23
21
21
20
20
20
20
19
19
2
3
4
5
6
7
8
9
11
12
13
14
15
16
17
18
19
17
3
4
4
5
5
5
6
6
6
7
7
7
8
8
9
9
10
9
10
11
12
12
12
13
13
13
14
14
15
15
18
18
18
17
16
15
11
11
2 4
17
16
19
21
14
10
8
3
P0 P1
P2 P3v
h blks
blks
(c)
solutionflow
updateflow
(a)
solution
flow
flow
update
(b)
0123456789
101112131415
1718
16
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
6
7
8
15
16
17
0 1 3 4 9 10 12 13
2 5 11 14
18: Supernode
: Node
: Subtree
4
65
18
33334
10111516
5555
10
12141718
66666666
12
358
18
22223
1267
11111
0
78
20000
13141518
777713
9111617
4444
9
22
03
576
7
01818
0P 1P 2P 3P 4P 5P 6P
0 1P 6 7P
T : Dense Supernodal Matrix
4 5P2 3P
7P
8
7
6
2
3 9
16
17
15
18
5 11 14
0
T T TTTT T T
02 3
10 00 01
678
186 7 8
T
575
4
46 7
5
15161718
15 16 17
T
0
00
678
3333
678
18
555
151617
111
222
333
444
555
666
777
000
267
58
1865
18
111617
111516
141718
78
2 141518
0
U UU
U U U U U U UU
U U
018 518
0 1 3 4 5 6
0 35
2
7
7
7 2
32 54 6701
0
05
6666
15161718
UT
6666
15161718
14
14
600
00
2678
2
T33333
5678
185
T
555
11
511151617
T
T
02
13P
4 576
and the computed solution for index k:
kpk
U : Update vector
processor p owns the right-hand side
P : Subcube as the Logical Grid
6 7
4 5P
0 1
2 3P
12 131 4 10
0
2
4
6
8
10
0 20 40 60 80 100 120 140
Tim
e (
sec.)
Number of Processors
bcsstk15bcsstk30
copter2hsct2
144pf3D
0
0.5
1
1.5
2
0 20 40 60 80 100 120 140
Tim
e (
sec.)
Number of Processors
bcsstk15bcsstk30
copter2hsct2
144pf3D
(a) (b)
(a) (b)
0
200
400
600
800
1000
1200
1400
1600
0 20 40 60 80 100 120 140
Perf
orm
ance (
MF
LO
PS
)
Number of Processors
bcsstk15bcsstk30
copter2hsct2
144pf3D
0
500
1000
1500
2000
2500
3000
3500
4000
4500
0 20 40 60 80 100 120 140
Perf
orm
ance (
MF
LO
PS
)
Number of Processors
bcsstk15bcsstk30
copter2hsct2
144pf3D