OpenFOAM-2.3.0の 全チュートアリアル・ベンチマーク
-
Upload
masashi-imano -
Category
Engineering
-
view
1.204 -
download
5
description
Transcript of OpenFOAM-2.3.0の 全チュートアリアル・ベンチマーク
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)1
OpenFOAM-2.3.0の 全チュートアリアル・ベンチマーク
今野 雅(株式会社OCAEL)
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)2
OpenFOAMのベンチマークテスト
✓コンパイラ種別(intel/gnu)やコンパイラ最適化オプションを変えて,できるだけ多くのケースでベンチマークテストを行い,可能ならば性能を最適化したい.
✓全チュートリアルケースを対象にベンチマークテストを行うのが理想的. ✓全実行(Allrun)は時間かかりすぎ. ✓テスト実行(Alltest)は1ステップ実行のため,時間比較の精度が悪い. ✓24時間以内で終了させるため,Alltestを500ステップ実行するように改造して,ベンチマークテストを行い,計算時間の比較を行なった.
✓ただし,実行時間が莫大な foamyHexMesh/mixerVessel や,実行に失敗するケースは除いた.
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)3
OpenFOAMのビルド
✓対象システム
• Aシステム (Intel Xeon L5640, 2.26GHz) ベクトル化: SSE4.2
• Dシステム (Intel Xeon E5-2670 v2, 2.5GHz) ベクトル化: AVX ✓intelコンパイラ (Ver 14.0.0)
• boost: 標準設定の1_55_0はビルド失敗 → 1_54_0
• 最適化オプション: -O3はfoamyHexMeshが動かない → -O2 ✓gnuコンパイラ
• Ver: 4.8.2はCGALやfoamyHexMeshのビルド失敗 → 4.8.3
• 最適化オプション: -O3に-msse4.2を付けても0.1%の違い→ -O3
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)4
intelコンパイラの最適化オプション検討✓対象システム
• A システム (Intel Xeon L5640 , 2.26GHz) ~SSE4.2
• D システム (Intel Xeon E5-2670 v2, 2.5 GHz) ~SSE4.2, AVX ✓ ケース I1
• 最適化オプション: -axAVX,SSE4.2 -O2 -no-prec-div
• 全てのシステムで動作.DシステムでAVXが有効(-axオプションで切替) ✓ケース I2
• 最適化オプション: -xSSE4.2 -O2 -no-prec-div
• 全てのシステムで動作.DシステムでAVXが有効ではない ✓ケース I3
• 最適化オプション: -xHost -O2 -no-prec-div (OpenFOAMデフォルト)
• Aシステムでは動作しない.DシステムではAVXが有効.
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)
0
1800
3600
5400
7200
0 1800 3600 5400 7200C
PU
tim
e [s]
(C
ase
I3: In
tel -
xHost
)
CPU time [s] (Case I1: Intel -axAVX,SSE4.2)
D system (Intel Xeon E5-2670 v2, 2.5GHz)
1.015 x 0
1800
3600
5400
7200
0 1800 3600 5400 7200
CP
U tim
e [s]
(C
ase
I2: In
tel -
xSS
E4.2
)
CPU time [s] (Case I1: Intel -axAVX,SSE4.2)
D system (Intel Xeon E5-2670 v2, 2.5GHz)
1.020 x
5
計算時間の比較 (Dシステム)
intelコンパイラでベクトル化オプションを変えても計算時間はほぼ変らない
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)
0
1800
3600
5400
7200
0 1800 3600 5400 7200
CP
U t
ime
[s]
(C
ase
I2
: In
tel -
xSS
E4
.2)
CPU time [s] (Case I1: Intel -axAVX,SSE4.2)
A system (Intel Xeon L5640, 2.26GHz)
0.995 x
6
計算時間の比較 (Aシステム)
intelコンパイラでベクトル化オプションを変えても計算時間はほぼ変らない
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)7
intel vs gnu
0
1800
3600
5400
7200
0 1800 3600 5400 7200
CP
U t
ime
[s]
(C
ase
G2
: g
nu
-O
3)
CPU time [s] (Case I1: Intel -axAVX,SSE4.2)
D system (Intel Xeon E5-2670 v2, 2.5GHz)
1.147 x 0
1800
3600
5400
7200
0 1800 3600 5400 7200C
PU
tim
e [
s] (
Ca
se G
2:
gn
u -
O3
)CPU time [s] (Case I1: Intel -axAVX,SSE4.2)
A system (Intel Xeon L5640, 2.26GHz)
1.154 x
intel版に対するgnu版の計算時間の回帰式の係数は1.15で15%の増加
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)8
計算時間の比(Dシステム, intelコンパイラ)
0.85
0.9
0.95
1
1.05
1.1
1.15
DP
MF
oa
m/G
old
schm
idt
LT
SR
ea
cting
Fo
am
/ras/co
un
terF
low
Fla
me
2D
LT
SR
ea
cting
Pa
rcelF
oa
m/co
un
terF
low
Fla
me
2D
LT
SR
ea
cting
Pa
rcelF
oa
m/ve
rticalC
ha
nn
el
MP
PIC
Fo
am
/Go
ldsch
mid
t
MP
PIC
Fo
am
/colu
mn
MP
PIC
Fo
am
/cyclon
e
MP
PIC
Fo
am
/inje
ction
Ch
an
ne
l
MR
FIn
terF
oa
m/m
ixerV
esse
l2D
MR
FM
ultip
ha
seIn
terF
oa
m/m
ixerV
esse
l2D
PD
RF
oa
m/fla
me
Pro
pa
ga
tion
With
Ob
stacle
s
SR
FP
imp
leF
oa
m/ro
tor2
D
SR
FS
imp
leF
oa
m/m
ixer
XiF
oa
m/ra
s/mo
riyosh
iHo
mo
ge
ne
ou
s
XiF
oa
m/ra
s/mo
riyosh
iHo
mo
ge
ne
ou
sHyd
rog
en
XiF
oa
m/ra
s/mo
riyosh
iHo
mo
ge
ne
ou
sPa
rt2
ad
join
tSh
ap
eO
ptim
izatio
nF
oa
m/p
itzDa
ily
bo
un
da
ryFo
am
/bo
un
da
ryLa
un
de
rSh
arm
a
bo
un
da
ryFo
am
/bo
un
da
ryWa
llFu
nctio
ns
bu
oya
ntB
ou
ssine
sqP
imp
leF
oa
m/h
otR
oo
m
bu
oya
ntB
ou
ssine
sqS
imp
leF
oa
m/h
otR
oo
m
bu
oya
ntB
ou
ssine
sqS
imp
leF
oa
m/ig
loo
With
Frid
ge
s
bu
oya
ntP
imp
leF
oa
m/h
otR
oo
m
bu
oya
ntS
imp
leF
oa
m/b
uo
yan
tCa
vity
bu
oya
ntS
imp
leF
oa
m/circu
itBo
ard
Co
olin
g
bu
oya
ntS
imp
leF
oa
m/e
xtern
alC
ou
ple
dC
avity
bu
oya
ntS
imp
leF
oa
m/h
otR
ad
iatio
nR
oo
m
bu
oya
ntS
imp
leF
oa
m/h
otR
ad
iatio
nR
oo
mF
vDO
M
che
mF
oa
m/g
ri
che
mF
oa
m/ic8
h1
8
che
mF
oa
m/n
c7h
16
chtM
ultiR
eg
ion
Fo
am
/mu
ltiRe
gio
nH
ea
ter
chtM
ultiR
eg
ion
Fo
am
/sna
pp
yMu
ltiRe
gio
nH
ea
ter
chtM
ultiR
eg
ion
Sim
ple
Fo
am
/he
atE
xcha
ng
er
chtM
ultiR
eg
ion
Sim
ple
Fo
am
/mu
ltiRe
gio
nH
ea
terR
ad
iatio
n
coa
lCh
em
istryFo
am
/simp
lified
Siw
ek
com
pre
ssible
Mu
ltiph
ase
Inte
rFo
am
/lam
ina
r/da
mB
rea
k4p
ha
se
dn
sFo
am
/bo
xTu
rb1
6
ele
ctrosta
ticFo
am
/cha
rge
dW
ire
fina
ncia
lFo
am
/eu
rop
ea
nC
all
fireF
oa
m/le
s/flam
eS
pre
ad
Wa
terS
up
pre
ssion
Pa
ne
l
fireF
oa
m/le
s/op
po
siteB
urn
ing
Pa
ne
ls
fireF
oa
m/le
s/sma
llPo
olF
ire2
D
fireF
oa
m/le
s/sma
llPo
olF
ire3
D
foa
myH
exM
esh
/blo
b
foa
myH
exM
esh
/flan
ge
foa
myH
exM
esh
/simp
leS
ha
pe
s
icoF
oa
m/ca
vity
icoF
oa
m/ca
vityClip
pe
d
icoF
oa
m/ca
vityFin
e
icoF
oa
m/ca
vityGra
de
icoF
oa
m/ca
vityHig
hR
e
icoF
oa
m/e
lbo
w
CP
U tim
e r
atio
FOCUS D system (Intel Xeon E5-2670 v2, 2.5GHz)
I2/I1, average:1.017I3/I1, average:1.004
I1 Better
I1 Worse
I1 > I2,I1 ≒ I3
I1:-axAVX,SSE4.2 I2:-xSSE4.2 I3:-xHost
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)9
0.85
0.9
0.95
1
1.05
1.1
1.15
icoU
nco
up
led
Kin
em
aticP
arce
lFo
am
/ho
pp
er/h
op
pe
rEm
ptyin
g
icoU
nco
up
led
Kin
em
aticP
arce
lFo
am
/ho
pp
er/h
op
pe
rInitia
lSta
te
inte
rDyM
Fo
am
/ras/d
am
Bre
akW
ithO
bsta
cle
inte
rDyM
Fo
am
/ras/flo
atin
gO
bje
ct
inte
rDyM
Fo
am
/ras/m
ixerV
esse
lAM
I
inte
rDyM
Fo
am
/ras/slo
shin
gT
an
k2D
3D
oF
inte
rDyM
Fo
am
/ras/slo
shin
gT
an
k2D
inte
rDyM
Fo
am
/ras/slo
shin
gT
an
k3D
3D
oF
inte
rDyM
Fo
am
/ras/slo
shin
gT
an
k3D
6D
oF
inte
rDyM
Fo
am
/ras/slo
shin
gT
an
k3D
inte
rDyM
Fo
am
/ras/te
stTu
be
Mixe
r
inte
rFo
am
/lam
ina
r/cap
illaryR
ise
inte
rFo
am
/lam
ina
r/da
mB
rea
k
inte
rFo
am
/lam
ina
r/da
mB
rea
kFin
e
inte
rFo
am
/les/n
ozzle
Flo
w2
D
inte
rFo
am
/ras/d
am
Bre
ak
inte
rFo
am
/ras/d
am
Bre
akF
ine
inte
rFo
am
/ras/d
am
Bre
akP
oro
usB
affle
inte
rFo
am
/ras/w
eirO
verflo
w
inte
rMixin
gF
oa
m/la
min
ar/d
am
Bre
ak
inte
rPh
ase
Ch
an
ge
DyM
Fo
am
/pro
pe
ller
inte
rPh
ase
Ch
an
ge
Fo
am
/cavita
ting
Bu
llet
lap
lacia
nF
oa
m/fla
ng
e
mo
veD
yna
micM
esh
/Sn
ake
Rive
rCa
nyo
n
mu
ltiph
ase
Eu
lerF
oa
m/b
ub
ble
Co
lum
n
mu
ltiph
ase
Eu
lerF
oa
m/d
am
Bre
ak4
ph
ase
mu
ltiph
ase
Eu
lerF
oa
m/d
am
Bre
ak4
ph
ase
Fin
e
mu
ltiph
ase
Eu
lerF
oa
m/m
ixerV
esse
l2D
mu
ltiph
ase
Inte
rFo
am
/lam
ina
r/da
mB
rea
k4p
ha
se
mu
ltiph
ase
Inte
rFo
am
/lam
ina
r/da
mB
rea
k4p
ha
seF
ine
no
nN
ew
ton
ian
IcoF
oa
m/o
ffsetC
ylind
er
pim
ple
DyM
Fo
am
/mixe
rVe
sselA
MI2
D
pim
ple
DyM
Fo
am
/mo
ving
Co
ne
pim
ple
DyM
Fo
am
/oscilla
ting
Inle
tAC
MI2
D
pim
ple
DyM
Fo
am
/win
gM
otio
n/w
ing
Mo
tion
2D
_p
imp
leD
yMF
oa
m
pim
ple
Fo
am
/TJu
nctio
n
pim
ple
Fo
am
/TJu
nctio
nF
an
pim
ple
Fo
am
/cha
nn
el3
95
pim
ple
Fo
am
/elip
sekkL
Om
eg
a
pim
ple
Fo
am
/pitzD
aily
piso
Fo
am
/les/p
itzDa
ily
piso
Fo
am
/les/p
itzDa
ilyMa
pp
ed
piso
Fo
am
/ras/ca
vity
piso
Fo
am
/ras/ca
vityCo
up
led
U
po
rou
sSim
ple
Fo
am
/an
gle
dD
uctE
xplicit
po
rou
sSim
ple
Fo
am
/an
gle
dD
uctIm
plicit
po
rou
sSim
ple
Fo
am
/straig
htD
uctIm
plicit
po
ten
tialF
oa
m/cylin
de
r
po
ten
tialF
oa
m/p
itzDa
ily
CP
U t
ime
ra
tio
FOCUS D system (Intel Xeon E5-2670 v2, 2.5GHz)
I2/I1, average:1.017I3/I1, average:1.004
I1:-axAVX,SSE4.2 I2:-xSSE4.2 I3:-xHost
I1 > I2,I1 ≒ I3I1 Better
I1 Worse
計算時間の比(Dシステム, intelコンパイラ)
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)10
0.85
0.9
0.95
1
1.05
1.1
1.15
rea
cting
Fo
am
/ras/co
un
terF
low
Fla
me
2D
rea
cting
Pa
rcelF
ilmF
oa
m/cylin
de
r
rea
cting
Pa
rcelF
ilmF
oa
m/h
otB
oxe
s
rea
cting
Pa
rcelF
ilmF
oa
m/rivu
letP
an
el
rea
cting
Pa
rcelF
ilmF
oa
m/sp
lash
Pa
ne
l
rea
cting
Pa
rcelF
oa
m/filte
r
rea
cting
Pa
rcelF
oa
m/p
arce
lInB
ox
rea
cting
Pa
rcelF
oa
m/ve
rticalC
ha
nn
el
rho
Ce
ntra
lFo
am
/La
de
nb
urg
Jet6
0p
si
rho
Ce
ntra
lFo
am
/forw
ard
Ste
p
rho
Ce
ntra
lFo
am
/ob
liqu
eS
ho
ck
rho
Ce
ntra
lFo
am
/sho
ckTu
be
rho
Ce
ntra
lFo
am
/we
dg
e1
5M
a5
rho
LT
SP
imp
leF
oa
m/a
ng
led
Du
ct
rho
Pim
ple
DyM
Fo
am
/an
nu
larT
he
rma
lMixe
r
rho
Pim
ple
Fo
am
/les/p
itzDa
ily
rho
Pim
ple
Fo
am
/ras/a
ng
led
Du
ct
rho
Pim
ple
Fo
am
/ras/ca
vity
rho
Pim
ple
Fo
am
/ras/m
ixerV
esse
l2D
rho
Pim
ple
cFo
am
/an
gle
dD
uct
rho
Po
rou
sSim
ple
Fo
am
/an
gle
dD
uctE
xplicit
rho
Po
rou
sSim
ple
Fo
am
/an
gle
dD
uctIm
plicit
rho
Sim
ple
Fo
am
/an
gle
dD
uctE
xplicitF
ixed
Co
eff
rho
Sim
ple
cFo
am
/squ
are
Be
nd
settlin
gF
oa
m/ra
s/da
hl
settlin
gF
oa
m/ra
s/tan
k3D
sha
llow
Wa
terF
oa
m/sq
ua
reB
um
p
simp
leF
oa
m/a
irFo
il2D
simp
leF
oa
m/m
ixerV
esse
l2D
simp
leF
oa
m/p
ipe
Cyclic
simp
leF
oa
m/p
itzDa
ily
simp
leF
oa
m/p
itzDa
ilyExp
tInle
t
simp
leF
oa
m/tu
rbin
eS
iting
simp
leR
ea
cting
Pa
rcelF
oa
m/ve
rticalC
ha
nn
el
solid
Disp
lace
me
ntF
oa
m/p
late
Ho
le
solid
Eq
uilib
rium
Disp
lace
me
ntF
oa
m/b
ea
mE
nd
Lo
ad
son
icFo
am
/lam
ina
r/forw
ard
Ste
p
son
icFo
am
/lam
ina
r/sho
ckTu
be
son
icFo
am
/ras/p
rism
son
icLiq
uid
Fo
am
/de
com
pre
ssion
Ta
nk
son
icLiq
uid
Fo
am
/de
com
pre
ssion
Ta
nkF
ine
spra
yFo
am
/aa
che
nB
om
b
two
Liq
uid
Mixin
gF
oa
m/lo
ckExch
an
ge
two
Ph
ase
Eu
lerF
oa
m/L
ES
/bu
bb
leC
olu
mn
two
Ph
ase
Eu
lerF
oa
m/R
AS
/bu
bb
leC
olu
mn
two
Ph
ase
Eu
lerF
oa
m/R
AS
/fluid
ised
Be
d
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/b
ub
ble
Co
lum
n
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/b
ub
ble
Co
lum
nIA
TE
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/flu
idise
dB
ed
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/m
ixerV
esse
l2D
CP
U t
ime
ra
tio
FOCUS D system (Intel Xeon E5-2670 v2, 2.5GHz)
I2/I1, average:1.017I3/I1, average:1.004
I1:-axAVX,SSE4.2 I2:-xSSE4.2 I3:-xHost
I1 > I2,I1 ≒ I3I1 Better
I1 Worse
計算時間の比(Dシステム, intelコンパイラ)
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)11
0.85
0.9
0.95
1
1.05
1.1
1.15
DP
MF
oa
m/G
old
schm
idt
LT
SR
ea
cting
Fo
am
/ras/co
un
terF
low
Fla
me
2D
LT
SR
ea
cting
Pa
rcelF
oa
m/co
un
terF
low
Fla
me
2D
LT
SR
ea
cting
Pa
rcelF
oa
m/ve
rticalC
ha
nn
el
MP
PIC
Fo
am
/Go
ldsch
mid
t
MP
PIC
Fo
am
/colu
mn
MP
PIC
Fo
am
/cyclon
e
MP
PIC
Fo
am
/inje
ction
Ch
an
ne
l
MR
FIn
terF
oa
m/m
ixerV
esse
l2D
MR
FM
ultip
ha
seIn
terF
oa
m/m
ixerV
esse
l2D
PD
RF
oa
m/fla
me
Pro
pa
ga
tion
With
Ob
stacle
s
SR
FP
imp
leF
oa
m/ro
tor2
D
SR
FS
imp
leF
oa
m/m
ixer
XiF
oa
m/ra
s/mo
riyosh
iHo
mo
ge
ne
ou
s
XiF
oa
m/ra
s/mo
riyosh
iHo
mo
ge
ne
ou
sHyd
rog
en
XiF
oa
m/ra
s/mo
riyosh
iHo
mo
ge
ne
ou
sPa
rt2
ad
join
tSh
ap
eO
ptim
izatio
nF
oa
m/p
itzDa
ily
bo
un
da
ryFo
am
/bo
un
da
ryLa
un
de
rSh
arm
a
bo
un
da
ryFo
am
/bo
un
da
ryWa
llFu
nctio
ns
bu
oya
ntB
ou
ssine
sqP
imp
leF
oa
m/h
otR
oo
m
bu
oya
ntB
ou
ssine
sqS
imp
leF
oa
m/h
otR
oo
m
bu
oya
ntB
ou
ssine
sqS
imp
leF
oa
m/ig
loo
With
Frid
ge
s
bu
oya
ntP
imp
leF
oa
m/h
otR
oo
m
bu
oya
ntS
imp
leF
oa
m/b
uo
yan
tCa
vity
bu
oya
ntS
imp
leF
oa
m/circu
itBo
ard
Co
olin
g
bu
oya
ntS
imp
leF
oa
m/e
xtern
alC
ou
ple
dC
avity
bu
oya
ntS
imp
leF
oa
m/h
otR
ad
iatio
nR
oo
m
bu
oya
ntS
imp
leF
oa
m/h
otR
ad
iatio
nR
oo
mF
vDO
M
che
mF
oa
m/g
ri
che
mF
oa
m/ic8
h1
8
che
mF
oa
m/n
c7h
16
chtM
ultiR
eg
ion
Fo
am
/mu
ltiRe
gio
nH
ea
ter
chtM
ultiR
eg
ion
Fo
am
/sna
pp
yMu
ltiRe
gio
nH
ea
ter
chtM
ultiR
eg
ion
Sim
ple
Fo
am
/he
atE
xcha
ng
er
chtM
ultiR
eg
ion
Sim
ple
Fo
am
/mu
ltiRe
gio
nH
ea
terR
ad
iatio
n
coa
lCh
em
istryFo
am
/simp
lified
Siw
ek
com
pre
ssible
Mu
ltiph
ase
Inte
rFo
am
/lam
ina
r/da
mB
rea
k4p
ha
se
dn
sFo
am
/bo
xTu
rb1
6
ele
ctrosta
ticFo
am
/cha
rge
dW
ire
fina
ncia
lFo
am
/eu
rop
ea
nC
all
fireF
oa
m/le
s/flam
eS
pre
ad
Wa
terS
up
pre
ssion
Pa
ne
l
fireF
oa
m/le
s/op
po
siteB
urn
ing
Pa
ne
ls
fireF
oa
m/le
s/sma
llPo
olF
ire2
D
fireF
oa
m/le
s/sma
llPo
olF
ire3
D
foa
myH
exM
esh
/blo
b
foa
myH
exM
esh
/flan
ge
foa
myH
exM
esh
/simp
leS
ha
pe
s
icoF
oa
m/ca
vity
icoF
oa
m/ca
vityClip
pe
d
icoF
oa
m/ca
vityFin
e
icoF
oa
m/ca
vityGra
de
icoF
oa
m/ca
vityHig
hR
e
icoF
oa
m/e
lbo
w
CP
U tim
e r
atio
FOCUS A system (Intel Xeon L5640, 2.26GHz)
I2/I1, average:0.991
I1:-axAVX,SSE4.2 I2:-xSSE4.2
I1 ≒ I2I1 Better
I1 Worse
計算時間の比(Aシステム, intelコンパイラ)
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)12
0.85
0.9
0.95
1
1.05
1.1
1.15
icoU
nco
uple
dK
inem
aticP
arce
lFoam
/hopper/h
opperE
mptyin
g
icoU
nco
uple
dK
inem
aticP
arce
lFoam
/hopper/h
opperIn
itialS
tate
inte
rDyM
Foam
/ras/d
am
Bre
akW
ithO
bsta
cle
inte
rDyM
Foam
/ras/flo
atin
gO
bje
ct
inte
rDyM
Foam
/ras/m
ixerV
esse
lAM
I
inte
rDyM
Foam
/ras/slo
shin
gT
ank2
D3D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank2
D
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D3D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D6D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D
inte
rDyM
Foam
/ras/te
stTubeM
ixer
inte
rFoam
/lam
inar/ca
pilla
ryRise
inte
rFoam
/lam
inar/d
am
Bre
ak
inte
rFoam
/lam
inar/d
am
Bre
akF
ine
inte
rFoam
/les/n
ozzle
Flo
w2D
inte
rFoam
/ras/d
am
Bre
ak
inte
rFoam
/ras/d
am
Bre
akF
ine
inte
rFoam
/ras/d
am
Bre
akP
oro
usB
affle
inte
rFoam
/ras/w
eirO
verflo
w
inte
rMixin
gF
oam
/lam
inar/d
am
Bre
ak
inte
rPhase
ChangeD
yMF
oam
/pro
pelle
r
inte
rPhase
ChangeF
oam
/cavita
tingB
ulle
t
lapla
cianF
oam
/flange
move
Dyn
am
icMesh
/Snake
Rive
rCanyo
n
multip
hase
Eule
rFoam
/bubble
Colu
mn
multip
hase
Eule
rFoam
/dam
Bre
ak4
phase
multip
hase
Eule
rFoam
/dam
Bre
ak4
phase
Fin
e
multip
hase
Eule
rFoam
/mixe
rVesse
l2D
multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
Fin
e
nonN
ew
tonia
nIco
Foam
/offse
tCylin
der
pim
ple
DyM
Foam
/mixe
rVesse
lAM
I2D
pim
ple
DyM
Foam
/movin
gC
one
pim
ple
DyM
Foam
/oscilla
tingIn
letA
CM
I2D
pim
ple
DyM
Foam
/win
gM
otio
n/w
ingM
otio
n2D
_pim
ple
DyM
Foam
pim
ple
Foam
/TJu
nctio
n
pim
ple
Foam
/TJu
nctio
nF
an
pim
ple
Foam
/channel3
95
pim
ple
Foam
/elip
sekkL
Om
ega
pim
ple
Foam
/pitzD
aily
piso
Foam
/les/p
itzDaily
piso
Foam
/les/p
itzDailyM
apped
piso
Foam
/ras/ca
vity
piso
Foam
/ras/ca
vityCouple
dU
poro
usS
imple
Foam
/angle
dD
uctE
xplicit
poro
usS
imple
Foam
/angle
dD
uctIm
plicit
poro
usS
imple
Foam
/straig
htD
uctIm
plicit
pote
ntia
lFoam
/cylinder
pote
ntia
lFoam
/pitzD
aily
CP
U t
ime
ra
tio
FOCUS A system (Intel Xeon L5640, 2.26GHz)
I2/I1, average:0.991
I1 Better
I1 Worse
I1:-axAVX,SSE4.2 I2:-xSSE4.2
I1 ≒ I2
計算時間の比(Aシステム, intelコンパイラ)
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)13
0.85
0.9
0.95
1
1.05
1.1
1.15
rea
cting
Fo
am
/ras/co
un
terF
low
Fla
me
2D
rea
cting
Pa
rcelF
ilmF
oa
m/cylin
de
r
rea
cting
Pa
rcelF
ilmF
oa
m/h
otB
oxe
s
rea
cting
Pa
rcelF
ilmF
oa
m/rivu
letP
an
el
rea
cting
Pa
rcelF
ilmF
oa
m/sp
lash
Pa
ne
l
rea
cting
Pa
rcelF
oa
m/filte
r
rea
cting
Pa
rcelF
oa
m/p
arce
lInB
ox
rea
cting
Pa
rcelF
oa
m/ve
rticalC
ha
nn
el
rho
Ce
ntra
lFo
am
/La
de
nb
urg
Jet6
0p
si
rho
Ce
ntra
lFo
am
/forw
ard
Ste
p
rho
Ce
ntra
lFo
am
/ob
liqu
eS
ho
ck
rho
Ce
ntra
lFo
am
/sho
ckTu
be
rho
Ce
ntra
lFo
am
/we
dg
e1
5M
a5
rho
LT
SP
imp
leF
oa
m/a
ng
led
Du
ct
rho
Pim
ple
DyM
Fo
am
/an
nu
larT
he
rma
lMixe
r
rho
Pim
ple
Fo
am
/les/p
itzDa
ily
rho
Pim
ple
Fo
am
/ras/a
ng
led
Du
ct
rho
Pim
ple
Fo
am
/ras/ca
vity
rho
Pim
ple
Fo
am
/ras/m
ixerV
esse
l2D
rho
Pim
ple
cFo
am
/an
gle
dD
uct
rho
Po
rou
sSim
ple
Fo
am
/an
gle
dD
uctE
xplicit
rho
Po
rou
sSim
ple
Fo
am
/an
gle
dD
uctIm
plicit
rho
Sim
ple
Fo
am
/an
gle
dD
uctE
xplicitF
ixed
Co
eff
rho
Sim
ple
cFo
am
/squ
are
Be
nd
settlin
gF
oa
m/ra
s/da
hl
settlin
gF
oa
m/ra
s/tan
k3D
sha
llow
Wa
terF
oa
m/sq
ua
reB
um
p
simp
leF
oa
m/a
irFo
il2D
simp
leF
oa
m/m
ixerV
esse
l2D
simp
leF
oa
m/p
ipe
Cyclic
simp
leF
oa
m/p
itzDa
ily
simp
leF
oa
m/p
itzDa
ilyExp
tInle
t
simp
leF
oa
m/tu
rbin
eS
iting
simp
leR
ea
cting
Pa
rcelF
oa
m/ve
rticalC
ha
nn
el
solid
Disp
lace
me
ntF
oa
m/p
late
Ho
le
solid
Eq
uilib
rium
Disp
lace
me
ntF
oa
m/b
ea
mE
nd
Lo
ad
son
icFo
am
/lam
ina
r/forw
ard
Ste
p
son
icFo
am
/lam
ina
r/sho
ckTu
be
son
icFo
am
/ras/p
rism
son
icLiq
uid
Fo
am
/de
com
pre
ssion
Ta
nk
son
icLiq
uid
Fo
am
/de
com
pre
ssion
Ta
nkF
ine
spra
yFo
am
/aa
che
nB
om
b
two
Liq
uid
Mixin
gF
oa
m/lo
ckExch
an
ge
two
Ph
ase
Eu
lerF
oa
m/L
ES
/bu
bb
leC
olu
mn
two
Ph
ase
Eu
lerF
oa
m/R
AS
/bu
bb
leC
olu
mn
two
Ph
ase
Eu
lerF
oa
m/R
AS
/fluid
ised
Be
d
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/b
ub
ble
Co
lum
n
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/b
ub
ble
Co
lum
nIA
TE
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/flu
idise
dB
ed
two
Ph
ase
Eu
lerF
oa
m/la
min
ar/m
ixerV
esse
l2D
CP
U t
ime
ra
tio
FOCUS A system (Intel Xeon L5640, 2.26GHz)
I2/I1, average:0.991
I1 Better
I1 Worse
I1:-axAVX,SSE4.2 I2:-xSSE4.2
I1 ≒ I2
計算時間の比(Aシステム, intelコンパイラ)
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)14
MPPICFoam/cyclone chemFoam/nc7h16
I1 Better
I1 Worse
I1:intel compiler -axAVX,SSE4.2 G2:gnu compiler -O3
計算時間の比(Dシステム,intel vs gnu)
0.6
0.8
1
1.2
1.4
1.6
1.8
2
DP
MF
oam
/Gold
schm
idt
LT
SR
eactin
gF
oam
/ras/co
unte
rFlo
wF
lam
e2D
LT
SR
eactin
gP
arce
lFoam
/counte
rFlo
wF
lam
e2D
LT
SR
eactin
gP
arce
lFoam
/vertica
lChannel
MP
PIC
Foam
/Gold
schm
idt
MP
PIC
Foam
/colu
mn
MP
PIC
Foam
/cyclone
MP
PIC
Foam
/inje
ctionC
hannel
MR
FIn
terF
oam
/mixe
rVesse
l2D
MR
FM
ultip
hase
Inte
rFoam
/mixe
rVesse
l2D
PD
RF
oam
/flam
eP
ropagatio
nW
ithO
bsta
cles
SR
FP
imple
Foam
/roto
r2D
SR
FS
imple
Foam
/mixe
r
XiF
oam
/ras/m
oriyo
shiH
om
ogeneous
XiF
oam
/ras/m
oriyo
shiH
om
ogeneousH
ydro
gen
XiF
oam
/ras/m
oriyo
shiH
om
ogeneousP
art2
adjo
intS
hapeO
ptim
izatio
nF
oam
/pitzD
aily
boundaryF
oam
/boundaryL
aunderS
harm
a
boundaryF
oam
/boundaryW
allF
unctio
ns
buoya
ntB
oussin
esq
Pim
ple
Foam
/hotR
oom
buoya
ntB
oussin
esq
Sim
ple
Foam
/hotR
oom
buoya
ntB
oussin
esq
Sim
ple
Foam
/iglo
oW
ithF
ridges
buoya
ntP
imple
Foam
/hotR
oom
buoya
ntS
imple
Foam
/buoya
ntC
avity
buoya
ntS
imple
Foam
/circuitB
oard
Coolin
g
buoya
ntS
imple
Foam
/exte
rnalC
ouple
dC
avity
buoya
ntS
imple
Foam
/hotR
adia
tionR
oom
buoya
ntS
imple
Foam
/hotR
adia
tionR
oom
FvD
OM
chem
Foam
/gri
chem
Foam
/ic8h18
chem
Foam
/nc7
h16
chtM
ultiR
egio
nF
oam
/multiR
egio
nH
eate
r
chtM
ultiR
egio
nF
oam
/snappyM
ultiR
egio
nH
eate
r
chtM
ultiR
egio
nS
imple
Foam
/heatE
xchanger
chtM
ultiR
egio
nS
imple
Foam
/multiR
egio
nH
eate
rRadia
tion
coalC
hem
istryFoam
/simplifie
dS
iwek
com
pre
ssible
Multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
dnsF
oam
/boxT
urb
16
ele
ctrosta
ticFoam
/charg
edW
ire
financia
lFoam
/euro
peanC
all
fireF
oam
/les/fla
meS
pre
adW
ate
rSuppre
ssionP
anel
fireF
oam
/les/o
pposite
Burn
ingP
anels
fireF
oam
/les/sm
allP
oolF
ire2D
fireF
oam
/les/sm
allP
oolF
ire3D
foam
yHexM
esh
/blo
b
foam
yHexM
esh
/flange
foam
yHexM
esh
/simple
Shapes
icoF
oam
/cavity
icoF
oam
/cavityC
lipped
icoF
oam
/cavityF
ine
icoF
oam
/cavityG
rade
icoF
oam
/cavityH
ighR
e
icoF
oam
/elb
ow
CP
U t
ime
ra
tio
FOCUS D system (Intel Xeon E5-2670 v2, 2.5GHz)
G2/I1, average:1.066
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)15
interDyMFoam/ras/mixerVesselAMI
I1 Better
I1 Worse
I1:intel compiler -axAVX,SSE4.2 G2:gnu compiler -O3
計算時間の比(Dシステム,intel vs gnu)
0.6
0.8
1
1.2
1.4
1.6
1.8
2
icoU
nco
uple
dK
inem
aticP
arce
lFoam
/hopper/h
opperE
mptyin
g
icoU
nco
uple
dK
inem
aticP
arce
lFoam
/hopper/h
opperIn
itialS
tate
inte
rDyM
Foam
/ras/d
am
Bre
akW
ithO
bsta
cle
inte
rDyM
Foam
/ras/flo
atin
gO
bje
ct
inte
rDyM
Foam
/ras/m
ixerV
esse
lAM
I
inte
rDyM
Foam
/ras/slo
shin
gT
ank2
D3D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank2
D
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D3D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D6D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D
inte
rDyM
Foam
/ras/te
stTubeM
ixer
inte
rFoam
/lam
inar/ca
pilla
ryRise
inte
rFoam
/lam
inar/d
am
Bre
ak
inte
rFoam
/lam
inar/d
am
Bre
akF
ine
inte
rFoam
/les/n
ozzle
Flo
w2D
inte
rFoam
/ras/d
am
Bre
ak
inte
rFoam
/ras/d
am
Bre
akF
ine
inte
rFoam
/ras/d
am
Bre
akP
oro
usB
affle
inte
rFoam
/ras/w
eirO
verflo
w
inte
rMixin
gF
oam
/lam
inar/d
am
Bre
ak
inte
rPhase
ChangeD
yMF
oam
/pro
pelle
r
inte
rPhase
ChangeF
oam
/cavita
tingB
ulle
t
lapla
cianF
oam
/flange
move
Dyn
am
icMesh
/Snake
Rive
rCanyo
n
multip
hase
Eule
rFoam
/bubble
Colu
mn
multip
hase
Eule
rFoam
/dam
Bre
ak4
phase
multip
hase
Eule
rFoam
/dam
Bre
ak4
phase
Fin
e
multip
hase
Eule
rFoam
/mixe
rVesse
l2D
multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
Fin
e
nonN
ew
tonia
nIco
Foam
/offse
tCylin
der
pim
ple
DyM
Foam
/mixe
rVesse
lAM
I2D
pim
ple
DyM
Foam
/movin
gC
one
pim
ple
DyM
Foam
/oscilla
tingIn
letA
CM
I2D
pim
ple
DyM
Foam
/win
gM
otio
n/w
ingM
otio
n2D
_pim
ple
DyM
Foam
pim
ple
Foam
/TJu
nctio
n
pim
ple
Foam
/TJu
nctio
nF
an
pim
ple
Foam
/channel3
95
pim
ple
Foam
/elip
sekkL
Om
ega
pim
ple
Foam
/pitzD
aily
piso
Foam
/les/p
itzDaily
piso
Foam
/les/p
itzDailyM
apped
piso
Foam
/ras/ca
vity
piso
Foam
/ras/ca
vityCouple
dU
poro
usS
imple
Foam
/angle
dD
uctE
xplicit
poro
usS
imple
Foam
/angle
dD
uctIm
plicit
poro
usS
imple
Foam
/straig
htD
uctIm
plicit
pote
ntia
lFoam
/cylinder
pote
ntia
lFoam
/pitzD
aily
CP
U tim
e r
atio
FOCUS D system (Intel Xeon E5-2670 v2, 2.5GHz)
G2/I1, average:1.066
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)16
I1:intel compiler -axAVX,SSE4.2 G2:gnu compiler -O3
sprayFoam/aachenBomb
I1 Better
I1 Worse
計算時間の比(Dシステム,intel vs gnu)
0.6
0.8
1
1.2
1.4
1.6
1.8
2
rea
cting
Foam
/ras/co
un
terF
low
Fla
me2D
rea
cting
Parce
lFilm
Fo
am
/cylind
er
rea
cting
Parce
lFilm
Fo
am
/ho
tBo
xes
rea
cting
Parce
lFilm
Fo
am
/rivule
tPa
nel
rea
cting
Parce
lFilm
Fo
am
/spla
shP
an
el
rea
cting
Parce
lFo
am
/filter
rea
cting
Parce
lFo
am
/pa
rcelIn
Box
rea
cting
Parce
lFo
am
/vertica
lCha
nnel
rho
Ce
ntra
lFo
am
/La
denb
urg
Jet6
0psi
rho
Ce
ntra
lFo
am
/forw
ard
Ste
p
rho
Ce
ntra
lFo
am
/ob
liqueS
hock
rho
Ce
ntra
lFo
am
/sho
ckTu
be
rho
Ce
ntra
lFo
am
/wed
ge15
Ma
5
rho
LT
SP
imple
Fo
am
/an
gle
dD
uct
rho
Pim
ple
DyM
Foa
m/a
nn
ula
rThe
rma
lMixe
r
rho
Pim
ple
Foa
m/le
s/pitzD
aily
rho
Pim
ple
Foa
m/ra
s/an
gle
dD
uct
rho
Pim
ple
Foa
m/ra
s/cavity
rho
Pim
ple
Foa
m/ra
s/mixe
rVesse
l2D
rho
Pim
ple
cFoa
m/a
ng
ledD
uct
rho
Po
rou
sSim
ple
Fo
am
/an
gle
dD
uctE
xplicit
rho
Po
rou
sSim
ple
Fo
am
/an
gle
dD
uctIm
plicit
rho
Sim
ple
Foa
m/a
ng
ledD
uctE
xplicitF
ixed
Coeff
rho
Sim
ple
cFoa
m/sq
ua
reB
en
d
settlin
gF
oam
/ras/d
ah
l
settlin
gF
oam
/ras/ta
nk3
D
sha
llow
Wa
terF
oa
m/sq
ua
reB
um
p
simp
leF
oa
m/a
irFoil2
D
simp
leF
oa
m/m
ixerV
esse
l2D
simp
leF
oa
m/p
ipe
Cyclic
simp
leF
oa
m/p
itzDa
ily
simp
leF
oa
m/p
itzDa
ilyExp
tInle
t
simp
leF
oa
m/tu
rbin
eS
iting
simp
leR
ea
ctingP
arce
lFoa
m/ve
rticalC
hann
el
solid
Disp
lace
mentF
oam
/pla
teH
ole
solid
Eq
uilib
rium
Disp
lace
me
ntF
oa
m/b
ea
mE
ndL
oad
son
icFoa
m/la
min
ar/fo
rwa
rdS
tep
son
icFoa
m/la
min
ar/sh
ockT
ub
e
son
icFoa
m/ra
s/prism
son
icLiq
uid
Fo
am
/de
com
pre
ssion
Tank
son
icLiq
uid
Fo
am
/de
com
pre
ssion
TankF
ine
spra
yFoa
m/a
ach
enB
om
b
two
Liq
uid
Mixin
gF
oam
/lockE
xchan
ge
two
Ph
ase
Eu
lerF
oam
/LE
S/b
ubb
leC
olu
mn
two
Ph
ase
Eu
lerF
oam
/RA
S/b
ubble
Co
lum
n
two
Ph
ase
Eu
lerF
oam
/RA
S/flu
idise
dB
ed
two
Ph
ase
Eu
lerF
oam
/lam
inar/b
ubb
leC
olu
mn
two
Ph
ase
Eu
lerF
oam
/lam
inar/b
ubb
leC
olu
mn
IAT
E
two
Ph
ase
Eu
lerF
oam
/lam
inar/flu
idise
dB
ed
two
Ph
ase
Eu
lerF
oam
/lam
inar/m
ixerV
esse
l2D
CP
U t
ime
ra
tio
FOCUS D system (Intel Xeon E5-2670 v2, 2.5GHz)
G2/I1, average:1.066
twoPhaseEulerFoam/laminar/bubbleColumnIATE
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)17
MPPICFoam/cyclone
I1 Better
I1 Worse
I1:intel compiler -axAVX,SSE4.2 G2:gnu compiler -O3
計算時間の比(Aシステム,intel vs gnu)
0.6
0.8
1
1.2
1.4
1.6
1.8
2
DP
MF
oam
/Gold
schm
idt
LT
SR
eactin
gF
oam
/ras/co
unte
rFlo
wF
lam
e2D
LT
SR
eactin
gP
arce
lFoam
/counte
rFlo
wF
lam
e2D
LT
SR
eactin
gP
arce
lFoam
/vertica
lChannel
MP
PIC
Foam
/Gold
schm
idt
MP
PIC
Foam
/colu
mn
MP
PIC
Foam
/cyclone
MP
PIC
Foam
/inje
ctionC
hannel
MR
FIn
terF
oam
/mixe
rVesse
l2D
MR
FM
ultip
hase
Inte
rFoam
/mixe
rVesse
l2D
PD
RF
oam
/flam
eP
ropagatio
nW
ithO
bsta
cles
SR
FP
imple
Foam
/roto
r2D
SR
FS
imple
Foam
/mixe
r
XiF
oam
/ras/m
oriyo
shiH
om
ogeneous
XiF
oam
/ras/m
oriyo
shiH
om
ogeneousH
ydro
gen
XiF
oam
/ras/m
oriyo
shiH
om
ogeneousP
art2
adjo
intS
hapeO
ptim
izatio
nF
oam
/pitzD
aily
boundaryF
oam
/boundaryL
aunderS
harm
a
boundaryF
oam
/boundaryW
allF
unctio
ns
buoya
ntB
oussin
esq
Pim
ple
Foam
/hotR
oom
buoya
ntB
oussin
esq
Sim
ple
Foam
/hotR
oom
buoya
ntB
oussin
esq
Sim
ple
Foam
/iglo
oW
ithF
ridges
buoya
ntP
imple
Foam
/hotR
oom
buoya
ntS
imple
Foam
/buoya
ntC
avity
buoya
ntS
imple
Foam
/circuitB
oard
Coolin
g
buoya
ntS
imple
Foam
/exte
rnalC
ouple
dC
avity
buoya
ntS
imple
Foam
/hotR
adia
tionR
oom
buoya
ntS
imple
Foam
/hotR
adia
tionR
oom
FvD
OM
chem
Foam
/gri
chem
Foam
/ic8h18
chem
Foam
/nc7
h16
chtM
ultiR
egio
nF
oam
/multiR
egio
nH
eate
r
chtM
ultiR
egio
nF
oam
/snappyM
ultiR
egio
nH
eate
r
chtM
ultiR
egio
nS
imple
Foam
/heatE
xchanger
chtM
ultiR
egio
nS
imple
Foam
/multiR
egio
nH
eate
rRadia
tion
coalC
hem
istryFoam
/simplifie
dS
iwek
com
pre
ssible
Multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
dnsF
oam
/boxT
urb
16
ele
ctrosta
ticFoam
/charg
edW
ire
financia
lFoam
/euro
peanC
all
fireF
oam
/les/fla
meS
pre
adW
ate
rSuppre
ssionP
anel
fireF
oam
/les/o
pposite
Burn
ingP
anels
fireF
oam
/les/sm
allP
oolF
ire2D
fireF
oam
/les/sm
allP
oolF
ire3D
foam
yHexM
esh
/blo
b
foam
yHexM
esh
/flange
foam
yHexM
esh
/simple
Shapes
icoF
oam
/cavity
icoF
oam
/cavityC
lipped
icoF
oam
/cavityF
ine
icoF
oam
/cavityG
rade
icoF
oam
/cavityH
ighR
e
icoF
oam
/elb
ow
CP
U tim
e r
atio
FOCUS A system (Intel Xeon L5640, 2.26GHz)
G2/I1, average:1.038
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)18
multiphaseInterFoam/ laminar/ damBreak4phaseFine
I1 Better
I1 Worse
I1:intel compiler -axAVX,SSE4.2 G2:gnu compiler -O3
計算時間の比(Aシステム,intel vs gnu)
0.6
0.8
1
1.2
1.4
1.6
1.8
2
icoU
nco
uple
dK
inem
aticP
arce
lFoam
/hopper/h
opperE
mptyin
g
icoU
nco
uple
dK
inem
aticP
arce
lFoam
/hopper/h
opperIn
itialS
tate
inte
rDyM
Foam
/ras/d
am
Bre
akW
ithO
bsta
cle
inte
rDyM
Foam
/ras/flo
atin
gO
bje
ct
inte
rDyM
Foam
/ras/m
ixerV
esse
lAM
I
inte
rDyM
Foam
/ras/slo
shin
gT
ank2
D3D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank2
D
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D3D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D6D
oF
inte
rDyM
Foam
/ras/slo
shin
gT
ank3
D
inte
rDyM
Foam
/ras/te
stTubeM
ixer
inte
rFoam
/lam
inar/ca
pilla
ryRise
inte
rFoam
/lam
inar/d
am
Bre
ak
inte
rFoam
/lam
inar/d
am
Bre
akF
ine
inte
rFoam
/les/n
ozzle
Flo
w2D
inte
rFoam
/ras/d
am
Bre
ak
inte
rFoam
/ras/d
am
Bre
akF
ine
inte
rFoam
/ras/d
am
Bre
akP
oro
usB
affle
inte
rFoam
/ras/w
eirO
verflo
w
inte
rMixin
gF
oam
/lam
inar/d
am
Bre
ak
inte
rPhase
ChangeD
yMF
oam
/pro
pelle
r
inte
rPhase
ChangeF
oam
/cavita
tingB
ulle
t
lapla
cianF
oam
/flange
move
Dyn
am
icMesh
/Snake
Rive
rCanyo
n
multip
hase
Eule
rFoam
/bubble
Colu
mn
multip
hase
Eule
rFoam
/dam
Bre
ak4
phase
multip
hase
Eule
rFoam
/dam
Bre
ak4
phase
Fin
e
multip
hase
Eule
rFoam
/mixe
rVesse
l2D
multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
multip
hase
Inte
rFoam
/lam
inar/d
am
Bre
ak4
phase
Fin
e
nonN
ew
tonia
nIco
Foam
/offse
tCylin
der
pim
ple
DyM
Foam
/mixe
rVesse
lAM
I2D
pim
ple
DyM
Foam
/movin
gC
one
pim
ple
DyM
Foam
/oscilla
tingIn
letA
CM
I2D
pim
ple
DyM
Foam
/win
gM
otio
n/w
ingM
otio
n2D
_pim
ple
DyM
Foam
pim
ple
Foam
/TJu
nctio
n
pim
ple
Foam
/TJu
nctio
nF
an
pim
ple
Foam
/channel3
95
pim
ple
Foam
/elip
sekkL
Om
ega
pim
ple
Foam
/pitzD
aily
piso
Foam
/les/p
itzDaily
piso
Foam
/les/p
itzDailyM
apped
piso
Foam
/ras/ca
vity
piso
Foam
/ras/ca
vityCouple
dU
poro
usS
imple
Foam
/angle
dD
uctE
xplicit
poro
usS
imple
Foam
/angle
dD
uctIm
plicit
poro
usS
imple
Foam
/straig
htD
uctIm
plicit
pote
ntia
lFoam
/cylinder
pote
ntia
lFoam
/pitzD
aily
CP
U t
ime
ra
tio
FOCUS A system (Intel Xeon L5640, 2.26GHz)
G2/I1, average:1.038multiphaseEulerFoam/ damBreak4phaseFine
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)19
I1:intel compiler -axAVX,SSE4.2 G2:gnu compiler -O3
sprayFoam/aachenBomb
I1 Better
I1 Worse
計算時間の比(Aシステム,intel vs gnu)
twoPhaseEulerFoam/laminar/bubbleColumnIATE 0.6
0.8
1
1.2
1.4
1.6
1.8
2
reactin
gF
oam
/ras/co
unte
rFlo
wF
lam
e2D
reactin
gP
arce
lFilm
Foam
/cylinder
reactin
gP
arce
lFilm
Foam
/hotB
oxe
s
reactin
gP
arce
lFilm
Foam
/rivule
tPanel
reactin
gP
arce
lFilm
Foam
/spla
shP
anel
reactin
gP
arce
lFoam
/filter
reactin
gP
arce
lFoam
/parce
lInB
ox
reactin
gP
arce
lFoam
/vertica
lChannel
rhoC
entra
lFoam
/Ladenburg
Jet6
0psi
rhoC
entra
lFoam
/forw
ard
Ste
p
rhoC
entra
lFoam
/obliq
ueS
hock
rhoC
entra
lFoam
/shockT
ube
rhoC
entra
lFoam
/wedge15M
a5
rhoLT
SP
imple
Foam
/angle
dD
uct
rhoP
imple
DyM
Foam
/annula
rTherm
alM
ixer
rhoP
imple
Foam
/les/p
itzDaily
rhoP
imple
Foam
/ras/a
ngle
dD
uct
rhoP
imple
Foam
/ras/ca
vity
rhoP
imple
Foam
/ras/m
ixerV
esse
l2D
rhoP
imple
cFoam
/angle
dD
uct
rhoP
oro
usS
imple
Foam
/angle
dD
uctE
xplicit
rhoP
oro
usS
imple
Foam
/angle
dD
uctIm
plicit
rhoS
imple
Foam
/angle
dD
uctE
xplicitF
ixedC
oeff
rhoS
imple
cFoam
/square
Bend
settlin
gF
oam
/ras/d
ahl
settlin
gF
oam
/ras/ta
nk3
D
shallo
wW
ate
rFoam
/square
Bum
p
simple
Foam
/airF
oil2
D
simple
Foam
/mixe
rVesse
l2D
simple
Foam
/pip
eC
yclic
simple
Foam
/pitzD
aily
simple
Foam
/pitzD
ailyE
xptIn
let
simple
Foam
/turb
ineS
iting
simple
Reactin
gP
arce
lFoam
/vertica
lChannel
solid
Disp
lace
mentF
oam
/pla
teH
ole
solid
Equilib
rium
Disp
lace
mentF
oam
/beam
EndLoad
sonicF
oam
/lam
inar/fo
rward
Ste
p
sonicF
oam
/lam
inar/sh
ockT
ube
sonicF
oam
/ras/p
rism
sonicL
iquid
Foam
/deco
mpre
ssionT
ank
sonicL
iquid
Foam
/deco
mpre
ssionT
ankF
ine
spra
yFoam
/aach
enB
om
b
twoLiq
uid
Mixin
gF
oam
/lockE
xchange
twoP
hase
Eule
rFoam
/LE
S/b
ubble
Colu
mn
twoP
hase
Eule
rFoam
/RA
S/b
ubble
Colu
mn
twoP
hase
Eule
rFoam
/RA
S/flu
idise
dB
ed
twoP
hase
Eule
rFoam
/lam
inar/b
ubble
Colu
mn
twoP
hase
Eule
rFoam
/lam
inar/b
ubble
Colu
mnIA
TE
twoP
hase
Eule
rFoam
/lam
inar/flu
idise
dB
ed
twoP
hase
Eule
rFoam
/lam
inar/m
ixerV
esse
l2D
CP
U t
ime
ra
tio
FOCUS A system (Intel Xeon L5640, 2.26GHz)
G2/I1, average:1.038
OpenCAE Local Workshop@Kansai 20th Sep 2014 Masashi Imano (OCAEL Co. Ltd.)20
まとめ
✓intelコンパイラの最適化オプションとして-axAVX,SSE4.2を用いることで,DシステムではAVXが有効,AシステムではSSE4.2が自動的に有効になるバイナリができる.
✓-axAVX,SSE4.2オプション版は,-xAVXや-xSSE4.2の版と速度がほぼ同等である.
✓intel版に対するgnu版の計算時間の比の平均は,Dシステムで約1.07,Aシステムで約1.04であり,平均するとgnu版の速度低下は10%以下である.ただし,gnu版のほうが40%以上遅いケースがA,D両システムとも5ケースあった.