Computer Science | Harvard John A. Paulson School of ...parkes/cs286r/spring06/... · 1.22 1.24...
Transcript of Computer Science | Harvard John A. Paulson School of ...parkes/cs286r/spring06/... · 1.22 1.24...
FeasibleSolutions
InfeasibleSolutions
25.2
29.4
18.3
30.6
19.5
Optimal Solution
Initial state
Final state
Capacity Cj,k
Capacity Cj,k
Time t
1.22
1.24
1.26
1.28
1.3
1.32
1.34
1.36
1.38
1024 2048 4096 8192 16384 32768 65536 131072
Aver
age
RDF
Running Time (seconds)
TDIR
40
50
60
70
80
90
100
4096 8192 16384 32768 65536 131072
Perc
enta
ge o
f TD
Win
s
Running Time (seconds)
TD Win+TieTD Win
0.02
0.025
0.03
0.035
0.04
0.045
0.05
0.055
0 1000 2000 3000 4000 5000Tr
aini
ng E
rror
Training Epochs
P30P10aP10bP10c
1.15
1.2
1.25
1.3
1.35
1.4
1.45
0 1000 2000 3000 4000 5000
CV
Aver
age
RD
F
Training Epochs
P30P10aP10bP10c
20
40
60
80
100
120
140
160
0 1000 2000 3000 4000 5000
CV
Aver
age
#Mov
es
Training Epochs
P30P10aP10bP10c
00.020.040.060.08
0.10.120.140.160.18
0 1000 2000 3000 4000 5000
CV
Fina
l TD
Erro
r
Training Epochs
P30P10aP10bP10c
0
0.005
0.01
0.015
0.02
0.025
0 1000 2000 3000 4000 5000
CV
Non
-Fin
al T
D E
rror
Training Epochs
P30P10aP10bP10c
1
1.1
1.2
1.3
1.4
1.5
1.6
0 5 10 15 20 25 30 35 40 45 50
Reso
urce
Dila
tion
Fact
or
Position along repair path
Current State RDF
Predicted Final RDF
1.15
1.2
1.25
1.3
1.35
1.4
1.45
0 500 1000 1500 2000
Aver
age
RD
F
Training Epochs
No Final Reward
With Reward
20406080
100120140160180200
0 500 1000 1500 2000
Aver
age
Num
ber o
f Rep
airs
Training Epochs
No Final Reward
With Reward
1.2
1.22
1.24
1.26
1.28
1.3Av
erag
e RD
FPure IR TD1Q
TD2Q TD3QTD4Q
Pure TD
1.2
1.22
1.24
1.26
1.28
1.3
Aver
age
RDF
Pure TD
IR1Q
IR2Q IR3QIR4Q
Pure IR