Cost Curves - Engineeringstan/csi5387/CostCurves.pdf · •Cost curves enable easy visualization of...

What ROC Curves Can't Do

(and Cost Curves Can)

Robert Holte

Computing Science Dept.

University of Alberta

Chris Drummond

Research Officer NRC

Adjunct Professor SITE

Cost Curves

Scalar Measures Don’t Even Come Close

How to Evaluate Performance

?• Scalar measure summarizing performance

– Accuracy

– Expected cost

• Performance Visualization Techniques

– ROC curve

– Area under the ROC curve

– Cost Curve

The Lure of Scalar Measures

“…it is often preferable to employ a single value

measure which summarizes the performance

of a classifier, e.g. because there are several

classifiers to be compared and there is no

clear dominance of one ROC curve above the

others.

The most widely used single measure is the

Area Under the ROC Curve …”

– paraphrase from a workshop paper

because there are several

classifiers to be compared and there is no clear

dominance of one ROC curve above the others.

What’s Genuinely Good About

Scalar Measures ?

• We know how to

– average them,

– compute confidence intervals,

– test for significance, etc

• and there is off-the-shelf software to do these

calculations for us.

• But they often hide important details

ositiv

False Positive Rate

2 Splitting Criteria for C4.5

Criterion-A

Criterion-D

The key question is

Is A better than D?

When is A better than D?

Appendicitis Dataset

ositiv

Is AUC=0.95 better than AUC=0.75 ?

False Positive Rate

positives >> negatives 25:1

AUC=0.95

> twice the error rate of

AUC=0.75

FP = 0.75, TP = 1.0

AUC = 0.75FP = 0, TP = 0.95

AUC = 0.95

Cost Curves

Error Curves

Probability of Positive P(+)0.8 1.00.0 0.2 0.4 0.6

1.0Classifier 1

TP = 0.4

FP = 0.3

Classifier 2

TP = 0.7

FP = 0.5

Classifier 3

TP = 0.6

FP = 0.2

FP FN = 1-TP

Operating Range

“always

negative”“always

positive”

Lower Envelope

“always

negative”

“always

positive”

The lower envelope is a biased estimate of performance.

Fresh data is needed to get an unbiased estimate.

X = p(+) =>

Y = error rate

X = p(+) •C(-|+)

Probability of Positive P(+)

Taking Costs Into Account

Y = FN•X + FP •(1-X)

PC(+) - Probability Cost

p(+)•C(-|+) + p(-)•C(+|-)

Y = E[cost] norm to [0,1]

Comparing Cost Curves

ROC, Selection procedure

False Positive Rate

ate Suppose this classifier was

produced by a training set

with a class ratio of 10:1,

and was used whenever the

deployment situation had a

10:1 class ratio.

Cost Curves, Selection

ProcedureN

Averaging Cost Curves

PC(+)-Probability Cost

Averaging ROC Curves

False Positive Rate

Confidence Intervals

Predicted

pos neg

pos 78 22

neg 40 60

Original

TP = 0.78

FP = 0.4

Predicted

negposTrue

6238neg

1783pos

Resample #2

TP = 0.83

FP = 0.38

Resample confusion matrix 10000 times and take 95% envelope

Resample #1

TP = 0.75

FP = 0.45

Predicted

negposTrue

5545neg

2575pos

Confidence Interval Example

Paired Resampling to Test

Statistical Significance

Predicted by

Classifier1

Predicted by Classifier2

pos neg

pos 30 10

neg 0 60

For the 100 test examples in the negative class:

FP for classifier1: (30+10)/100 = 0.40

FP for classifier2: (30+0)/100 = 0.30

FP2 – FP1 = -0.10

Resample this matrix 10000 times to get (FP2-FP1) values.

Do the same for the matrix based on positive test examples.

Plot and take 95% envelope as before.

Low correlation = Low

significance

classifier1

classifier2

FN2-FN1

FP2-FP10

Comparing J48 and AdaBoost

Conclusions

• Scalar performance measures, including

AUC, do not indicate when one classifier is

better than another.

• Cost/ROC curve software is available

• Cost curves enable easy visualization of

– average performance (expected cost)

– operating range

– confidence intervals on performance

– difference in performance and its significance.

Cost Curves - Engineeringstan/csi5387/CostCurves.pdf · •Cost curves enable easy visualization of...

Documents

Transcript of Cost Curves - Engineeringstan/csi5387/CostCurves.pdf · •Cost curves enable easy visualization of...

Lecture # 12 Cost Curves Lecturer: Martin Paredes

Applications and extensions of cost curves to marine ...

Grp PPt Cost Curves

Sketching Cost Curves 2

Chapter 21 Cost Curves. Types of Cost Curves A total cost (TC) curve is the graph of a firm’s total cost function. An average total cost (ATC or AC) curve.

Statistical Methods for Learning Curves and Cost Analysis · Statistical Methods for Learning Curves and Cost Analysis ... I joined the Cost Analysis and Research ... Table 2.1 Work

COST NOTES LECTURE ALL COST CURVES NUMERICALS EXAMPLES THEORY

GHG Marginal Abatement Cost Curves

Using Marginal Abatement Cost Curves to Realize the ...

Cost Minimization and Cost Curves

03 cost curves

Opportunity Cost and Produdion Possibilities Curves

Marginal Abatement Cost Curves: Combining Energy …ucft347/MACC_methodology.pdf · Marginal Abatement Cost Curves: Combining Energy System Modelling and ... A MAC curve is defined

Long Run Cost Curves

Cost Curves To Master

Appendix A_Capital Cost Equations & Curves

APPLICATION OF COST MATRICES AND COST CURVES TO …

Iron Ore Cost Curves

Isoquant to Cost Curves

CHAPTER CURVES - FEP · 2008-02-26 · 8.1 LONG-RUN COST CURVES 261 8.1 LONG-RUN COST CURVES LONG-RUN TOTAL COST CURVE In Chapter 7, we studied the ﬁrm’s long-run cost-minimization