Pattern recognition (4) - Electrical and Computer …aalbu/computer vision 2009/Lecture...

Pattern recognition (4)

Things we have discussed until now

Statistical pattern recognitionBuilding simple classifiers

Supervised classificationMinimum distance classifier Bayesian classifier (1D and multiple D)Building discriminant functions

Unsupervised classificationK-means algorithm

Equivalence between classifiers

Pattern recognition using multivariate normal distributions and equal priors is simply a minimum Mahalonobis distance classifier.

Classifier design:Errors and risk in the classification process

Performance evaluation of classification systems

Reading: slides and blackboard derivations only

How often will we be wrong?in the two class-case:

Global error

Suppose that using our training samples, we have partitioned the feature space into regions

Ri corresponds to class ωi if all samples belonging to this region of the feature space will be classified in class ωi

We can compute then the overall error of the classification process by integrating the class-specific errors over their corresponding regions

Not every mistake has the same costClassification strategy: minimizing the probability of loss (risk) instead of minimizing the probability of error

Example

Computer-assisted diagnosis of suspicious lesions in a CT scan: Two ways to go wrong

Alpha risk Beta risk

Label a lesion as malignant when in fact it is benignUnnecessary biopsy

Label a lesion as benign when in fact it is malignantProgression of cancer

Another example

Quality control for manufactured parts: accept or reject a partTwo ways to go wrong

Alpha risk Beta risk

Accept a bad partLoses customers

Reject a good partWastes money

Loss tables

Suppose that the cost of classifying into class ωj when the actual class is ωi is Lij

We can summarize this in a loss table

Example (cont’d)

Let ω1 be the class for good parts and ω2 be the class for bad parts

Risk for a specific pattern X

Loss tables are useful for computing the risk in making a specific choice αi given pattern x:

We can compute this risk by adding up the costs for each possible classification.

)/( XR iα

Computing risks for our example

Suppose that for our earlier example we have computed the posterior probabilities

We will compute the risks for classifying X in ω1 and in ω2 respectively.

2.0)/(;8.0)/( 21 == XPXP ωω

Bayesian classifiers revisited

Instead of maximizing posterior probability P(ωj/X), minimize risk R(αj/X)

Given N classes, we compute N risk values rj=R(αj/X)We assign X to the class corresponding to the minimum riskDerivation..

The 0-1 Loss rule

Under this rule, the Bayesian classifier maximizes the posterior probability (as we have learned in the previous lecture) and can be expressed as a minimum distance classifier.

The most common assumption in Computer Vision classifiers!

Performance classification paradigms

Against ground truth (manually generated segmentation/classification)

The method of preference in medical image segmentation

Benchmarking: for mature/maturing subfields in computer vision

Example 1: “The gait identification challenge problem: datasets and baseline algorithm”, in International Conference on Pattern Recognition 2002Example 2: “Benchmark Studies on Face Recognition”, in International Workshop on Automatic Face- and Gesture- Recognition 1995.

Evaluation of classifiers

ROC analysisPrecision and recallConfusion matrices

ROC analysis

ROC stands for receiver-operator characteristic and was initially used to analyze and compare the performances of human radar operators.A ROC curve=plot of false positive rate against true positive rate as some parameter is varied. 1970: ROC curves were used in medical studies; useful in bringing out the sensitivity (true positive rate) versus specificity (false positive rate) of diagnosis trials.Computer Vision performs ROC analysis for algorithmsWe can also compare different algorithms that are designed for the same task

ROC terminology

Four kinds of errors:TP “yes” and are right (True Positives) “hit”TN “no” and are right (True Negatives) “correct rejection”FP “yes” and are wrong (False Positives) “false alarm”FN “no” and are wrong (False Negatives) “miss”

We don’t actually really need all four rates because

FN = 1-TPTN = 1-FP

False positives, false negatives

ROC curves

trade-off between the true positive rate and the false positive rate: an increase in true positive rate is accompanied by an increase in false positive rate

the area under each curve gives a measure of accuracy

ROC curve

- the closer the curve approaches the top left-hand corner of the plot, the more accurate the classifier;- the closer the curve is to a 45 diagonal, the worse the classifier;

Where are ROC curves helpful?

Detection-type problemsFace detection in images/video dataEvent detection in video dataLesion detection in medical imagesEtc…

Precision and recall

Also used mostly for detection-type problemsIn a multiple class case, can be measured for each class

detections missedC1 trueC1 true

database in samples C1 ofnumber totaldetectionscorrect of norecall

alarms false C1 trueC1 true

detections ofnumber Total detectionscorrect of noprecision

Trade-of between precision and recall

Example: content-based image retrievalSuppose we aim at detecting all sunset images from an image databaseThe image database contains 200 sunset imagesThe classifier retrieves 150 of the relevant 200 images and 100 images of no interest to the userPrecision=150/250=60%Recall=150/200=57%

The system could obtain 100 percent recall if returned all images in the database, but its precision would be terribleIf we aim at a low false alarm rate: precision would be high, recall would be low.

Confusion matrix

Used for visualizing/reporting results of a classification system

The binary confusion matrix

We can construct a binary confusion matrix for one class

Calculating the precision and recall from the confusion matrix

Example. Consider the confusion matrix of a OCR that produces the following output over a test document set

Calculate the precision and recall for class a.

Pattern recognition (4) - Electrical and Computer …aalbu/computer vision 2009/Lecture...

Documents

Transcript of Pattern recognition (4) - Electrical and Computer …aalbu/computer vision 2009/Lecture...

Medical Robotics and Computer- Integrated Therapy Delivery ...cis/cista/445/Lectures/foo.pdf · Systems • Computer-Computer Computer-Surgical “information Computer-Computer-Computer-

Computer Computer Characteristics of computer Characteristics of computer Limitation of computer Limitation of computer What is Computer Hardware?

Computer Science Department of Computer Science · Computer Science Department of Computer Science ... puter architecture, computer graphics, ... Introduction to Computer Science),

YACHTING Student Yacht Weekoyvind.kolbu.ws/Turinformasjon.pdf · 2008-06-13 · Mari Aalbu Sønstebø Håvard Halvorsen Tore F. Lie Beneteau oseanis 473 Skipper Roy Andreas Berglind

© 2000 Morgan Kaufman Overheads for Computers as Components Networking for Embedded Systems zWhy we use networks. zNetwork abstractions. zExample networks.

© 2000 Morgan Kaufman Overheads for Computers as Components 2 nd ed. Introduction zExample: model train controller.

Computer-Ständer / Computer-Stationen SMK / Computer ...

Readings: Dix 9.1, 9.2, 9.3 Optional: Dumas et al ...1 Evaluation without users Readings: Dix 9.1, 9.2, 9.3 Optional: Dumas et al Describing usability problems. Interactions 2004.aalbu/HCI/HCI_13.pdf ·

Image Processing and Computer Graphics Computer Graphics · Image Processing and Computer Graphics Computer Graphics ... Procedural Elements of Computer Graphics ... Computer Science

Eagle County Schools Page No 1 A/P Detail Check Register...CV CV CV Computer Computer Computer Computer Manual Manual Manual Computer Computer Computer Computer Computer ... AMY NISWANGER

Teaching Human-Computer-Interaction with Shakespeare ...aalbu/teaching publications/ABA... · Teaching Human-Computer-Interaction with Shakespeare Sonnets: a case study in interdisciplinary

© 2008 Wayne Wolf Overheads for Computers as Components 2 nd ed. Networking for Embedded Systems zWhy we use networks. zNetwork abstractions. zExample.

Lecture 9. Segmentation-thresholdingaalbu/computer vision 2009/Lecture 9. Segmentation... · 2 Context {Segmentation decomposes the image into parts for further analysis zExample:

SENG 310: Human Computer Interaction - Electrical …aalbu/HCI/HCI_6.pdfSENG 310 : Human Computer Interaction, Lecture 6. 2 User-centred design {Iterative design using rapid prototyping

chti.rajbhasha.gov.inchti.rajbhasha.gov.in/pdf/Feb-20-Jul-20-circular-E.pdf · On Computer On Computer On Computer On Computer On Computer On Computer On Computer On Computer . Training

AN EXPLORATION OF PERCEPTIONS AND ...8.5.2 Modality in the whole 4th document ZExample Lecture Notes (week 11)..... 212 8.5.3 The 4th document ZExample Lecture Notes (week 11) ZProcesses

Computer Math AP Computer Science Computer Programming.

Høy i hatten av Ragnar Aalbu

© 2010 by Oppenheim. Published by Pearson Prentice Hall ...aalbu/elec310_2010/Solutions of... · material may be reproduced, in any form or by any means, without permission in writing

© 2008 Wayne Wolf Overheads for Computers as Components 2 nd ed. CPUs zExample: data compressor.