Combining efficient object localization and image classification

31
Combining efficient object localization and image classi cation H. Harzallah, F. Jurie and C. Schmid LEAR, INRIA Grenoble, LJK

description

Combining efficient object localization and image classification. H. Harzallah, F. Jurie and C. Schmid LEAR, INRIA Grenoble, LJK. Tasks. Image classification: assigning labels to the image. Car: present Cow: present Bike: not present Horse: not present …. Cow. Car. Tasks. - PowerPoint PPT Presentation

Transcript of Combining efficient object localization and image classification

Page 1: Combining efficient object localization  and image classification

Combining efficient object localization and image classification

H. Harzallah, F. Jurie and C. Schmid

LEAR, INRIA Grenoble, LJK

Page 2: Combining efficient object localization  and image classification

• Image classification: assigning labels to the image

Tasks

Car: presentCow: presentBike: not presentHorse: not present…

Page 3: Combining efficient object localization  and image classification

• Image classification: assigning labels to the image

Tasks

Car: presentCow: presentBike: not presentHorse: not present…

• Object localization: define the location and the category

Car CowLocatio

n

Category

Page 4: Combining efficient object localization  and image classification

Contributions

• Object class localization method

• Combining image classification and object localization

Localization++ Classification-- Localization-- Classification++

Page 5: Combining efficient object localization  and image classification

Overview

• Related work and datasets

• Efficient object localization– Experimental results

• Combining image classification and localization– Experimental results

• Conclusion

Page 6: Combining efficient object localization  and image classification

Related work

• Object localization– Sliding window [Dalal06] [Rowley95]– Implicit shape model [Leibe04]– SVM classifiers [Chum07] [Ferrari08]– Cascade of classifiers [Viola01] [Vedaldi09]

• Context information– Combination of context sources [Divvala09]– Graphical model of events in images [Li07]– Local segmentation + global classification [Shotton08] [Heitz08]

Page 7: Combining efficient object localization  and image classification

PASCAL VOC dataset

• PASCAL VOC dataset 2007 and 2008

• Two tasks : classification and localization

• Fixed train/test set-up for the 20 object classes

• Standard evaluation measure – Area of overlap as detection matching criterion– Average precision for performance evaluation

Page 8: Combining efficient object localization  and image classification

Overview

• Related work and datasets

• Efficient object localization– Experimental results

• Combining image classification and localization– Experimental results

• Conclusion

Page 9: Combining efficient object localization  and image classification

Efficient object localization

• Sliding window based approach

• Image representation• Combination of features• Extensive parameters evaluation

Robust image representation

• Efficient search strategy

Page 10: Combining efficient object localization  and image classification

Image representation

• Combination of 2 image representations

• Histogram Oriented Gradient– Gradient based features– Integral Histograms

• Bag of Features– SIFT features extracted densely + k-means clustering– Pyramidal representation of the sliding windows– One histogram per tile Histogram

Histogram

Histogram

Histogram

Histogram Histogram

Page 11: Combining efficient object localization  and image classification

Efficient search strategy

• Reduce search complexity– Sliding windows: huge number of candidate windows– Cascades: pros/cons

• Two stage cascade:– Filtering classifier with a linear SVM

• Low computational cost• Evaluation: capacity of rejecting negative windows

– Scoring classifier with a non-linear SVM• Χ2 kernel with a channel combination [Zhang07]• Significant increase of performance

Page 12: Combining efficient object localization  and image classification

Efficiency of the 2 stage localization• Performance w. resp. to nbr of windows selected by the linear SVM

(mAP on Pascal 2007)

• Sliding windows: 100k candidate windows• A small number of windows are enough after filtering

Page 13: Combining efficient object localization  and image classification

Localization performance: aeroplane

Method AP

X2, HOG+BOF 33.8

X2, BOF 29.8

X2, HOG 18.4

Linear, HOG 10.0

Page 14: Combining efficient object localization  and image classification

Localization performance: car

Method AP

X2, HOG+BOF 50.4

X2, BOF 42.3

X2, HOG 47.5

Linear, HOG 33.9

Page 15: Combining efficient object localization  and image classification

Localization performance

• Mean Average Precision on all 20 classes

• PASCAL 2007 dataset

Method mAP

Linear, HOG 14.6

Linear, BOF 15.0

Linear, HOG+BOF 17.6

X2, HOG 21.9

X2, BOF 23.1

X2, HOG+BOF 26.3

Page 16: Combining efficient object localization  and image classification

Localization examples: correct localizations

Car

Sofa

Bicycle

Horse

Page 17: Combining efficient object localization  and image classification

Localization examples: false positives

Car

Sofa

Bicycle

Horse

Page 18: Combining efficient object localization  and image classification

Localization examples: missed objects

Car

Sofa

Bicycle

Horse

Page 19: Combining efficient object localization  and image classification

Overview

• Related work and datasets

• Efficient object localization– Experimental results

• Combining image classification and localization– Experimental results

• Conclusion

Page 20: Combining efficient object localization  and image classification

• Image classification & localization use a different information

Combination: key points

• For many TP only one has a high score• Truncated objects: hard for the

detector• Small objects: ok for the detector but

not for the classifier using global information

Page 21: Combining efficient object localization  and image classification

• Input: classification ( Si ) and localization ( Sw ) scores

• Output: probability that object is present

• Suppose that classification and localization outputs are independent:

Combination model

Page 22: Combining efficient object localization  and image classification

• For each modality (classification/detection): notion of detectability P(Di) for classifier and P(Dw) for detector

• Encodes the ability to detect presence of the objects

• Assuming that the classifier/detector outputs conditional probabilities: P(O|Di,Si) and P(O|Dw,Sw)

Combination model

Page 23: Combining efficient object localization  and image classification

• P (O |Si) = P(Di) × P(O|Si, Di) + P(¬Di) × P(O|Si,¬Di)

• P (O |Sw) = P(Dw) × P(O|Sw, Dw) + P(¬Dw) × P(O|Si,¬Dw)

• Final probability:

• Handle both cases:– Object detectable by two modalities– Object detectable by only one modality

Combination model

Page 24: Combining efficient object localization  and image classification

• P(O|¬Di,Si) and P(O|¬Di,Si) : constant value

• Sw = classification by localization: highest localization score

• Priors P(Di) and P(Dw) class dependant

Combination model

Page 25: Combining efficient object localization  and image classification

Combination experimental setup

• Image classifier : INRIA_flat classifier– SVM classifier Χ2 kernel using multiple feature channels [Zhang07]– Excellent results in PASCAL 2008 challenge

• Detector : as described previously

• Experimental validation on PASCAL VOC 2007

• Comparison to the state of the art on PASCAL VOC 2008

Page 26: Combining efficient object localization  and image classification

Experimental results : gain obtained

• Classification

• Localization0

2

4

6

8

1 0

1 2

B o t t le P la n t T V C a r C o w

G a i n i n A P

0

1

2

3

4

5

6

7

8

C o w S h e e p B o t t le S o f a C a t

G a i n i n A P

Method mAP

Base Classifier 60.1

Our Combination 63.5

Method mAP

Base Detector 26.3

Our Combination 28.9

Page 27: Combining efficient object localization  and image classification

• Correct but low score localization• High classification score score increased after combination

Experimental results

Car localization

Page 28: Combining efficient object localization  and image classification

• High classification score• No localization score decreased after combination

Experimental results

Car classification

Page 29: Combining efficient object localization  and image classification

• Based on blind evaluation on PASCAL VOC 2008• Classification

– Best on 12 classes out of 20

• Localization

– Best on 11 classes out of 20

Comparison to the state of the art

Method mAP

Lear_flat 53.8

Lear_shotgun 54.5

SurreyUvA_SRKDA 54.9

UvA_TreeSFS 54.3

Our method (based on Lear_flat) 57.7

Method mAP

CASIA_Det 12.7

MPI_struct 10.4

UoCTTIUCI 22.8

Our method 22.7

Page 30: Combining efficient object localization  and image classification

Conclusion

• Efficient localization method

• Successful combination of classification and localization

• State of the art performance on both tasks

Page 31: Combining efficient object localization  and image classification

Thank you