Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus,...

20
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    219
  • download

    0

Transcript of Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus,...

Page 1: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Beyond bags of features:Adding spatial information

Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba

Page 2: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Adding spatial information

• Computing bags of features on sub-windows of the whole image

• Using codebooks to vote for object position• Generative part-based models

Page 3: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Spatial pyramid representationSpatial pyramid representation• Extension of a bag of features• Locally orderless representation at several levels of resolution

level 0

Lazebnik, Schmid & Ponce (CVPR 2006)

Page 4: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Spatial pyramid representationSpatial pyramid representation• Extension of a bag of features• Locally orderless representation at several levels of resolution

level 0 level 1

Lazebnik, Schmid & Ponce (CVPR 2006)

Page 5: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Spatial pyramid representationSpatial pyramid representation

level 0 level 1 level 2

• Extension of a bag of features• Locally orderless representation at several levels of resolution

Lazebnik, Schmid & Ponce (CVPR 2006)

Page 6: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Scene category datasetScene category dataset

Multi-class classification results(100 training images per class)

Page 7: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Caltech101 datasetCaltech101 datasethttp://www.vision.caltech.edu/Image_Datasets/Caltech101/Caltech101.html

Multi-class classification results (30 training images per class)

Page 8: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Implicit shape models

• Visual codebook is used to index votes for object position

B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004

training image annotated with object localization info

visual codeword withdisplacement vectors

Page 9: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Implicit shape models

• Visual codebook is used to index votes for object position

B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004

test image

Page 10: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Implicit shape models: Details

B. Leibe, A. Leonardis, and B. Schiele, Combined Object Categorization and Segmentation with an Implicit Shape Model, ECCV Workshop on Statistical Learning in Computer Vision 2004

Page 11: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Generative part-based models

R. Fergus, P. Perona and A. Zisserman, Object Class Recognition by Unsupervised Scale-Invariant Learning, CVPR 2003

Page 12: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Probabilistic model

h: assignment of features to parts

)|(),|(),|(max

)|,()|(

objecthpobjecthshapepobjecthappearanceP

objectshapeappearancePobjectimageP

h

Partdescriptors

Partlocations

Candidate parts

Page 13: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Probabilistic model

h: assignment of features to parts

Part 2

Part 3

Part 1

)|(),|(),|(max

)|,()|(

objecthpobjecthshapepobjecthappearanceP

objectshapeappearancePobjectimageP

h

Page 14: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Probabilistic model

h: assignment of features to parts

Part 2

Part 3

Part 1

)|(),|(),|(max

)|,()|(

objecthpobjecthshapepobjecthappearanceP

objectshapeappearancePobjectimageP

h

Page 15: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Probabilistic model

)|(),|(),|(max

)|,()|(

objecthpobjecthshapepobjecthappearanceP

objectshapeappearancePobjectimageP

h

High-dimensional appearance space

Distribution over patchdescriptors

Page 16: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Probabilistic model

)|(),|(),|(max

)|,()|(

objecthpobjecthshapepobjecthappearanceP

objectshapeappearancePobjectimageP

h

2D image space

Distribution over jointpart positions

Page 17: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Results: Faces

Faceshapemodel

Patchappearancemodel

Recognitionresults

Page 18: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Results: Motorbikes and airplanes

Page 19: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Representing people

Page 20: Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Summary: Adding spatial information• Spatial pyramids

• Pro: simple extension of a bag of features, works very well• Con: no geometric invariance, no object localization

• Implicit shape models• Pro: can localize object, maintain translation and possibly scale

invariance• Con: need supervised training data (known object positions and possibly

segmentation masks)

• Generative part-based models• Pro: very nice conceptually, can be learned from unsegmented images• Con: combinatorial hypothesis search problem