1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and...

29
1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos Martial Hebert Computer Science Carnegie Mellon University October 31, 2008, Vancouver, Canada ACM MIR 2008
  • date post

    23-Jan-2016
  • Category

    Documents

  • view

    221
  • download

    0

Transcript of 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and...

Page 1: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

1

Unsupervised Modeling and Recognition of Object Categories with Combination of Visual

Contents and Geometric Similarity Links

Unsupervised Modeling and Recognition of Object Categories with Combination of Visual

Contents and Geometric Similarity Links

Gunhee KimChristos Faloutsos

Martial Hebert

Computer ScienceCarnegie Mellon University

October 31, 2008, Vancouver, CanadaACM MIR 2008

Page 2: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

2

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Page 3: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

3

Unsupervised Modeling Unsupervised Modeling

• Category discovery + Ranking

Page 4: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

4

Recognition Recognition

Novel Images

Bicycle

Sheep Sign

• Classification + Localization

Page 5: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

5

IntuitionIntuition

• Combination of Topic contents and Link AnalysisLatent Topic: Bicycles

Word distributions

Same latent TopicDifferent latent Topic

(Sparse and irregular links)

link distributions

(Dense and consistent links)

link distributions

[1] Sivic, ICCV 2005[2] Fei Fei, ICCV 2005

Page 6: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

6

Intuition Intuition

• Combination of Topic contents and link analysis

• Samples of visual words based on Bag-of-Words

• Samples of links generated by image matching

• Two types of evidence into a single generative model

– Ex. Hierarchical Bayesian Models (pLSA, LDA)

Page 7: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

7

Our Previous WorkOur Previous Work

• Unsupervised Modeling using Link Analysis Techniques [Kim, CVPR08]

Large Scale Network

Link Analysis Techniques

(ex. PageRank)

- Only links- Only modeling

→ Visual content + Links→ Modeling + Recognition

Page 8: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

8

Pros over Conventional Models (1/2)Pros over Conventional Models (1/2)

• Easy Plug-in of geometric information

Indirect Formulation: Link generation with geometric consistency+ Independent of number of parts[Liu, ICCV 2008]

[Lazebnik, CVPR 2006] [Sudderth, ICCV 2005][Niebles, CVPR 2007]

[FeiFei CVPR07 Tutorials]

Page 9: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

9

Pros over Conventional Models (2/2)Pros over Conventional Models (2/2)

• Ambiguity in definition of visual words

Word A

Word B

Word C

Semantically similar Different

Different

+ Relaxed by similarity links between words

Page 10: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

10

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Page 11: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

11

Visual Words HistogramVisual Words Histogram

• Follow Standard Bag-of-Words Approach

– Harris Affine + SIFT

– Dictionary Formation: K-mean clustering

Word ID

Freq

Word ID

Freq

Word ID

Freq

: Freq. of word w in the image j (Weighted by Links)

Page 12: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

12

Network GenerationNetwork Generation

• Pairwise Image Matching

– Spectral Matching [Leordeanu, ICCV 2005]

: Sum of weights of links from image a to image b

Page 13: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

13

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Page 14: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

14

pLSA Based ModelpLSA Based Model

• Standard pLSA [Hofmann NIPS 1999]

• [Cohn NIPS 2001]

Page 15: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

15

LDA Based ModelLDA Based Model

• Standard LDA [Blei JMLR 2003]

wN

z

M

c

Lz

• Linked LDA [Erosheva PNAS 2004]

Page 16: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

16

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Page 17: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

17

Unsupervised Modeling (1/2)Unsupervised Modeling (1/2)

• 1. Category Discovery

– Find out class memberships of all training images

– pLSA based model

– LDA based model

Page 18: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

18

Unsupervised Modeling (2/2)Unsupervised Modeling (2/2)

• 2. Ranking

– 1. For recognition, only fixed number of example images to be matched.

O(m) → O(K)

– 2. Highly probable mis-clustering in low ranked images

– pLSA based Model

– LDA based Model

: Most cited documents in topic i

Most matched image in class i

Page 19: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

19

New image

RecognitionRecognition

Word ID

Freq

30*K high ranked images

Page 20: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

20

RecognitionRecognition

• 3. Classification

– Similar formula to unsupervised clustering.

– pLSA based Model

– LDA based Model

• 4. Localization

– pLSA based Model

– LDA based Model [Sivic ICCV 2005]

Page 21: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

21

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Page 22: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

22

Lea

rne

d ca

tego

ry a

ccu

racy

1.0

0.9

0.8

0.7

0.6

0.5

0.4

0.3

Standard Weighted Linked + Weighted

Comparison TestsComparison Tests

• 5 Objects in Caltech-101

– Similar experimental setup to [Kim CVPR 2008]

55.8% 62.1%

97.8%

Page 23: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

23

Unsupervised Category DiscoveryUnsupervised Category Discovery

• Experiment Setup

– MSRC: 6 Objects (75 training / testing)

– PASCAL/ETHZ: 4 objects

(40 training / testing)

85.4 %

90.3 %

PG PM

PC

MB MC

MD MS

MG

PP

Page 24: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

24

Classification of Unseen ImagesClassification of Unseen Images

• Experiment Setup

– MSRC: 6 Objects (75 training / testing)

– PASCAL/ETHZ: 4 objects (40 training /

testing)

85.4 %

90.3 %

PG PM

PC

MB MC

MD MS

MG

PP

80.5 %

82.16 %

Page 25: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

25

RankingRanking

1.0

0.8

0.6

0.4

0.2

Lea

rne

d ca

tego

ry a

ccu

racy

Number of selected examples per object5 10 15 20 25 30

77.5 % < 85.4 %

Only Link < Link+

Content

Page 26: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

26

LocalizationLocalization

• PASCAL/ETHZ dataset

• MSRC dataset

Motorbike Car Peoples Giraffe

Bike Car Sheep Door Sign

Page 27: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

27

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Page 28: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

28

ConclusionConclusion

• Combination of Topic contents and Link Analysis

– Easy Plug-in of geometric information

– Relaxation of the ambiguous definition of visual words

– Integration between two object recognition approaches

• Unsupervised Modeling

+ Recognition

• Competitive performance

Page 29: 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and Geometric Similarity Links Gunhee Kim Christos Faloutsos.

29

Comments?Comments?

Thank You

[email protected]