1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and...

Unsupervised Modeling and Recognition of Object Categories with Combination of Visual

Contents and Geometric Similarity Links

Unsupervised Modeling and Recognition of Object Categories with Combination of Visual

Contents and Geometric Similarity Links

Gunhee KimChristos Faloutsos

Martial Hebert

Computer ScienceCarnegie Mellon University

October 31, 2008, Vancouver, CanadaACM MIR 2008

OutlineOutline

• Problem Statement & Our Approach

• Word Histogram & Network Construction

• pLSA and LDA based Models

• Unsupervised Modeling & Recognition

• Experiments

• Discussion

Unsupervised Modeling Unsupervised Modeling

• Category discovery + Ranking

Recognition Recognition

Novel Images

Bicycle

Sheep Sign

• Classification + Localization

IntuitionIntuition

• Combination of Topic contents and Link AnalysisLatent Topic: Bicycles

Word distributions

Same latent TopicDifferent latent Topic

(Sparse and irregular links)

link distributions

(Dense and consistent links)

link distributions

[1] Sivic, ICCV 2005[2] Fei Fei, ICCV 2005

Intuition Intuition

• Combination of Topic contents and link analysis

• Samples of visual words based on Bag-of-Words

• Samples of links generated by image matching

• Two types of evidence into a single generative model

– Ex. Hierarchical Bayesian Models (pLSA, LDA)

Our Previous WorkOur Previous Work

• Unsupervised Modeling using Link Analysis Techniques [Kim, CVPR08]

Large Scale Network

Link Analysis Techniques

(ex. PageRank)

- Only links- Only modeling

→ Visual content + Links→ Modeling + Recognition

Pros over Conventional Models (1/2)Pros over Conventional Models (1/2)

• Easy Plug-in of geometric information

Indirect Formulation: Link generation with geometric consistency+ Independent of number of parts[Liu, ICCV 2008]

[Lazebnik, CVPR 2006] [Sudderth, ICCV 2005][Niebles, CVPR 2007]

[FeiFei CVPR07 Tutorials]

Pros over Conventional Models (2/2)Pros over Conventional Models (2/2)

• Ambiguity in definition of visual words

Word A

Word B

Word C

Semantically similar Different

Different

+ Relaxed by similarity links between words

OutlineOutline

• Experiments

• Discussion

Visual Words HistogramVisual Words Histogram

• Follow Standard Bag-of-Words Approach

– Harris Affine + SIFT

– Dictionary Formation: K-mean clustering

Word ID

: Freq. of word w in the image j (Weighted by Links)

Network GenerationNetwork Generation

• Pairwise Image Matching

– Spectral Matching [Leordeanu, ICCV 2005]

: Sum of weights of links from image a to image b

OutlineOutline

• Experiments

• Discussion

pLSA Based ModelpLSA Based Model

• Standard pLSA [Hofmann NIPS 1999]

• [Cohn NIPS 2001]

LDA Based ModelLDA Based Model

• Standard LDA [Blei JMLR 2003]

• Linked LDA [Erosheva PNAS 2004]

OutlineOutline

• Experiments

• Discussion

Unsupervised Modeling (1/2)Unsupervised Modeling (1/2)

• 1. Category Discovery

– Find out class memberships of all training images

– pLSA based model

– LDA based model

Unsupervised Modeling (2/2)Unsupervised Modeling (2/2)

• 2. Ranking

– 1. For recognition, only fixed number of example images to be matched.

O(m) → O(K)

– 2. Highly probable mis-clustering in low ranked images

– pLSA based Model

– LDA based Model

: Most cited documents in topic i

Most matched image in class i

New image

RecognitionRecognition

Word ID

30*K high ranked images

RecognitionRecognition

• 3. Classification

– Similar formula to unsupervised clustering.

– LDA based Model

• 4. Localization

– LDA based Model [Sivic ICCV 2005]

OutlineOutline

• Experiments

• Discussion

Standard Weighted Linked + Weighted

Comparison TestsComparison Tests

• 5 Objects in Caltech-101

– Similar experimental setup to [Kim CVPR 2008]

55.8% 62.1%

Unsupervised Category DiscoveryUnsupervised Category Discovery

• Experiment Setup

– MSRC: 6 Objects (75 training / testing)

– PASCAL/ETHZ: 4 objects

(40 training / testing)

85.4 %

90.3 %

Classification of Unseen ImagesClassification of Unseen Images

• Experiment Setup

– MSRC: 6 Objects (75 training / testing)

– PASCAL/ETHZ: 4 objects (40 training /

testing)

85.4 %

90.3 %

80.5 %

82.16 %

RankingRanking

Number of selected examples per object5 10 15 20 25 30

77.5 % < 85.4 %

Only Link < Link+

Content

LocalizationLocalization

• PASCAL/ETHZ dataset

• MSRC dataset

Motorbike Car Peoples Giraffe

Bike Car Sheep Door Sign

OutlineOutline

• Experiments

• Discussion

ConclusionConclusion

• Combination of Topic contents and Link Analysis

– Easy Plug-in of geometric information

– Relaxation of the ambiguous definition of visual words

– Integration between two object recognition approaches

• Unsupervised Modeling

+ Recognition

• Competitive performance

Comments?Comments?

Thank You

gunhee@cs.cmu.edu

1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and...

Documents

Transcript of 1 Unsupervised Modeling and Recognition of Object Categories with Combination of Visual Contents and...

Talk 2: Graph Mining Tools - SVD, ranking, proximity Christos Faloutsos CMU.

CMU SCS Finding patterns in large, real networks Christos Faloutsos CMU.

Anomaly Detection and Virus Propagation in Large Graphs Christos Faloutsos CMU.

Propagation on Large Networks B. Aditya Prakash Christos Faloutsos Carnegie Mellon University.

Deepayan ChakrabartiCIKM 20021 F4: Large Scale Automated Forecasting Using Fractals -Deepayan Chakrabarti -Christos Faloutsos.

Large Graph Mining – Patterns, tools and cascade analysis by Christos Faloutsos

EVENT DETECTION IN TIME SERIES OF MOBILE COMMUNICATION GRAPHS Leman Akoglu Christos Faloutsos.

Chakrabarti Christos Faloutsos Threshold Conditions for ...people.cs.vt.edu/~badityap/papers/gen-threshold-kais12.pdf · Faloutsos Christos Faloutsos Threshold Conditions for Arbitrary

CMU SCS Identifying on-line Fraudsters: Anomaly Detection Using Network Effects Christos Faloutsos CMU.

CMU SCS Graph Analytics wkshpC. Faloutsos (CMU) 1 Graph Analytics Workshop: Tools Christos Faloutsos CMU.

CMU SCS Sensor data mining and forecasting Christos Faloutsos CMU christos@cs.cmu.edu.

CMU SCS Talk 3: Graph Mining Tools – Tensors, communities, parallelism Christos Faloutsos CMU.

CMU SCS Mining Billion-node Graphs Christos Faloutsos CMU.

Mining Large Graphschristos/TALKS/16-09-AMAZON/faloutsos... · 2016. 9. 9. · CMU SCS Mining Large Graphs: Patterns, Anomalies, and Fraud Detection Christos Faloutsos CMU

Introduction to Fractals and Fractal Dimension Christos Faloutsos.

CMU SCS Graph and stream mining Christos Faloutsos CMU.

Mining Graphs and Tensorschristos/TALKS/09-nsf-tensors/graph... · 2009. 2. 24. · Faloutsos 1 CMU SCS Mining Graphs and Tensors Christos Faloutsos CMU NSF tensors 2009 C. Faloutsos

CMU SCS Data Mining Meets Systems: Tools and Case Studies Christos Faloutsos SCS CMU.

CHRISTOS FALOUTSOSchristos/webvitae.pdf · J27. Manish Arya, William Cody, Christos Faloutsos, Joel Richardson and Arthur TogaA 3D Medical Image Database Management System International

CMU SCS Large Graph Mining Christos Faloutsos CMU.