CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Divergences

Tutorial

Advanced Information Theory in CVPR “in a Nutshell”

CVPRJune 13-18 2010

San Francisco,CAShape Matching with I-divergences

Anand Rangarajan

Shape Matching with I-Divergences

Groupwise Point-set Pattern RegistrationGiven N point-sets, which are denoted by {X p, p ∈ {1, ...,N}}, thetask of multiple point pattern matching or point-set registration is torecover the spatial transformations which yield the best alignment ofall shapes.

Problem Visualization

Group-wise Point-set Registration

Principal Technical ChallengesI Solving for nonrigid deformations between point-sets with

unknown correspondence is a difficult problem.

I How do we align all the point-sets in a symmetric manner sothat there is no bias toward any particular point-set?

From point-sets to density functions

From point-sets to density functionsI Point sets are represented by probability density functions.I Intuitively, if these point sets are aligned properly, the

corresponding density functions should be similar.

Question: How do we measure the similarity between multipledensity functions?

From point-sets to density functionsI Point sets are represented by probability density functions.I Intuitively, if these point sets are aligned properly, the

corresponding density functions should be similar.

Question: How do we measure the similarity between multipledensity functions?

Divergence Measures

Kullback-Leibler divergence

DKL(p‖q) =

ˆp(x) log

q(x)dx

where p(x), q(x) are the probabilitydensity functions.

J divergenceGiven two probability densityfunction p and q, the symmetric KLdivergence is defined as:

J(p, q) =12

(DKL(p‖q) + DKL(q‖p))

Divergence Measures

Kullback-Leibler divergence

DKL(p‖q) =

ˆp(x) log

q(x)dx

where p(x), q(x) are the probabilitydensity functions.

J divergenceGiven two probability densityfunction p and q, the symmetric KLdivergence is defined as:

J(p, q) =12

(DKL(p‖q) + DKL(q‖p))

Motivating the JS divergence

Modeling two shapes

p(X |θ(1)) =

N1∏i=1

K1∑a=1

p(Xi |θ(1)a ), p(Y |θ(2)) =

N2∏j=1

K2∑b=1

p(Yj |θ(2)b )

Modeling the overlay of two shapes with identity of origin

p(X ∪ Y |θ(1), θ(2)) = p(X |θ(1))p(Y |θ(2))

Modeling the overlay of two shapes without identity of origin

p(Z |θ(1), θ(2)) =N1

N1 + N2p(Z |θ(1)) +

N1 + N2p(Z |θ(2))

Likelihood Ratio

I Which generative model do you prefer? The union of disparateshapes where identity of origin is preserved or one combinedshape where the identity of origin is suppressed.

I Likelihood ratio:

logΛ = logp(Z |θ(1), θ(2))

p(X ∪ Y |θ(1), θ(2))=

N1N1+N2

p(Z |θ(1)) + N2N1+N2

p(Z |θ(2))

p(X |θ(1))p(Y |θ(2))

I Z is understood to arise from a convex combination of twomixture models p(Z |θ(1)) and p(Z |θ(2)) where the weights ofeach mixture are proportional to the number of points N1 andN2 in each set.

I Weak law of large numbers leads to Jensen-Shannon divergence.

JS Divergence for multiple shapes

JS-divergence of shape densities

JSπ(P1,P2, ...,Pn) = H(∑

πiPi)−∑

πiH(Pi) (1)

where π = {π1, π2, ..., πn|πi > 0,∑πi = 1} are the weights of the

probability densities Pi and H(Pi) is the Shannon entropy.

Atlas estimation

Formulation using JS-divergence

JSβ(P1,P2, ...,PN) + λ

N∑i=1

||Lf i ||2

=H(∑

βiPi )−∑

βiH(Pi ) + λN∑

||Lf i ||2.

f i is the deformation function corresponding to point set X i ;Pi = p(f i (X i )) is the probability density for deformed point-set.

Multiple shapes: JS divergence

JS divergence in a hypothesis testing framework:I Construct a likelihood ratio between i.i.d. samples drawn from a

mixture (∑

a πaPa) and i.i.d. samples drawn from aheterogeneous collection of densities (P1,P2, ...,PN).

I The likelihood ratio is then

∏Mk=1

∑Na=1 πaPa(xk)∏N

a=1∏Na

ka=1 Pa(xaka

I Weak law of large numbers gives us the JS-divergence.

Group-wise Registration ResultsExperimental results on four 3D hippocampus point sets.

Shape matching via CDF I-divergences

I Model each point-set by a cumulative distribution function(CDF)

I Quantify the distance among cdfs via an information-theoreticmeasure [typically the cumulative residual entropy (CRE)]

I Minimize the dis-similarity measure over the space ofcoordinate transformation parameters

Havrda-Charvát CRE

HC-CRE: Let X be a random vector in Rd , we define the HC-CREof X by

EH(X ) = −ˆ

(α− 1)−1(Pα(|X | > λ)− P(|X | > λ))dλ

where X = {x1, x2, . . . , xd}, λ = {λ1, λ2, . . . , λd}, and |X | > λmeans |xi | > λi , Rd

+ = {xi ∈ Rd ; xi ≥ 0; i ∈ {1, 2, . . . , d}}.

CDF-HC Divergence

CDF-HC Divergence : Given N cumulative probability distributionsPk , k ∈ {1, . . . ,N}, the CDF-JS divergence of the set {Pk} isdefined as

HC (P1,P2, . . . ,PN) = EH(∑k

πkPk)−∑k

πkEH(Pk)

where 0 ≤ πk ≤ 1,∑

k πk = 1, and EH is the HC-CRE.

CDF-HC Divergence

Let P =∑

k πkPk

HC (P1,P2, . . . ,PN)

= −(α− 1)−1(

Pα(X >λ)dλ−∑k

Pαk (Xk>λ)dλ)

P2k (Xk>λ)dλ−

P2(X >λ)dλ (α = 2)

Dirac Mixture Model

Pk(Xk > λ) =1

Dk∑i

H i (x, xi )

where H(x, xi ) is the Heaviside function (equal to 1 if allcomponents of x are greater than xi ).

CDF-JS, PDF-JS & CDF-HC

3.5Before Registraion

3.5CDF−JS

3.5PDF−JS

3.5CDF−HC

0 2 4 6 80

4Before Registraion

0 2 4 6 80

4CDF−JS

0 2 4 6 80

4PDF−JS

0 2 4 6 80

4CDF−HC

2D Point-set Registration for CC

0 0.2 0.4 0.6−0.1

Point Set 1

0 0.2 0.4 0.6−0.1

Point Set 2

0 0.2 0.4 0.6

Point Set 3

0 0.2 0.4 0.6−0.1

Point Set 4

0 0.2 0.4 0.6

Point Set 5

0 0.2 0.4 0.6

Point Set 6

0 0.2 0.4 0.6

Point Set 7

0 0.2 0.4 0.6−0.1

Before Registration

0 0.2 0.4 0.6

After Registration

With outliers

0.2 0.4 0.6 0.8 1

Before Registration

0.4 0.6 0.8 1 1.2

After PDF−JS Registration

0 0.2 0.4 0.60

After CDF−HC Registration

With different α values

00.10.2

Initial Configuration

0 0.2 0.4 0.60

0.10.2

α=1.1

0 0.2 0.4 0.60

0.10.2

α=1.3

0 0.2 0.4 0.60

0.10.2

α=1.5

0 0.2 0.4 0.60

0.10.2

0 0.2 0.4 0.60

0.10.2

α=1.9

0 0.2 0.4 0.60

0.10.2

α=1.7

0 0.2 0.4 0.60

0.10.2

0 0.2 0.4 0.60

0.10.2

0 0.2 0.4 0.60

0.10.2

3D Point-set Registration for Duck

02040600 100 200

Point Set 1

02040600 100 200

Point Set 2

02040600 100 200

Point Set 3

02040600 100 200

Point Set 4

02040600 100 200

Before Registration

0500 100 200

After Registration

3D Registration of Hippocampi

Point Set 10

Point Set 2

Point Set 40

051015

Point Sets Before Registration0

Point sets After Registration

Point Set 3

Group-Wise Registration Assessment

The Kolmogorov-Smirnov (KS) statistic was computed to measurethe difference between the CDFs.

I With ground truth1N

N∑k=1

D(Fg ,Fk)

I Without ground truth

N∑k,s=1

D(Fk ,Fs)

KS statistic for comparison

Table: KS statistic

KS-statistic CDF-JS PDF-JS CDF-HCOlympic Logo 0.1103 0.1018 0.0324

Fish with outliers 0.1314 0.1267 0.0722

Table: Average nearest neighbor distance

ANN distance CDF-JS PDF-JS CDF-HCOlympic Logo 0.0367 0.0307 0.0019

Fish with outliers 0.0970 0.0610 0.0446

KS statistic for comparison (contd.)

Table: Non-rigid group-wise registration assessment without ground truthusing KS statistics

Before Registration After RegistrationCorpus Callosum 0.3226 0.0635

Corpus Callosum with outlier 0.3180 0.0742Olympic Logo 0.1559 0.0308

Fish 0.1102 0.0544Hippocampus 0.2620 0.0770

Duck 0.2287 0.0160

KS statistic for comparison (contd.)

Table: Non-rigid group-wise registration assessment without ground truthusing average nearest neighbor distance

Before Registration After RegistrationCorpus Callosum 0.0291 0.0029

Corpus Callosum with outlier 0.0288 0.0092Olympic Logo 0.0825 0.0022

Fish 0.1461 0.0601Hippocampus 13.7679 3.1779

Duck 15.4725 0.3280

Discussion

I I-divergences for shape matching avoid correspondence problemI Symmetric, unbiased registration and atlas estimationI Shape densities modeled as Gaussian mixtures, cumulatives

directly estimatedI JS (pdf and cdf-based) and HC divergences usedI Estimated atlas useful in model-based segmentation

CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Divergences

Education

Transcript of CVPR2010: Advanced ITinCVPR in a Nutshell: part 5: Shape, Matching and Divergences

Accommodating clustered divergences in phylogenetic inference

CVPR2010: Context-aware saliency detection

On divergences, surrogate loss functions, and ...

CVPR2010: higher order models in computer vision: Part 1, 2

Make 100 pips Trading Divergences. - WordPress.com...Make 100 pips Trading Divergences. There are several ways to trade divergences and several indicators that can help you identify

LONGWave 10-10-12 - DELUSIONAL DIVERGENCES

Clustering With Bregman Divergences - Machine Learning

CONTRADICTIONS, DIVERGENCES ET …mosqueesfrance.free.fr/ahmeddeedat/CONTRADICTIONS dans la Bible.… · contradictions, divergences et inexactitudes dans la bible . chapitre i: contradictions

CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration

The Channel Divergences - Shima_Chase

Infrared divergences during inflation

Trading Divergences - KISS-TRADER.COMkiss-trader.com/freetradingarticles/TradingDivergencesR.pdf · Trading Divergences Divergences occur when the trend of a security’s price does

Convergences and divergences on land nomos

Convergences and Divergences

Divergences comptabilité - fiscalité, gestion fiscale et ...

CVPR2010: Semi-supervised Learning in Vision: Part 3: Algorithms and Applications

CVPR2010: Semi-supervised Learning in Vision: Part 1: Introduction

12 channels channels of divergences

Machine Translation Divergences: A Formal Description and ...users.umiacs.umd.edu/~bonnie/Publications/Attic/Dorr1994g.pdfBonnie J. Dorr Machine Translation Divergences divergence

CVPR2010: Advanced ITinCVPR in a Nutshell: part 3: Feature Selection