estimasi densitas dengan EM - PCU Teaching...

1

EstimasiEstimasi Prob. Density Function Prob. Density Function dengandengan EMEM

SumberSumber: :

--Forsyth & Ponce Chap. 7Forsyth & Ponce Chap. 7

--StandfordStandford Vision & ModelingVision & Modeling

Probability Density EstimationProbability Density Estimation

• Parametric Representations• Non-Parametric Representations• Mixture Models

2

MetodeMetode estimasiestimasi NonNon--parametricparametric

• Tanpa asumsi apapun tentang distribusi• Estimasi sepenuhnya bergantung ada DATA• cara mudah menggunakan: Histogram

HistogramsHistograms

Diskritisasi, lantas ubah dalam bentuk batang:

3


• Butuh komputasi banyak, namun sangat umumdigunakan• Dapat diterapkan pada sembarang bentukdensitas (arbitrary density)


Permasalahan:

• Higher dimensional Spaces:

- jumlah batang (bins) yg. Exponential - jumlah training data yg exponential- Curse of Dimensionality

• size batang ? Terlalu sedikit: >> kasar

Terlalu banyak: >> terlalu halus

4

PendekatanPendekatan secarasecara prinsipprinsip::

• x diambil dari ‘unknown’ p(x)• probabiliti bahwa x ada dalam region R adalah:

VxpdxxpPR

)(')'( ≈= ∫


VxpdxxpPR

)(')'( ≈= ∫

N

KP =


5


VxpdxxpPR

)(')'( ≈= ∫

N

KP =

NV

Kxp ≈⇒ )(



NV

Kxp ≈⇒ )(

Dengan Fix VTentukan K

Dengan Fix KTentukan V

Metoda Kernel-Based K-nearestneighbor

6

MetodaMetoda KernelKernel--Based:Based:

NV

Kxp ≈⇒ )(

Parzen Window:

<

=otherwise 0

2/1|u| 1)( juH


NV

Kxp ≈⇒ )(

Parzen Window:

<

=otherwise 0

2/1|u| 1)( juH

∑=

−=N

nnxxHK

1

)(

7


NV

Kxp ≈⇒ )(

Parzen Window:

<

=otherwise 0

2/1|u| 1)( juH

∑=

−=N

nnxxHK

1

)(∑=

−=N

nnd

xxHNh

xp1

)(1

)(


NV

Kxp ≈⇒ )(

Gaussian Window:

−−= ∑

=2

2

12/2 2

||||exp

)2(

11)(

h

xx

hNxp n

N

ndπ

8


KK--nearestnearest--neighbor:neighbor:

NV

Kxp ≈⇒ )(

Kembankan V sampai dia mencapai K points.

9



Klasifikasi secara Bayesian :

VN

KCxp

k

kk =)|(

NV

Kxp =)(

N

NCp k

k =)(

10


Klasifikasi secara Bayesian :

VN

KCxp

k

kk =)|(

NV

Kxp =)(

N

NCp k

k =)(

K

KxCp k

k =)|(

“aturan klasifikasi k-nearest-neighbour ”

Probability Density EstimationProbability Density Estimation

• Parametric Representations• Non-Parametric Representations• Mixture Models (Model Gabungan)

11

MixtureMixture--Models (Model Models (Model GabunganGabungan):):

Gaussians:

- Mudah- Low Memory- Cepat- Good Properties

Non-Parametric:

- Umum- Memory Intensive- Slow

Mixture Models

CampuranCampuran fungsifungsi Gaussian (mixture of Gaussian (mixture of Gaussians):Gaussians):

x

p(x)

Jumlah dari Gaussians tunggal

12

CampuranCampuran fungsifungsi Gaussian:Gaussian:

x

p(x)

Jumlah dari Gaussians tunggal

Keunggulan: Dapat mendekati bentuk densitassembarang (Arbitrary Shape)


x

p(x)

Generative Model: z

1 2 3P(j)

p(x|j)

13


x

p(x)

∑=

=M

j

jPjxpxp1

)()|()(

−−=

2

2

2/2 2

||||exp

)2(

1)|(

jd

j

xjxp

σµ

πσ


Maximum Likelihood:

∑=

−=−=N

nnxpLE

1

)(lnln

14


Maximum Likelihood:

∑=

−=−=N

nnxpLE

1

)(lnln

0=∂∂

k

E

µ

E

kµ


Maximum Likelihood:

∑=

−=−=N

nnxpLE

1

)(lnln

0=∂∂

k

E

µ ∑

∑

=

==⇒N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

15


∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ


∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

∑=

= M

kn

nn

kPkxp

jPjxpxjP

1

)()|(

)()|()|(

16


∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

−−=

2

2

2/2 2

||||exp

)2(

1)|(

j

jn

dj

n

xjxp

σ

µ

πσ

∑=

= M

kn

nn

kPkxp

jPjxpxjP

1

)()|(

)()|()|(


∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

−−=

2

2

2/2 2

||||exp

)2(

1)|(

j

jn

dj

n

xjxp

σ

µ

πσ

∑=

= M

kn

nn

kPkxp

jPjxpxjP

1

)()|(

)()|()|(

17


Maximum Likelihood:

∑=

−=−=N

nnxpLE

1

)(lnln

0=∂∂

k

E

µ

E

kµ

Tidak adasolusi pendek !


Maximum Likelihood:

∑=

−=−=N

nnxpLE

1

)(lnln

EGradient Descent

18


Maximum Likelihood:

∑=

−=−=N

nnxpLE

1

)(lnln

),...,,,...,,,...,( 111 MMMk

fE

αασσµµµ

=∂∂


Optimasi secara Gradient Descent:

• Complex Gradient Function(highly nonlinear coupled equations)

• Optimasi sebuah Gaussian tergantung dari seluruh

campuran lainnya.

19


x

p(x)

-> Dengan strategi berbeda:

Observed Data:


x

p(x)

Observed Data:

Densitas yg dihasilkan

20


x

p(x)

yVariabel Hidden

1 2

Observed Data:


x

p(x)

yVariabel Hidden

1 2

1 1 1111 1 2 2 2222 2 yUnobserved:

Observed Data:

21

ContohContoh populerpopuler ttgttg. Chicken and Egg . Chicken and Egg Problem:Problem:

x

p(x)

1 1 1111 1 2 2 2222 2 yAnggapkita tahu

Max.LikelihoodUtk. Gaussian #1


Chicken+Egg Problem:Chicken+Egg Problem:

x

p(x)

1 1 1111 1 2 2 2222 2 y

Anggapkita tahu

P(y=1|x) P(y=2|x)

22


x

p(x)

1 1 1111 1 2 2 2222 2 y

Tapi yg ini kitatidak tau samasekali ?

?


x

p(x)

1 1 1111 1 2 2 2222 2 yCoba pura2 tahu

23

Clustering:Clustering:

x

1 1 1111 1 2 2 2222 2 yTebakan benar ?

K-mean clustering / Basic Isodata

PengelompokanPengelompokan (Clustering):(Clustering):

Procedure: Basic Isodata

1. Choose some initial values for the meansLoop: 2. Classify the n samples by assigning them to the class

of the closest mean.3. Recompute the means as the average of the samples

in their class.4. If any mean changed value, go to Loop;

otherwise, stop.

Mµµ ,...,1

24

IsodataIsodata: : InisialisasiInisialisasi

1µ

2µ

IsodataIsodata: : MenyatuMenyatu (Convergence)(Convergence)

1µ

2µ

25

IsodataIsodata: : BeberapaBeberapa permasalahanpermasalahan

DitebakDitebak Eggs / Eggs / TerhitungTerhitung ChickenChicken

x

p(x)

1 1 1111 1 2 2 2222 2 yDisini kita berada



26

GaussianAproximasiGaussianAproximasi ygyg. . baikbaik

x

p(x)

• Namun tidak optimal! • Permasalahan: Highly overlapping Gaussians

Expectation Maximization (EM)Expectation Maximization (EM)

• EM adalah formula umum dari problem seperti “Chicken+Egg” (Mix.Gaussians, Mix.Experts, Neural Nets, HMMs, Bayes-Nets,…)

• Isodata: adalah contoh spesifik dari EM

• General EM for mix.Gaussian: disebut Soft-Clustering

• Dapat konvergen menjadi Maximum Likelihood

27

IngatIngat rumusanrumusan iniini ?:?:

∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

−−=

2

2

2/2 2

||||exp

)2(

1)|(

j

jn

dj

n

xjxp

σ

µ

πσ

∑=

= M

kn

nn

kPkxp

jPjxpxjP

1

)()|(

)()|()|(

Soft Chicken and Egg Problem:Soft Chicken and Egg Problem:

x

p(x)

P(1|x)0.1 0.3 0.7 0.1 0.01 0.0001

0.99 0.99 0.99 0.5 0.001 0.00001

∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

28


x

p(x)

P(1|x)

∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

0.1 0.3 0.7 0.1 0.01 0.0001

0.99 0.99 0.99 0.5 0.001 0.00001

Anggap kitatahu:

Weighted Mean of Data


x

p(x)

P(1|x)

∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

0.1 0.3 0.7 0.1 0.01 0.0001

0.99 0.99 0.99 0.5 0.001 0.00001

Step-2:Hitung ulangposteriors

29

LangkahLangkah prosedurprosedur EM:EM:

Procedure: EM

1. Choose some initial values for the meansE-Step: 2. Compute the posteriors for each class and each

sample: M-Step: 3. Re-compute the means as the weighted average

of their class:

4. If any mean changed value, go to Loop; otherwise, stop.

Mµµ ,...,1

)|( nxjP

∑

∑

=

==N

nn

n

N

nn

j

xjP

xxjP

1

1

)|(

)|(µ

EM EM dandan Gaussian mixtureGaussian mixture

),(maxarg )1()( −= ii Q θθθθ

∑

∑

=

−

=

−

=N

n

in

N

nn

in

ij

xjp

xxjp

1

)1(

1

)1(

)(

),|(

),|(

θ

θµ

30



∑

∑

=

−

=

− −−=∑

N

n

in

N

n

Tijn

ijn

in

ij

xjp

xxxjp

1

)1(

1

)()()1(

)(

),|(

))()(,|(

θ

µµθ



∑=

−=N

n

in

ij xjp

N 1

)1()( ),|(1

θα

31

ContohContoh--contohcontoh EM:EM:

Training Samples


Training Samples Initialization

32


Training Samples End Result of EM


Training Samples Density Isocontours

33


Color Segmentation


Layered Motion

Yair Weiss

estimasi densitas dengan EM - PCU Teaching...

Documents

Transcript of estimasi densitas dengan EM - PCU Teaching...