Example: Feature selection

16

description

Y “Sick”. X 1 “Fever”. X 2 “Rash”. X 3 “Male”. Uncertainty before knowing X A. Uncertainty after knowing X A. Example: Feature selection. Given random variables Y, X 1 , … X n Want to predict Y from subset X A = (X i 1 ,…,X i k ) Want k most informative features: - PowerPoint PPT Presentation

Transcript of Example: Feature selection

Page 1: Example: Feature selection
Page 2: Example: Feature selection
Page 3: Example: Feature selection
Page 4: Example: Feature selection

4

Example: Feature selection• Given random variables Y, X1, … Xn

• Want to predict Y from subset XA = (Xi1,…,Xik

)

Want k most informative features:

A* = argmax IG(XA; Y) s.t. |A| · k

where IG(XA; Y) = H(Y) - H(Y | XA)

Problem inherently combinatorial!

Y“Sick”

X1

“Fever”X2

“Rash”X3

“Male”

Naïve BayesModel

Uncertaintybefore knowing XA

Uncertaintyafter knowing XA

Page 5: Example: Feature selection

5

s

Key property: Diminishing returnsSelection A = {} Selection B = {X2,X3}

Adding X1 will help a lot!

Adding X1 doesn’t help much

New feature X1

B A

s

+

+

Large improvement

Small improvement

For Aµ B, z(A [ {s}) – z(A) ¸ z(B [ {s}) – z(B)

Submodularity:

Y“Sick”

X1

“Fever”

X2

“Rash”X3

“Male”

Y“Sick”

Page 6: Example: Feature selection

6

Submodular set functions• Set function z on V is called submodular if

For all A,B µ V: z(A)+z(B) ¸ z(A[B)+z(AÅB)

• Equivalent diminishing returns characterization: SB A

S

+

+

Large improvement

Small improvement

For AµB, sB, z(A [ {s}) – z(A) ¸ z(B [ {s}) – z(B)

Submodularity:

BA A [ B

AÅB

++ ¸

Page 7: Example: Feature selection

7

Example: Set cover

Node predictsvalues of positionswith some radius

SERVER

LAB

KITCHEN

COPYELEC

PHONEQUIET

STORAGE

CONFERENCE

OFFICEOFFICE

For A µ V: z(A) = “area covered by sensors placed at A”

Formally: W finite set, collection of n subsets Si

µ WFor A µ V={1,…,n} define z(A) = |i2 A Si|

Want to cover floorplan with discsPlace sensorsin building

Possiblelocations

V

Page 8: Example: Feature selection

8

Set cover is submodular

SERVER

LAB

KITCHEN

COPYELEC

PHONEQUIET

STORAGE

CONFERENCE

OFFICEOFFICE

SERVER

LAB

KITCHEN

COPYELEC

PHONEQUIET

STORAGE

CONFERENCE

OFFICEOFFICE

S1 S2

S1 S2

S3

S4 S’

S’

A={S1,S2}

B = {S1,S2,S3,S4}

z(A[{S’})-z(A)

z(B[{S’})-z(B)

¸

Page 9: Example: Feature selection
Page 10: Example: Feature selection
Page 11: Example: Feature selection
Page 12: Example: Feature selection
Page 13: Example: Feature selection
Page 14: Example: Feature selection
Page 15: Example: Feature selection
Page 16: Example: Feature selection