Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before...

19
Support Vector Machines Joseph Gonzalez
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    222
  • download

    0

Transcript of Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before...

Page 1: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Support Vector Machines

Joseph Gonzalez

Page 2: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

From a linear classifier to ...

*One of the most famous slides you will see, ever!

Page 3: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

The Big Idea

O

X

O

O

X

X

X

X

X

X

O

O

OO

O

O

Page 4: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Maximum margin

Maximum possible separation between positive and negative training examples

*One of the most famous slides you will see, ever!

Page 5: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Geometric Intuition

O

X

O

OO

XX

X

SUPPORT VECTORS

Page 6: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Geometric Intuition

O

X

X

O

OO

XX

X

SUPPORT VECTORS

Page 7: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Geometric Intuition

O

X

XO

O

O

XX

X

SUPPORT VECTORS

Page 8: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Primal Versionmin ||w||

2 +C ∑ξs.t. (w.x + b)y ≥ 1-ξ

ξ ≥ 0

Page 9: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

DUAL Version

Where did this come from?Remember Lagrange Multipliers

Let us “incorporate” constraints into objectiveThen solve the problem in the “dual” space of lagrange multipliers

max ∑α -1/2 ∑αiαjyiyjxixj

s.t. ∑αiyi = 0C ≥ αi ≥ 0

Page 10: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Primal vs Dual

Number of parameters?large # features?large # examples?

for large # features, DUAL preferredmany αi can go to zero!

max ∑α -1/2 ∑αiαjyiyjxixj

s.t. ∑αiyi = 0C ≥ αi ≥ 0

min ||w||2 +C ∑ξ

s.t. (w.x + b)y ≥ 1-ξξ ≥ 0

Page 11: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

DUAL: the “Support vector” version

How do we find α?

Quadratic programming

How do we find C?

Cross-validation!

Wait... how do we predict y for a new point x??

How do we find w?

How do we find b?

y = sign(w.x+b)

w = Σi αi yi xi

max ∑α - 1/2 ∑αiαjyiyjxixj

s.t. ∑αiyi = 0C ≥ αi ≥ 0

y = sign(Σi αi yi xi xj + b)

Page 12: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

max α1 + α2 + 2α1α2 - α12/2 - 4α22

s.t. α1-α2 = 0C ≥ αi ≥ 0

“Support Vector”s?

O

X

α1

α2

max ∑α - 1/2 ∑αiαjyiyjxixj

s.t. ∑αiyi = 0C ≥ αi ≥ 0

(0,1)

(2,2)max ∑α - α1α2(-1)(0+2)- 1/2 α12(1)(0+1) - 1/2 α22(1)(4+4)

w = Σi αi yi xi

w = .4([0 1]-[2 2]) =.4[-2 -1 ]

y=w.x+bb = y-w.xx1: b = 1-

.4 [-2 -1][0 1] = 1+.4 =1.4

b

4/5

α1=α2=αmax 2α -5/2α2

max 5/2α(4/5-α) 0 2/5

α1=α2=2/5

Page 13: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

“Support Vector”s?

O

X

α1

α2

max ∑α - 1/2 ∑αiαjyiyjxixj

s.t. ∑αiyi = 0C ≥ αi ≥ 0

(0,1)

(2,2)

Oα3

What is α3? Try this at home

Page 14: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Playing With SVMS

• http://www.csie.ntu.edu.tw/~cjlin/libsvm/

Page 15: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

More on Kernels

• Kernels represent inner products– K(a,b) = a.b– K(a,b) = φ(a) . φ(b)

• Kernel trick is allows extremely complex φ( ) while keeping K(a,b) simple

• Goal: Avoid having to directly construct φ( ) at any point in the algorithm

Page 16: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Kernels

Complexity of the optimization problem remains only dependent on the dimensionality of the input space and not of the feature space!

Page 17: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Can we used Kernels to Measure Distances?

• Can we measure distance between φ(a) and φ(b) using K(a,b)?

Page 18: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Continued:

Page 19: Support Vector Machines Joseph Gonzalez TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A AA A AA.

Popular Kernel Methods

• Gaussian Processes• Kernel Regression (Smoothing)

– Nadarayan-Watson Kernel Regression