Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview...

50
Introduction Frank Wood July 6, 2010

Transcript of Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview...

Page 1: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

Introduction

Frank Wood

July 6, 2010

Page 2: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

IntroductionOverview of Topics

Page 3: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

Introduction

I Data mining is the search for patterns in large collections ofdata

I Pattern recognition is concerned with automatically findingpatterns in data

I Machine learning is pattern recognition with concern forcomputational tractability

Page 4: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

Example : Handwritten Digit Identification

Figure: Hand written digits from the USPS

Page 5: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

Approaches to Identifying Digits

Goal

I Build a machine that can identify digits automatically

Approaches

I Hand craft a set of rules that separate each digit from the next

I Set of rules invariably grows large and unwieldy and requiresmany “exceptions”

I “Learn” a set of models for each digit automatically fromlabeled training data.

Formalism

I Each digit is 28x28 pixel image

I Vectorized into a 784 entry vector x

Page 6: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

Machine learning approach

I Obtain a of N digits {x1, . . . , xN} called the training set.

I Label (by hand) the training set to produce a label or “target”t for each digit image x

I Learn a function y(x) which takes an image x as input andreturns an output in the same “format” as the target vector.

Page 7: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

0 1

−1

0

1

Figure: Chapter 1, Figure 2

Page 8: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

t

x

y(xn,w)

tn

xn

Figure: Chapter 1, Figure 3

Page 9: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

0 1

−1

0

1

Figure: Chapter 1, Figure 4a

Page 10: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

0 1

−1

0

1

Figure: Chapter 1, Figure 4b

Page 11: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

0 1

−1

0

1

Figure: Chapter 1, Figure 4c

Page 12: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

0 1

−1

0

1

Figure: Chapter 1, Figure 4d

Page 13: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

0 3 6 90

0.5

1TrainingTest

Figure: Chapter 1, Figure 5

Page 14: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�������

0 1

−1

0

1

Figure: Chapter 1, Figure 6a

Page 15: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

���������

0 1

−1

0

1

Figure: Chapter 1, Figure 6b

Page 16: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

� ������� �

0 1

−1

0

1

Figure: Chapter 1, Figure 7a

Page 17: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

� ������

0 1

−1

0

1

Figure: Chapter 1, Figure 7b

Page 18: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

� ���−35 −30 −25 −200

0.5

1TrainingTest

Figure: Chapter 1, Figure 8

Page 19: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

Figure: Chapter 1, Figure 9

Page 20: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

}

}ci

rjyj

xi

nij

Figure: Chapter 1, Figure 10

Page 21: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

���������

����

�����

Figure: Chapter 1, Figure 11a

Page 22: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�������

Figure: Chapter 1, Figure 11b

Page 23: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�������

Figure: Chapter 1, Figure 11c

Page 24: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

��� ��� �����

Figure: Chapter 1, Figure 11d

Page 25: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

xδx

p(x) P (x)

Figure: Chapter 1, Figure 12

Page 26: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

N (x|µ, σ2)

x

µ

Figure: Chapter 1, Figure 13

Page 27: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

x

p(x)

xn

N (xn|µ, σ2)

Figure: Chapter 1, Figure 14

Page 28: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

(a)

(b)

(c)

Figure: Chapter 1, Figure 15

Page 29: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

t

xx0

2σy(x0,w)

y(x,w)

p(t|x0,w, β)

Figure: Chapter 1, Figure 16

Page 30: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

0 1

−1

0

1

Figure: Chapter 1, Figure 17

Page 31: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

run 1

run 2

run 3

run 4

Figure: Chapter 1, Figure 18

Page 32: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

���

���

0 0.25 0.5 0.75 10

0.5

1

1.5

2

Figure: Chapter 1, Figure 19

Page 33: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

���

���

0 0.25 0.5 0.75 10

0.5

1

1.5

2

Figure: Chapter 1, Figure 20

Page 34: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

x1

D = 1

Figure: Chapter 1, Figure 21a

Page 35: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

x1

x2

D = 2

Figure: Chapter 1, Figure 21b

Page 36: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

x1

x2

x3

D = 3

Figure: Chapter 1, Figure 21c

Page 37: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

volu

me

frac

tion

�����

�����

����

�����

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

Figure: Chapter 1, Figure 22

Page 38: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

�����

������

��� �

0 2 40

1

2

Figure: Chapter 1, Figure 23

Page 39: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

R1 R2

x0 x̂

p(x, C1)

p(x, C2)

x

Figure: Chapter 1, Figure 24

Page 40: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

x

p(C1|x) p(C2|x)

0.0

1.0θ

reject region

Figure: Chapter 1, Figure 26

Page 41: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

������� ���

����� � ����

clas

s de

nsiti

es

0 0.2 0.4 0.6 0.8 10

1

2

3

4

5

Figure: Chapter 1, Figure 27a

Page 42: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�������� �� ����� ��� ��

0 0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

1

1.2

Figure: Chapter 1, Figure 27b

Page 43: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

t

xx0

y(x0)

y(x)

p(t|x0)

Figure: Chapter 1, Figure 28

Page 44: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

�����

� ����� �

−2 −1 0 1 20

1

2

Figure: Chapter 1, Figure 29a

Page 45: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

���

��

� ���

−2 −1 0 1 20

1

2

Figure: Chapter 1, Figure 29b

Page 46: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

���

��

� ���

−2 −1 0 1 20

1

2

Figure: Chapter 1, Figure 29c

Page 47: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

�����

���

��

� �����

−2 −1 0 1 20

1

2

Figure: Chapter 1, Figure 29d

Page 48: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

prob

abili

ties

������� ���

0

0.25

0.5

Figure: Chapter 1, Figure 30a

Page 49: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

prob

abili

ties

������� ���

0

0.25

0.5

Figure: Chapter 1, Figure 30b

Page 50: Introduction - Columbia University · Introduction Frank Wood July 6, 2010. Introduction Overview of Topics. Introduction I Data mining is the search for patterns in large collections

xa bxλ

chord

f(x)

Figure: Chapter 1, Figure 31