Introduction to Pattern Recognition Chapter 1 ( Duda et al.)

Introduction to Pattern Recognition

Chapter 1 (Duda et al.)

CS479/679 Pattern RecognitionDr. George Bebis

What is Pattern Recognition?

• Assign an unknown pattern to one of several known categories (or classes).

What is a Pattern?

• A pattern could be an object or event.

biometric patterns hand gesture patterns

What is a Pattern? (con’t)

• Loan/Credit card applications – Income, # of dependents, mortgage amount credit worthiness

classification.

• Dating services– Age, hobbies, income “desirability” classification

• Web documents– Key-word based descriptions (e.g., documents containing

“football”, “NFL”) document classification.

Pattern Class

• A collection of “similar” objects.

Female Male

How do we model a Pattern Class?

• Typically, using a statistical model.– probability density function (e.g., Gaussian)

Gender Classification malefemale

How do we model a Pattern Class? (cont’d)

• Key challenges:

– Intra-class variability

– Inter-class variability

7Letters/Numbers that look similar

The letter “T” in different typefaces

Pattern Recognition:Main Objectives

• Hypothesize the models that describe each pattern class (e.g., recover the process that generated the patterns).

• Given a novel pattern, choose the best-fitting model for it and then assign it to the pattern class associated with the model.

Classification vs Clustering

– Classification (known categories) – Clustering (unknown categories)

Category “A”

Category “B”

Classification (Recognition) (Supervised Classification)

Clustering(Unsupervised Classification)

Pattern Recognition Applications

Handwriting Recognition

Handwriting Recognition (cont’d)

License Plate Recognition

Biometric Recognition

Fingerprint Classification

Face Detection

Gender Classification

Balanced classes (i.e., male vs female)

Autonomous Systems

Medical Applications

Skin Cancer Detection Breast Cancer Detection

Land Classification(from aerial or satellite images)

“Hot” Applications

• Recommendation systems– Amazon, Netflix

• Targeted advertising

The Netflix Prize

• Predict how much someone is going to enjoy a movie based on their movie preferences.– $1M awarded in Sept. 2009

• Can software recommend movies to customers?– Not Rambo to Woody Allen fans– Not Saw VI if you’ve seen all previous Saw movies

Main Classification Approaches

x: input vector (pattern)

y: class label (class)

•Generative – Model the joint probability, p(x, y)– Make predictions by using Bayes rules to calculate p(ylx)– Pick the most likely label y

•Discriminative – Estimate p(ylx) directly (e.g., learn a direct map from inputs x to

the class labels y)– Pick the most likely label y

“Syntactic” Pattern Recognition Approach

• Represent patterns in terms of simple primitives.

• Describe patterns using deterministic grammars or formal languages.

Complexity of PR – An Example

Problem: Sorting incoming fish on a conveyor belt.

Assumption: Two kind of fish: (1) sea bass (2) salmon

Pre-processing Step

Example

(1) Image enhancement

(2) Separate touchingor occluding fish

(3) Find the boundary of each fish

Feature Extraction

• Assume a fisherman told us that a sea bass is generally longer than a salmon.

• We can use length as a feature and decide between sea bass and salmon according to a threshold on length.

• How should we choose the threshold?

“Length” Histograms

• Even though sea bass is longer than salmon on the average, there are many examples of fish where this observation does not hold.

threshold l*

“Average Lightness” Histograms• Consider a different feature such as “average

lightness”

• It seems easier to choose the threshold x* but we still cannot make a perfect decision.

threshold x*

Multiple Features• To improve recognition accuracy, we might have to

use more than one features at a time.– Single features might not yield the best performance.– Using combinations of features might yield better

performance.

• How many features should we choose?

x lightness

x width

Classification• Partition the feature space into two regions by

finding the decision boundary that minimizes the error.

• How should we find the optimal decision boundary?

PR System – Two Phases

Training PhaseTest Phase

Sensors & Preprocessing• Sensing:

– Use a sensor (camera or microphone) for data capture.– PR depends on bandwidth, resolution, sensitivity,

distortion of the sensor.

• Pre-processing:– Removal of noise in data.– Segmentation (i.e., isolation of patterns of interest from

background).

Training/Test data

• How do we know that we have collected an adequately large and representative set of examples for training/testing the system?

Training Set Test Set ?

Feature Extraction• How to choose a good set of features?

– Discriminative features

– Invariant features (e.g., translation, rotation and scale)

• Are there ways to automatically learn which features are best ?

How Many Features?

• Does adding more features always improve performance?– It might be difficult and computationally

expensive to extract certain features.– Correlated features might not improve

performance.– “Curse” of dimensionality.

Curse of Dimensionality• Adding too many features can, paradoxically, lead to a

worsening of performance.– Divide each of the input features into a number of intervals, so

that the value of a feature can be specified approximately by saying in which interval it lies.

– If each input feature is divided into M divisions, then the total number of cells is Md (d: # of features).

– Since each cell must contain at least one point, the number of training data grows exponentially with d.

Missing Features

• Certain features might be missing (e.g., due to occlusion).

• How should we train the classifier with missing features ?

• How should the classifier make the best decision with missing features ?

Complexity

• We can get perfect classification performance on the training data by choosing complex models.

• Complex models are tuned to the particular training samples, rather than on the characteristics of the true model.

How well can the model generalize to unknown samples?

overfitting

Generalization• Generalization is defined as the ability of a classifier to

produce correct results on novel patterns.• How can we improve generalization performance ?

– More training examples (i.e., better model estimates).– Simpler models usually yield better performance.

complex model simpler model

Introduction to Pattern Recognition Chapter 1 ( Duda et al.)

Documents

Transcript of Introduction to Pattern Recognition Chapter 1 ( Duda et al.)

1 Chapter 1 INTRODUCTION. 2 What is Pattern Recognition? Pattern Recognition by Human perceptual specialized – decision making Pattern Recognition by.

Deep Learning For Sequential Pattern Recognition...Pattern or Pattern Recognition is the process of taking in raw data and taking an action based on the category of the pattern [Duda

Parameter Estimation: Maximum Likelihood Estimation Chapter 3 (Duda et al.) – Sections 3.1-3.2 CS479/679 Pattern Recognition Dr. George Bebis.

Pattern Recognition

Expectation-Maximization (EM) Chapter 3 (Duda et al.) – Section 3.9 CS479/679 Pattern Recognition Dr. George Bebis.

1 Perceptual Processes Introduction Pattern Recognition Pattern Recognition Top-down Processing & Pattern Recognition Top-down Processing & Pattern Recognition.

Dimensionality Reduction Chapter 3 (Duda et al.) – Section 3.8 CS479/679 Pattern Recognition Dr. George Bebis.

Introduction to Pattern Recognition. Pattern Recognition.

Pattern Recognition Techniques for Boson Sampling Validationqtml2017.di.univr.it/resources/Slides/Pattern-recognition...Pattern Recognition Techniques for Boson Sampling Validation

Support Vector Machines (SVMs) Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.

Introduction to Pattern Recognition Chapter 1 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis 1.

Dr. Ng Teck Khim CS4243 Computer Vision and Pattern Recognition Pattern Recognition Basics 1 Pattern Recognition Features Principle Components Analysis.

Statistical Pattern Recognition Techniques for Intrusion ... · Statistical Pattern Recognition Techniques for Intrusion Detection in Computer ... Pattern recognition techniques have

PATTERN RECOGNITION · 1. Mapping various real world problem into a pattern recognition framework 2. Studying Statistical Pattern Recognition Approaches (Ch1-Ch6 of Duda et al. 2001)

Pattern Recognition & Matlab Intro: Pattern Recognition, Fourth Edition

Pattern Recognition •Why is pattern recognition …Pattern Recognition •Why is pattern recognition important? –Humans’ ability to recognize patterns is what separates us most

Pattern Recognition: Statistics to Deep Learningbiometrics.cse.msu.edu/Presentations/Anil Jain_BAAI_June21.pdf · Pattern Recognition By pattern recognition we mean the extraction

MULTIVARIATE PATTERN RECOGNITION FOR …mobil.bfr.bund.de/cm/349/multivariate-pattern-recognition-for... · Pattern Recognition Many definitions Most modern definitions involve classification

Linear Discriminant Functions Chapter 5 (Duda et al.) CS479/679 Pattern Recognition Dr. George Bebis.

Syntactic Pattern Recognition - User page server for CoEuser.engineering.uiowa.edu/~image/READING/Duda-PR-transparencies...Sonka: Pattern Recognition Class 1 Syntactic Pattern Recognition