Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

Face Recognition Using Neural Networks

Presented By:Hadis MohseniLeila Taghavi

Atefeh Mirsafian

2

Outline

Overview Scaling Invariance Rotation Invariance Face Recognition Methods

Multi-Layer Perceptron Hybrid NN

SOM Convolutional NN

Conclusion

3

Overview

4

Scaling Invariance

Magnifying image while minimizing the loss of perceptual quality.

Interpolation methods: Weighted sum of neighboring pixels. Content-adaptive methods. Edge-directed. Classification-based.

Using multilayer neural networks.

Proposed method: Content-adaptive neural filters using pixel classification.

5

Scaling Invariance (Cont.)

Pixel Classification: Adaptive Dynamic Range Coding (ADRC):

Concatenation of ADRC(x) of all pixels in the window gives the class code.

If we invert the picture date, the coefficients for the filter should remain the same ⇒ It is possible to reduce half of the numbers of classes.

Number of classes: 2N-1 for a window with N pixels

otherwise 1,

x x if ,0)( avxADRC

6


Content-adaptive neural filters: The original high resolution, y, and the downscaled, x,

images are employed as the training set. These pairs, (x, y), are classified using ADRC on the input vector x.

The optimal coefficients are obtained for each class. The coefficients are stored in the corresponding index of a

look-up-table(LUT).

7

Scaling Invariance (Cont.) A simple 3-layer feedforward architecture. Few neurons in the hidden layer. The activation function in the hidden layer

is tanh. The neural network can be described as:

y2, y3 and y4 can be calculated in the same way by flipping the window simmetrically

hN

nnnn bbxuy

101 ))..(tanh(

8


Pixel classification set reduction1. Calculate the Euclidian distance of normalized coefficient

vector between each class.

2. If the distance is below the threshold, combine the classes. The coefficient can be obtained by training on the combined data of the corresponding classes.

3. Repeat step 1 for the new class set , until the threshold is reached.

2,

9

1, )( bi

iaiD

9


10

Rotation Invariance

Handling in-plane rotation of face. Using a neural network called router. The router’s input is the same region that the detector network

will receive as input. The router returns the angle of the face.

11

Rotation Invariance (Cont.)

The output angle can be represented by Single unit 1-of-N encoding Gaussian output encoding

An array of 72 output unit is used for proposed method. For a face with angle of θ, each output trained to have a value of

cos(θ – i×5o)

Computing an input face angle as:

71

0

71

0

)5sin(),5cos(i i

ii ioutputioutput

12


Router architecture Input is 20×20 window

of scaled image. Router has a single

hidden layer consistingof a total 100 units.

There are 4 sets of units in hidden layer.

Each unit connects to a 4×4 region of the input. Each set of 25 units covers the entire input without overlap. The activation function for hidden layer is tanh. The network in trained using the standard error back propagation

algorithm.

13


Generating a set of manually labeled example images Align the labeled faces:

1. Initializing F, a vector which will be the average position of each labeled feature over all the training faces.

2. Each face is aligned with F by computing rotation and scaling.

3. Transformation can be written as linear functions, we can solve it for the best alignment.

4. After iterating these steps a small number of times, the alignments converge.

14


To generate the training set, the faces are rotated to a random orientation.

15


Empirical results:

16


17

Face Recognition Methods

Database: ORL(Olivetti Research Lab.) Database consists of 10

92×112 different images of 40 distinct subject. 5 image per person for training set and 5 for test. There are variation of facial expression and facial detail.

18

Face Recognition Methods

Multi-Layer Perceptron: The training set faces are run through a PCA, and the 200

corresponding eigenvectors (principal components) are found which can be displayed as eigenfaces.

Each face in the training set can be

reconstructed by a linear combination

of all the principal components. By projecting the test set images onto

the eigenvector basis, the eigenvector

expansion coefficients can be found.

(a dimensionality reduction!)

19

Face Recognition Methods (Cont.)MLP

Training classifier using coefficients

of training set images. Using variable number of

principal components ranging

from 25 to 200 in different

simulation. Repeating simulation 5 times for

each number with random initialization of all parameters in the MLP and averaging the results for that number.

The Error Backpropagation learning algorithm was applied with a small constant learning rate (normally < 0.01)

20

Face Recognition Methods (Cont.)MLP

Results:

21

Face Recognition Methods (Cont.)

Hybrid NN

22

Face Recognition Methods (Cont.) Hybrid NN

1. Local Image Sampling• •

],,...,,...,,[ ,1,1,, WjWiWjWiijWjWiWjWi xxxxx

],,...,,...,,[ ,1,1,, WjWiijWjWiijijijWjWiijWjWiij xxxxxwxxxx

23


2. Self-Organizing Map

24


0)(2

exp)(

),()(

)]()()[()()1(

.

],...,,[

kernel theof width thedefines )(

rate learning uedscalar val a is )(

input ther toight vectoclosest we with thenode theis

function odneighborho a is )(

SOM in the nodeeach toassigned spaceinput in the vector refrence a is

2

2

21

t

t

cr

tcih

im

ic rr

t

icci

icci

iciii

nTiniii

t

rrth

trrhth

tmtxthtmtm

m

25


SOM image samples corresponding to each node before training and after training

26


3. Convolutional NNsInvariant to some degree of: Shift Deformation

Using these 3 ideas: Local Receptive Fields Shared Weights aiding genaralization Spatial Subsampling

27


28


Network Layers: Convolutional Layers

Each Layer one or more planes Each Plane can be considered as a feature map which

has a fixed feature detector that is convolved with the local window which is scanned over the planes in previous layer.

Subsampling Layers Local averaging and subsampling operation

29


Convolutional and Sampling relations:

30


Simulation Details:

Initial weights are uniformly distributed random numbers in the range [-2.4/Fi, 2.4/Fi] where Fi is the fan-in neuron i.

Target outputs are -0.8 and 0.8 using the tanh output activation function.

Weights are updated after each pattern presentation.

31


Expremental Results Expriment #1:

Variation of the number of output classes

32


Variation of the dimentionality of the SOM

33


Substituting the SOM with the KLT

Replacing the CN with an MLP

34

The tradeoff between rejection threshold and recognition accuracy


35


Comparison with other known results on the same database

36


Variation of the number of training images per person

37


38


Expriment #2:

39


40

Conclusion

The results of the face recognition expriments are greatly influenced by: The Training Data The Preprocessing Function The Type of Network selected Activation Functions

A fast, automatic system for face recognition has been presented which is a combination of SOM and CN. This network is partial invariant to translation, rotation, scale and deformation.

Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.

Documents

Transcript of Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.