MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with...

21
MATH6380O Mini-Project 1 Image Classification with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong University of Science and Technology March 13, 2018

Transcript of MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with...

Page 1: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

MATH6380O Mini-Project 1Image Classification with Extracted Feature

ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min

Hong Kong University of Science and Technology

March 13, 2018

Page 2: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Outline

1 Preprocessing data

2 Feature ExtractionScattering NetResNet-50

3 Classification

4 Discussion

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 2 / 21

Page 3: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Preprocessing data

Outline

1 Preprocessing data

2 Feature ExtractionScattering NetResNet-50

3 Classification

4 Discussion

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 3 / 21

Page 4: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Preprocessing data

More Samples from Limited Paintins

Original data: 28 paintings totally, 12 genuine, 9 fake, 7 unknown.

Crop more samples from one single painting, that is, we crop 200 sampleswith 224*224 size.

Random cropping does not work well.

Figure: Painting No.9

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 4 / 21

Page 5: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Preprocessing data

Variance Threshold

We randomly crop 5000 samples from each paintings and compute theirvariance, whereby we get empirical distribution of variances.

Small variance may represent empty sample. We could set a properthreshold to distinguish meaningful samples from empty ones.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 5 / 21

Page 6: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Preprocessing data

Variance Threshold

First, for every painting we set 90th percentile as variance threshold toselect samples cropped randomly. But it does not work well. Then wepre-crop paintings with edges like No. 18 and then crop them randomlywith variance threshold.

Figure: Samples from No. 18

Figure: Samples from pre-cropped No. 18Figure: Painting No. 18

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 6 / 21

Page 7: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Preprocessing data

Choose a Proper Variance Threshold

If we choose large variancethreshold, we could only cropsamples from small area. Howto choose a proper variancethreshold? In project, wesimply choose 45th percentileas variance threshold.

Figure: Samples with 45th

percentile from No. 18

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 7 / 21

Page 8: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Feature Extraction

Outline

1 Preprocessing data

2 Feature ExtractionScattering NetResNet-50

3 Classification

4 Discussion

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 8 / 21

Page 9: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Feature Extraction Scattering Net

Parameters of Scattering Net

We use the package ScatNet 2.0 from ENS.

Parameter setting

filt opt.J = 5, the number of scale of wavelets (high pass filters)

filt opt.L = 6, the number of orientations

scat opt.M = 3, the maximum scattering order (layers of scatter net)

Samples are RGB small images. We implment scattering net on eachchannel and then concatenate transformed feature together as a singlevecter.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 9 / 21

Page 10: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Feature Extraction ResNet-50

RestNet 50

We use a pre-trained ResNet-50 model on Image Net trained byTensor flow.

Data pre-processing.

Remove last layer and use the output as feature.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 10 / 21

Page 11: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Classification

Outline

1 Preprocessing data

2 Feature ExtractionScattering NetResNet-50

3 Classification

4 Discussion

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 11 / 21

Page 12: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Classification

Classification Methods

Use features extracted by CNN and Scatter Network

Linear Regression

SVM

KNN

Fine tune ResNet 50

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 12 / 21

Page 13: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Classification

Results on extracted features

Leave one out scheme for testing

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 13 / 21

Page 14: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Classification

Data splitting for ResNet 50

Pre-select samples of 7 paintings as test set.

Pre-select 20% in training set as validation setOne concern is that features of samples from the same painting aresimilar.

Trained on a 1080ti GPU.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 14 / 21

Page 15: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Classification

Results

Accuracy on samples: 84.28%

Accuracy on paintings: 85.71% (6/7)

Voting result:

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 15 / 21

Page 16: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Discussion

Outline

1 Preprocessing data

2 Feature ExtractionScattering NetResNet-50

3 Classification

4 Discussion

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 16 / 21

Page 17: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Discussion

Training Losses

During the first epoch, the training acc is about 100%.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 17 / 21

Page 18: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Discussion

Re-choose training and validation set

The validation set are all non-Raphael paintings while the training setis consist of mostly Raphael paintings.

Split the data again, and use most of the non-Raphael paintings astraining data.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 18 / 21

Page 19: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Discussion

Result

Accuracy on samples: 31.9%.

Accuracy on paintings: 28.6%.

Voting result:

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 19 / 21

Page 20: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Discussion

Conclusion

The dominant factor is which painting the sample belongs to.

The network draw a boundary for Rapheals paintings out, andhopefully they are close to each others. So we have many truepositive cases.

But the boundary is not accurate, so it leads to many false negativecases.

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 20 / 21

Page 21: MATH6380O Mini-Project 1 Image Classi cation …...MATH6380O Mini-Project 1 Image Classi cation with Extracted Feature ZHANG Jianhui, ZHANG Hongming, ZHU Weizhi, FAN Min Hong Kong

Discussion

Q&A

Thank you for listening!

Q&A

Zhang, Zhang, Zhu, Fan (HKUST) MATH6380O Project 1 March 13, 2018 21 / 21