CS229 MachineLearning BeAlert!AccidentsHurt! Team #924...

Be Alert! Accidents Hurt! Liuming Zhao Taiming Zhang Liz Guo Ø Introduction Ø Models Fully Connected(FC) Ø Results & Analysis Confusion Matrix Models Epoch Batch Size Validation Accuracy Validation Loss Kaggle Score VGG-16 15 16 85.01% 0.4954 0.64 Basic CNN 20 64 74.38% 1.2081 1.32 Fully Connected 20 64 11.4% 6.1 6.8 VGG-16 + KNN 15 16 86.23% 0.4523 0.58 Ø Data • Driving distraction has always been a driving safety issue. • With the goal of detecting driver distractions, we want to design a driver posture classification system—classify the input image into 10 classes, such as texting, drinking, etc. • Fully Connected Neural Network (FC), Basic Convolutional Neural Network (CNN)，Transfer learning using VGG-16 and Inception-v4 are compared in this classification problem. • Use the dataset provided by the State Farm in the Kaggle Challenge. Consists of mages (640x480 pixels RGB) with different drivers’ behaviors. • Train set contains 22,400 labeled images, test set contains 79,727 unlabeled images. • Use K-folds cross-validation method to split training data into training set and validation set. • Resized the images from size 640x480x3 into 150x150x3. Fig.3 VGG-16 architecture Fig.4 Inception-v4 architecture[3] Fig.2 Basic CNN architecture Table 1 Results comparison of different models Fig.5 Train & validation accuracy using basic CNN and VGG-16 Fig.6 Confusion matrix using basic CNN(left) and VGG-16(right) Ø Summary CS229 Machine Learning Team #924 VGG-16 Inception-v4 Ø Method • We use the log loss function given as follow: Basic CNN Fig.1 FC neural network architecture • Fully Connected Neural Network: High bias with the lowest accuracy and Kaggle score. Deeper and more complicated model should be proposed. • Basic CNN: Improve performance dramatically but still have serious overfitting problem. We use adam optimizer with learning rate 0.001and trained the model with 20 epochs. • VGG-16: Relatively low bias with high overfitting problem. We use adam optimizer with learning rate 0.001 and trained the model with 15 epochs. • VGG-16 + KNN: Improve VGG-16 model a little bit but still need future improvement. • Perform the data augmentation • Currently training and debugging Inception-v4 model. • Pseudo Labeling Ø Ongoing & Future Work Reference [1] cs 229 http://cs229.stanford.edu/proj2016/report/SamCenLuo-ClassificationOfDriverDistraction-report.pdf [2] Kaggle. State Farm Distracted Driver Detection. https://www.kaggle.com/c/state-farm-distracted- driverdetection/data [3] https://github.com/jonas-pettersson/fast-ai/blob/master/statefarm.ipynb

Upload
others
Category

Documents
view
4
download
0

Embed Size (px):

Transcript of CS229 MachineLearning BeAlert!AccidentsHurt! Team #924...

Page 1: CS229 MachineLearning BeAlert!AccidentsHurt! Team #924 …cs229.stanford.edu/proj2017/final-posters/5145465.pdf · 2018. 1. 4. · BeAlert!AccidentsHurt! LiumingZhao Taiming Zhang

Be Alert! Accidents Hurt!Liuming Zhao Taiming Zhang Liz Guo

Ø Introduction Ø ModelsFully Connected(FC)

Ø Results & Analysis Confusion Matrix

Models Epoch BatchSize

Validation Accuracy

Validation Loss

KaggleScore

VGG-16 15 16 85.01% 0.4954 0.64

Basic CNN 20 64 74.38% 1.2081 1.32

FullyConnected

20 64 11.4% 6.1 6.8

VGG-16 +KNN

15 16 86.23% 0.4523 0.58

Ø Data

• Driving distraction has alwaysbeen a driving safety issue.

• With the goal of detecting driverdistractions, we want to designa driver posture classificationsystem—classify the inputimage into 10 classes, such astexting, drinking, etc.

• Fully Connected Neural Network(FC), Basic ConvolutionalNeural Network (CNN)，Transferlearning using VGG-16 andInception-v4 are compared inthis classification problem.

• Use the dataset provided by the State Farm in the KaggleChallenge. Consists of mages (640x480 pixels RGB) withdifferent drivers’ behaviors.

• Train set contains 22,400 labeled images, test setcontains 79,727 unlabeled images.

• Use K-folds cross-validation method to split training datainto training set and validation set.

• Resized the images from size 640x480x3 into 150x150x3.

Fig.3 VGG-16 architecture

Fig.4 Inception-v4 architecture[3]Fig.2 Basic CNN architecture

Table 1 Results comparison of different models

Fig.5 Train & validation accuracy using basic CNN and VGG-16

Fig.6 Confusion matrix using basic CNN(left) and VGG-16(right)

Ø Summary

CS229Machine LearningTeam #924

VGG-16

Inception-v4

Ø Method• We use the log loss function given as follow:

Basic CNNFig.1 FC neural network architecture

• Fully Connected Neural Network: High bias withthe lowest accuracy and Kaggle score. Deeperand more complicated model should beproposed.

• Basic CNN: Improve performance dramaticallybut still have serious overfitting problem. Weuse adam optimizer with learning rate 0.001andtrained the model with 20 epochs.

• VGG-16: Relatively low bias with high overfittingproblem. We use adam optimizer with learningrate 0.001 and trained the model with 15 epochs.

• VGG-16 + KNN: Improve VGG-16 model a littlebit but still need future improvement.

• Perform the data augmentation• Currently training and debugging Inception-v4

model.• Pseudo Labeling

Ø Ongoing & Future Work

Reference[1] cs 229 http://cs229.stanford.edu/proj2016/report/SamCenLuo-ClassificationOfDriverDistraction-report.pdf[2] Kaggle. State Farm Distracted Driver Detection. https://www.kaggle.com/c/state-farm-distracted-driverdetection/data [3] https://github.com/jonas-pettersson/fast-ai/blob/master/statefarm.ipynb