CS229 MachineLearning BeAlert!AccidentsHurt! Team #924...

1
Be Alert! Accidents Hurt! Liuming Zhao Taiming Zhang Liz Guo Ø Introduction Ø Models Fully Connected(FC) Ø Results & Analysis Confusion Matrix Models Epoch Batch Size Validation Accuracy Validation Loss Kaggle Score VGG-16 15 16 85.01% 0.4954 0.64 Basic CNN 20 64 74.38% 1.2081 1.32 Fully Connected 20 64 11.4% 6.1 6.8 VGG-16 + KNN 15 16 86.23% 0.4523 0.58 Ø Data Driving distraction has always been a driving safety issue. With the goal of detecting driver distractions, we want to design a driver posture classification system—classify the input image into 10 classes, such as texting, drinking, etc. Fully Connected Neural Network (FC), Basic Convolutional Neural Network (CNN)Transfer learning using VGG-16 and Inception-v4 are compared in this classification problem. Use the dataset provided by the State Farm in the Kaggle Challenge. Consists of mages (640x480 pixels RGB) with different drivers’ behaviors. Train set contains 22,400 labeled images, test set contains 79,727 unlabeled images. Use K-folds cross-validation method to split training data into training set and validation set. Resized the images from size 640x480x3 into 150x150x3. Fig.3 VGG-16 architecture Fig.4 Inception-v4 architecture[3] Fig.2 Basic CNN architecture Table 1 Results comparison of different models Fig.5 Train & validation accuracy using basic CNN and VGG-16 Fig.6 Confusion matrix using basic CNN(left) and VGG-16(right) Ø Summary CS229 Machine Learning Team #924 VGG-16 Inception-v4 Ø Method We use the log loss function given as follow: Basic CNN Fig.1 FC neural network architecture Fully Connected Neural Network: High bias with the lowest accuracy and Kaggle score. Deeper and more complicated model should be proposed. Basic CNN: Improve performance dramatically but still have serious overfitting problem. We use adam optimizer with learning rate 0.001and trained the model with 20 epochs. VGG-16: Relatively low bias with high overfitting problem. We use adam optimizer with learning rate 0.001 and trained the model with 15 epochs. VGG-16 + KNN: Improve VGG-16 model a little bit but still need future improvement. Perform the data augmentation Currently training and debugging Inception-v4 model. Pseudo Labeling Ø Ongoing & Future Work Reference [1] cs 229 http://cs229.stanford.edu/proj2016/report/SamCenLuo-ClassificationOfDriverDistraction-report.pdf [2] Kaggle. State Farm Distracted Driver Detection. https://www.kaggle.com/c/state-farm-distracted- driverdetection/data [3] https://github.com/jonas-pettersson/fast-ai/blob/master/statefarm.ipynb

Transcript of CS229 MachineLearning BeAlert!AccidentsHurt! Team #924...

Page 1: CS229 MachineLearning BeAlert!AccidentsHurt! Team #924 …cs229.stanford.edu/proj2017/final-posters/5145465.pdf · 2018. 1. 4. · BeAlert!AccidentsHurt! LiumingZhao Taiming Zhang

Be Alert! Accidents Hurt!Liuming Zhao Taiming Zhang Liz Guo

Ø Introduction Ø ModelsFully Connected(FC)

Ø Results & Analysis Confusion Matrix

Models Epoch BatchSize

Validation Accuracy

Validation Loss

KaggleScore

VGG-16 15 16 85.01% 0.4954 0.64

Basic CNN 20 64 74.38% 1.2081 1.32

FullyConnected

20 64 11.4% 6.1 6.8

VGG-16 +KNN

15 16 86.23% 0.4523 0.58

Ø Data

• Driving distraction has alwaysbeen a driving safety issue.

• With the goal of detecting driverdistractions, we want to designa driver posture classificationsystem—classify the inputimage into 10 classes, such astexting, drinking, etc.

• Fully Connected Neural Network(FC), Basic ConvolutionalNeural Network (CNN),Transferlearning using VGG-16 andInception-v4 are compared inthis classification problem.

• Use the dataset provided by the State Farm in the KaggleChallenge. Consists of mages (640x480 pixels RGB) withdifferent drivers’ behaviors.

• Train set contains 22,400 labeled images, test setcontains 79,727 unlabeled images.

• Use K-folds cross-validation method to split training datainto training set and validation set.

• Resized the images from size 640x480x3 into 150x150x3.

Fig.3 VGG-16 architecture

Fig.4 Inception-v4 architecture[3]Fig.2 Basic CNN architecture

Table 1 Results comparison of different models

Fig.5 Train & validation accuracy using basic CNN and VGG-16

Fig.6 Confusion matrix using basic CNN(left) and VGG-16(right)

Ø Summary

CS229Machine LearningTeam #924

VGG-16

Inception-v4

Ø Method• We use the log loss function given as follow:

Basic CNNFig.1 FC neural network architecture

• Fully Connected Neural Network: High bias withthe lowest accuracy and Kaggle score. Deeperand more complicated model should beproposed.

• Basic CNN: Improve performance dramaticallybut still have serious overfitting problem. Weuse adam optimizer with learning rate 0.001andtrained the model with 20 epochs.

• VGG-16: Relatively low bias with high overfittingproblem. We use adam optimizer with learningrate 0.001 and trained the model with 15 epochs.

• VGG-16 + KNN: Improve VGG-16 model a littlebit but still need future improvement.

• Perform the data augmentation• Currently training and debugging Inception-v4

model.• Pseudo Labeling

Ø Ongoing & Future Work

Reference[1] cs 229 http://cs229.stanford.edu/proj2016/report/SamCenLuo-ClassificationOfDriverDistraction-report.pdf[2] Kaggle. State Farm Distracted Driver Detection. https://www.kaggle.com/c/state-farm-distracted-driverdetection/data [3] https://github.com/jonas-pettersson/fast-ai/blob/master/statefarm.ipynb