Learning Spatiotemporal Features with 3D Convolutional...

LearningSpatiotemporalFeatureswith

3DConvolutionalNetworksDuTran,LubomirBourdev,RobFergus,LorenzoTorresani,ManoharPaluri

EffectiveVideoDescriptor

• Generic– Canrepresentdifferenttypes

• Compact– Processing,storage

• Efficient– computation

• Simple– implementation

3DConvolutionandPooling

• 3DConvolutionisbetterthan2DConvolutiontomodeltemporalinformation.– 2DCONV:performedonlyspatially,losetemporalinformation.

– 3DCONV:performedspatio-temporally,preservetemporalinformation.

• Samephenomenaisapplicableforpooling.

2DConvolutionOn1-chInput

• Result:2DImage.

2DConvolutionOnn-chInput

• Result:2DImage.

3DConvolutionOnn-chInput

• Result:Volume

IdentifyBestArchitectureFor3DConvNets(OnUCF101)

• Commonnetworksettings– Allvideoframesresizedinto128x171.– Videosaresplitintonon-overlapped16frameclip.– Input:3x16x128x171.– 5ConvolutionandPoolinglayer– 2FullyConnectedlayer– SoftmaxLosslayertopredictactionlabels

IdentifyBestArchitectureFor3DConvNets(OnUCF101)

• VaryingNetworkArchitecture– Homogeneoustemporaldepth.• Depth–dfor1,3,5,7

– Varyingtemporaldepth.• Increasing:3-3-5-5-7• Decreasing:7-7-5-5-3-3

3DConvolutionKernelTemporalDepthSearch

SpatiotemporalFeatureLearning

• BestNetworkArchitecture–With3x3x3kernel

SpatiotemporalFeatureLearning

• Datasetfortraining– Sports1MDataset• Largestvideoclassificationbenchmark• 1.1millionsportsvideos• 487categories

Sports1MClassificationResults

C3DVideoDescriptor

• C3DModelcanbeusedasafeatureextractorforvariousvideoanalysistasks.– Actionrecognition– Actionsimilarity– SceneandObjectrecognition

• Usingwithfc6activations– 4096dimension

ActionRecognition

• Dataset:UCF101– 13.320video– 101humanaction

ActionSimilarityLabeling

• Dataset:ASLAN– 3,631video– 432actionclass

SceneObjectRecognition

• Dataset:YUPENN– 420video– 14scene

• Dataset:Maryland– 130video– 13scene

WhyC3DFeatures?

• Generic• Compact• Efficient• Simple

Visualisation using t-SNE method:

L. van der Maaten and G. Hinton. Visualizing data using t-sne. JMLR

WhatDoesC3DLearn?

Using deconvolution method in M. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In ECCV, 2014

UsefulLinks

• http://vlg.cs.dartmouth.edu/c3d/• https://github.com/facebook/C3D

Tools and software required:

- keras- tensorflow- ffmpeg(compiled form source)- opencv(compiled from source)

Thank you

Learning Spatiotemporal Features with 3D Convolutional...

Documents

Transcript of Learning Spatiotemporal Features with 3D Convolutional...

Spatiotemporal Recurrent Convolutional Networks for ...jultika.oulu.fi/files/nbnfi-fe2019120345372.pdf · Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous

STQL ( SpatioTemporal Query Language )

Convolutional Neural Networksimft.ftn.uns.ac.rs/ssip2017/wp-content/uploads/... · Convolutional Neural Networks for Semantic ... Shelhamer, Darrell, Fully Convolutional Networks

Spatiotemporal expression of Wnt signaling pathway ...ir.ioz.ac.cn/bitstream/000000/10341/1/Spatiotemporal expression of … · Spatiotemporal expression of Wnt signaling pathway

Convolutional Neural Networks Analyzed via Convolutional ...yromano/posters/ml_csc_poster.pdf · Convolutional Neural Networks ... Recently, convolutional sparse coding (CSC) ...

Spatiotemporal Multicast in Sensor Networks

Richer Convolutional Features for Edge Detectionopenaccess.thecvf.com/...Richer_Convolutional_Features_CVPR_2017… · Richer Convolutional Features for Edge Detection ... convolutional

Spatial Load Prediction Considering Spatiotemporal ...

SPATIOTEMPORAL SENSITIVITY AND VISUAL ATTENTION …

Spatiotemporal properties of intracellular calcium ...liinc.bme.columbia.edu/wp-content/uploads/Spatiotemporal... · Spatiotemporal properties of intracellular calcium signaling in

Analyzing InterUrban SpatioTemporal Network Patterns ...snap.stanford.edu/class/cs224w...SpatioTemporal_Network_Patterns… · Analyzing InterUrban SpatioTemporal Network Patterns

Spatiotemporal metabolic modeling of multispecies ...€¦ · Spatiotemporal metabolic modeling of multispecies bacterial biofilms. 1. Background. Spatiotemporal Behavior of Microbial

Modeling spatiotemporal information with convolutional ...publications.lib.chalmers.se/records/fulltext/248944/248944.pdfoutperformed the original fully-connected versions for modelling

Spatiotemporal Stereo via Spatiotemporal ... - Vision Lab

Chapter 6 Supervised Learning II: Backpropagation and Beyondfaculty.iitmandi.ac.in/~aditya/cs671/cs671_2017/data/satish_Kumar… · MATLAB Simulation Example 1 Two Dimensional XOR

Spatiotemporal Data Representation in R

Learning Spatiotemporal Features with 3D Convolutional ...action recognition [26], anomaly detection [2], video re-trieval [1], event and action detection [30,17], and many more have

SPATIOTEMPORAL ANALYSIS IN MONITORING LANDSCAPE …

Automatically Generated Presentationfaculty.iitmandi.ac.in/~aditya/cs671/cs671_2017/data/Lect4_2.pdf · Automatically Generated Presentation Aditya Nigam School of Computing and Electrical

Convolutional Neural Networks · 2017-10-10 · Convolutional? –Pruning Convolutional Neural Networks for Resource Efficient Inference –FILTER SHAPING FOR CONVOLUTIONAL NEURAL