4 months ago

Learning Spatiotemporal Features with 3D Convolutional Networks

Du Tran; Lubomir Bourdev; Rob Fergus; Lorenzo Torresani; Manohar Paluri

Abstract

We propose a simple, yet effective approach for spatiotemporal feature learning using deep 3-dimensional convolutional networks (3D ConvNets) trained on a large scale supervised video dataset. Our findings are three-fold: 1) 3D ConvNets are more suitable for spatiotemporal feature learning compared to 2D ConvNets; 2) A homogeneous architecture with small 3x3x3 convolution kernels in all layers is among the best performing architectures for 3D ConvNets; and 3) Our learned features, namely C3D (Convolutional 3D), with a simple linear classifier outperform state-of-the-art methods on 4 different benchmarks and are comparable with current best methods on the other 2 benchmarks. In addition, the features are compact: achieving 52.8% accuracy on UCF101 dataset with only 10 dimensions and also very efficient to compute due to the fast inference of ConvNets. Finally, they are conceptually very simple and easy to train and use.

Code Repositories

AKASH2907/Content-based-Video-Recommendation

Mentioned in GitHub

facebookarchive/C3D

Official

caffe2

scouTT1/C3D

mindspore

aj9011/Car-Speed-Prediction

pytorch

Mentioned in GitHub

labs12/Action-Recgontion-

pytorch

Mentioned in GitHub

ashu5711/Neural_Network_Hand_Gesture_Recognition

Mentioned in GitHub

code-implementation1/Code2/tree/main/3dcnn

mindspore

HardyYoungX/C3D

mindspore

VEDANTGHODKE/Hand-Gesture-Recognition-Using-Neural-Networks

Mentioned in GitHub

MekkaSiekka/C3D-UCF11-Tensorflow

Mentioned in GitHub

mindspore-ai/models/tree/master/official/cv/C3D/src

mindspore

mamtajha-ts/gesture-recognition

Mentioned in GitHub

waynshang/Gesture-Recognition-with-3DCNN

Mentioned in GitHub

2024-MindSpore-1/Code6/tree/main/C3D

mindspore

MichiganCOG/M-PACT

Mentioned in GitHub

2024-MindSpore-1/Code6/tree/main/3dcnn

mindspore

MarkoLewis-Projects/Sign_language_detection

Mentioned in GitHub

aim3-ruc/youmakeup_challenge2022

pytorch

Mentioned in GitHub

xiuyu0000/vision/blob/main/mindvision/msvideo/models/c3d.py

mindspore

leftthomas/r2plus1d-c3d

pytorch

Mentioned in GitHub

coderSkyChen/Action_Recognition_Zoo

Mentioned in GitHub

ZJUT-ERCISS/c3d_mindspore

mindspore

open-mmlab/mmaction2

pytorch

AKASH2907/Content-based-Video-Relevance-Prediction

Mentioned in GitHub

2024-MindSpore-1/Code5/tree/main/3dcnn

mindspore

santhoshpkumar/Hand-gesture-recognition-using-neural-networks

Mentioned in GitHub

myaldiz/deep_violence_detection

Mentioned in GitHub

axon-research/c3d-keras

caffe2

Mentioned in GitHub

2023-MindSpore-1/ms-code-6/tree/main/C3D

mindspore

Benchmarks

Benchmark	Methodology	Metrics
action-recognition-in-videos-on-hmdb-51	C3D	Average accuracy of 3 splits: 51.6
action-recognition-in-videos-on-sports-1m	C3D	Clip Hit@1: 46.1 Video hit@1 : 61.1 Video hit@5: 85.5
action-recognition-in-videos-on-ucf101	C3D	3-fold Accuracy: 82.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Learning Spatiotemporal Features with 3D Convolutional Networks

Du Tran; Lubomir Bourdev; Rob Fergus; Lorenzo Torresani; Manohar Paluri

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters