HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Temporal Segment Networks: Towards Good Practices for Deep Action
  Recognition

Abstract

Deep convolutional networks have achieved great success for visualrecognition in still images. However, for action recognition in videos, theadvantage over traditional methods is not so evident. This paper aims todiscover the principles to design effective ConvNet architectures for actionrecognition in videos and learn these models given limited training samples.Our first contribution is temporal segment network (TSN), a novel framework forvideo-based action recognition. which is based on the idea of long-rangetemporal structure modeling. It combines a sparse temporal sampling strategyand video-level supervision to enable efficient and effective learning usingthe whole action video. The other contribution is our study on a series of goodpractices in learning ConvNets on video data with the help of temporal segmentnetwork. Our approach obtains the state-the-of-art performance on the datasetsof HMDB51 ( $ 69.4\% $) and UCF101 ($ 94.2\% $). We also visualize the learnedConvNet models, which qualitatively demonstrates the effectiveness of temporalsegment network and the proposed good practices.

Code Repositories

WavesUR/embedded_TSM
pytorch
Mentioned in GitHub
yjxiong/temporal-segment-networks
Official
pytorch
Mentioned in GitHub
sunutf/TSM
pytorch
Mentioned in GitHub
iamhoushiyou/tsn
pytorch
Mentioned in GitHub
ZJCV/TSN
pytorch
Mentioned in GitHub
ZJCV/Non-local
pytorch
Mentioned in GitHub
Nortinwell/TSN
pytorch
Mentioned in GitHub
rijuldhir/TSM
pytorch
Mentioned in GitHub
MichiganCOG/M-PACT
tf
Mentioned in GitHub
MIT-HAN-LAB/temporal-shift-module
pytorch
Mentioned in GitHub
CrazySherman/goodlife
pytorch
Mentioned in GitHub
mtlouie-unm/alome-tsn
pytorch
Mentioned in GitHub
nhannguyen2709/video_recognition
pytorch
Mentioned in GitHub
yjxiong/caffe
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
action-classification-on-kinetics-400TSN
Acc@1: 73.9
Acc@5: 91.1
action-recognition-in-videos-on-hmdb-51Temporal Segment Networks
Average accuracy of 3 splits: 69.4
action-recognition-in-videos-on-ucf101Temporal Segment Networks
3-fold Accuracy: 94.2
multimodal-activity-recognition-on-ev-actionTSN (RGB)
Accuracy: 73.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp