HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

Segmental Spatiotemporal CNNs for Fine-grained Action Segmentation

Lea Colin Reiter Austin Vidal Rene Hager Gregory D.

Segmental Spatiotemporal CNNs for Fine-grained Action Segmentation

Abstract

Joint segmentation and classification of fine-grained actions is importantfor applications of human-robot interaction, video surveillance, and humanskill evaluation. However, despite substantial recent progress in large-scaleaction classification, the performance of state-of-the-art fine-grained actionrecognition approaches remains low. We propose a model for action segmentationwhich combines low-level spatiotemporal features with a high-level segmentalclassifier. Our spatiotemporal CNN is comprised of a spatial component thatuses convolutional filters to capture information about objects and theirrelationships, and a temporal component that uses large 1D convolutionalfilters to capture information about how object relationships change acrosstime. These features are used in tandem with a semi-Markov model that modelstransitions from one action to another. We introduce an efficient constrainedsegmental inference algorithm for this model that is orders of magnitude fasterthan the current approach. We highlight the effectiveness of our SegmentalSpatiotemporal CNN on cooking and surgical action datasets for which we observesubstantially improved performance relative to recent baseline methods.

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-gtea-1ST-CNN
Acc: 60.6
Edit: -
F1@10%: 58.7
F1@25%: 54.4
F1@50%: 41.9
action-segmentation-on-jigsawsST-CNN+Seg
Accuracy: 74.22
Edit Distance: 66.56

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Segmental Spatiotemporal CNNs for Fine-grained Action Segmentation | Papers | HyperAI