HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

G-TAD: Sub-Graph Localization for Temporal Action Detection

Mengmeng Xu Chen Zhao David S. Rojas Ali Thabet Bernard Ghanem

G-TAD: Sub-Graph Localization for Temporal Action Detection

Abstract

Temporal action detection is a fundamental yet challenging task in video understanding. Video context is a critical cue to effectively detect actions, but current works mainly focus on temporal context, while neglecting semantic context as well as other important context properties. In this work, we propose a graph convolutional network (GCN) model to adaptively incorporate multi-level semantic context into video features and cast temporal action detection as a sub-graph localization problem. Specifically, we formulate video snippets as graph nodes, snippet-snippet correlations as edges, and actions associated with context as target sub-graphs. With graph convolution as the basic operation, we design a GCN block called GCNeXt, which learns the features of each node by aggregating its context and dynamically updates the edges in the graph. To localize each sub-graph, we also design an SGAlign layer to embed each sub-graph into the Euclidean space. Extensive experiments show that G-TAD is capable of finding effective video context without extra supervision and achieves state-of-the-art performance on two detection benchmarks. On ActivityNet-1.3, it obtains an average mAP of 34.09%; on THUMOS14, it reaches 51.6% at IoU@0.5 when combined with a proposal processing method. G-TAD code is publicly available at https://github.com/frostinassiky/gtad.

Code Repositories

812618101/TAL-Demo
Mentioned in GitHub
coolbay/VSGN
pytorch
Mentioned in GitHub
handhand123/prsa-net
pytorch
Mentioned in GitHub
musicalOffering/sola
pytorch
Mentioned in GitHub
carpedkm/G_TAD_customizing
pytorch
Mentioned in GitHub
Frostinassiky/gtad
Official
pytorch
Mentioned in GitHub
sauradip/fewshotQAT
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
temporal-action-localization-on-activitynetG-TAD
mAP: 34.09
mAP IOU@0.5: 50.36
mAP IOU@0.75: 34.60
mAP IOU@0.95: 9.02
temporal-action-localization-on-epic-kitchensG-TAD (verb)
Avg mAP (0.1-0.5): 9.4
mAP IOU@0.1: 12.1
mAP IOU@0.2: 11.0
mAP IOU@0.3: 9.4
mAP IOU@0.4: 8.1
mAP IOU@0.5: 6.5
temporal-action-localization-on-fineactionG-TAD (i3d feature)
mAP: 9.06
mAP IOU@0.5: 13.74
mAP IOU@0.75: 8.83
mAP IOU@0.95: 3.06
temporal-action-localization-on-thumos14G-TAD
mAP IOU@0.5: 40.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp