HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization

{Mei Chen Mubarak Shah Sandra Sajeev Matthew Hall Ye Yu Gaurav Mittal Mamshad Nayeem Rizve}

PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization

Abstract

Weakly-supervised Temporal Action Localization (WTAL) attempts to localize the actions in untrimmed videos using only video-level supervision. Most recent works approach WTAL from a localization-by-classification perspective where these methods try to classify each video frame followed by a manually-designed post-processing pipeline to aggregate these per-frame action predictions into action snippets. Due to this perspective, the model lacks any explicit understanding of action boundaries and tends to focus only on the most discriminative parts of the video resulting in incomplete action localization. To address this, we present PivoTAL, Prior-driven Supervision for Weakly-supervised Temporal Action Localization, to approach WTAL from a localization-by-localization perspective by learning to localize the action snippets directly. To this end, PivoTAL leverages the underlying spatio-temporal regularities in videos in the form of action-specific scene prior, action snippet generation prior, and learnable Gaussian prior to supervise the localization-based training. PivoTAL shows significant improvement (of at least 3% avg mAP) over all existing methods on the benchmark datasets, THUMOS-14 and ActivitNet-v1.3.

Benchmarks

BenchmarkMethodologyMetrics
weakly-supervised-action-localization-onPivoTAL
mAP@0.1:0.5: 60.1
mAP@0.1:0.7: 49.6
mAP@0.5: 42.8
weakly-supervised-action-localization-on-1PivoTAL
mAP@0.5: 45.1
mAP@0.5:0.95: 28.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization | Papers | HyperAI