HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Temporal Structure Mining for Weakly Supervised Action Detection

{ Junsong Yuan Ning Xu Enxu Yan Yuncheng Li Zhou Ren Tan Yu}

Temporal Structure Mining for Weakly Supervised Action Detection

Abstract

Different from the fully-supervised action detection problem that is dependent on expensive frame-level annotations, weakly supervised action detection (WSAD) only needs video-level annotations, making it more practical for real-world applications. Existing WSAD methods detect action instances by scoring each video segment (a stack of frames) individually. Most of them fail to model the temporal relations among video segments and cannot effectively characterize action instances possessing latent temporal structure. To alleviate this problem in WSAD, we propose the temporal structure mining (TSM) approach. In TSM, each action instance is modeled as a multi-phase process and phase evolving within an action instance, i.e., the temporal structure, is exploited. Meanwhile, the video background is modeled by a background phase, which separates different action instances in an untrimmed video. In this framework, phase filters are used to calculate the confidence scores of the presence of an action's phases in each segment. Since in the WSAD task, frame-level annotations are not available and thus phase filters cannot be trained directly. To tackle the challenge, we treat each segment's phase as a hidden variable. We use segments' confidence scores from each phase filter to construct a table and determine hidden variables, i.e., phases of segments, by a maximal circulant path discovery along the table. Experiments conducted on three benchmark datasets demonstrate the state-of-the-art performance of the proposed TSM.

Benchmarks

BenchmarkMethodologyMetrics
weakly-supervised-action-localization-onTSM
mAP@0.1:0.7: -
mAP@0.5: 24.5
weakly-supervised-action-localization-on-1TSM
mAP@0.5: 30.3
weakly-supervised-action-localization-on-2TSM
mAP@0.5: 28.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Temporal Structure Mining for Weakly Supervised Action Detection | Papers | HyperAI