HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

TriDet: Temporal Action Detection with Relative Boundary Modeling

Dingfeng Shi Yujie Zhong Qiong Cao Lin Ma Jia Li Dacheng Tao

TriDet: Temporal Action Detection with Relative Boundary Modeling

Abstract

In this paper, we present a one-stage framework TriDet for temporal action detection. Existing methods often suffer from imprecise boundary predictions due to the ambiguous action boundaries in videos. To alleviate this problem, we propose a novel Trident-head to model the action boundary via an estimated relative probability distribution around the boundary. In the feature pyramid of TriDet, we propose an efficient Scalable-Granularity Perception (SGP) layer to mitigate the rank loss problem of self-attention that takes place in the video features and aggregate information across different temporal granularities. Benefiting from the Trident-head and the SGP-based feature pyramid, TriDet achieves state-of-the-art performance on three challenging benchmarks: THUMOS14, HACS and EPIC-KITCHEN 100, with lower computational costs, compared to previous methods. For example, TriDet hits an average mAP of $69.3\%$ on THUMOS14, outperforming the previous best by $2.5\%$, but with only $74.6\%$ of its latency. The code is released to https://github.com/sssste/TriDet.

Code Repositories

dingfengshi/tridet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
temporal-action-localization-on-activitynetTriDet (TSP features)
mAP: 36.8
mAP IOU@0.5: 54.7
mAP IOU@0.75: 38.0
mAP IOU@0.95: 8.4
temporal-action-localization-on-epic-kitchensTriDet (verb)
Avg mAP (0.1-0.5): 25.4
mAP IOU@0.1: 28.6
mAP IOU@0.2: 27.4
mAP IOU@0.3: 26.1
mAP IOU@0.4: 24.2
mAP IOU@0.5: 20.8
temporal-action-localization-on-hacsTriDet (SlowFast)
Average-mAP: 38.6
mAP@0.5: 56.7
mAP@0.75: 39.3
mAP@0.95: 11.7
temporal-action-localization-on-hacsTriDet (I3D RGB)
Average-mAP: 36.8
mAP@0.5: 54.5
mAP@0.75: 36.8
mAP@0.95: 11.5
temporal-action-localization-on-thumos14TriDet (I3D features)
Avg mAP (0.3:0.7): 69.3
mAP IOU@0.3: 83.6
mAP IOU@0.4: 80.1
mAP IOU@0.5: 72.9
mAP IOU@0.6: 62.4
mAP IOU@0.7: 47.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp