HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Proposal Relation Network for Temporal Action Detection

Xiang Wang Zhiwu Qing Ziyuan Huang Yutong Feng Shiwei Zhang Jianwen Jiang Mingqian Tang Changxin Gao Nong Sang

Proposal Relation Network for Temporal Action Detection

Abstract

This technical report presents our solution for temporal action detection task in AcitivityNet Challenge 2021. The purpose of this task is to locate and identify actions of interest in long untrimmed videos. The crucial challenge of the task comes from that the temporal duration of action varies dramatically, and the target actions are typically embedded in a background of irrelevant activities. Our solution builds on BMN, and mainly contains three steps: 1) action classification and feature encoding by Slowfast, CSN and ViViT; 2) proposal generation. We improve BMN by embedding the proposed Proposal Relation Network (PRN), by which we can generate proposals of high quality; 3) action detection. We calculate the detection results by assigning the proposals with corresponding classification results. Finally, we ensemble the results under different settings and achieve 44.7% on the test set, which improves the champion result in ActivityNet 2020 by 1.9% in terms of average mAP.

Code Repositories

wangxiang1230/SSTAP
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
temporal-action-localization-on-activitynetPRN (CSN)
mAP: 39.4
mAP IOU@0.5: 57.9
temporal-action-localization-on-activitynetPRN+BMN (ensemble)
mAP: 42.0
mAP IOU@0.5: 59.7
temporal-action-localization-on-activitynetPRN (ViViT)
mAP: 37.5
mAP IOU@0.5: 55.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Proposal Relation Network for Temporal Action Detection | Papers | HyperAI