HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Efficient Temporal Action Segmentation via Boundary-aware Query Voting

Peiyao Wang Yuewei Lin Erik Blasch Jie Wei Haibin Ling

Efficient Temporal Action Segmentation via Boundary-aware Query Voting

Abstract

Although the performance of Temporal Action Segmentation (TAS) has improved in recent years, achieving promising results often comes with a high computational cost due to dense inputs, complex model structures, and resource-intensive post-processing requirements. To improve the efficiency while keeping the performance, we present a novel perspective centered on per-segment classification. By harnessing the capabilities of Transformers, we tokenize each video segment as an instance token, endowed with intrinsic instance segmentation. To realize efficient action segmentation, we introduce BaFormer, a boundary-aware Transformer network. It employs instance queries for instance segmentation and a global query for class-agnostic boundary prediction, yielding continuous segment proposals. During inference, BaFormer employs a simple yet effective voting strategy to classify boundary-wise segments based on instance segmentation. Remarkably, as a single-stage approach, BaFormer significantly reduces the computational costs, utilizing only 6% of the running time compared to state-of-the-art method DiffAct, while producing better or comparable accuracy over several popular benchmarks. The code for this project is publicly available at https://github.com/peiyao-w/BaFormer.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-50-salads-1BaFormer
Acc: 89.5
Edit: 84.2
F1@10%: 89.3
F1@25%: 88.4
F1@50%: 83.9
action-segmentation-on-breakfast-1BaFormer
Acc: 76.6
Average F1: 72.4
Edit: 77.3
F1@10%: 79.2
F1@25%: 74.9
F1@50%: 63.2
action-segmentation-on-gtea-1BaFormer
Acc: 83.0
Edit: 88.7
F1@10%: 92.0
F1@25%: 91.3
F1@50%: 83.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp