HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Alleviating Over-segmentation Errors by Detecting Action Boundaries

Yuchi Ishikawa Seito Kasai Yoshimitsu Aoki Hirokatsu Kataoka

Alleviating Over-segmentation Errors by Detecting Action Boundaries

Abstract

We propose an effective framework for the temporal action segmentation task, namely an Action Segment Refinement Framework (ASRF). Our model architecture consists of a long-term feature extractor and two branches: the Action Segmentation Branch (ASB) and the Boundary Regression Branch (BRB). The long-term feature extractor provides shared features for the two branches with a wide temporal receptive field. The ASB classifies video frames with action classes, while the BRB regresses the action boundary probabilities. The action boundaries predicted by the BRB refine the output from the ASB, which results in a significant performance improvement. Our contributions are three-fold: (i) We propose a framework for temporal action segmentation, the ASRF, which divides temporal action segmentation into frame-wise action classification and action boundary regression. Our framework refines frame-level hypotheses of action classes using predicted action boundaries. (ii) We propose a loss function for smoothing the transition of action probabilities, and analyze combinations of various loss functions for temporal action segmentation. (iii) Our framework outperforms state-of-the-art methods on three challenging datasets, offering an improvement of up to 13.7% in terms of segmental edit distance and up to 16.1% in terms of segmental F1 score. Our code will be publicly available soon.

Code Repositories

yiskw713/asrf
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-50-salads-1ASRF
Acc: 84.5
Edit: 79.3
F1@10%: 84.9
F1@25%: 83.5
F1@50%: 77.3
action-segmentation-on-breakfast-1ASRF
Acc: 67.6
Average F1: 66.4
Edit: 72.4
F1@10%: 74.3
F1@25%: 68.9
F1@50%: 56.1
action-segmentation-on-gtea-1ASRF
Acc: 77.3
Edit: 83.7
F1@10%: 89.4
F1@25%: 87.8
F1@50%: 79.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp