HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

ASFormer: Transformer for Action Segmentation

Fangqiu Yi Hongyu Wen Tingting Jiang

ASFormer: Transformer for Action Segmentation

Abstract

Algorithms for the action segmentation task typically use temporal models to predict what action is occurring at each frame for a minute-long daily activity. Recent studies have shown the potential of Transformer in modeling the relations among elements in sequential data. However, there are several major concerns when directly applying the Transformer to the action segmentation task, such as the lack of inductive biases with small training sets, the deficit in processing long input sequence, and the limitation of the decoder architecture to utilize temporal relations among multiple action segments to refine the initial predictions. To address these concerns, we design an efficient Transformer-based model for action segmentation task, named ASFormer, with three distinctive characteristics: (i) We explicitly bring in the local connectivity inductive priors because of the high locality of features. It constrains the hypothesis space within a reliable scope, and is beneficial for the action segmentation task to learn a proper target function with small training sets. (ii) We apply a pre-defined hierarchical representation pattern that efficiently handles long input sequences. (iii) We carefully design the decoder to refine the initial predictions from the encoder. Extensive experiments on three public datasets demonstrate that effectiveness of our methods. Code is available at \url{https://github.com/ChinaYi/ASFormer}.

Code Repositories

chinayi/asformer
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-50-salads-1ASFormer+ASRF
Acc: 85.9
Edit: 81.9
F1@10%: 85.1
F1@25%: 85.4
F1@50%: 79.3
action-segmentation-on-50-salads-1ASFormer
Acc: 85.6
Edit: 79.6
F1@10%: 85.1
F1@25%: 83.4
F1@50%: 76.0
action-segmentation-on-assembly101ASFormer
Edit: 30.5
F1@10%: 33.4
F1@25%: 29.2
F1@50%: 21.4
MoF: 38.8
action-segmentation-on-breakfast-1ASFormer
Acc: 73.5
Average F1: 68.0
Edit: 75.0
F1@10%: 76.0
F1@25%: 70.6
F1@50%: 57.4
action-segmentation-on-gtea-1ASFormer
Acc: 79.7
Edit: 84.6
F1@10%: 90.1
F1@25%: 88.8
F1@50%: 79.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp