HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

On the Utility of 3D Hand Poses for Action Recognition

Shamil Md Salman ; Chatterjee Dibyadip ; Sener Fadime ; Ma Shugao ; Yao Angela

On the Utility of 3D Hand Poses for Action Recognition

Abstract

3D hand pose is an underexplored modality for action recognition. Poses arecompact yet informative and can greatly benefit applications with limitedcompute budgets. However, poses alone offer an incomplete understanding ofactions, as they cannot fully capture objects and environments with whichhumans interact. We propose HandFormer, a novel multimodal transformer, toefficiently model hand-object interactions. HandFormer combines 3D hand posesat a high temporal resolution for fine-grained motion modeling with sparselysampled RGB frames for encoding scene semantics. Observing the uniquecharacteristics of hand poses, we temporally factorize hand modeling andrepresent each joint by its short-term trajectories. This factorized poserepresentation combined with sparse RGB samples is remarkably efficient andhighly accurate. Unimodal HandFormer with only hand poses outperforms existingskeleton-based methods at 5x fewer FLOPs. With RGB, we achieve newstate-of-the-art performance on Assembly101 and H2O with significantimprovements in egocentric action recognition.

Code Repositories

s-shamil/HandFormer
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-action-recognition-on-assembly101HandFormer-B/21
Actions Top-1: 41.06
Object Top-1: 51.17
Verbs Top-1: 69.23
action-recognition-on-h2o-2-hands-and-objectsHandFormer-B/21x8
Actions Top-1: 93.39
Hand Pose: 3D
Object Label: No
Object Pose: No
RGB: Yes

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp