HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

3D Human Pose Perception from Egocentric Stereo Videos

Akada Hiroyasu ; Wang Jian ; Golyanik Vladislav ; Theobalt Christian

3D Human Pose Perception from Egocentric Stereo Videos

Abstract

While head-mounted devices are becoming more compact, they provide egocentricviews with significant self-occlusions of the device user. Hence, existingmethods often fail to accurately estimate complex 3D poses from egocentricviews. In this work, we propose a new transformer-based framework to improveegocentric stereo 3D human pose estimation, which leverages the sceneinformation and temporal context of egocentric stereo videos. Specifically, weutilize 1) depth features from our 3D scene reconstruction module withuniformly sampled windows of egocentric stereo frames, and 2) human jointqueries enhanced by temporal features of the video inputs. Our method is ableto accurately estimate human poses even in challenging scenarios, such ascrouching and sitting. Furthermore, we introduce two new benchmark datasets,i.e., UnrealEgo2 and UnrealEgo-RW (RealWorld). The proposed datasets offer amuch larger number of egocentric stereo views with a wider variety of humanmotions than the existing datasets, allowing comprehensive evaluation ofexisting and upcoming methods. Our extensive experiments show that the proposedapproach significantly outperforms previous methods. We will releaseUnrealEgo2, UnrealEgo-RW, and trained models on our project page.

Benchmarks

BenchmarkMethodologyMetrics
egocentric-pose-estimation-on-unrealegoUnrealEgo2
Average MPJPE (mm): 50.0
PA-MPJPE: 40.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
3D Human Pose Perception from Egocentric Stereo Videos | Papers | HyperAI