HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation

Bastian Wandt; Bodo Rosenhahn

RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation

Abstract

This paper addresses the problem of 3D human pose estimation from single images. While for a long time human skeletons were parameterized and fitted to the observation by satisfying a reprojection error, nowadays researchers directly use neural networks to infer the 3D pose from the observations. However, most of these approaches ignore the fact that a reprojection constraint has to be satisfied and are sensitive to overfitting. We tackle the overfitting problem by ignoring 2D to 3D correspondences. This efficiently avoids a simple memorization of the training data and allows for a weakly supervised training. One part of the proposed reprojection network (RepNet) learns a mapping from a distribution of 2D poses to a distribution of 3D poses using an adversarial training approach. Another part of the network estimates the camera. This allows for the definition of a network layer that performs the reprojection of the estimated 3D pose back to 2D which results in a reprojection loss function. Our experiments show that RepNet generalizes well to unknown data and outperforms state-of-the-art methods when applied to unseen data. Moreover, our implementation runs in real-time on a standard desktop PC.

Code Repositories

bastianwandt/RepNet
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-human36mRepNet (GTi)
Average MPJPE (mm): 50.9
3d-human-pose-estimation-on-human36mRepNet
Average MPJPE (mm): 89.9
3d-human-pose-estimation-on-mpi-inf-3dhpRepNet (H36M)
AUC: 54.8
MPJPE: 92.5
PCK: 81.8
3d-human-pose-estimation-on-mpi-inf-3dhpRepNet (3DHP)
AUC: 58.5
MPJPE: 97.8
PCK: 82.5
monocular-3d-human-pose-estimation-on-human3RepNet
Average MPJPE (mm): 89.9
Frames Needed: 1
Need Ground Truth 2D Pose: No
Use Video Sequence: No
weakly-supervised-3d-human-pose-estimation-onRepNet
3D Annotations: No
Average MPJPE (mm): 89.9
Number of Frames Per View: 1
Number of Views: 1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp