Command Palette
Search for a command to run...
RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation
Bastian Wandt; Bodo Rosenhahn

Abstract
This paper addresses the problem of 3D human pose estimation from single images. While for a long time human skeletons were parameterized and fitted to the observation by satisfying a reprojection error, nowadays researchers directly use neural networks to infer the 3D pose from the observations. However, most of these approaches ignore the fact that a reprojection constraint has to be satisfied and are sensitive to overfitting. We tackle the overfitting problem by ignoring 2D to 3D correspondences. This efficiently avoids a simple memorization of the training data and allows for a weakly supervised training. One part of the proposed reprojection network (RepNet) learns a mapping from a distribution of 2D poses to a distribution of 3D poses using an adversarial training approach. Another part of the network estimates the camera. This allows for the definition of a network layer that performs the reprojection of the estimated 3D pose back to 2D which results in a reprojection loss function. Our experiments show that RepNet generalizes well to unknown data and outperforms state-of-the-art methods when applied to unseen data. Moreover, our implementation runs in real-time on a standard desktop PC.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-pose-estimation-on-human36m | RepNet (GTi) | Average MPJPE (mm): 50.9 |
| 3d-human-pose-estimation-on-human36m | RepNet | Average MPJPE (mm): 89.9 |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | RepNet (H36M) | AUC: 54.8 MPJPE: 92.5 PCK: 81.8 |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | RepNet (3DHP) | AUC: 58.5 MPJPE: 97.8 PCK: 82.5 |
| monocular-3d-human-pose-estimation-on-human3 | RepNet | Average MPJPE (mm): 89.9 Frames Needed: 1 Need Ground Truth 2D Pose: No Use Video Sequence: No |
| weakly-supervised-3d-human-pose-estimation-on | RepNet | 3D Annotations: No Average MPJPE (mm): 89.9 Number of Frames Per View: 1 Number of Views: 1 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.