Command Palette
Search for a command to run...
Sharma Saurabh ; Varigonda Pavan Teja ; Bindal Prashast ; Sharma Abhishek ; Jain Arjun

Abstract
Monocular 3D human-pose estimation from static images is a challengingproblem, due to the curse of dimensionality and the ill-posed nature of lifting2D-to-3D. In this paper, we propose a Deep Conditional Variational Autoencoderbased model that synthesizes diverse anatomically plausible 3D-pose samplesconditioned on the estimated 2D-pose. We show that CVAE-based 3D-pose sampleset is consistent with the 2D-pose and helps tackling the inherent ambiguity in2D-to-3D lifting. We propose two strategies for obtaining the final 3D pose-(a) depth-ordering/ordinal relations to score and weight-average the candidate3D-poses, referred to as OrdinalScore, and (b) with supervision from an Oracle.We report close to state of-the-art results on two benchmark datasets usingOrdinalScore, and state-of-the-art results using the Oracle. We also show thatour pipeline yields competitive results without paired image-to-3D annotations.The training and evaluation code is available athttps://github.com/ssfootball04/generative_pose.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-pose-estimation-on-human36m | MultiPoseNet with Oracle | Average MPJPE (mm): 46.8 |
| 3d-human-pose-estimation-on-humaneva-i | Ours (Oracle) | Mean Reconstruction Error (mm): 23.9 |
| monocular-3d-human-pose-estimation-on-human3 | MultiPoseNet | Average MPJPE (mm): 58.0 Frames Needed: 1 Need Ground Truth 2D Pose: No Use Video Sequence: No |
| multi-hypotheses-3d-human-pose-estimation-on | Sharma et al. | Average MPJPE (mm): 46.8 Average PMPJPE (mm): 37.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.