Command Palette
Search for a command to run...
Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network
Li Chen ; Lee Gim Hee

Abstract
3D human pose estimation from a monocular image or 2D joints is an ill-posedproblem because of depth ambiguity and occluded joints. We argue that 3D humanpose estimation from a monocular input is an inverse problem where multiplefeasible solutions can exist. In this paper, we propose a novel approach togenerate multiple feasible hypotheses of the 3D pose from 2D joints.In contrastto existing deep learning approaches which minimize a mean square error basedon an unimodal Gaussian distribution, our method is able to generate multiplefeasible hypotheses of 3D pose based on a multimodal mixture density networks.Our experiments show that the 3D poses estimated by our approach from an inputof 2D joints are consistent in 2D reprojections, which supports our argumentthat multiple solutions exist for the 2D-to-3D inverse problem. Furthermore, weshow state-of-the-art performance on the Human3.6M dataset in both besthypothesis and multi-view settings, and we demonstrate the generalizationcapacity of our model by testing on the MPII and MPI-INF-3DHP datasets. Ourcode is available at the project website.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-human-pose-estimation-on-human36m | MDN (Multi-View) | Average MPJPE (mm): 49.6 Multi-View or Monocular: Multi-View Using 2D ground-truth joints: No |
| 3d-human-pose-estimation-on-human36m | MDN | Average MPJPE (mm): 52.7 Multi-View or Monocular: Monocular PA-MPJPE: 42.6 Using 2D ground-truth joints: No |
| 3d-human-pose-estimation-on-mpi-inf-3dhp | MDM | PCK: 67.9 |
| monocular-3d-human-pose-estimation-on-human3 | Multimodal Mixture Density Networks | Average MPJPE (mm): 52.7 Frames Needed: 1 Need Ground Truth 2D Pose: No Use Video Sequence: No |
| multi-hypotheses-3d-human-pose-estimation-on | MDN | Average MPJPE (mm): 52.7 Average PMPJPE (mm): 42.6 |
| multi-hypotheses-3d-human-pose-estimation-on-2 | SMPL-MDN (by 3D Multi-bodies) | Best-Hypothesis MPJPE (n = 25): 91.5 Best-Hypothesis PMPJPE (n = 25): 69.5 H36M PMPJPE (n = 1): 44.8 H36M PMPJPE (n = 25): 42.7 Most-Likely Hypothesis PMPJPE (n = 1): 74.7 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.