HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Human Pose Regression with Residual Log-likelihood Estimation

Jiefeng Li Siyuan Bian Ailing Zeng Can Wang Bo Pang Wentao Liu Cewu Lu

Human Pose Regression with Residual Log-likelihood Estimation

Abstract

Heatmap-based methods dominate in the field of human pose estimation by modelling the output distribution through likelihood heatmaps. In contrast, regression-based methods are more efficient but suffer from inferior performance. In this work, we explore maximum likelihood estimation (MLE) to develop an efficient and effective regression-based methods. From the perspective of MLE, adopting different regression losses is making different assumptions about the output density function. A density function closer to the true distribution leads to a better regression performance. In light of this, we propose a novel regression paradigm with Residual Log-likelihood Estimation (RLE) to capture the underlying output distribution. Concretely, RLE learns the change of the distribution instead of the unreferenced underlying distribution to facilitate the training process. With the proposed reparameterization design, our method is compatible with off-the-shelf flow models. The proposed method is effective, efficient and flexible. We show its potential in various human pose estimation tasks with comprehensive experiments. Compared to the conventional regression paradigm, regression with RLE bring 12.4 mAP improvement on MSCOCO without any test-time overhead. Moreover, for the first time, especially on multi-person pose estimation, our regression method is superior to the heatmap-based methods. Our code is available at https://github.com/Jeff-sjtu/res-loglikelihood-regression

Code Repositories

open-mmlab/mmpose
pytorch
Mentioned in GitHub
Jeff-sjtu/res-loglikelihood-regression
Official
pytorch
Mentioned in GitHub
sithu31296/pose-estimation
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-human-pose-estimation-on-human36mRLE (Two-stage, based on SRNet)
Average MPJPE (mm): 36.3
pose-estimation-on-coco-val2017RLE (256x192)
AP: 71.3
AP50: 88.9
AP75: 78.3
AR: -

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp