3 months ago

Deformable Mesh Transformer for 3D Human Mesh Recovery

{Yusuke Yoshiyasu}

Abstract

We present Deformable mesh transFormer (DeFormer), a novel vertex-based approach to monocular 3D human mesh recovery. DeFormer iteratively fits a body mesh model to an input image via a mesh alignment feedback loop formed within a transformer decoder that is equipped with efficient body mesh driven attention modules: 1) body sparse self-attention and 2) deformable mesh cross attention. As a result, DeFormer can effectively exploit high-resolution image feature maps and a dense mesh model which were computationally expensive to deal with in previous approaches using the standard transformer attention. Experimental results show that DeFormer achieves state-of-the-art performances on the Human3.6M and 3DPW benchmarks. Ablation study is also conducted to show the effectiveness of the DeFormer model designs for leveraging multi-scale feature maps. Code is available at https://github.com/yusukey03012/DeFormer.

Benchmarks

Benchmark	Methodology	Metrics
3d-hand-pose-estimation-on-freihand	Deformer	PA-F@15mm: 0.984 PA-F@5mm: 0.743 PA-MPJPE: 6.2 PA-MPVPE: 6.4
3d-human-pose-estimation-on-3dpw	DeFormer	MPJPE: 72.9 MPVPE: 82.6 PA-MPJPE: 44.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning