HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Vítor Albiero; Xingyu Chen; Xi Yin; Guan Pang; Tal Hassner

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Abstract

We propose real-time, six degrees of freedom (6DoF), 3D face pose estimation without face detection or landmark localization. We observe that estimating the 6DoF rigid transformation of a face is a simpler problem than facial landmark detection, often used for 3D face alignment. In addition, 6DoF offers more information than face bounding box labels. We leverage these observations to make multiple contributions: (a) We describe an easily trained, efficient, Faster R-CNN--based model which regresses 6DoF pose for all faces in the photo, without preliminary face detection. (b) We explain how pose is converted and kept consistent between the input photo and arbitrary crops created while training and evaluating our model. (c) Finally, we show how face poses can replace detection bounding box training labels. Tests on AFLW2000-3D and BIWI show that our method runs at real-time and outperforms state of the art (SotA) face pose estimators. Remarkably, our method also surpasses SotA models of comparable complexity on the WIDER FACE detection benchmark, despite not been optimized on bounding box labels.

Code Repositories

vitoralbiero/img2pose
Official
pytorch
Mentioned in GitHub
nilseuropa/ros_img2pose
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
face-detection-on-wider-face-easyimg2pose
AP: 0.9
face-detection-on-wider-face-hardimg2pose
AP: 0.839
face-detection-on-wider-face-mediumimg2pose
AP: 0.890
head-pose-estimation-on-aflw2000RetinaFace R-50 (5 points)
MAE: 4.839
MAE_t: 0.114
head-pose-estimation-on-aflw2000img2pose
Geodesic Error (GE): 6.41
MAE: 3.913
MAE_t: 0.099
head-pose-estimation-on-biwiimg2pose
Geodesic Error (GE): 7.10
Geodesic Error - aligned (GE): 6.23
MAE (trained with other data): 3.786
MAE-aligned (trained with other data): 3.4
head-pose-estimation-on-biwiRetinaFace R-50 (5 points)
MAE (trained with other data): 4.578

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp