Command Palette
Search for a command to run...
Face Alignment using a 3D Deeply-initialized Ensemble of Regression Trees
Roberto Valle; José M. Buenaposada; Antonio Valdés; Luis Baumela

Abstract
Face alignment algorithms locate a set of landmark points in images of faces taken in unrestricted situations. State-of-the-art approaches typically fail or lose accuracy in the presence of occlusions, strong deformations, large pose variations and ambiguous configurations. In this paper we present 3DDE, a robust and efficient face alignment algorithm based on a coarse-to-fine cascade of ensembles of regression trees. It is initialized by robustly fitting a 3D face model to the probability maps produced by a convolutional neural network. With this initialization we address self-occlusions and large face rotations. Further, the regressor implicitly imposes a prior face shape on the solution, addressing occlusions and ambiguous face configurations. Its coarse-to-fine structure tackles the combinatorial explosion of parts deformation. In the experiments performed, 3DDE improves the state-of-the-art in 300W, COFW, AFLW and WFLW data sets. Finally, we perform cross-dataset experiments that reveal the existence of a significant data set bias in these benchmarks.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| face-alignment-on-300w | 3DDE | NME_inter-ocular (%, Challenge): 4.92 NME_inter-ocular (%, Common): 2.69 NME_inter-ocular (%, Full): 3.13 NME_inter-pupil (%, Challenge): 7.10 NME_inter-pupil (%, Common): 3.73 NME_inter-pupil (%, Full): 4.39 |
| face-alignment-on-300w-split-2 | 3DDE | AUC@8 (inter-ocular): 53.94 FR@8 (inter-ocular): 2.33 NME (inter-ocular): 3.73 |
| face-alignment-on-cofw | 3DDE (Inter-pupil Norm) | NME (inter-pupil): 5.11% Recall at 80% precision (Landmarks Visibility): 63.89 |
| face-alignment-on-wflw | 3DDE | AUC@10 (inter-ocular): 55.44 FR@10 (inter-ocular): 5.04 NME (inter-ocular): 4.68 |
| facial-landmark-detection-on-300w | 3DDE (Inter-ocular Norm) | NME: 3.13 |
| facial-landmark-detection-on-aflw-full | 3DDE (Box height Norm, 19 landmarks - no earlobs) | Mean NME: 2.01 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.