HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Fiducial Focus Augmentation for Facial Landmark Detection

Kar Purbayan ; Chudasama Vishal ; Onoe Naoyuki ; Wasnik Pankaj ; Balasubramanian Vineeth

Fiducial Focus Augmentation for Facial Landmark Detection

Abstract

Deep learning methods have led to significant improvements in the performanceon the facial landmark detection (FLD) task. However, detecting landmarks inchallenging settings, such as head pose changes, exaggerated expressions, oruneven illumination, continue to remain a challenge due to high variability andinsufficient samples. This inadequacy can be attributed to the model'sinability to effectively acquire appropriate facial structure information fromthe input images. To address this, we propose a novel image augmentationtechnique specifically designed for the FLD task to enhance the model'sunderstanding of facial structures. To effectively utilize the newly proposedaugmentation technique, we employ a Siamese architecture-based trainingmechanism with a Deep Canonical Correlation Analysis (DCCA)-based loss toachieve collective learning of high-level feature representations from twodifferent views of the input images. Furthermore, we employ a Transformer +CNN-based network with a custom hourglass module as the robust backbone for theSiamese framework. Extensive experiments show that our approach outperformsmultiple state-of-the-art approaches across various benchmark datasets.

Benchmarks

BenchmarkMethodologyMetrics
face-alignment-on-300wFiFA
NME_inter-ocular (%, Challenge): 4.47
NME_inter-ocular (%, Common): 2.51
NME_inter-ocular (%, Full): 2.89
face-alignment-on-aflw-19FiFA
AUC_box@0.07 (%, Full): 81.8
NME_box (%, Full): 1.31
NME_diag (%, Frontal): 0.80
NME_diag (%, Full): 0.92
face-alignment-on-cofwFiFA
NME (inter-ocular): 2.96
face-alignment-on-wflwFiFA
AUC@10 (inter-ocular): 61.78
FR@10 (inter-ocular): 1.60
NME (inter-ocular): 3.89
facial-landmark-detection-on-300wFiFA
NME: 2.89
facial-landmark-detection-on-aflw-frontFiFA
Mean NME: 0.80
Mean NME : 0.80
NME: 0.80
facial-landmark-detection-on-aflw-fullFiFA
Mean NME: 0.92
Mean NME : 0.92
NME: 0.92
facial-landmark-detection-on-cofwFiFA
NME: 2.96
NME (inter-ocular): 2.96
facial-landmark-detection-on-wflw-1FiFA
AUC@10 (inter-ocular): 61.78
FR@10 (inter-ocular): 1.60
NME (inter-ocular): 3.89

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp