HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection

Zhou Zhenglin ; Li Huaxia ; Liu Hong ; Wang Nanyang ; Yu Gang ; Ji Rongrong

STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection

Abstract

Recently, deep learning-based facial landmark detection has achievedsignificant improvement. However, the semantic ambiguity problem degradesdetection performance. Specifically, the semantic ambiguity causes inconsistentannotation and negatively affects the model's convergence, leading to worseaccuracy and instability prediction. To solve this problem, we propose aSelf-adapTive Ambiguity Reduction (STAR) loss by exploiting the properties ofsemantic ambiguity. We find that semantic ambiguity results in the anisotropicpredicted distribution, which inspires us to use predicted distribution torepresent semantic ambiguity. Based on this, we design the STAR loss thatmeasures the anisotropism of the predicted distribution. Compared with thestandard regression loss, STAR loss is encouraged to be small when thepredicted distribution is anisotropic and thus adaptively mitigates the impactof semantic ambiguity. Moreover, we propose two kinds of eigenvalue restrictionmethods that could avoid both distribution's abnormal change and the model'spremature convergence. Finally, the comprehensive experiments demonstrate thatSTAR loss outperforms the state-of-the-art methods on three benchmarks, i.e.,COFW, 300W, and WFLW, with negligible computation overhead. Code is athttps://github.com/ZhenglinZhou/STAR.

Code Repositories

zhenglinzhou/star
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
face-alignment-on-300wSTAR
NME_inter-ocular (%, Challenge): 4.32
NME_inter-ocular (%, Common): 2.52
NME_inter-ocular (%, Full): 2.87
NME_inter-pupil (%, Challenge): 6.22
NME_inter-pupil (%, Common): 3.5
NME_inter-pupil (%, Full): 4.03
face-alignment-on-cofwSTAR
NME (inter-ocular): 3.21%
NME (inter-pupil): 4.62
face-alignment-on-wflwSTAR
AUC@10 (inter-ocular): 60.5
FR@10 (inter-ocular): 2.32
NME (inter-ocular): 4.02

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp