Command Palette
Search for a command to run...
Unsupervised learning of object landmarks by factorized spatial embeddings
James Thewlis; Hakan Bilen; Andrea Vedaldi

Abstract
Learning automatically the structure of object categories remains an important open problem in computer vision. In this paper, we propose a novel unsupervised approach that can discover and learn landmarks in object categories, thus characterizing their structure. Our approach is based on factorizing image deformations, as induced by a viewpoint change or an object deformation, by learning a deep neural network that detects landmarks consistently with such visual effects. Furthermore, we show that the learned landmarks establish meaningful correspondences between different object instances in a category without having to impose this requirement explicitly. We assess the method qualitatively on a variety of object types, natural and man-made. We also show that our unsupervised landmarks are highly predictive of manually-annotated landmarks in face benchmark datasets, and can be used to regress these with a high degree of accuracy.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| unsupervised-facial-landmark-detection-on | FSE | NME: 7.97 |
| unsupervised-facial-landmark-detection-on-1 | FSE | NME: 6.67 |
| unsupervised-facial-landmark-detection-on-1 | Thewlis2017unsupervised | NME: 6.32 |
| unsupervised-facial-landmark-detection-on-3 | FSE | NME: 10.53 |
| unsupervised-facial-landmark-detection-on-5 | ULD | NME: 31.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.