Command Palette
Search for a command to run...
Filip Radenović; Giorgos Tolias; Ondřej Chum

Abstract
We cast shape matching as metric learning with convolutional networks. We break the end-to-end process of image representation into two parts. Firstly, well established efficient methods are chosen to turn the images into edge maps. Secondly, the network is trained with edge maps of landmark images, which are automatically obtained by a structure-from-motion pipeline. The learned representation is evaluated on a range of different tasks, providing improvements on challenging cases of domain generalization, generic sketch-based image retrieval or its fine-grained counterpart. In contrast to other methods that learn a different model per task, object category, or domain, we use the same network throughout all our experiments, achieving state-of-the-art results in multiple benchmarks.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| sketch-based-image-retrieval-on-chairs | EdgeMAC + whitening | R@1: 85.6 R@10: 97.9 |
| sketch-based-image-retrieval-on-handbags | EdgeMAC + whitening | R@1: 51.2 R@10: 85.7 |
| sketch-based-image-retrieval-on-shoes | EdgeMAC + whitening | R@1: 54.8 R@10: 92.2 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.