Command Palette
Search for a command to run...
{Chen-Change Loy Yi-Zhe Song Qian Yu Timothy M. Hospedales Tao Xiang Feng Liu}

Abstract
We investigate the problem of fine-grained sketch-based image retrieval (SBIR), where free-hand human sketches are used as queries to perform instance-level retrieval of images. This is an extremely challenging task because (i) visual comparisons not only need to be fine-grained but also executed cross-domain, (ii) free-hand (finger) sketches are highly abstract, making fine-grained matching harder, and most importantly (iii) annotated cross-domain sketch-photo datasets required for training are scarce, challenging many state-of-the-art machine learning techniques. In this paper, for the first time, we address all these challenges, providing a step towards the capabilities that would underpin a commercial sketch-based image retrieval application. We introduce a new database of 1,432 sketch-photo pairs from two categories with 32,000 fine-grained triplet ranking annotations. We then develop a deep triplet-ranking model for instance-level SBIR with a novel data augmentation and staged pre-training strategy to alleviate the issue of insufficient fine-grained training data. Extensive experiments are carried out to contribute a variety of insights into the challenges of data sufficiency and over-fitting avoidance when training deep networks for fine-grained cross-domain ranking tasks.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| sketch-based-image-retrieval-on-chairs | Sketch-a-Net + rankSVM | R@1: 47.4 R@10: 82.5 |
| sketch-based-image-retrieval-on-chairs | BoW-HOG + rankSVM | R@1: 28.9 R@10: 67.0 |
| sketch-based-image-retrieval-on-chairs | Shoes net + | R@1: 65.0 R@10: 92.8 |
| sketch-based-image-retrieval-on-chairs | Dense-HOG + rankSVM | R@1: 52.6 R@10: 93.8 |
| sketch-based-image-retrieval-on-chairs | Chairs net + | R@1: 72.2 R@10: 99.0 |
| sketch-based-image-retrieval-on-handbags | Chairs net + | R@1: 26.2 R@10: 58.3 |
| sketch-based-image-retrieval-on-handbags | Dense-HOG + rankSVM | R@1: 15.5 R@10: 40.5 |
| sketch-based-image-retrieval-on-handbags | Shoes net + | R@1: 23.2 R@10: 59.5 |
| sketch-based-image-retrieval-on-handbags | Sketch-a-Net + rankSVM | R@1: 9.5 R@10: 44.1 |
| sketch-based-image-retrieval-on-handbags | BoW-HOG + rankSVM | R@1: 2.4 R@10: 10.7 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.