HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

MixVPR: Feature Mixing for Visual Place Recognition

Amar Ali-bey Brahim Chaib-draa Philippe Giguère

MixVPR: Feature Mixing for Visual Place Recognition

Abstract

Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous driving as well as other computer vision tasks. It refers to the process of identifying a place depicted in a query image using only computer vision. At large scale, repetitive structures, weather and illumination changes pose a real challenge, as appearances can drastically change over time. Along with tackling these challenges, an efficient VPR technique must also be practical in real-world scenarios where latency matters. To address this, we introduce MixVPR, a new holistic feature aggregation technique that takes feature maps from pre-trained backbones as a set of global features. Then, it incorporates a global relationship between elements in each feature map in a cascade of feature mixing, eliminating the need for local or pyramidal aggregation as done in NetVLAD or TransVPR. We demonstrate the effectiveness of our technique through extensive experiments on multiple large-scale benchmarks. Our method outperforms all existing techniques by a large margin while having less than half the number of parameters compared to CosPlace and NetVLAD. We achieve a new all-time high recall@1 score of 94.6% on Pitts250k-test, 88.0% on MapillarySLS, and more importantly, 58.4% on Nordland. Finally, our method outperforms two-stage retrieval techniques such as Patch-NetVLAD, TransVPR and SuperGLUE all while being orders of magnitude faster. Our code and trained models are available at https://github.com/amaralibey/MixVPR.

Code Repositories

amaralibey/mixvpr
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-place-recognition-on-17-placesMixVPR
Recall@1: 63.79
visual-place-recognition-on-baidu-mallMixVPR
Recall@1: 64.44
visual-place-recognition-on-gardens-pointMixVPR
Recall@1: 91.50
visual-place-recognition-on-hawkinsMixVPR
Recall@1: 25.42
visual-place-recognition-on-laurel-cavernsMixVPR
Recall@1: 29.46
visual-place-recognition-on-mapillary-testMixVPR
Recall@1: 64.0
Recall@10: 80.6
Recall@5: 75.9
visual-place-recognition-on-mapillary-valMixVPR
Recall@1: 88.2
Recall@10: 94.3
Recall@5: 93.1
visual-place-recognition-on-mid-atlanticMixVPR
Recall@1: 25.74
visual-place-recognition-on-nardo-airMixVPR
Recall@1: 32.39
visual-place-recognition-on-nardo-air-rMixVPR
Recall@1: 76.06
visual-place-recognition-on-nordlandMixVPR
Recall@1: 76.0
Recall@5: 89.2
visual-place-recognition-on-oxford-robotcar-4MixVPR
Recall@1: 90.05
visual-place-recognition-on-pittsburgh-250kMixVPR
Recall@1: 94.6
Recall@10: 99.0
Recall@5: 98.3
visual-place-recognition-on-pittsburgh-30kMixVPR
Recall@1: 91.52
Recall@5: 95.9
visual-place-recognition-on-spedMixVPR
Recall@1: 85.2
Recall@10: 94.6
Recall@5: 92.1
visual-place-recognition-on-st-luciaMixVPR
Recall@1: 99.66
visual-place-recognition-on-vp-airMixVPR
Recall@1: 10.31

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp