Command Palette
Search for a command to run...
Shore Tavis ; Hadfield Simon ; Mendez Oscar

Abstract
Cross-view image matching for geo-localisation is a challenging problem dueto the significant visual difference between aerial and ground-levelviewpoints. The method provides localisation capabilities from geo-referencedimages, eliminating the need for external devices or costly equipment. Thisenhances the capacity of agents to autonomously determine their position,navigate, and operate effectively in GNSS-denied environments. Current researchemploys a variety of techniques to reduce the domain gap such as applying polartransforms to aerial images or synthesising between perspectives. However,these approaches generally rely on having a 360{\deg} field of view, limitingreal-world feasibility. We propose BEV-CV, an approach introducing two keynovelties with a focus on improving the real-world viability of cross-viewgeo-localisation. Firstly bringing ground-level images into a semanticBirds-Eye-View before matching embeddings, allowing for direct comparison withaerial image representations. Secondly, we adapt datasets into applicationrealistic format - limited Field-of-View images aligned to vehicle direction.BEV-CV achieves state-of-the-art recall accuracies, improving Top-1 rates of70{\deg} crops of CVUSA and CVACT by 23% and 24% respectively. Also decreasingcomputational requirements by reducing floating point operations to belowprevious works, and decreasing embedding dimensionality by 33% - togetherallowing for faster localisation capabilities.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| cross-view-geo-localisation-on-cvusa-70 | BEV-CV | Top-1: 27.4 Top-1%: 90.94 Top-10: 64.47 Top-5: 52.94 |
| cross-view-geo-localisation-on-cvusa-90 | GAL | Top-1: 22.54 |
| cross-view-geo-localisation-on-cvusa-90 | L2LTR | Top-1: 25.21 |
| cross-view-geo-localisation-on-cvusa-90 | L2LTR [Yang2021CrossviewGW] | R@5: 51.9 |
| cross-view-geo-localisation-on-cvusa-90 | CVFT | Top-1: 4.8 |
| cross-view-geo-localisation-on-cvusa-90 | DSM | Top-1: 33.66 |
| cross-view-geo-localisation-on-cvusa-90 | TransGeo | Top-1%: 86.8 Top-10: 56.49 Top-5: 45.35 |
| cross-view-geo-localisation-on-cvusa-90 | GeoDTR | Top-1: 15.21 Top-10: 52.27 Top-5: 39.32 |
| cross-view-geo-localisation-on-cvusa-90 | BEV-CV | Top-1: 32.11 Top-1%: 92.99 Top-10: 69.06 Top-5: 58.36 |
| cross-view-geo-localisation-on-cvusa-90 | TransGeo [Zhu2022TransGeoTI] | Top-1: 21.96 |
| cross-view-geo-localisation-on-cvusa-90 | CVM | Top-1: 2.76 |
| cross-view-geo-localisation-on-cvusa-90 | DSM [Shi2020WhereAI] | R@5: 51.7 |
| cross-view-geo-localisation-on-cvusa-90 | GeoDTR [zhang2023crossview] | Top-1%: 88.72 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.