HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation

Shore Tavis ; Hadfield Simon ; Mendez Oscar

BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation

Abstract

Cross-view image matching for geo-localisation is a challenging problem dueto the significant visual difference between aerial and ground-levelviewpoints. The method provides localisation capabilities from geo-referencedimages, eliminating the need for external devices or costly equipment. Thisenhances the capacity of agents to autonomously determine their position,navigate, and operate effectively in GNSS-denied environments. Current researchemploys a variety of techniques to reduce the domain gap such as applying polartransforms to aerial images or synthesising between perspectives. However,these approaches generally rely on having a 360{\deg} field of view, limitingreal-world feasibility. We propose BEV-CV, an approach introducing two keynovelties with a focus on improving the real-world viability of cross-viewgeo-localisation. Firstly bringing ground-level images into a semanticBirds-Eye-View before matching embeddings, allowing for direct comparison withaerial image representations. Secondly, we adapt datasets into applicationrealistic format - limited Field-of-View images aligned to vehicle direction.BEV-CV achieves state-of-the-art recall accuracies, improving Top-1 rates of70{\deg} crops of CVUSA and CVACT by 23% and 24% respectively. Also decreasingcomputational requirements by reducing floating point operations to belowprevious works, and decreasing embedding dimensionality by 33% - togetherallowing for faster localisation capabilities.

Code Repositories

tavisshore/BEV-CV
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
cross-view-geo-localisation-on-cvusa-70BEV-CV
Top-1: 27.4
Top-1%: 90.94
Top-10: 64.47
Top-5: 52.94
cross-view-geo-localisation-on-cvusa-90GAL
Top-1: 22.54
cross-view-geo-localisation-on-cvusa-90L2LTR
Top-1: 25.21
cross-view-geo-localisation-on-cvusa-90L2LTR [Yang2021CrossviewGW]
R@5: 51.9
cross-view-geo-localisation-on-cvusa-90CVFT
Top-1: 4.8
cross-view-geo-localisation-on-cvusa-90DSM
Top-1: 33.66
cross-view-geo-localisation-on-cvusa-90TransGeo
Top-1%: 86.8
Top-10: 56.49
Top-5: 45.35
cross-view-geo-localisation-on-cvusa-90GeoDTR
Top-1: 15.21
Top-10: 52.27
Top-5: 39.32
cross-view-geo-localisation-on-cvusa-90BEV-CV
Top-1: 32.11
Top-1%: 92.99
Top-10: 69.06
Top-5: 58.36
cross-view-geo-localisation-on-cvusa-90TransGeo [Zhu2022TransGeoTI]
Top-1: 21.96
cross-view-geo-localisation-on-cvusa-90CVM
Top-1: 2.76
cross-view-geo-localisation-on-cvusa-90DSM [Shi2020WhereAI]
R@5: 51.7
cross-view-geo-localisation-on-cvusa-90GeoDTR [zhang2023crossview]
Top-1%: 88.72

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp