Command Palette
Search for a command to run...
Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification
{Kader Pustu-Iren Eric Muller-Budack Ralph Ewerth}

Abstract
While the successful estimation of a photo's geolocation enables a number of interesting applications, it is also a very challenging task. Due to the complexity of the problem, most existing approaches are restricted to specific areas, imagery, or worldwide landmarks. Only a few proposals predict GPS coordinates without any limitations. In this paper, we introduce several deep learning methods, which pursue the latter approach and treat geolocalization as a classification problem where the earth is subdivided into geographical cells. We propose to exploit hierarchical knowledge of multiple partitionings and additionally extract and take the photo's scene content into account, i.e., indoor, natural, or urban setting etc. As a result, contextual information at different spatial resolutions as well as more specific features for various environmental settings are incorporated in the learning process of the convolutional neural network. Experimental results on two benchmarks demonstrate the effectiveness of our approach outperforming the state of the art while using a significant lower number of training images and without relying on retrieval methods that require an appropriate reference dataset.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| photo-geolocation-estimation-on-gws15k | ISNs (M, f*, S3) | City level (25 km): 0.6 Continent level (2500 km): 38.5 Country level (750 km): 15.5 Region level (200 km): 4.2 Street level (1 km): 0.05 |
| photo-geolocation-estimation-on-im2gps | base (L, m) | City level (25 km): 35.0 Continent level (2500 km): 79.7 Country level (750 km): 64.1 Reference images: 0 Region level (200 km): 49.8 Street level (1 km): 13.5 Training images: 4.7M |
| photo-geolocation-estimation-on-im2gps | ISNs (M, f*, S3) | City level (25 km): 43.0 Continent level (2500 km): 80.2 Country level (750 km): 66.7 Reference images: 0 Region level (200 km): 51.9 Street level (1 km): 16.9 Training images: 4.7M |
| photo-geolocation-estimation-on-im2gps | base (M, f*) | City level (25 km): 40.9 Continent level (2500 km): 78.5 Country level (750 km): 65.4 Reference images: 0 Region level (200 km): 51.5 Street level (1 km): 15.2 Training images: 4.7M |
| photo-geolocation-estimation-on-im2gps3k | ISNs (M, f*, S3) | City level (25 km): 28.0 Continent level (2500 km): 66.0 Country level (750 km): 49.7 Region level (200 km): 36.6 Street level (1 km): 10.5 Training Images: 4.7M |
| photo-geolocation-estimation-on-im2gps3k | base (M, f*) | City level (25 km): 27.0 Continent level (2500 km): 66.0 Country level (750 km): 49.2 Region level (200 km): 35.6 Street level (1 km): 9.7 Training Images: 4.7M |
| photo-geolocation-estimation-on-im2gps3k | base (L, m) | City level (25 km): 24.9 Continent level (2500 km): 65.8 Country level (750 km): 48.8 Region level (200 km): 34.0 Street level (1 km): 8.3 Training Images: 4.7M |
| photo-geolocation-estimation-on-yfcc26k | ISNs (M, f*, S3) | City level (25 km): 12.3 Continent level (2500 km): 50.7 Country level (750 km): 31.9 Region level (200 km): 19.0 Street level (1 km): 5.3 Training Images: 4.7M |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.