Command Palette
Search for a command to run...
Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging
S. Mahdi H. Miangoleh Sebastian Dille Long Mai Sylvain Paris Yağız Aksoy

Abstract
Neural networks have shown great abilities in estimating depth from a single image. However, the inferred depth maps are well below one-megapixel resolution and often lack fine-grained details, which limits their practicality. Our method builds on our analysis on how the input resolution and the scene structure affects depth estimation performance. We demonstrate that there is a trade-off between a consistent scene structure and the high-frequency details, and merge low- and high-resolution estimations to take advantage of this duality using a simple depth merging network. We present a double estimation method that improves the whole-image depth estimation and a patch selection method that adds local details to the final result. We demonstrate that by merging estimations at different resolutions with changing context, we can generate multi-megapixel depth maps with a high level of detail using a pre-trained model.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| monocular-depth-estimation-on-ibims-1 | Miangoleh et al. (SGR) | D3R: 0.3222 ORD: 0.3938 RMSE: 0.1598 δ1.25: 0.6390 |
| monocular-depth-estimation-on-ibims-1 | Miangoleh et al. (MiDaS) | D3R: 0.4671 ORD: 0.5538 RMSE: 0.1965 δ1.25: 0.7460 |
| monocular-depth-estimation-on-middlebury-2014 | Miangoleh et al. (MiDaS) | D3R: 0.1578 ORD : 0.3467 RMSE: 0.1557 δ1.25: 0.7406 |
| monocular-depth-estimation-on-middlebury-2014 | Miangoleh et al. (SGR) | D3R: 0.2324 ORD : 0.3879 RMSE: 0.1973 δ1.25: 0.7891 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.