HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

iDisc: Internal Discretization for Monocular Depth Estimation

Luigi Piccinelli; Christos Sakaridis; Fisher Yu

iDisc: Internal Discretization for Monocular Depth Estimation

Abstract

Monocular depth estimation is fundamental for 3D scene understanding and downstream applications. However, even under the supervised setup, it is still challenging and ill-posed due to the lack of full geometric constraints. Although a scene can consist of millions of pixels, there are fewer high-level patterns. We propose iDisc to learn those patterns with internal discretized representations. The method implicitly partitions the scene into a set of high-level patterns. In particular, our new module, Internal Discretization (ID), implements a continuous-discrete-continuous bottleneck to learn those concepts without supervision. In contrast to state-of-the-art methods, the proposed model does not enforce any explicit constraints or priors on the depth output. The whole network with the ID module can be trained end-to-end, thanks to the bottleneck module based on attention. Our method sets the new state of the art with significant improvements on NYU-Depth v2 and KITTI, outperforming all published methods on the official KITTI benchmark. iDisc can also achieve state-of-the-art results on surface normal estimation. Further, we explore the model generalization capability via zero-shot testing. We observe the compelling need to promote diversification in the outdoor scenario. Hence, we introduce splits of two autonomous driving datasets, DDAD and Argoverse. Code is available at http://vis.xyz/pub/idisc .

Code Repositories

SysCV/idisc
Official
pytorch
Mentioned in GitHub
lpiccinelli-eth/unidepth
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
monocular-depth-estimation-on-kitti-eigeniDisc
Delta u003c 1.25: 0.977
Delta u003c 1.25^2: 0.997
Delta u003c 1.25^3: 0.999
RMSE: 2.067
RMSE log: 0.077
Sq Rel: 0.145
absolute relative error: 0.050
monocular-depth-estimation-on-nyu-depth-v2iDisc
Delta u003c 1.25^2: 0.993
Delta u003c 1.25^3: 0.999
absolute relative error: 0.086
surface-normals-estimation-on-nyu-depth-v2-1iDisc
% u003c 11.25: 63.8
% u003c 22.5: 79.8
% u003c 30: 85.6
Mean Angle Error: 14.6
RMSE: 22.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp