HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Digging Into Self-Supervised Monocular Depth Estimation

Clément Godard; Oisin Mac Aodha; Michael Firman; Gabriel Brostow

Digging Into Self-Supervised Monocular Depth Estimation

Abstract

Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods. Research on self-supervised monocular training usually explores increasingly complex architectures, loss functions, and image formation models, all of which have recently helped to close the gap with fully-supervised methods. We show that a surprisingly simple model, and associated design choices, lead to superior predictions. In particular, we propose (i) a minimum reprojection loss, designed to robustly handle occlusions, (ii) a full-resolution multi-scale sampling method that reduces visual artifacts, and (iii) an auto-masking loss to ignore training pixels that violate camera motion assumptions. We demonstrate the effectiveness of each component in isolation, and show high quality, state-of-the-art results on the KITTI benchmark.

Code Repositories

FangGet/tf-monodepth2
tf
Mentioned in GitHub
XXXVincent/MonoDepth2
pytorch
Mentioned in GitHub
TWJianNuo/panoptic-scene-understanding
pytorch
Mentioned in GitHub
minghanz/DepthC3D
pytorch
Mentioned in GitHub
isennkubilay/monodepth2_tf
tf
Mentioned in GitHub
nianticlabs/monodepth2
Official
pytorch
Mentioned in GitHub
rnlee1998/SRD
pytorch
Mentioned in GitHub
IcarusWizard/monodepth2-paddle
paddle
Mentioned in GitHub
CaptainEven/MonoDepthV2
pytorch
Mentioned in GitHub
qrzyang/pseudo-stereo
pytorch
Mentioned in GitHub
TanyaChutani/Monodepth-Tf2.x
tf
Mentioned in GitHub
jzwqaq/monodepth_jzw
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
camera-pose-estimation-on-kitti-odometryMonodepth2
Absolute Trajectory Error [m]: 93.04
Average Rotational Error er[%]: 20.72
Average Translational Error et[%]: 43.21
monocular-depth-estimation-on-kitti-eigenmonodepth2 M
absolute relative error: 0.106
monocular-depth-estimation-on-make3dMonodepth2
Abs Rel: 0.322
RMSE: 7.417
Sq Rel: 3.589
monocular-depth-estimation-on-mid-air-datasetMonodepth2
Abs Rel: 0.717
RMSE: 74.552
RMSE log: 0.882
SQ Rel: 37.164
monocular-depth-estimation-on-vaMonoDepth2
Absolute relative error (AbsRel): 0.203
Log root mean square error (RMSE_log): 0.251
Mean average error (MAE) : 0.295
Root mean square error (RMSE): 0.432

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp