HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Self-Supervised Multi-Frame Monocular Scene Flow

Junhwa Hur Stefan Roth

Self-Supervised Multi-Frame Monocular Scene Flow

Abstract

Estimating 3D scene flow from a sequence of monocular images has been gaining increased attention due to the simple, economical capture setup. Owing to the severe ill-posedness of the problem, the accuracy of current methods has been limited, especially that of efficient, real-time approaches. In this paper, we introduce a multi-frame monocular scene flow network based on self-supervised learning, improving the accuracy over previous networks while retaining real-time efficiency. Based on an advanced two-frame baseline with a split-decoder design, we propose (i) a multi-frame model using a triple frame input and convolutional LSTM connections, (ii) an occlusion-aware census loss for better accuracy, and (iii) a gradient detaching strategy to improve training stability. On the KITTI dataset, we observe state-of-the-art accuracy among monocular scene flow methods based on self-supervised learning.

Code Repositories

visinf/multi-mono-sf
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-flow-estimation-on-kitti-2015-sceneMulti-Mono-SF
Runtime (s): 0.063
D1-all: 27.33
D2-all: 30.44
Fl-all: 18.92
SF-all: 39.82
scene-flow-estimation-on-kitti-2015-scene-1Multi-Mono-SF
D1-all: 30.78
D2-all: 34.41
Fl-all: 19.54
Runtime (s): 0.063
SF-all: 44.04

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp