5 months ago

End-to-End Learning of Geometry and Context for Deep Stereo Regression

Alex Kendall; Hayk Martirosyan; Saumitro Dasgupta; Peter Henry; Ryan Kennedy; Abraham Bachrach; Adam Bry

Abstract

We propose a novel deep learning architecture for regressing disparity from a rectified pair of stereo images. We leverage knowledge of the problem's geometry to form a cost volume using deep feature representations. We learn to incorporate contextual information using 3-D convolutions over this volume. Disparity values are regressed from the cost volume using a proposed differentiable soft argmin operation, which allows us to train our method end-to-end to sub-pixel accuracy without any additional post-processing or regularization. We evaluate our method on the Scene Flow and KITTI datasets and on KITTI we set a new state-of-the-art benchmark, while being significantly faster than competing approaches.

Code Repositories

laoreja/CS231A-project-Stereo-matching

Mentioned in GitHub

zyf12389/GC-Net

pytorch

Mentioned in GitHub

EnriqueSolarte/GC-Net-tensorflow

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
stereo-lidar-fusion-on-kitti-depth-completion	GCNet	RMSE: 1031.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

End-to-End Learning of Geometry and Context for Deep Stereo Regression

Alex Kendall; Hayk Martirosyan; Saumitro Dasgupta; Peter Henry; Ryan Kennedy; Abraham Bachrach; Adam Bry

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters