3 months ago

Is Pseudo-Lidar needed for Monocular 3D Object detection?

Dennis Park Rares Ambrus Vitor Guizilini Jie Li Adrien Gaidon

Abstract

Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors. These two-stage detectors improve with the accuracy of the intermediate depth estimation network, which can itself be improved without manual labels via large-scale self-supervised learning. However, they tend to suffer from overfitting more than end-to-end methods, are more complex, and the gap with similar lidar-based detectors remains significant. In this work, we propose an end-to-end, single stage, monocular 3D object detector, DD3D, that can benefit from depth pre-training like pseudo-lidar methods, but without their limitations. Our architecture is designed for effective information transfer between depth estimation and 3D detection, allowing us to scale with the amount of unlabeled pre-training data. Our method achieves state-of-the-art results on two challenging benchmarks, with 16.34% and 9.28% AP for Cars and Pedestrians (respectively) on the KITTI-3D benchmark, and 41.5% mAP on NuScenes.

Code Repositories

tri-ml/vedet

pytorch

Mentioned in GitHub

tri-ml/dd3d

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
monocular-3d-object-detection-on-kitti-1	DD3D	AP Hard: 8.05
monocular-3d-object-detection-on-kitti-3	DD3D	AP Easy: 13.91
monocular-3d-object-detection-on-kitti-4	DD3D	AP Medium: 9.30
monocular-3d-object-detection-on-kitti-cars	DD3D	AP Medium: 16.34
monocular-3d-object-detection-on-kitti-cars-1	DD3D	AP Hard: 14.20
monocular-3d-object-detection-on-kitti-cars-2	DD3D	AP Easy: 23.22

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette