Command Palette
Search for a command to run...
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
Yinhao Li Zheng Ge Guanyi Yu Jinrong Yang Zengran Wang Yukang Shi Jianjian Sun Zeming Li

Abstract
In this research, we propose a new 3D object detector with a trustworthy depth estimation, dubbed BEVDepth, for camera-based Bird's-Eye-View (BEV) 3D object detection. Our work is based on a key observation -- depth estimation in recent approaches is surprisingly inadequate given the fact that depth is essential to camera 3D detection. Our BEVDepth resolves this by leveraging explicit depth supervision. A camera-awareness depth estimation module is also introduced to facilitate the depth predicting capability. Besides, we design a novel Depth Refinement Module to counter the side effects carried by imprecise feature unprojection. Aided by customized Efficient Voxel Pooling and multi-frame mechanism, BEVDepth achieves the new state-of-the-art 60.9% NDS on the challenging nuScenes test set while maintaining high efficiency. For the first time, the NDS score of a camera model reaches 60%.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-object-detection-on-dair-v2x-i | BEVDepth | AP|R40(easy): 75.7 AP|R40(hard): 63.7 AP|R40(moderate): 63.6 |
| 3d-object-detection-on-nuscenes-camera-only | BEVDepth-pure | Future Frame: false NDS: 60.9 |
| 3d-object-detection-on-rope3d | BEVDepth | AP@0.7: 42.56 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.