Command Palette
Search for a command to run...
Zetong Yang; Yanan Sun; Shu Liu; Xiaoyong Shen; Jiaya Jia

Abstract
We present a novel 3D object detection framework, named IPOD, based on raw point cloud. It seeds object proposal for each point, which is the basic element. This paradigm provides us with high recall and high fidelity of information, leading to a suitable way to process point cloud data. We design an end-to-end trainable architecture, where features of all points within a proposal are extracted from the backbone network and achieve a proposal feature for final bounding inference. These features with both context information and precise point cloud coordinates yield improved performance. We conduct experiments on KITTI dataset, evaluating our performance in terms of 3D object detection, Bird's Eye View (BEV) detection and 2D object detection. Our method accomplishes new state-of-the-art , showing great advantage on the hard set.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-object-detection-on-kitti-cars-easy | IPOD | AP: 79.75% |
| 3d-object-detection-on-kitti-cars-hard | IPOD | AP: 66.33% |
| 3d-object-detection-on-kitti-cyclists | IPOD | AP: 53.46% |
| 3d-object-detection-on-kitti-cyclists-easy | IPOD | AP: 71.40% |
| 3d-object-detection-on-kitti-cyclists-hard | IPOD | AP: 48.34% |
| 3d-object-detection-on-kitti-pedestrians | IPOD | AP: 44.68% |
| 3d-object-detection-on-kitti-pedestrians-easy | IPOD | AP: 56.92% |
| 3d-object-detection-on-kitti-pedestrians-hard | IPOD | AP: 42.39% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.