HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

PETR: Position Embedding Transformation for Multi-View 3D Object Detection

Yingfei Liu Tiancai Wang Xiangyu Zhang Jian Sun

PETR: Position Embedding Transformation for Multi-View 3D Object Detection

Abstract

In this paper, we develop position embedding transformation (PETR) for multi-view 3D object detection. PETR encodes the position information of 3D coordinates into image features, producing the 3D position-aware features. Object query can perceive the 3D position-aware features and perform end-to-end object detection. PETR achieves state-of-the-art performance (50.4% NDS and 44.1% mAP) on standard nuScenes dataset and ranks 1st place on the benchmark. It can serve as a simple yet strong baseline for future research. Code is available at \url{https://github.com/megvii-research/PETR}.

Code Repositories

megvii-research/petr
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-object-detection-on-3d-object-detection-onPETR
Average mAP: 17.6
3d-object-detection-on-truckscenesPETR
NDS: 12.1
mAP: 2.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp