HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Liu Yingfei ; Yan Junjie ; Jia Fan ; Li Shuailin ; Gao Aqi ; Wang Tiancai ; Zhang Xiangyu ; Sun Jian

PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images

Abstract

In this paper, we propose PETRv2, a unified framework for 3D perception frommulti-view images. Based on PETR, PETRv2 explores the effectiveness of temporalmodeling, which utilizes the temporal information of previous frames to boost3D object detection. More specifically, we extend the 3D position embedding (3DPE) in PETR for temporal modeling. The 3D PE achieves the temporal alignment onobject position of different frames. A feature-guided position encoder isfurther introduced to improve the data adaptability of 3D PE. To support formulti-task learning (e.g., BEV segmentation and 3D lane detection), PETRv2provides a simple yet effective solution by introducing task-specific queries,which are initialized under different spaces. PETRv2 achieves state-of-the-artperformance on 3D object detection, BEV segmentation and 3D lane detection.Detailed robustness analysis is also conducted on PETR framework. We hopePETRv2 can serve as a strong baseline for 3D perception. Code is available at\url{https://github.com/megvii-research/PETR}.

Code Repositories

megvii-research/petr
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-lane-detection-on-openlanePETRv2-V∗ (VoVNetV2 with 400 anchor points)
Curve: -
Extreme Weather: -
F1 (all): 61.2
FPS (pytorch): -
Intersection: -
Merge u0026 Split: -
Night: -
Up u0026 Down: -
3d-lane-detection-on-openlanePETRv2-E (EfficientNet)
Curve: -
Extreme Weather: -
F1 (all): 51.9
FPS (pytorch): -
Intersection: -
Merge u0026 Split: -
Night: -
Up u0026 Down: -
3d-lane-detection-on-openlanePETRv2-V (VoVNetV2)
Curve: -
Extreme Weather: -
F1 (all): 57.8
FPS (pytorch): -
Intersection: -
Merge u0026 Split: -
Night: -
Up u0026 Down: -
3d-object-detection-on-nuscenes-camera-onlyPETRv2-pure
Future Frame: false
NDS: 59.2
bird-s-eye-view-semantic-segmentation-onPETRv2
IoU lane - 224x480 - 100x100 at 0.5: 44.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp