HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps

Yue Hu Shaoheng Fang Zixing Lei Yiqi Zhong Siheng Chen

Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps

Abstract

Multi-agent collaborative perception could significantly upgrade the perception performance by enabling agents to share complementary information with each other through communication. It inevitably results in a fundamental trade-off between perception performance and communication bandwidth. To tackle this bottleneck issue, we propose a spatial confidence map, which reflects the spatial heterogeneity of perceptual information. It empowers agents to only share spatially sparse, yet perceptually critical information, contributing to where to communicate. Based on this novel spatial confidence map, we propose Where2comm, a communication-efficient collaborative perception framework. Where2comm has two distinct advantages: i) it considers pragmatic compression and uses less communication to achieve higher perception performance by focusing on perceptually critical areas; and ii) it can handle varying communication bandwidth by dynamically adjusting spatial areas involved in communication. To evaluate Where2comm, we consider 3D object detection in both real-world and simulation scenarios with two modalities (camera/LiDAR) and two agent types (cars/drones) on four datasets: OPV2V, V2X-Sim, DAIR-V2X, and our original CoPerception-UAVs. Where2comm consistently outperforms previous methods; for example, it achieves more than $100,000 \times$ lower communication volume and still outperforms DiscoNet and V2X-ViT on OPV2V. Our code is available at https://github.com/MediaBrain-SJTU/where2comm.

Code Repositories

mediabrain-sjtu/where2comm
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-object-detection-on-dair-v2xWhere2comm
AP50: 63.71
3d-object-detection-on-v2x-simWhere2comm
mAOE: 0.310
mAP: 19.0
mASE: 0.275
mATE: 0.911
monocular-3d-object-detection-on-coperceptionWhere2comm
AP50: 65.71
monocular-3d-object-detection-on-opv2vWhere2comm
AP50: 47.14

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp