HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

Huang Zhening ; Wu Xiaoyang ; Chen Xi ; Zhao Hengshuang ; Zhu Lei ; Lasenby Joan

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

Abstract

In this work, we introduce OpenIns3D, a new 3D-input-only framework for 3Dopen-vocabulary scene understanding. The OpenIns3D framework employs a"Mask-Snap-Lookup" scheme. The "Mask" module learns class-agnostic maskproposals in 3D point clouds, the "Snap" module generates synthetic scene-levelimages at multiple scales and leverages 2D vision-language models to extractinteresting objects, and the "Lookup" module searches through the outcomes of"Snap" to assign category names to the proposed masks. This approach, yetsimple, achieves state-of-the-art performance across a wide range of 3Dopen-vocabulary tasks, including recognition, object detection, and instancesegmentation, on both indoor and outdoor datasets. Moreover, OpenIns3Dfacilitates effortless switching between different 2D detectors withoutrequiring retraining. When integrated with powerful 2D open-world models, itachieves excellent results in scene understanding tasks. Furthermore, whencombined with LLM-powered 2D models, OpenIns3D exhibits an impressivecapability to comprehend and process highly complex text queries that demandintricate reasoning and real-world knowledge. Project page:https://zheninghuang.github.io/OpenIns3D/

Code Repositories

Pointcept/OpenIns3D
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-open-vocabulary-instance-segmentation-onOpenIns3D (3d only)
AP Common: 6.5
AP Head: 16.0
AP Tail: 4.2
AP25: 14.4
AP50: 10.3
mAP: 8.8
3d-open-vocabulary-instance-segmentation-onOpenIns3D
AP Common: 14.2
AP Head: 19.2
AP Tail: 14.2
AP25: 23.3
AP50: 20.6
mAP: 15.9
3d-open-vocabulary-instance-segmentation-on-1OpenIns3D
mAP: 15.4
3d-open-vocabulary-instance-segmentation-on-1OpenIns3D (with rgbd)
mAP: 21.1
3d-open-vocabulary-instance-segmentation-on-2OpenIns3D
AP50 Novel B6/N6: 33.0
AP50 Novel B8/N4: 37.0
3d-open-vocabulary-instance-segmentation-on-3OPENINS3D
AP50: 13.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp