HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Hou Ji ; Dai Angela ; Nießner Matthias

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Abstract

We introduce 3D-SIS, a novel neural network architecture for 3D semanticinstance segmentation in commodity RGB-D scans. The core idea of our method isto jointly learn from both geometric and color signal, thus enabling accurateinstance predictions. Rather than operate solely on 2D frames, we observe thatmost computer vision applications have multi-view RGB-D input available, whichwe leverage to construct an approach for 3D instance segmentation thateffectively fuses together these multi-modal inputs. Our network leverageshigh-resolution RGB input by associating 2D images with the volumetric gridbased on the pose alignment of the 3D reconstruction. For each image, we firstextract 2D features for each pixel with a series of 2D convolutions; we thenbackproject the resulting feature vector to the associated voxel in the 3Dgrid. This combination of 2D and 3D feature learning allows significantlyhigher accuracy object detection and instance segmentation thanstate-of-the-art alternatives. We show results on both synthetic and real-worldpublic benchmarks, achieving an improvement in mAP of over 13 on real-worlddata.

Code Repositories

Sekunde/3D-SIS
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-instance-segmentation-on-scannetv23D-SIS
mAP @ 50: 38.2
3d-object-detection-on-scannetv23D-SIS
mAP@0.25: 40.2
mAP@0.5: 22.5
3d-semantic-instance-segmentation-on-13D-SIS
mAP@0.50: 38.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp