HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation

Tran Duc Dang Trung ; Kang Byeongkeun ; Lee Yeejin

MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation

Abstract

Recently, transformer-based techniques incorporating superpoints have becomeprevalent in 3D instance segmentation. However, they often encounter anover-segmentation problem, especially noticeable with large objects.Additionally, unreliable mask predictions stemming from superpoint maskprediction further compound this issue. To address these challenges, we proposea novel framework called MSTA3D. It leverages multi-scale featurerepresentation and introduces a twin-attention mechanism to effectively capturethem. Furthermore, MSTA3D integrates a box query with a box regularizer,offering a complementary spatial constraint alongside semantic queries.Experimental evaluations on ScanNetV2, ScanNet200 and S3DIS datasetsdemonstrate that our approach surpasses state-of-the-art 3D instancesegmentation methods.

Benchmarks

BenchmarkMethodologyMetrics
3d-instance-segmentation-on-s3disMSTA3D
AP@50: 70.0
mPrec: 80.6
mRec: 70.1
3d-instance-segmentation-on-scannet200MSTA3D
mAP: 26.2
mAP@25: 40.1
mAP@50: 35.2
3d-instance-segmentation-on-scannetv2MSTA3D
mAP: 56.9
mAP @ 50: 79.5
mAP@25: 87.9
mRec: 74.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MSTA3D: Multi-scale Twin-attention for 3D Instance Segmentation | Papers | HyperAI