HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Mask Frozen-DETR: High Quality Instance Segmentation with One GPU

Zhanhao Liang Yuhui Yuan

Mask Frozen-DETR: High Quality Instance Segmentation with One GPU

Abstract

In this paper, we aim to study how to build a strong instance segmenter with minimal training time and GPUs, as opposed to the majority of current approaches that pursue more accurate instance segmenter by building more advanced frameworks at the cost of longer training time and higher GPU requirements. To achieve this, we introduce a simple and general framework, termed Mask Frozen-DETR, which can convert any existing DETR-based object detection model into a powerful instance segmentation model. Our method only requires training an additional lightweight mask network that predicts instance masks within the bounding boxes given by a frozen DETR-based object detector. Remarkably, our method outperforms the state-of-the-art instance segmentation method Mask DINO in terms of performance on the COCO test-dev split (55.3% vs. 54.7%) while being over 10X times faster to train. Furthermore, all of our experiments can be trained using only one Tesla V100 GPU with 16 GB of memory, demonstrating the significant efficiency of our proposed framework.

Benchmarks

BenchmarkMethodologyMetrics
instance-segmentation-on-cocoMask Frozen-DETR
AP50: 79.3
AP75: 61.4
APL: 70.4
APM: 58.4
APS: 37.8
mask AP: 55.3
instance-segmentation-on-coco-minivalMask Frozen-DETR
AP50: 78.9
AP75: 60.8
APL: 72.9
APM: 58.4
APS: 37.2
mask AP: 54.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mask Frozen-DETR: High Quality Instance Segmentation with One GPU | Papers | HyperAI