HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes

Cai Zhi ; Gao Yingjie ; Zheng Yaoyan ; Zhou Nan ; Huang Di

Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded
  Scenes

Abstract

In computer vision, object detection is an important task that finds itsapplication in many scenarios. However, obtaining extensive labels can bechallenging, especially in crowded scenes. Recently, the Segment Anything Model(SAM) has been proposed as a powerful zero-shot segmenter, offering a novelapproach to instance segmentation tasks. However, the accuracy and efficiencyof SAM and its variants are often compromised when handling objects in crowdedand occluded scenes. In this paper, we introduce Crowd-SAM, a SAM-basedframework designed to enhance SAM's performance in crowded and occluded sceneswith the cost of few learnable parameters and minimal labeled images. Weintroduce an efficient prompt sampler (EPS) and a part-whole discriminationnetwork (PWD-Net), enhancing mask selection and accuracy in crowded scenes.Despite its simplicity, Crowd-SAM rivals state-of-the-art (SOTA)fully-supervised object detection methods on several benchmarks includingCrowdHuman and CityPersons. Our code is available athttps://github.com/FelixCaae/CrowdSAM.

Code Repositories

felixcaae/crowdsam
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
human-instance-segmentation-on-ochumanCrowd-SAM (ViT-L)
AP: 31.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp