HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Anindya Mondal; Sauradip Nag; Xiatian Zhu; Anjan Dutta

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors

Abstract

Object counting is pivotal for understanding the composition of scenes. Previously, this task was dominated by class-specific methods, which have gradually evolved into more adaptable class-agnostic strategies. However, these strategies come with their own set of limitations, such as the need for manual exemplar input and multiple passes for multiple categories, resulting in significant inefficiencies. This paper introduces a more practical approach enabling simultaneous counting of multiple object categories using an open-vocabulary framework. Our solution, OmniCount, stands out by using semantic and geometric insights (priors) from pre-trained models to count multiple categories of objects as specified by users, all without additional training. OmniCount distinguishes itself by generating precise object masks and leveraging varied interactive prompts via the Segment Anything Model for efficient counting. To evaluate OmniCount, we created the OmniCount-191 benchmark, a first-of-its-kind dataset with multi-label object counts, including points, bounding boxes, and VQA annotations. Our comprehensive evaluation in OmniCount-191, alongside other leading benchmarks, demonstrates OmniCount's exceptional performance, significantly outpacing existing solutions. The project webpage is available at https://mondalanindya.github.io/OmniCount.

Benchmarks

BenchmarkMethodologyMetrics
object-counting-on-fsc147Omnicount (Open vocabulary, multi-label, without training)
MAE(test): 18.63
RMSE(test): 112
object-counting-on-omnicount-191Omnicount
mRMSE: 0.0023
object-counting-on-pascal-voc-2007-count-testOmnicount
mRMSE: 0.0023
mRMSE-nz: 0.009
training-free-object-counting-on-fsc147Omnicount
MAE: 18.63
training-free-object-counting-on-omnicountOmnicount
mRMSE: 0.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp