HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Open-world Text-specified Object Counting

Niki Amini-Naieni Kiana Amini-Naieni Tengda Han Andrew Zisserman

Open-world Text-specified Object Counting

Abstract

Our objective is open-world object counting in images, where the target object class is specified by a text description. To this end, we propose CounTX, a class-agnostic, single-stage model using a transformer decoder counting head on top of pre-trained joint text-image representations. CounTX is able to count the number of instances of any class given only an image and a text description of the target object class, and can be trained end-to-end. In addition to this model, we make the following contributions: (i) we compare the performance of CounTX to prior work on open-world object counting, and show that our approach exceeds the state of the art on all measures on the FSC-147 benchmark for methods that use text to specify the task; (ii) we present and release FSC-147-D, an enhanced version of FSC-147 with text descriptions, so that object classes can be described with more detailed language than their simple class names. FSC-147-D and the code are available at https://www.robots.ox.ac.uk/~vgg/research/countx.

Code Repositories

niki-amini-naieni/countx
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
object-counting-on-carpkCounTX (uses arbitrary text input to specify object to count, used "the cars" for CARPK)
MAE: 8.13
RMSE: 10.87
object-counting-on-fsc147CounTX (uses text descriptions instead of visual exemplars)
MAE(test): 15.88
MAE(val): 17.10
RMSE(test): 106.29
RMSE(val): 65.61

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp