Command Palette
Search for a command to run...
Chi Xie Fangao Zeng Yue Hu Shuang Liang Yichen Wei

Abstract
Unlike most previous HOI methods that focus on learning better human-object features, we propose a novel and complementary approach called category query learning. Such queries are explicitly associated to interaction categories, converted to image specific category representation via a transformer decoder, and learnt via an auxiliary image-level classification task. This idea is motivated by an earlier multi-label image classification method, but is for the first time applied for the challenging human-object interaction classification task. Our method is simple, general and effective. It is validated on three representative HOI baselines and achieves new state-of-the-art results on two benchmarks.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| human-object-interaction-detection-on-hico | CQL+GEN-VLKT-L | mAP: 36.03 |
| human-object-interaction-detection-on-hico | CQL+GEN-VLKT-B | mAP: 35.36 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.