HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Fine-Grained Scene Graph Generation with Data Transfer

Ao Zhang; Yuan Yao; Qianyu Chen; Wei Ji; Zhiyuan Liu; Maosong Sun; Tat-Seng Chua

Fine-Grained Scene Graph Generation with Data Transfer

Abstract

Scene graph generation (SGG) is designed to extract (subject, predicate, object) triplets in images. Recent works have made a steady progress on SGG, and provide useful tools for high-level vision and language understanding. However, due to the data distribution problems including long-tail distribution and semantic ambiguity, the predictions of current SGG models tend to collapse to several frequent but uninformative predicates (e.g., on, at), which limits practical application of these models in downstream tasks. To deal with the problems above, we propose a novel Internal and External Data Transfer (IETrans) method, which can be applied in a plug-and-play fashion and expanded to large SGG with 1,807 predicate classes. Our IETrans tries to relieve the data distribution problem by automatically creating an enhanced dataset that provides more sufficient and coherent annotations for all predicates. By training on the enhanced dataset, a Neural Motif model doubles the macro performance while maintaining competitive micro performance. The code and data are publicly available at https://github.com/waxnkw/IETrans-SGG.pytorch.

Code Repositories

rlqja1107/torch-st-sgg
pytorch
Mentioned in GitHub
waxnkw/ietrans-sgg.pytorch
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-graph-generation-on-visual-genomeIETrans
Recall@100: 27.2
Recall@50: 23.5
mean Recall @100: 18.0
unbiased-scene-graph-generation-on-visualIETrans (MOTIFS-ResNeXt-101-FPN backbone; PredCls mode)
F@100: 44.1
mR@20: 28.9
ng-mR@20: 36.0
unbiased-scene-graph-generation-on-visualIETrans (MOTIFS-ResNeXt-101-FPN backbone; SGDet mode)
F@100: 21.7
mR@20: 10.9
ng-mR@20: 13.4
unbiased-scene-graph-generation-on-visualIETrans (MOTIFS-ResNeXt-101-FPN backbone; SGCls mode)
F@100: 26.0
mR@20: 17.5
ng-mR@20: 21.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp