HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

ZeroDiff: Solidified Visual-Semantic Correlation in Zero-Shot Learning

Ye Zihan ; Gowda Shreyank N. ; Huang Xiaowei ; Xu Haotian ; Jin Yaochu ; Huang Kaizhu ; Jin Xiaobo

ZeroDiff: Solidified Visual-Semantic Correlation in Zero-Shot Learning

Abstract

Zero-shot Learning (ZSL) aims to enable classifiers to identify unseenclasses. This is typically achieved by generating visual features for unseenclasses based on learned visual-semantic correlations from seen classes.However, most current generative approaches heavily rely on having a sufficientnumber of samples from seen classes. Our study reveals that a scarcity of seenclass samples results in a marked decrease in performance across manygenerative ZSL techniques. We argue, quantify, and empirically demonstrate thatthis decline is largely attributable to spurious visual-semantic correlations.To address this issue, we introduce ZeroDiff, an innovative generativeframework for ZSL that incorporates diffusion mechanisms and contrastiverepresentations to enhance visual-semantic correlations. ZeroDiff comprisesthree key components: (1) Diffusion augmentation, which naturally transformslimited data into an expanded set of noised data to mitigate generative modeloverfitting; (2) Supervised-contrastive (SC)-based representations thatdynamically characterize each limited sample to support visual featuregeneration; and (3) Multiple feature discriminators employing aWasserstein-distance-based mutual learning approach, evaluating generatedfeatures from various perspectives, including pre-defined semantics, SC-basedrepresentations, and the diffusion process. Extensive experiments on threepopular ZSL benchmarks demonstrate that ZeroDiff not only achieves significantimprovements over existing ZSL methods but also maintains robust performanceeven with scarce training data. Our codes are available athttps://github.com/FouriYe/ZeroDiff_ICLR25.

Benchmarks

BenchmarkMethodologyMetrics
generalized-zero-shot-learning-on-awa2ZeroDiff
Harmonic mean: 79.5
generalized-zero-shot-learning-on-cub-200ZeroDiff
Harmonic mean: 81.6
generalized-zero-shot-learning-on-sunZeroDiff
Harmonic mean: 59.8
zero-shot-learning-on-awa2ZeroDiff
average top-1 classification accuracy: 86.4
zero-shot-learning-on-cub-200-2011ZeroDiff
average top-1 classification accuracy: 87.5
zero-shot-learning-on-sun-attributeZeroDiff
average top-1 classification accuracy: 77.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp