HyperAI
HyperAI
Home
News
Latest Papers
Tutorials
Datasets
Wiki
SOTA
LLM Models
GPU Leaderboard
Events
Search
About
English
HyperAI
HyperAI
Toggle sidebar
Search the site…
⌘
K
Home
SOTA
Image Generation
Image Generation On Imagenet 512X512
Image Generation On Imagenet 512X512
Metrics
FID
Inception score
Results
Performance results of various models on this benchmark
Columns
Model Name
FID
Inception score
Paper Title
Repository
DiT-XL/2
3.04
240.82
Scalable Diffusion Models with Transformers
-
MAR-L, Diff Loss
1.73
-
Autoregressive Image Generation without Vector Quantization
-
SIMS
1.73
-
Self-Improving Diffusion Models with Synthetic Data
-
EDM2- S Autoguidance (XS, T /16)
1.34
-
Guiding a Diffusion Model with a Bad Version of Itself
-
SiD-EDM2-M (498M)
2.06
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
-
MAGVIT-v2 (w/o guidance)
3.07
213.1
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
-
Poly-INR
3.81
-
Polynomial Implicit Neural Representations For Large Diverse Datasets
-
SiDA-EDM2-M (498M)
1.488
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
-
PaGoDA
1.80
-
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
-
SiDA-EDM2-L (777M)
1.413
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
-
SiD-EDM2-XS (125M)
3.353
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
-
MaskGIT (a=0.05)
4.46
342.0
MaskGIT: Masked Generative Image Transformer
-
MAGVIT-v2
1.91
324.3
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
-
ADM-G
7.72
172.71
Diffusion Models Beat GANs on Image Synthesis
-
SiDA-EDM2-XL (1.1B)
1.379
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
-
SiD-EDM2-S (280M)
2.707
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
-
TiTok-L-64
2.49
-
An Image is Worth 32 Tokens for Reconstruction and Generation
-
DPC-U
3.54
350.2
Discrete Predictor-Corrector Diffusion Models for Image Synthesis
-
GMem
1.71
-
GMem: A Modular Approach for Ultra-Efficient Generative Models
-
Latent Diffusion (LDM-4-G)
3.60
247.67
High-Resolution Image Synthesis with Latent Diffusion Models
-
0 of 48 row(s) selected.
Previous
Next
Image Generation On Imagenet 512X512 | SOTA | HyperAI