HyperAI超神经
首页
资讯
最新论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
首页
SOTA
Image Generation
Image Generation On Imagenet 512X512
Image Generation On Imagenet 512X512
评估指标
FID
Inception score
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
FID
Inception score
Paper Title
Repository
DiT-XL/2
3.04
240.82
Scalable Diffusion Models with Transformers
MAR-L, Diff Loss
1.73
-
Autoregressive Image Generation without Vector Quantization
SIMS
1.73
-
Self-Improving Diffusion Models with Synthetic Data
-
EDM2- S Autoguidance (XS, T /16)
1.34
-
Guiding a Diffusion Model with a Bad Version of Itself
SiD-EDM2-M (498M)
2.06
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
MAGVIT-v2 (w/o guidance)
3.07
213.1
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Poly-INR
3.81
-
Polynomial Implicit Neural Representations For Large Diverse Datasets
SiDA-EDM2-M (498M)
1.488
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
PaGoDA
1.80
-
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher
SiDA-EDM2-L (777M)
1.413
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
SiD-EDM2-XS (125M)
3.353
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
MaskGIT (a=0.05)
4.46
342.0
MaskGIT: Masked Generative Image Transformer
MAGVIT-v2
1.91
324.3
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
ADM-G
7.72
172.71
Diffusion Models Beat GANs on Image Synthesis
SiDA-EDM2-XL (1.1B)
1.379
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
SiD-EDM2-S (280M)
2.707
-
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
TiTok-L-64
2.49
-
An Image is Worth 32 Tokens for Reconstruction and Generation
DPC-U
3.54
350.2
Discrete Predictor-Corrector Diffusion Models for Image Synthesis
-
GMem
1.71
-
Generative Modeling with Explicit Memory
Latent Diffusion (LDM-4-G)
3.60
247.67
High-Resolution Image Synthesis with Latent Diffusion Models
0 of 48 row(s) selected.
Previous
Next