HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

Jiayan Teng Wendi Zheng Ming Ding Wenyi Hong Jianqiao Wangni Zhuoyi Yang Jie Tang

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

Abstract

Diffusion models achieved great success in image synthesis, but still face challenges in high-resolution generation. Through the lens of discrete cosine transformation, we find the main reason is that \emph{the same noise level on a higher resolution results in a higher Signal-to-Noise Ratio in the frequency domain}. In this work, we present Relay Diffusion Model (RDM), which transfers a low-resolution image or noise into an equivalent high-resolution one for diffusion model via blurring diffusion and block noise. Therefore, the diffusion process can continue seamlessly in any new resolution or model without restarting from pure noise or low-resolution conditioning. RDM achieves state-of-the-art FID on CelebA-HQ and sFID on ImageNet 256$\times$256, surpassing previous works such as ADM, LDM and DiT by a large margin. All the codes and checkpoints are open-sourced at \url{https://github.com/THUDM/RelayDiffusion}.

Code Repositories

THUDM/RelayDiffusion
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-generation-on-celeba-hq-256x256RDM
FID: 3.15
Precision: 0.77
Recall: 0.55
image-generation-on-imagenet-256x256RDM
FID: 1.99

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp