HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Dual Pyramid Generative Adversarial Networks for Semantic Image Synthesis

Shijie Li Ming-Ming Cheng Juergen Gall

Dual Pyramid Generative Adversarial Networks for Semantic Image Synthesis

Abstract

The goal of semantic image synthesis is to generate photo-realistic images from semantic label maps. It is highly relevant for tasks like content generation and image editing. Current state-of-the-art approaches, however, still struggle to generate realistic objects in images at various scales. In particular, small objects tend to fade away and large objects are often generated as collages of patches. In order to address this issue, we propose a Dual Pyramid Generative Adversarial Network (DP-GAN) that learns the conditioning of spatially-adaptive normalization blocks at all scales jointly, such that scale information is bi-directionally used, and it unifies supervision at different scales. Our qualitative and quantitative results show that the proposed approach generates images where small and large objects look more realistic compared to images generated by state-of-the-art methods.

Code Repositories

sj-li/dp_gan
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-to-image-translation-on-ade20k-labelsDP-GAN
FID: 26.1
mIoU: 52.7
image-to-image-translation-on-ade20k-outdoorDP-GAN
FID: 45.8
mIoU: 40.4
image-to-image-translation-on-cityscapesDP-GAN
FID: 44.1
mIoU: 73.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp