HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

Tang Hao ; Liu Hong ; Sebe Nicu

Unified Generative Adversarial Networks for Controllable Image-to-Image
  Translation

Abstract

We propose a unified Generative Adversarial Network (GAN) for controllableimage-to-image translation, i.e., transferring an image from a source to atarget domain guided by controllable structures. In addition to conditioning ona reference image, we show how the model can generate images conditioned oncontrollable structures, e.g., class labels, object keypoints, human skeletons,and scene semantic maps. The proposed model consists of a single generator anda discriminator taking a conditional image and the target controllablestructure as input. In this way, the conditional image can provide appearanceinformation and the controllable structure can provide the structureinformation for generating the target result. Moreover, our model learns theimage-to-image mapping through three novel losses, i.e., color loss,controllable structure guided cycle-consistency loss, and controllablestructure guided self-content preserving loss. Also, we present the Fr\'echetResNet Distance (FRD) to evaluate the quality of the generated images.Experiments on two challenging image translation tasks, i.e., handgesture-to-gesture translation and cross-view image translation, show that ourmodel generates convincing results, and significantly outperforms otherstate-of-the-art methods on both tasks. Meanwhile, the proposed framework is aunified solution, thus it can be applied to solving other controllablestructure guided image translation tasks such as landmark guided facialexpression translation and keypoint guided person image generation. To the bestof our knowledge, we are the first to make one GAN framework work on all suchcontrollable structure guided image translation tasks. Code is available athttps://github.com/Ha0Tang/GestureGAN.

Code Repositories

Ha0Tang/GestureGAN
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
cross-view-image-to-image-translation-onUniGAN
KL: 2.16
LPIPS: 0.3817
PSNR: 23.3632
SD: 16.4788
SSIM: 0.5064
cross-view-image-to-image-translation-on-1UniGAN
LPIPS: 0.4527
cross-view-image-to-image-translation-on-2UniGAN
KL: 5.17
PSNR: 22.0273
SD: 17.6542
SSIM: 0.3357
cross-view-image-to-image-translation-on-4UniGAN
KL: 2.6
PSNR: 22.8223
SD: 19.8276
SSIM: 0.5366
gesture-to-gesture-translation-on-ntu-handUniGAN
AMT: 29.3
FID: 6.7493
FRD: 1.7401
IS: 2.3783
PSNR: 32.6574
gesture-to-gesture-translation-on-senz3dUniGAN
AMT: 27.6
FID: 12.4465
FRD: 2.2104
IS: 2.2159
PSNR: 31.542

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp