Command Palette
Search for a command to run...
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
Zhu Jun-Yan Park Taesung Isola Phillip Efros Alexei A.

Abstract
Image-to-image translation is a class of vision and graphics problems wherethe goal is to learn the mapping between an input image and an output imageusing a training set of aligned image pairs. However, for many tasks, pairedtraining data will not be available. We present an approach for learning totranslate an image from a source domain $X$ to a target domain $Y$ in theabsence of paired examples. Our goal is to learn a mapping $G: X \rightarrow Y$such that the distribution of images from $G(X)$ is indistinguishable from thedistribution $Y$ using an adversarial loss. Because this mapping is highlyunder-constrained, we couple it with an inverse mapping $F: Y \rightarrow X$and introduce a cycle consistency loss to push $F(G(X)) \approx X$ (and viceversa). Qualitative results are presented on several tasks where pairedtraining data does not exist, including collection style transfer, objecttransfiguration, season transfer, photo enhancement, etc. Quantitativecomparisons against several prior methods demonstrate the superiority of ourapproach.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-to-image-translation-on-cityscapes | CycleGAN | Class IOU: 0.11 Per-class Accuracy: 17% Per-pixel Accuracy: 52% |
| image-to-image-translation-on-cityscapes-1 | CycleGAN | Class IOU: 0.16 Per-class Accuracy: 22% Per-pixel Accuracy: 58% |
| image-to-image-translation-on-horse2zebra | CycleGAN | Frechet Inception Distance: 89.7 Number of params: 28.2M |
| image-to-image-translation-on-photo2vangogh | CycleGAN | Frechet Inception Distance: 151.4 Number of params: 28.2M |
| image-to-image-translation-on-rafd | CycleGAN | Classification Error: 5.99% |
| image-to-image-translation-on-vangogh2photo | CycleGAN | Frechet Inception Distance: 163.4 Number of Params: 28.2M |
| image-to-image-translation-on-zebra2horse | CycleGAN | Frechet Inception Distance: 110.5 Number of params: 28.2M |
| multimodal-unsupervised-image-to-image | CycleGAN | CIS: 0.076 IS: 0.813 |
| multimodal-unsupervised-image-to-image-1 | CycleGAN | Diversity: 0.012 Quality: 40.8% |
| multimodal-unsupervised-image-to-image-2 | CycleGAN | Diversity: 0.010 Quality: 36.0% |
| multimodal-unsupervised-image-to-image-3 | cycGAN | PSNR: 17.38 |
| unsupervised-image-to-image-translation-on-1 | cycGAN | PSNR: 18.57 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.