HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

LatentKeypointGAN: Controlling Images via Latent Keypoints

Xingzhe He; Bastian Wandt; Helge Rhodin

LatentKeypointGAN: Controlling Images via Latent Keypoints

Abstract

Generative adversarial networks (GANs) have attained photo-realistic quality in image generation. However, how to best control the image content remains an open challenge. We introduce LatentKeypointGAN, a two-stage GAN which is trained end-to-end on the classical GAN objective with internal conditioning on a set of space keypoints. These keypoints have associated appearance embeddings that respectively control the position and style of the generated objects and their parts. A major difficulty that we address with suitable network architectures and training schemes is disentangling the image into spatial and appearance factors without domain knowledge and supervision signals. We demonstrate that LatentKeypointGAN provides an interpretable latent space that can be used to re-arrange the generated images by re-positioning and exchanging keypoint embeddings, such as generating portraits by combining the eyes, nose, and mouth from different images. In addition, the explicit generation of keypoints and matching images enables a new, GAN-based method for unsupervised keypoint detection.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
unsupervised-facial-landmark-detection-on-1LatentKeypointGAN
NME: 5.85

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
LatentKeypointGAN: Controlling Images via Latent Keypoints | Papers | HyperAI