8 months ago

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa

Abstract

Controllable person image generation aims to generate a person imageconditioned on reference images, allowing precise control over the person'sappearance or pose. However, prior methods often distort fine-grained texturaldetails from the reference image, despite achieving high overall image quality.We attribute these distortions to inadequate attention to corresponding regionsin the reference image. To address this, we thereby propose learning flowfields in attention (Leffa), which explicitly guides the target query to attendto the correct reference key in the attention layer during training.Specifically, it is realized via a regularization loss on top of the attentionmap within a diffusion-based baseline. Our extensive experiments show thatLeffa achieves state-of-the-art performance in controlling appearance (virtualtry-on) and pose (pose transfer), significantly reducing fine-grained detaildistortion while maintaining high image quality. Additionally, we show that ourloss is model-agnostic and can be used to improve the performance of otherdiffusion models.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

8 months ago

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Learning Flow Fields in Attention for Controllable Person Image Generation

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa4 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Learning Flow Fields in Attention for Controllable Person Image Generation

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa4 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

Learning Flow Fields in Attention for Controllable Person Image Generation

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa4 more

Abstract

Build AI with AI

HyperAI Newsletters

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa

Zijian Zhou Shikun Liu Xiao Han Haozhe Liu Kam Woh Ng Tian Xie Yuren Cong Hang Li Mengmeng Xu Juan-Manuel Pérez-Rúa