HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation

Cho Jaemin ; Li Linjie ; Yang Zhengyuan ; Gan Zhe ; Wang Lijuan ; Bansal Mohit

Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image
  Generation

Abstract

Spatial control is a core capability in controllable image generation.Advancements in layout-guided image generation have shown promising results onin-distribution (ID) datasets with similar spatial configurations. However, itis unclear how these models perform when facing out-of-distribution (OOD)samples with arbitrary, unseen layouts. In this paper, we propose LayoutBench,a diagnostic benchmark for layout-guided image generation that examines fourcategories of spatial control skills: number, position, size, and shape. Webenchmark two recent representative layout-guided image generation methods andobserve that the good ID layout control may not generalize well to arbitrarylayouts in the wild (e.g., objects at the boundary). Next, we proposeIterInpaint, a new baseline that generates foreground and background regionsstep-by-step via inpainting, demonstrating stronger generalizability thanexisting models on OOD layouts in LayoutBench. We perform quantitative andqualitative evaluation and fine-grained analysis on the four LayoutBench skillsto pinpoint the weaknesses of existing models. We show comprehensive ablationstudies on IterInpaint, including training task ratio, crop&paste vs. repaint,and generation order. Lastly, we evaluate the zero-shot performance ofdifferent pretrained layout-guided image generation models on LayoutBench-COCO,our new benchmark for OOD layouts with real objects, where our IterInpaintconsistently outperforms SOTA baselines in all four splits. Project website:https://layoutbench.github.io

Code Repositories

j-min/LayoutBench-COCO
pytorch
Mentioned in GitHub
j-min/IterInpaint
Official
pytorch
Mentioned in GitHub

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp