HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Elevating Flow-Guided Video Inpainting with Reference Generation

Suhwan Cho; Seoung Wug Oh; Sangyoun Lee; Joon-Young Lee

Elevating Flow-Guided Video Inpainting with Reference Generation

Abstract

Video inpainting (VI) is a challenging task that requires effective propagation of observable content across frames while simultaneously generating new content not present in the original video. In this study, we propose a robust and practical VI framework that leverages a large generative model for reference generation in combination with an advanced pixel propagation algorithm. Powered by a strong generative model, our method not only significantly enhances frame-level quality for object removal but also synthesizes new content in the missing areas based on user-provided text prompts. For pixel propagation, we introduce a one-shot pixel pulling method that effectively avoids error accumulation from repeated sampling while maintaining sub-pixel precision. To evaluate various VI methods in realistic scenarios, we also propose a high-quality VI benchmark, HQVI, comprising carefully generated videos using alpha matte composition. On public benchmarks and the HQVI dataset, our method demonstrates significantly higher visual quality and metric scores compared to existing solutions. Furthermore, it can process high-resolution videos exceeding 2K resolution with ease, underscoring its superiority for real-world applications.

Code Repositories

suhwan-cho/RGVI
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-inpainting-on-hqvi-240pRGVI w/o Ref.
LPIPS: 0.0390
PSNR: 31.60
SSIM: 0.9559
VFID: 0.1868
video-inpainting-on-hqvi-240pRGVI
LPIPS: 0.0335
PSNR: 30.66
SSIM: 0.9527
VFID: 0.1825
video-inpainting-on-hqvi-2kRGVI
LPIPS: 0.0357
PSNR: 30.10
SSIM: 0.9489
VFID: 0.0058
video-inpainting-on-hqvi-2kRGVI w/o Ref.
LPIPS: 0.0403
PSNR: 29.81
SSIM: 0.9501
VFID: 0.0101
video-inpainting-on-hqvi-480pRGVI w/o Ref.
LPIPS: 0.0403
PSNR: 31.19
SSIM: 0.9534
VFID: 0.0404
video-inpainting-on-hqvi-480pRGVI
LPIPS: 0.0342
PSNR: 30.90
SSIM: 0.9513
VFID: 0.0311

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp