Command Palette
Search for a command to run...
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Lingtong Kong Boyuan Jiang Donghao Luo Wenqing Chu Xiaoming Huang Ying Tai Chengjie Wang Jie Yang

Abstract
Prevailing video frame interpolation algorithms, that generate the intermediate frames from consecutive inputs, typically rely on complex model architectures with heavy parameters or large delay, hindering them from diverse real-time applications. In this work, we devise an efficient encoder-decoder based network, termed IFRNet, for fast intermediate frame synthesizing. It first extracts pyramid features from given inputs, and then refines the bilateral intermediate flow fields together with a powerful intermediate feature until generating the desired output. The gradually refined intermediate feature can not only facilitate intermediate flow estimation, but also compensate for contextual details, making IFRNet do not need additional synthesis or refinement module. To fully release its potential, we further propose a novel task-oriented optical flow distillation loss to focus on learning the useful teacher knowledge towards frame synthesizing. Meanwhile, a new geometry consistency regularization term is imposed on the gradually refined intermediate features to keep better structure layout. Experiments on various benchmarks demonstrate the excellent performance and fast inference speed of proposed approaches. Code is available at https://github.com/ltkong218/IFRNet.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-frame-interpolation-on-middlebury | IFRNet | Interpolation Error: 4.216 |
| video-frame-interpolation-on-msu-video-frame | IFRNet_large | LPIPS: 0.037 MS-SSIM: 0.943 PSNR: 28.04 SSIM: 0.921 VMAF: 66.98 |
| video-frame-interpolation-on-msu-video-frame | IFRNet_small | LPIPS: 0.049 MS-SSIM: 0.931 PSNR: 27.45 SSIM: 0.908 VMAF: 63.43 |
| video-frame-interpolation-on-msu-video-frame | IFRNet_base | LPIPS: 0.048 MS-SSIM: 0.932 PSNR: 27.67 SSIM: 0.909 VMAF: 64.16 |
| video-frame-interpolation-on-ucf101-1 | IFRNet | PSNR: 35.42 SSIM: 0.9698 |
| video-frame-interpolation-on-vimeo90k | IFRNet | PSNR: 36.20 SSIM: 0.9808 Speed (ms/f): 16 (Tesla V100) |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.