Command Palette
Search for a command to run...
Tianfan Xue; Baian Chen; Jiajun Wu; Donglai Wei; William T. Freeman

Abstract
Many video enhancement algorithms rely on optical flow to register frames in a video sequence. Precise flow estimation is however intractable; and optical flow itself is often a sub-optimal representation for particular video processing tasks. In this paper, we propose task-oriented flow (TOFlow), a motion representation learned in a self-supervised, task-specific manner. We design a neural network with a trainable motion estimation component and a video processing component, and train them jointly to learn the task-oriented flow. For evaluation, we build Vimeo-90K, a large-scale, high-quality video dataset for low-level video processing. TOFlow outperforms traditional optical flow on standard benchmarks as well as our Vimeo-90K dataset in three video processing tasks: frame interpolation, video denoising/deblocking, and video super-resolution.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-frame-interpolation-on-middlebury | ToFlow | Interpolation Error: 5.49 |
| video-frame-interpolation-on-vimeo90k | ToFlow | PSNR: 33.73 |
| video-super-resolution-on-vid4-4x-upscaling-1 | TOFlow | PSNR: 25.85 SSIM: 0.7659 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.