3 months ago

Patch-VQ: 'Patching Up' the Video Quality Problem

Zhenqiang Ying Maniratnam Mandal Deepti Ghadiyaram Alan Bovik

Abstract

No-reference (NR) perceptual video quality assessment (VQA) is a complex, unsolved, and important problem to social and streaming media applications. Efficient and accurate video quality predictors are needed to monitor and guide the processing of billions of shared, often imperfect, user-generated content (UGC). Unfortunately, current NR models are limited in their prediction capabilities on real-world, "in-the-wild" UGC video data. To advance progress on this problem, we created the largest (by far) subjective video quality dataset, containing 39, 000 realworld distorted videos and 117, 000 space-time localized video patches ('v-patches'), and 5.5M human perceptual quality annotations. Using this, we created two unique NR-VQA models: (a) a local-to-global region-based NR VQA architecture (called PVQ) that learns to predict global video quality and achieves state-of-the-art performance on 3 UGC datasets, and (b) a first-of-a-kind space-time video quality mapping engine (called PVQ Mapper) that helps localize and visualize perceptual distortions in space and time. We will make the new database and prediction models available immediately following the review process.

Code Repositories

baidut/PatchVQ

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
video-quality-assessment-on-konvid-1k	PVQ	PLCC: 0.770
video-quality-assessment-on-live-fb-lsvq	PVQ	PLCC: 0.827
video-quality-assessment-on-live-vqc	PVQ	PLCC: 0.791

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette