HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Event-Based Video Reconstruction Using Transformer

{Zhiwei Xiong Yueyi Zhang Wenming Weng}

Event-Based Video Reconstruction Using Transformer

Abstract

Event cameras, which output events by detecting spatio-temporal brightness changes, bring a novel paradigm to image sensors with high dynamic range and low latency. Previous works have achieved impressive performances on event-based video reconstruction by introducing convolutional neural networks (CNNs). However, intrinsic locality of convolutional operations is not capable of modeling long-range dependency, which is crucial to many vision tasks. In this paper, we present a hybrid CNN-Transformer network for event-based video reconstruction (ET-Net), which merits the fine local information from CNN and global contexts from Transformer. In addition, we further propose a Token Pyramid Aggregation strategy to implement multi-scale token integration for relating internal and intersected semantic concepts in the token-space. Experimental results demonstrate that our proposed method achieves superior performance over state-of-the-art methods on multiple real-world event datasets. The code is available at https://github.com/WarranWeng/ET-Net

Benchmarks

BenchmarkMethodologyMetrics
video-reconstruction-on-event-camera-datasetET-Net
LPIPS: 0.224
Mean Squared Error: 0.047
video-reconstruction-on-mvsecET-Net
LPIPS: 0.489
Mean Squared Error: 0.107

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Event-Based Video Reconstruction Using Transformer | Papers | HyperAI