HyperAIHyperAI

Command Palette

Search for a command to run...

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Abstract

In this report, we present our champion solutions to five tracks at Ego4Dchallenge. We leverage our developed InternVideo, a video foundation model, forfive Ego4D tasks, including Moment Queries, Natural Language Queries, FutureHand Prediction, State Change Object Detection, and Short-term ObjectInteraction Anticipation. InternVideo-Ego4D is an effective paradigm to adaptthe strong foundation model to the downstream ego-centric video understandingtasks with simple head designs. In these five tasks, the performance ofInternVideo-Ego4D comprehensively surpasses the baseline methods and thechampions of CVPR2022, demonstrating the powerful representation ability ofInternVideo as a video foundation model. Our code will be released athttps://github.com/OpenGVLab/ego4d-eccv2022-solutions


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp