8 months ago

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang

Abstract

In this report, we present our champion solutions to five tracks at Ego4Dchallenge. We leverage our developed InternVideo, a video foundation model, forfive Ego4D tasks, including Moment Queries, Natural Language Queries, FutureHand Prediction, State Change Object Detection, and Short-term ObjectInteraction Anticipation. InternVideo-Ego4D is an effective paradigm to adaptthe strong foundation model to the downstream ego-centric video understandingtasks with simple head designs. In these five tasks, the performance ofInternVideo-Ego4D comprehensively surpasses the baseline methods and thechampions of CVPR2022, demonstrating the powerful representation ability ofInternVideo as a video foundation model. Our code will be released athttps://github.com/OpenGVLab/ego4d-eccv2022-solutions

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Video Understanding

Multimodal

Multimodal Representation

Multimodality

Computer Vision

Task/Problem

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Video Understanding

Multimodal

Multimodal Representation

Multimodality

Computer Vision

Task/Problem

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang11 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang11 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang11 more

Abstract

Build AI with AI

HyperAI Newsletters

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang

Guo Chen Sen Xing Zhe Chen Yi Wang Kunchang Li Yizhuo Li Yi Liu Jiahao Wang Yin-Dong Zheng Bingkun Huang