HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms

Pierre-Etienne Martin

Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms

Abstract

This paper presents the baseline method proposed for the Sports Video task part of the MediaEval 2022 benchmark. This task proposes two subtasks: stroke classification from trimmed videos, and stroke detection from untrimmed videos. This baseline addresses both subtasks. We propose two types of 3D-CNN architectures to solve the two subtasks. Both 3D-CNNs use Spatio-temporal convolutions and attention mechanisms. The architectures and the training process are tailored to solve the addressed subtask. This baseline method is shared publicly online to help the participants in their investigation and alleviate eventually some aspects of the task such as video processing, training method, evaluation and submission routine. The baseline method reaches 86.4% of accuracy with our v2 model for the classification subtask. For the detection subtask, the baseline reaches a mAP of 0.131 and IoU of 0.515 with our v1 model.

Code Repositories

ccp-eva/sporttaskme22
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
action-classification-on-ttstroke-21STCNN-V2 (Gaussian decision)
Acc: 0.864
action-detection-on-ttstroke-21STCNN-V2 (Vote decision)
IoU: 0.515
mAP: 0.131

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp