HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A Spatio-Temporal Multilayer Perceptron for Gesture Recognition

Adrian Holzbock Alexander Tsaregorodtsev Youssef Dawoud Klaus Dietmayer Vasileios Belagiannis

A Spatio-Temporal Multilayer Perceptron for Gesture Recognition

Abstract

Gesture recognition is essential for the interaction of autonomous vehicles with humans. While the current approaches focus on combining several modalities like image features, keypoints and bone vectors, we present neural network architecture that delivers state-of-the-art results only with body skeleton input data. We propose the spatio-temporal multilayer perceptron for gesture recognition in the context of autonomous vehicles. Given 3D body poses over time, we define temporal and spatial mixing operations to extract features in both domains. Additionally, the importance of each time step is re-weighted with Squeeze-and-Excitation layers. An extensive evaluation of the TCG and Drive&Act datasets is provided to showcase the promising performance of our approach. Furthermore, we deploy our model to our autonomous vehicle to show its real-time capability and stable execution.

Code Repositories

holzbock/st_mlp
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
skeleton-based-action-recognition-on-drivestMLP
mean per-class accuracy: 34.61
skeleton-based-action-recognition-on-tcgstMLP
Acc: 85.99
F1-Score: 80.05
Jaccard Index: 67.88

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp