HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition

{Qingyao Wu Yukun Su Jinhui Zhu Guosheng Lin}

Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition

Abstract

This paper introduces a new method for recognizing violent behavior by learning contextual relationships between related people from human skeleton points. Unlike previous work, we first formulate 3D skeleton point clouds from human skeleton sequences extracted from videos and then perform interaction learning on these 3D skeleton point clouds. A novel extbf{S}keleton extbf{P}oints extbf{I}nteraction extbf{L}earning (SPIL) module, is proposed to model the interactions between skeleton points. Specifically, by constructing a specific weight distribution strategy between local regional points, SPIL aims to selectively focus on the most relevant parts of them based on their features and spatial-temporal position information. In order to capture diverse types of relation information, a multi-head mechanism is designed to aggregate different features from independent heads to jointly handle different types of relationships between points. Experimental results show that our model outperforms the existing networks and achieves new state-of-the-art performance on video violence datasets.

Benchmarks

BenchmarkMethodologyMetrics
activity-recognition-on-rwf-2000SPIL Convolution
Accuracy: 89.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition | Papers | HyperAI