HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Continual Spatio-Temporal Graph Convolutional Networks

Lukas Hedegaard Negar Heidari Alexandros Iosifidis

Continual Spatio-Temporal Graph Convolutional Networks

Abstract

Graph-based reasoning over skeleton data has emerged as a promising approach for human action recognition. However, the application of prior graph-based methods, which predominantly employ whole temporal sequences as their input, to the setting of online inference entails considerable computational redundancy. In this paper, we tackle this issue by reformulating the Spatio-Temporal Graph Convolutional Neural Network as a Continual Inference Network, which can perform step-by-step predictions in time without repeat frame processing. To evaluate our method, we create a continual version of ST-GCN, CoST-GCN, alongside two derived methods with different self-attention mechanisms, CoAGCN and CoS-TR. We investigate weight transfer strategies and architectural modifications for inference acceleration, and perform experiments on the NTU RGB+D 60, NTU RGB+D 120, and Kinetics Skeleton 400 datasets. Retaining similar predictive accuracy, we observe up to 109x reduction in time complexity, on-hardware accelerations of 26x, and reductions in maximum allocated memory of 52% during online inference.

Code Repositories

lukashedegaard/continual-skeletons
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
skeleton-based-action-recognition-on-kineticsST-GCN (2-stream)
Accuracy: 34.4
GFLOPS per prediction: 24.09
skeleton-based-action-recognition-on-kineticsCoST-GCN* (1-stream)
Accuracy: 30.2
GFLOPS per prediction: 0.11
skeleton-based-action-recognition-on-kineticsCoAGCN* (1-stream)
Accuracy: 23.3
GFLOPS per prediction: 0.12
skeleton-based-action-recognition-on-kineticsCoST-GCN* (2-stream)
Accuracy: 32.2
GFLOPS per prediction: 0.22
skeleton-based-action-recognition-on-kineticsCoS-TR* (1-stream)
Accuracy: 27.4
GFLOPS per prediction: 0.11
skeleton-based-action-recognition-on-kineticsCoAGCN (1-stream)
Accuracy: 33
GFLOPS per prediction: 0.18
skeleton-based-action-recognition-on-kineticsCoST-GCN (1-stream)
Accuracy: 31.8
GFLOPS per prediction: 0.16
skeleton-based-action-recognition-on-kineticsCoAGCN (2-stream)
GFLOPS per prediction: 0.36
skeleton-based-action-recognition-on-kineticsCoS-TR* (2-stream)
Accuracy: 29.9
GFLOPS per prediction: 0.22
skeleton-based-action-recognition-on-kineticsCoST-GCN (2-stream)
Accuracy: 33.1
GFLOPS per prediction: 0.32
skeleton-based-action-recognition-on-kineticsCoS-TR (2-stream)
Accuracy: 32.7
GFLOPS per prediction: 0.31
skeleton-based-action-recognition-on-kineticsS-TR (1-stream)
Accuracy: 32
GFLOPS per prediction: 11.62
skeleton-based-action-recognition-on-kineticsCoS-TR (1-stream)
Accuracy: 29.7
skeleton-based-action-recognition-on-kineticsST-GCN (1-stream)
Accuracy: 33.4
GFLOPS per prediction: 12.04
skeleton-based-action-recognition-on-kineticsAGCN (2-stream)
Accuracy: 36.9
GFLOPS per prediction: 26.91
skeleton-based-action-recognition-on-kineticsCoAGCN* (2-stream)
Accuracy: 27.5
GFLOPS per prediction: 0.25
skeleton-based-action-recognition-on-kineticsAGCN (1-stream)
Accuracy: 35
GFLOPS per prediction: 13.45
skeleton-based-action-recognition-on-kineticsS-TR (2-stream)
Accuracy: 34.7
GFLOPS per prediction: 23.24
skeleton-based-action-recognition-on-ntu-rgbdCoAGCN* (2-stream)
Accuracy (CS): 86.0
Accuracy (CV): 93.1
GFLOPs per pred: 0.44
skeleton-based-action-recognition-on-ntu-rgbdCoST-GCN* (2-stream)
Accuracy (CS): 88.3
Accuracy (CV): 95
GFLOPs per pred: 0.32
skeleton-based-action-recognition-on-ntu-rgbdCoS-TR*
Accuracy (CS): 86.3
Accuracy (CV): 92.4
GFLOPs per pred: 0.15
skeleton-based-action-recognition-on-ntu-rgbdCoS-TR* (2-stream)
Accuracy (CS): 88.9
Accuracy (CV): 94.8
GFLOPs per pred: 0.3
skeleton-based-action-recognition-on-ntu-rgbdST-GCN
Accuracy (CS): 86
Accuracy (CV): 93.4
GFLOPs per pred: 16.73
skeleton-based-action-recognition-on-ntu-rgbdCoAGCN*
Accuracy (CS): 84.1
Accuracy (CV): 92.6
skeleton-based-action-recognition-on-ntu-rgbdCoST-GCN*
Accuracy (CS): 86.3
Accuracy (CV): 93.8
GFLOPs per pred: 0.16
skeleton-based-action-recognition-on-ntu-rgbd-1S-TR (1-stream)
Accuracy (Cross-Setup): 81.8
Accuracy (Cross-Subject): 80.2
GFLOPS per prediction: 16.2
skeleton-based-action-recognition-on-ntu-rgbd-1ST-GCN (1-stream)
Accuracy (Cross-Subject): 79
GFLOPS per prediction: 16.73
skeleton-based-action-recognition-on-ntu-rgbd-1AGCN (1-stream)
Accuracy (Cross-Setup): 80.7
Accuracy (Cross-Subject): 79.7
GFLOPS per prediction: 18.69
skeleton-based-action-recognition-on-ntu-rgbd-1CoS-TR* (2-stream)
Accuracy (Cross-Setup): 86.1
Accuracy (Cross-Subject): 84.8
GFLOPS per prediction: 0.3
skeleton-based-action-recognition-on-ntu-rgbd-1CoST-GCN* (1-stream)
Accuracy (Cross-Setup): 81.6
Accuracy (Cross-Subject): 79.4
GFLOPS per prediction: 0.16
skeleton-based-action-recognition-on-ntu-rgbd-1CoST-GCN* (2-stream)
Accuracy (Cross-Setup): 85.5
Accuracy (Cross-Subject): 84.0
GFLOPS per prediction: 0.32
skeleton-based-action-recognition-on-ntu-rgbd-1CoAGCN* (2-stream)
Accuracy (Cross-Setup): 82
Accuracy (Cross-Subject): 80.4
GFLOPS per prediction: 0.44
skeleton-based-action-recognition-on-ntu-rgbd-1CoS-TR* (1-stream)
Accuracy (Cross-Setup): 81.7
Accuracy (Cross-Subject): 79.7
GFLOPS per prediction: 0.15
skeleton-based-action-recognition-on-ntu-rgbd-1AGCN (2-stream)
Accuracy (Cross-Setup): 85.4
Accuracy (Cross-Subject): 84
GFLOPS per prediction: 37.38
skeleton-based-action-recognition-on-ntu-rgbd-1CoAGCN* (1-stream)
Accuracy (Cross-Setup): 79.1
Accuracy (Cross-Subject): 77.3
GFLOPS per prediction: 0.22
skeleton-based-action-recognition-on-ntu-rgbd-1ST-GCN (2-stream)
Accuracy (Cross-Setup): 85.1
Accuracy (Cross-Subject): 83.7
GFLOPS per prediction: 33.46
skeleton-based-action-recognition-on-ntu-rgbd-1S-TR (2-stream)
Accuracy (Cross-Setup): 86.2
Accuracy (Cross-Subject): 84.8
GFLOPS per prediction: 32.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp