Command Palette
Search for a command to run...
Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
Yuxin Chen Ziqi Zhang Chunfeng Yuan Bing Li Ying Deng Weiming Hu

Abstract
Graph convolutional networks (GCNs) have been widely used and achieved remarkable results in skeleton-based action recognition. In GCNs, graph topology dominates feature aggregation and therefore is the key to extracting representative features. In this work, we propose a novel Channel-wise Topology Refinement Graph Convolution (CTR-GC) to dynamically learn different topologies and effectively aggregate joint features in different channels for skeleton-based action recognition. The proposed CTR-GC models channel-wise topologies through learning a shared topology as a generic prior for all channels and refining it with channel-specific correlations for each channel. Our refinement method introduces few extra parameters and significantly reduces the difficulty of modeling channel-wise topologies. Furthermore, via reformulating graph convolutions into a unified form, we find that CTR-GC relaxes strict constraints of graph convolutions, leading to stronger representation capability. Combining CTR-GC with temporal modeling modules, we develop a powerful graph convolutional network named CTR-GCN which notably outperforms state-of-the-art methods on the NTU RGB+D, NTU RGB+D 120, and NW-UCLA datasets.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| skeleton-based-action-recognition-on-n-ucla | CTR-GCN | Accuracy: 96.5 |
| skeleton-based-action-recognition-on-ntu-rgbd | CTR-GCN | Accuracy (CS): 92.4 Accuracy (CV): 96.8 Ensembled Modalities: 4 |
| skeleton-based-action-recognition-on-ntu-rgbd-1 | CTR-GCN | Accuracy (Cross-Setup): 90.6 Accuracy (Cross-Subject): 88.9 Ensembled Modalities: 4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.