HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition

Ji Yanli ; Xu Feixiang ; Yang Yang ; Shen Fumin ; Shen Heng Tao ; Zheng Wei-Shi

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human
  Action Recognition

Abstract

Current researches of action recognition mainly focus on single-view andmulti-view recognition, which can hardly satisfies the requirements ofhuman-robot interaction (HRI) applications to recognize actions from arbitraryviews. The lack of datasets also sets up barriers. To provide data forarbitrary-view action recognition, we newly collect a large-scale RGB-D actiondataset for arbitrary-view action analysis, including RGB videos, depth andskeleton sequences. The dataset includes action samples captured in 8 fixedviewpoints and varying-view sequences which covers the entire 360 degree viewangles. In total, 118 persons are invited to act 40 action categories, and25,600 video samples are collected. Our dataset involves more participants,more viewpoints and a large number of samples. More importantly, it is thefirst dataset containing the entire 360 degree varying-view sequences. Thedataset provides sufficient data for multi-view, cross-view and arbitrary-viewaction analysis. Besides, we propose a View-guided Skeleton CNN (VS-CNN) totackle the problem of arbitrary-view action recognition. Experiment resultsshow that the VS-CNN achieves superior performance.

Benchmarks

BenchmarkMethodologyMetrics
skeleton-based-action-recognition-on-varyingVS-CNN
Accuracy (AV I): 57%
Accuracy (AV II): 75%
Accuracy (CS): 76%
Accuracy (CV I): 29%
Accuracy (CV II): 71%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition | Papers | HyperAI