HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos

Serena Yeung; Olga Russakovsky; Ning Jin; Mykhaylo Andriluka; Greg Mori; Li Fei-Fei

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos

Abstract

Every moment counts in action recognition. A comprehensive understanding of human activity in video requires labeling every frame according to the actions occurring, placing multiple labels densely over a video sequence. To study this problem we extend the existing THUMOS dataset and introduce MultiTHUMOS, a new dataset of dense labels over unconstrained internet videos. Modeling multiple, dense labels benefits from temporal relations within and across classes. We define a novel variant of long short-term memory (LSTM) deep networks for modeling these temporal relations via multiple input and output connections. We show that this model improves action labeling accuracy and further enables deeper understanding tasks ranging from structured retrieval to action prediction.

Code Repositories

lauradhatt/Interesting-Reads
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
action-detection-on-multi-thumosTwo-stream
mAP: 27.6
action-detection-on-multi-thumosTwo-stream + LSTM
mAP: 28.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos | Papers | HyperAI