HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Deep Radial Embedding for Visual Sequence Learning

{Xilin Chen Xiujuan Chai Lei Lei Xiaotao Wang Yanan Li Peiqi Jiao Yuecong Min}

Deep Radial Embedding for Visual Sequence Learning

Abstract

Connectionist Temporal Classification (CTC) is a popularobjective function in sequence recognition, which provides supervisionfor unsegmented sequence data through aligning sequence and its corresponding labeling iteratively. The blank class of CTC plays a crucialrole in the alignment process and is often considered responsible for thepeaky behavior of CTC. In this study, we propose an objective functionnamed RadialCTC that constrains sequence features on a hyperspherewhile retaining the iterative alignment mechanism of CTC. The learnedfeatures of each non-blank class are distributed on a radial arc from thecenter of the blank class, which provides a clear geometric interpretationand makes the alignment process more efficient. Besides, RadialCTC cancontrol the peaky behavior by simply modifying the logit of the blankclass. Experimental results of recognition and localization demonstratethe effectiveness of RadialCTC on two sequence recognition applications.

Benchmarks

BenchmarkMethodologyMetrics
sign-language-recognition-on-rwth-phoenixRadialCTC
Word Error Rate (WER): 20.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Deep Radial Embedding for Visual Sequence Learning | Papers | HyperAI