HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A3S: Adversarial learning of semantic representations for Scene-Text Spotting

Masato Fujitake

A3S: Adversarial learning of semantic representations for Scene-Text Spotting

Abstract

Scene-text spotting is a task that predicts a text area on natural scene images and recognizes its text characters simultaneously. It has attracted much attention in recent years due to its wide applications. Existing research has mainly focused on improving text region detection, not text recognition. Thus, while detection accuracy is improved, the end-to-end accuracy is insufficient. Texts in natural scene images tend to not be a random string of characters but a meaningful string of characters, a word. Therefore, we propose adversarial learning of semantic representations for scene text spotting (A3S) to improve end-to-end accuracy, including text recognition. A3S simultaneously predicts semantic features in the detected text area instead of only performing text recognition based on existing visual features. Experimental results on publicly available datasets show that the proposed method achieves better accuracy than other methods.

Benchmarks

BenchmarkMethodologyMetrics
text-spotting-on-icdar-2015A3S
F-measure (%) - Generic Lexicon: 79.6
F-measure (%) - Strong Lexicon: 84.8
F-measure (%) - Weak Lexicon: 83.7
text-spotting-on-scut-ctw1500A3S
F-Measure (%) - Full Lexicon: 82.3
F-measure (%) - No Lexicon: 64.4
text-spotting-on-total-textA3S
F-measure (%) - Full Lexicon: 85.1
F-measure (%) - No Lexicon: 79.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp