HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

FOTS: Fast Oriented Text Spotting with a Unified Network

Xuebo Liu; Ding Liang; Shi Yan; Dagui Chen; Yu Qiao; Junjie Yan

FOTS: Fast Oriented Text Spotting with a Unified Network

Abstract

Incidental scene text spotting is considered one of the most difficult and valuable challenges in the document analysis community. Most existing methods treat text detection and recognition as separate tasks. In this work, we propose a unified end-to-end trainable Fast Oriented Text Spotting (FOTS) network for simultaneous detection and recognition, sharing computation and visual information among the two complementary tasks. Specially, RoIRotate is introduced to share convolutional features between detection and recognition. Benefiting from convolution sharing strategy, our FOTS has little computation overhead compared to baseline text detection network, and the joint training method learns more generic features to make our method perform better than these two-stage methods. Experiments on ICDAR 2015, ICDAR 2017 MLT, and ICDAR 2013 datasets demonstrate that the proposed method outperforms state-of-the-art methods significantly, which further allows us to develop the first real-time oriented text spotting system which surpasses all previous state-of-the-art results by more than 5% on ICDAR 2015 text spotting task while keeping 22.6 fps.

Code Repositories

Pay20Y/FOTS_TF
tf
Mentioned in GitHub
Masao-Taketani/FOTS_OCR
tf
Mentioned in GitHub
ArashJavan/FOTS
tf
Mentioned in GitHub
jiangxiluning/FOTS.PyTorch
pytorch
Mentioned in GitHub
Kaushal28/FOTS-PyTorch
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-icdar-2015FOTS
F-Measure: 87.99
Precision: 91
Recall: 85.17
scene-text-detection-on-icdar-2015FOTS MS
F-Measure: 89.84
Precision: 91.85
Recall: 87.92
scene-text-detection-on-icdar-2017-mlt-1FOTS MS
F-Measure: 70.75%
Precision: 81.86
Recall: 62.3
scene-text-detection-on-icdar-2017-mlt-1FOTS
F-Measure: 67.25%
Precision: 80.95
Recall: 57.51
text-spotting-on-icdar-2015FOTS
F-measure (%) - Generic Lexicon: 62.2
F-measure (%) - Strong Lexicon: 83.6
F-measure (%) - Weak Lexicon: 74.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp