HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Pengfei Wang Chengquan Zhang Fei Qi Shanshan Liu Xiaoqiang Zhang Pengyuan Lyu Junyu Han Jingtuo Liu Errui Ding Guangming Shi

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Abstract

The reading of arbitrarily-shaped text has received increasing research attention. However, existing text spotters are mostly built on two-stage frameworks or character-based methods, which suffer from either Non-Maximum Suppression (NMS), Region-of-Interest (RoI) operations, or character-level annotations. In this paper, to address the above problems, we propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time. The PGNet is a single-shot text spotter, where the pixel-level character classification map is learned with proposed PG-CTC loss avoiding the usage of character-level annotations. With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations involved, which guarantees high efficiency. Additionally, reasoning the relations between each character and its neighbors, a graph refinement module (GRM) is proposed to optimize the coarse recognition and improve the end-to-end performance. Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed. In particular, in Total-Text, it runs at 46.7 FPS, surpassing the previous spotters with a large margin.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-icdar-2015PGNet-A
Accuracy: 62.3
scene-text-detection-on-icdar-2015MCLAB_FCN
F-Measure: 53.6
Precision: 70.8
Recall: 43.0
text-spotting-on-icdar-2015PGNet
F-measure (%) - Generic Lexicon: 63.5
F-measure (%) - Strong Lexicon: 83.3
F-measure (%) - Weak Lexicon: 78.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp