HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

Zhe Chen Jiahao Wang Wenhai Wang Guo Chen Enze Xie Ping Luo Tong Lu

FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation

Abstract

We propose an accurate and efficient scene text detection framework, termed FAST (i.e., faster arbitrarily-shaped text detector). Different from recent advanced text detectors that used complicated post-processing and hand-crafted network architectures, resulting in low inference speed, FAST has two new designs. (1) We design a minimalist kernel representation (only has 1-channel output) to model text with arbitrary shape, as well as a GPU-parallel post-processing to efficiently assemble text lines with a negligible time overhead. (2) We search the network architecture tailored for text detection, leading to more powerful features than most networks that are searched for image classification. Benefiting from these two designs, FAST achieves an excellent trade-off between accuracy and efficiency on several challenging datasets, including Total Text, CTW1500, ICDAR 2015, and MSRA-TD500. For example, FAST-T yields 81.6% F-measure at 152 FPS on Total-Text, outperforming the previous fastest method by 1.7 points and 70 FPS in terms of accuracy and speed. With TensorRT optimization, the inference speed can be further accelerated to over 600 FPS. Code and models will be released at https://github.com/czczup/FAST.

Code Repositories

whai362/pan_pp.pytorch
pytorch
Mentioned in GitHub
czczup/FAST
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-icdar-2015FAST-B-1280
F-Measure: 87.1
FPS: 15.7
Precision: 89.7
Recall: 84.6
scene-text-detection-on-icdar-2015FAST-B-736
F-Measure: 84.7
FPS: 42.7
Precision: 88.0
Recall: 81.7
scene-text-detection-on-icdar-2015FAST-T-736
F-Measure: 81.7
FPS: 60.9
Precision: 86
Recall: 77.9
scene-text-detection-on-icdar-2015FAST-B-896
F-Measure: 86.3
FPS: 31.8
Precision: 89.2
Recall: 83.6
scene-text-detection-on-icdar-2015FAST-S-736
F-Measure: 82.9
FPS: 53.9
Precision: 86.3
Recall: 79.8
scene-text-detection-on-msra-td500FAST-T-512
F-Measure: 84.5
FPS: 137.2
Precision: 91.1
Recall: 78.8
scene-text-detection-on-msra-td500FAST-B-736
F-Measure: 87.3
FPS: 56.8
Precision: 92.1
Recall: 83
scene-text-detection-on-msra-td500FAST-T-736
F-Measure: 84.9
FPS: 79.6
Precision: 88.1
Recall: 81.9
scene-text-detection-on-msra-td500FAST-S-736
F-Measure: 86.4
FPS: 72
Precision: 91.6
Recall: 81.7
scene-text-detection-on-scut-ctw1500FAST-S-512
F-Measure: 82
FPS: 112.9
Precision: 85.6
Recall: 78.7
scene-text-detection-on-scut-ctw1500FAST-B-640
F-Measure: 84.2
FPS: 66.5
Precision: 87.8
Recall: 80.9
scene-text-detection-on-scut-ctw1500FAST-T-512
F-Measure: 81.5
FPS: 129.1
Precision: 85.5
Recall: 77.9
scene-text-detection-on-scut-ctw1500FAST-B-512
F-Measure: 82.9
FPS: 92.6
Precision: 85.7
Recall: 80.2
scene-text-detection-on-total-textFAST-T-448
F-Measure: 81.6%
FPS: 152.8
Precision: 86.5
Recall: 77.2
scene-text-detection-on-total-textFAST-B-512
F-Measure: 85.8%
FPS: 93.2
Precision: 89.6
Recall: 82.4
scene-text-detection-on-total-textFAST-S-512
F-Measure: 84.9%
FPS: 115.5
Precision: 88.3
Recall: 81.7
scene-text-detection-on-total-textFAST-B-800
F-Measure: 87.5%
FPS: 46
Precision: 90.0
Recall: 85.2
scene-text-detection-on-total-textFAST-B-640
F-Measure: 86.4%
FPS: 67.5
Precision: 89.9
Recall: 83.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp