HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Shape Robust Text Detection with Progressive Scale Expansion Network

Li Xiang ; Wang Wenhai ; Hou Wenbo ; Liu Ruo-Ze ; Lu Tong ; Yang Jian

Shape Robust Text Detection with Progressive Scale Expansion Network

Abstract

The challenges of shape robust text detection lie in two aspects: 1) mostexisting quadrangular bounding box based detectors are difficult to locatetexts with arbitrary shapes, which are hard to be enclosed perfectly in arectangle; 2) most pixel-wise segmentation-based detectors may not separate thetext instances that are very close to each other. To address these problems, wepropose a novel Progressive Scale Expansion Network (PSENet), designed as asegmentation-based detector with multiple predictions for each text instance.These predictions correspond to different `kernels' produced by shrinking theoriginal text instance into various scales. Consequently, the final detectioncan be conducted through our progressive scale expansion algorithm whichgradually expands the kernels with minimal scales to the text instances withmaximal and complete shapes. Due to the fact that there are large geometricalmargins among these minimal kernels, our method is effective to distinguish theadjacent text instances and is robust to arbitrary shapes. The state-of-the-artresults on ICDAR 2015 and ICDAR 2017 MLT benchmarks further confirm the greateffectiveness of PSENet. Notably, PSENet outperforms the previous best recordby absolute 6.37\% on the curve text dataset SCUT-CTW1500. Code will beavailable in https://github.com/whai362/PSENet.

Code Repositories

DePengW/PSENet
tf
Mentioned in GitHub
A-ZHANG1/PSENet
tf
Mentioned in GitHub
li10141110/PSENet-tf2
tf
Mentioned in GitHub
SimonWang00/psenet.tf2
tf
Mentioned in GitHub
whai362/PSENet
Official
tf
Mentioned in GitHub
JiaquanYe/TableMASTER-mmocr
pytorch
Mentioned in GitHub
Mael-zys/PSENet
pytorch
Mentioned in GitHub
liuheng92/tensorflow_PSENet
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-icdar-2015PSENet-1s
F-Measure: 87.1
Precision: 88.7
Recall: 85.5
scene-text-detection-on-icdar-2017-mlt-1PSENet-1s
F-Measure: 72.45%
Precision: 77.01
Recall: 68.4
scene-text-detection-on-scut-ctw1500PSENet-1s
F-Measure: 81.17
Precision: 82.5
Recall: 79.89

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp