HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Lyu Pengyuan Yao Cong Wu Wenhao Yan Shuicheng Bai Xiang

Multi-Oriented Scene Text Detection via Corner Localization and Region
  Segmentation

Abstract

Previous deep learning based state-of-the-art scene text detection methodscan be roughly classified into two categories. The first category treats scenetext as a type of general objects and follows general object detection paradigmto localize scene text by regressing the text box locations, but troubled bythe arbitrary-orientation and large aspect ratios of scene text. The second onesegments text regions directly, but mostly needs complex post processing. Inthis paper, we present a method that combines the ideas of the two types ofmethods while avoiding their shortcomings. We propose to detect scene text bylocalizing corner points of text bounding boxes and segmenting text regions inrelative positions. In inference stage, candidate boxes are generated bysampling and grouping corner points, which are further scored by segmentationmaps and suppressed by NMS. Compared with previous methods, our method canhandle long oriented text naturally and doesn't need complex post processing.The experiments on ICDAR2013, ICDAR2015, MSRA-TD500, MLT and COCO-Textdemonstrate that the proposed algorithm achieves better or comparable resultsin both accuracy and efficiency. Based on VGG16, it achieves an F-measure of84.3% on ICDAR2015 and 81.5% on MSRA-TD500.

Code Repositories

lvpengyuan/corner
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-icdar-2013Corner Localization (multi-scale)
F-Measure: 88%
Precision: 92
Recall: 84.4
scene-text-detection-on-icdar-2015Corner Localization (multi-scale)
F-Measure: 84.3
Precision: 89.5
Recall: 79.7
scene-text-detection-on-icdar-2017-mlt-1Corner Localization (single-scale)
F-Measure: 66.8%
Precision: 83.8
Recall: 55.6
scene-text-detection-on-icdar-2017-mlt-1Corner Localization (multi-scale)
F-Measure: 72.4%
Precision: 74.3
Recall: 70.6
scene-text-detection-on-msra-td500Corner Localization
F-Measure: 81.5
Precision: 87.6
Recall: 76.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp