HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Real-time Scene Text Detection with Differentiable Binarization

Minghui Liao Zhaoyi Wan Cong Yao Kai Chen Xiang Bai

Real-time Scene Text Detection with Differentiable Binarization

Abstract

Recently, segmentation-based methods are quite popular in scene text detection, as the segmentation results can more accurately describe scene text of various shapes such as curve text. However, the post-processing of binarization is essential for segmentation-based detection, which converts probability maps produced by a segmentation method into bounding boxes/regions of text. In this paper, we propose a module named Differentiable Binarization (DB), which can perform the binarization process in a segmentation network. Optimized along with a DB module, a segmentation network can adaptively set the thresholds for binarization, which not only simplifies the post-processing but also enhances the performance of text detection. Based on a simple segmentation network, we validate the performance improvements of DB on five benchmark datasets, which consistently achieves state-of-the-art results, in terms of both detection accuracy and speed. In particular, with a light-weight backbone, the performance improvements by DB are significant so that we can look for an ideal tradeoff between detection accuracy and efficiency. Specifically, with a backbone of ResNet-18, our detector achieves an F-measure of 82.8, running at 62 FPS, on the MSRA-TD500 dataset. Code is available at: https://github.com/MhLiao/DB

Code Repositories

jakeywu/ocr_torch
pytorch
Mentioned in GitHub
2023-MindSpore-1/ms-code-43
mindspore
Mentioned in GitHub
SURFZJY/Real-time-Text-Detection
pytorch
Mentioned in GitHub
huyhoang17/DB_text_minimal
pytorch
Mentioned in GitHub
Mushroomcat9998/DBNet
pytorch
Mentioned in GitHub
mindee/doctr
pytorch
Mentioned in GitHub
WenmuZhou/DBNet.pytorch
pytorch
Mentioned in GitHub
WenmuZhou/PytorchOCR
pytorch
Mentioned in GitHub
MhLiao/DB
Official
pytorch
Mentioned in GitHub
PaddlePaddle/PaddleOCR
paddle
Mentioned in GitHub
18520339/dbnet-tf2
tf
Mentioned in GitHub
yanan0122/dbnet-and-dbnet_pp-by-mind-spore
mindspore
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-icdar-2015DB-ResNet-50 (1152)
F-Measure: 87.3
Precision: 91.8
Recall: 83.2
scene-text-detection-on-msra-td500DB-ResNet-50 (736)
F-Measure: 84.9
Precision: 91.5
Recall: 79.2
scene-text-detection-on-scut-ctw1500DB-ResNet50 (1024)
F-Measure: 83.4
scene-text-detection-on-total-textDB-ResNet-50 (800)
F-Measure: 84.7%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp