HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Rotation-Sensitive Regression for Oriented Scene Text Detection

Minghui Liao; Zhen Zhu; Baoguang Shi; Gui-song Xia; Xiang Bai

Rotation-Sensitive Regression for Oriented Scene Text Detection

Abstract

Text in natural images is of arbitrary orientations, requiring detection in terms of oriented bounding boxes. Normally, a multi-oriented text detector often involves two key tasks: 1) text presence detection, which is a classification problem disregarding text orientation; 2) oriented bounding box regression, which concerns about text orientation. Previous methods rely on shared features for both tasks, resulting in degraded performance due to the incompatibility of the two tasks. To address this issue, we propose to perform classification and regression on features of different characteristics, extracted by two network branches of different designs. Concretely, the regression branch extracts rotation-sensitive features by actively rotating the convolutional filters, while the classification branch extracts rotation-invariant features by pooling the rotation-sensitive features. The proposed method named Rotation-sensitive Regression Detector (RRD) achieves state-of-the-art performance on three oriented scene text benchmark datasets, including ICDAR 2015, MSRA-TD500, RCTW-17 and COCO-Text. Furthermore, RRD achieves a significant improvement on a ship collection dataset, demonstrating its generality on oriented object detection.

Benchmarks

BenchmarkMethodologyMetrics
scene-text-detection-on-msra-td500RRD∗
F-Measure: 79
Precision: 87
Recall: 73

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Rotation-Sensitive Regression for Oriented Scene Text Detection | Papers | HyperAI