HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

Wei Liu; Irtiza Hasan; Shengcai Liao

Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

Abstract

Object detection generally requires sliding-window classifiers in tradition or anchor box based predictions in modern deep learning approaches. However, either of these approaches requires tedious configurations in boxes. In this paper, we provide a new perspective where detecting objects is motivated as a high-level semantic feature detection task. Like edges, corners, blobs and other feature detectors, the proposed detector scans for feature points all over the image, for which the convolution is naturally suited. However, unlike these traditional low-level features, the proposed detector goes for a higher-level abstraction, that is, we are looking for central points where there are objects, and modern deep models are already capable of such a high-level semantic abstraction. Besides, like blob detection, we also predict the scales of the central points, which is also a straightforward convolution. Therefore, in this paper, pedestrian and face detection is simplified as a straightforward center and scale prediction task through convolutions. This way, the proposed method enjoys a box-free setting. Though structurally simple, it presents competitive accuracy on several challenging benchmarks, including pedestrian detection and face detection. Furthermore, a cross-dataset evaluation is performed, demonstrating a superior generalization ability of the proposed method. Code and models can be accessed at (https://github.com/liuwei16/CSP and https://github.com/hasanirtiza/Pedestron).

Code Repositories

liuwei16/CSP
Official
tf
hasanirtiza/Pedestron
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
pedestrian-detection-on-caltechCSP + CityPersons dataset
Reasonable Miss Rate: 3.8
pedestrian-detection-on-caltechCSP
Reasonable Miss Rate: 4.5
pedestrian-detection-on-citypersonsCSP (with offset) + ResNet-50
Bare MR^-2: 7.3
Heavy MR^-2: 49.3
Large MR^-2: 6.5
Medium MR^-2: 3.7
Partial MR^-2: 10.4
Reasonable MR^-2: 11.0
Small MR^-2: 16.0
Test Time: 0.33s/img

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp