4 months ago

Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

Wei Liu; Irtiza Hasan; Shengcai Liao

Abstract

Object detection generally requires sliding-window classifiers in tradition or anchor box based predictions in modern deep learning approaches. However, either of these approaches requires tedious configurations in boxes. In this paper, we provide a new perspective where detecting objects is motivated as a high-level semantic feature detection task. Like edges, corners, blobs and other feature detectors, the proposed detector scans for feature points all over the image, for which the convolution is naturally suited. However, unlike these traditional low-level features, the proposed detector goes for a higher-level abstraction, that is, we are looking for central points where there are objects, and modern deep models are already capable of such a high-level semantic abstraction. Besides, like blob detection, we also predict the scales of the central points, which is also a straightforward convolution. Therefore, in this paper, pedestrian and face detection is simplified as a straightforward center and scale prediction task through convolutions. This way, the proposed method enjoys a box-free setting. Though structurally simple, it presents competitive accuracy on several challenging benchmarks, including pedestrian detection and face detection. Furthermore, a cross-dataset evaluation is performed, demonstrating a superior generalization ability of the proposed method. Code and models can be accessed at (https://github.com/liuwei16/CSP and https://github.com/hasanirtiza/Pedestron).

Code Repositories

liuwei16/CSP

Official

hasanirtiza/Pedestron

Official

pytorch

Benchmarks

Benchmark	Methodology	Metrics
pedestrian-detection-on-caltech	CSP + CityPersons dataset	Reasonable Miss Rate: 3.8
pedestrian-detection-on-caltech	CSP	Reasonable Miss Rate: 4.5
pedestrian-detection-on-citypersons	CSP (with offset) + ResNet-50	Bare MR^-2: 7.3 Heavy MR^-2: 49.3 Large MR^-2: 6.5 Medium MR^-2: 3.7 Partial MR^-2: 10.4 Reasonable MR^-2: 11.0 Small MR^-2: 16.0 Test Time: 0.33s/img

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette