Command Palette
Search for a command to run...
Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Zhang Qi ; Gong Yunfei ; Chen Daijie ; Chan Antoni B. ; Huang Hui

Abstract
Recent deep learning-based multi-view people detection (MVD) methods haveshown promising results on existing datasets. However, current methods aremainly trained and evaluated on small, single scenes with a limited number ofmulti-view frames and fixed camera views. As a result, these methods may not bepractical for detecting people in larger, more complex scenes with severeocclusions and camera calibration errors. This paper focuses on improvingmulti-view people detection by developing a supervised view-wise contributionweighting approach that better fuses multi-camera information under largescenes. Besides, a large synthetic dataset is adopted to enhance the model'sgeneralization ability and enable more practical evaluation and comparison. Themodel's performance on new testing scenes is further improved with a simpledomain adaptation technique. Experimental results demonstrate the effectivenessof our approach in achieving promising cross-scene multi-view people detectionperformance. See code here: https://vcc.tech/research/2024/MVD.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| multiview-detection-on-citystreet | SVCW | F1_score (2m): 76.0 MODA (2m): 55.0 MODP (2m): 70.0 Precision (2m): 81.4 Recall (2m): 71.2 |
| multiview-detection-on-cvcs | SVCW | F1_score (0.5m): / F1_score (1m): 68.4 MODA (0.5m): / MODA (1m): 46.2 MODP (1m): 78.4 Precision (1m): 81.2 Recall (1m): 59.1 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.