Command Palette
Search for a command to run...
Bruno Artacho Andreas Savakis

Abstract
We propose OmniPose, a single-pass, end-to-end trainable framework, that achieves state-of-the-art results for multi-person pose estimation. Using a novel waterfall module, the OmniPose architecture leverages multi-scale feature representations that increase the effectiveness of backbone feature extractors, without the need for post-processing. OmniPose incorporates contextual information across scales and joint localization with Gaussian heatmap modulation at the multi-scale feature extractor to estimate human pose with state-of-the-art accuracy. The multi-scale representations, obtained by the improved waterfall module in OmniPose, leverage the efficiency of progressive filtering in the cascade architecture, while maintaining multi-scale fields-of-view comparable to spatial pyramid configurations. Our results on multiple datasets demonstrate that OmniPose, with an improved HRNet backbone and waterfall module, is a robust and efficient architecture for multi-person pose estimation that achieves state-of-the-art results.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| pose-estimation-on-coco | OmniPose (WASPv2) | AP: 79.5 AP50: 93.6 AP75: 85.9 APL: 84.6 APM: 76 AR: 81.9 |
| pose-estimation-on-coco-test-dev | OmniPose (WASPv2) | AP: 76.4 AP50: 92.6 AP75: 83.7 APL: 82.6 APM: 72.6 AR: 81.2 |
| pose-estimation-on-leeds-sports-poses | OmniPose | PCK: 99.5% |
| pose-estimation-on-mpii | OmniPose (WASPv2) | PCKh@0.2: 92.3 |
| pose-estimation-on-upenn-action | OmniPose | Mean PCK@0.2: 99.4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.