Command Palette
Search for a command to run...
BAPose: Bottom-Up Pose Estimation with Disentangled Waterfall Representations
Artacho Bruno ; Savakis Andreas

Abstract
We propose BAPose, a novel bottom-up approach that achieves state-of-the-artresults for multi-person pose estimation. Our end-to-end trainable frameworkleverages a disentangled multi-scale waterfall architecture and incorporatesadaptive convolutions to infer keypoints more precisely in crowded scenes withocclusions. The multi-scale representations, obtained by the disentangledwaterfall module in BAPose, leverage the efficiency of progressive filtering inthe cascade architecture, while maintaining multi-scale fields-of-viewcomparable to spatial pyramid configurations. Our results on the challengingCOCO and CrowdPose datasets demonstrate that BAPose is an efficient and robustframework for multi-person pose estimation, achieving significant improvementson state-of-the-art accuracy.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| multi-person-pose-estimation-on-coco | BAPose | AP: 0.727 Test AP: 71.2 Validation AP: 72.7 |
| multi-person-pose-estimation-on-crowdpose | BAPose (W32) | AP Easy: 79.9 AP Hard: 61.3 AP Medium: 73.4 mAP @0.5:0.95: 72.2 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.