Command Palette
Search for a command to run...
Mask-aware IoU for Anchor Assignment in Real-time Instance Segmentation
Kemal Oksuz; Baris Can Cam; Fehmi Kahraman; Zeynep Sonat Baltaci; Sinan Kalkan; Emre Akbas

Abstract
This paper presents Mask-aware Intersection-over-Union (maIoU) for assigning anchor boxes as positives and negatives during training of instance segmentation methods. Unlike conventional IoU or its variants, which only considers the proximity of two boxes; maIoU consistently measures the proximity of an anchor box with not only a ground truth box but also its associated ground truth mask. Thus, additionally considering the mask, which, in fact, represents the shape of the object, maIoU enables a more accurate supervision during training. We present the effectiveness of maIoU on a state-of-the-art (SOTA) assigner, ATSS, by replacing IoU operation by our maIoU and training YOLACT, a SOTA real-time instance segmentation method. Using ATSS with maIoU consistently outperforms (i) ATSS with IoU by $\sim 1$ mask AP, (ii) baseline YOLACT with fixed IoU threshold assigner by $\sim 2$ mask AP over different image sizes and (iii) decreases the inference time by $25 \%$ owing to using less anchors. Then, exploiting this efficiency, we devise maYOLACT, a faster and $+6$ AP more accurate detector than YOLACT. Our best model achieves $37.7$ mask AP at $25$ fps on COCO test-dev establishing a new state-of-the-art for real-time instance segmentation. Code is available at https://github.com/kemaloksuz/Mask-aware-IoU
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| real-time-instance-segmentation-on-mscoco | maYOLACT-700 (ResNet-50) | AP50: 59.4 AP75: 39.9 APL: 52.5 APM: 40.8 APS: 18.1 Frame (fps): 25 (Tesla V100) mask AP: 37.7 |
| real-time-instance-segmentation-on-mscoco | maYOLACT-550 (ResNet-50) | AP50: 56.2 AP75: 37.1 APL: 51.4 APM: 38.0 APS: 14.7 Frame (fps): 30 (Tesla V100) mask AP: 35.2 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.