Command Palette
Search for a command to run...
Evangelos Skartados; Konstantinos Georgiadis; Mehmet Kerim Yucel; Koskinas Ioannis; Armando Domi; Anastasios Drosou; Bruno Manganelli; Albert Saa-Garriga

Abstract
Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i) supervisory signal, ii) pretraining and iii) spatial awareness. We then propose TrickVOS; a generic, method-agnostic bag of tricks addressing each aspect with i) a structure-aware hybrid loss, ii) a simple decoder pretraining regime and iii) a cheap tracker that imposes spatial constraints in model predictions. Finally, we propose a lightweight network and show that when trained with TrickVOS, it achieves competitive results to state-of-the-art methods on DAVIS and YouTube benchmarks, while being one of the first STM-based SVOS methods that can run in real-time on a mobile device.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| semi-supervised-video-object-segmentation-on-18 | Lightweight TrickVOS (PT) | F-Measure (Seen): 83.3 F-Measure (Unseen): 84 J score (unseen): 75.2 Ju0026F: 80.5 Jaccard (Seen): 79.5 |
| semi-supervised-video-object-segmentation-on-18 | STCN + TrickVOS (PT) | F-Measure (Seen): 86.4 F-Measure (Unseen): 85.5 Ju0026F: 82.8 Jaccard (Seen): 82.1 Jaccard (Unseen): 77.2 |
| semi-supervised-video-object-segmentation-on-2 | Lightweight TrickVOS (PT) | F-measure (Mean): 86 Ju0026F: 82.7 Jaccard (Mean): 79.4 Speed (FPS): 76.4 |
| semi-supervised-video-object-segmentation-on-2 | STCN + TrickVOS (PT) | F-measure (Mean): 89.6 Ju0026F: 86.1 Jaccard (Mean): 82.6 Speed (FPS): 35.1 |
| semi-supervised-video-object-segmentation-on-3 | STCN + TrickVOS (PT) | Speed (FPS): 45.4 |
| visual-object-tracking-on-davis-2016 | STCN + TrickVOS (PT) | F-measure (Mean): 93.1 Ju0026F: 91.8 Jaccard (Mean): 90.5 |
| visual-object-tracking-on-davis-2016 | Lightweight TrickVOS (PT) | F-measure (Mean): 89.9 Ju0026F: 89.3 Jaccard (Mean): 88.7 Speed (FPS): 86.4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.