Command Palette
Search for a command to run...
AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder
Tal Shaharabany; Aviad Dahan; Raja Giryes; Lior Wolf

Abstract
The recently introduced Segment Anything Model (SAM) combines a clever architecture and large quantities of training data to obtain remarkable image segmentation capabilities. However, it fails to reproduce such results for Out-Of-Distribution (OOD) domains such as medical images. Moreover, while SAM is conditioned on either a mask or a set of points, it may be desirable to have a fully automatic solution. In this work, we replace SAM's conditioning with an encoder that operates on the same input image. By adding this encoder and without further fine-tuning SAM, we obtain state-of-the-art results on multiple medical images and video benchmarks. This new encoder is trained via gradients provided by a frozen SAM. For inspecting the knowledge within it, and providing a lightweight segmentation solution, we also learn to decode it into a mask by a shallow deconvolution network.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| video-polyp-segmentation-on-sun-seg-easy | AutoSAM | Dice: 0.753 S measure: 0.815 Sensitivity: 0.672 mean E-measure: 0.855 mean F-measure: 0.774 weighted F-measure: 0.716 |
| video-polyp-segmentation-on-sun-seg-hard | AutoSAM | Dice: 0.759 S-Measure: 0.822 Sensitivity: 0.726 mean E-measure: 0.866 mean F-measure: 0.764 weighted F-measure: 0.714 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.