Command Palette
Search for a command to run...
Zhuohao Li; Guoyang Xie; Guannan Jiang; Zhichao Lu

Abstract
Transformer recently emerged as the de facto model for computer vision tasks and has also been successfully applied to shadow removal. However, these existing methods heavily rely on intricate modifications to the attention mechanisms within the transformer blocks while using a generic patch embedding. As a result, it often leads to complex architectural designs requiring additional computation resources. In this work, we aim to explore the efficacy of incorporating shadow information within the early processing stage. Accordingly, we propose a transformer-based framework with a novel patch embedding that is tailored for shadow removal, dubbed ShadowMaskFormer. Specifically, we present a simple and effective mask-augmented patch embedding to integrate shadow information and promote the model's emphasis on acquiring knowledge for shadow regions. Extensive experiments conducted on the ISTD, ISTD+, and SRD benchmark datasets demonstrate the efficacy of our method against state-of-the-art approaches while using fewer model parameters.g fewer model parameters. Our implementation is available at https://github.com/lizhh268/ShadowMaskFormer.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| shadow-removal-on-istd-1 | ShadowMaskFormer (arXiv 2024) (256x256) | LPIPS: 0.395 PSNR: 26.57 RMSE: 3.39 SSIM: 0.698 |
| shadow-removal-on-istd-1 | ShadowMaskFormer (arXiv 2024) (512x512) | LPIPS: 0.211 PSNR: 28.34 RMSE: 2.95 SSIM: 0.849 |
| shadow-removal-on-srd | ShadowMaskFormer (arXiv 2024) (256x256) | LPIPS: 0.386 PSNR: 23.85 RMSE: 4.69 SSIM: 0.671 |
| shadow-removal-on-srd | ShadowMaskFormer (arXiv 2024) (512x512) | LPIPS: 0.249 PSNR: 25.13 RMSE: 4.15 SSIM: 0.798 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.