Pyramid Adversarial Training Improves ViT (Im21k) | 42.16 | Pyramid Adversarial Training Improves ViT Performance | |
ConvFormer-B36 | 48.9 | MetaFormer Baselines for Vision | |
CAR-FT (CLIP, ViT-L/14@336px) | 10.3 | Context-Aware Robust Fine-Tuning | - |
ConvFormer-B36 (384) | 47.8 | MetaFormer Baselines for Vision | |
FAN-Hybrid-L(IN-21K, 384)) | 28.9 | Understanding The Robustness in Vision Transformers | |
ConvFormer-B36 (IN21K, 384) | 33.5 | MetaFormer Baselines for Vision | |