Discrete Adversarial Distillation (ResNet-50) | 7.7 | Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models | - |
CAFormer-B36 (IN-21K) | 69.4 | MetaFormer Baselines for Vision | - |
FAN-Hybrid-L(IN-21K, 384) | 74.5 | Understanding The Robustness in Vision Transformers | - |
ResNet-50 (300 Epochs) | 4.2 | Deep Residual Learning for Image Recognition | - |
TransNeXt-Base (IN-1K supervised, 224) | 50.6 | TransNeXt: Robust Foveal Visual Perception for Vision Transformers | - |
CAFormer-B36 (IN-21K, 384) | 79.5 | MetaFormer Baselines for Vision | - |
CutMix+MoEx (ResNet-50) | 8.4 | On Feature Normalization and Data Augmentation | - |
ConvFormer-B36 (384) | 55.3 | MetaFormer Baselines for Vision | - |