HyperAI超神经

Image Classification On Inaturalist 2019

评估指标

Top-1 Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Top-1 Accuracy
Paper TitleRepository
LeViT-19270.8LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
CeiT-S78.9Incorporating Convolution Designs into Visual Transformers
CeiT-S (384 finetune resolution)82.7Incorporating Convolution Designs into Visual Transformers
ResNet50 (A2)75.0ResNet strikes back: An improved training procedure in timm
RDNet-T (224 res, IN-1K pretrained)81.2DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
Conviformer-B82.85Conviformers: Convolutionally guided Vision Transformer
MixMIM-L83.9MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
MAE (ViT-H, 448)88.3Masked Autoencoders Are Scalable Vision Learners
RDNet-S (224 res, IN-1K pretrained)82.9DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
Hiera-H (448px)88.5Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
RDNet-L (224 res, IN-1K pretrained)83.7DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
LeViT-25672.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
LeViT-12868.4LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
CeiT-T72.8Incorporating Convolution Designs into Visual Transformers
ResMLP-1271.0ResMLP: Feedforward networks for image classification with data-efficient training
CeiT-T (384 finetune resolution)77.9Incorporating Convolution Designs into Visual Transformers
RDNet-B (224 res, IN-1K pretrained)83.5DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
LeViT-38474.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
LeViT-128S66.5LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference
CaiT-M-36 U 22481.8Going deeper with Image Transformers
0 of 22 row(s) selected.