HyperAIHyperAI

Image Classification On Inaturalist 2019

Metrics

Top-1 Accuracy

Results

Performance results of various models on this benchmark

Model Name
Top-1 Accuracy
Paper TitleRepository
LeViT-19270.8LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
CeiT-S78.9Incorporating Convolution Designs into Visual Transformers-
CeiT-S (384 finetune resolution)82.7Incorporating Convolution Designs into Visual Transformers-
ResNet50 (A2)75.0ResNet strikes back: An improved training procedure in timm-
RDNet-T (224 res, IN-1K pretrained)81.2DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
Conviformer-B82.85Conviformers: Convolutionally guided Vision Transformer-
MixMIM-L83.9MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers-
MAE (ViT-H, 448)88.3Masked Autoencoders Are Scalable Vision Learners-
RDNet-S (224 res, IN-1K pretrained)82.9DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
Hiera-H (448px)88.5Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles-
RDNet-L (224 res, IN-1K pretrained)83.7DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
LeViT-25672.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
LeViT-12868.4LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
CeiT-T72.8Incorporating Convolution Designs into Visual Transformers-
ResMLP-1271.0ResMLP: Feedforward networks for image classification with data-efficient training-
CeiT-T (384 finetune resolution)77.9Incorporating Convolution Designs into Visual Transformers-
RDNet-B (224 res, IN-1K pretrained)83.5DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs-
LeViT-38474.3LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
LeViT-128S66.5LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference-
CaiT-M-36 U 22481.8--
0 of 22 row(s) selected.